Has anyone integrated BeautifulSoup with ASP.NET/C# (possibly using IronPython or otherwise)?
Is there a BeautifulSoup alternative or a port that works nicely with ASP.NET/C#
The intent of planning to use the library is to extract readable text from any random URL.
Thanks
Html Agility Pack is a similar project, but for C# and .NET
EDIT:
To extract all readable text:
Note that this will return the text content of
<script>tags.To fix that, you can remove all of the
<script>tags, like this:(Credit: SLaks)