as titled, am trying to build a small application that will aggregate RSS from different blogs. Am trying to test out and explore feedparser for this operation, am stuck though trying to write a peace of code that would detect the rss feed.
Most people would just enter http://www.mysite.com/blog which is not exactly the URL to the RSS feed. If there a way for me to detect the RSS feed, am trying to replicate the browser behavior where it can see the RSS URL.
any ideas?
Browsers use RSS feed auto-discovery and Atom feed auto-discovery to find feeds on a given web page.
For example, the django question lists are available via an Atom feed which is linked in the HTML header of the associated pages with:
You’ll need to parse out the
<link rel="alternate">tags in a given page to discover these; anything with anapplication/atom+xmlorapplication/rss+xmltype fits.