Ofcourse you can do it with Java Swings. All you…

Question

0

Editorial Team

Asked: May 13, 20262026-05-13T21:09:32+00:00 2026-05-13T21:09:32+00:00

Can anyone point me towards a ready made RSS screen scraper, preferably in Python

0

Can anyone point me towards a ready made RSS screen scraper, preferably in Python in order to get full text RSS feeds?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-13T21:09:32+00:00

Sorry but it doesn’t exist in python, though they do in php. You are more then welcome to use and improve the one I made named scraped. Though it does not do all sites, it is a recipe based system that currently only handles the NYT, WSJ and the Economist. I am working on an all inclusive algorithm, but its a major undertaking. It includes a ton of analysis to the different types of html and xml. Even the 3 sites mentioned above, have vastly different algorithms on how to scrape their sites WSJ being the most complex by far. They screw their HTML up with so much useless crap, mainly to just stop you.

Here is the program I was talking about, it requires lxml but it explains everything in the readme. It reads the config files, parses partial rss feeds, takes links and then scrapes those links, formulating in the end a RSS 2.0 xml file. Which I mainly convert into a ebook for my kindle. I utilize lxml, BeautifulSoup and feedparser.

http://tinyurl.com/yh3s9pa

You can also look at the calibre project, which uses a similar method to the way I do it, on recipes.

How to approach applying for a job at a company ...

What is a programmer’s life like?

How to handle personal stress caused by utterly incompetent and ...

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions