Is there a way to programmatically list all geo-tagged Wikipedia entries within a radius of a long/lat point? I’m thinking this is possible with the google maps API but I am interested in any method. NOTE: I do not want to display a googlemap.
Is there a way to programmatically list all geo-tagged Wikipedia entries within a radius
Share
Yes, it’s possible. The hard part is either:
(1) Screen-scraping Wikipedia (bad idea, unless you already have a (small) list of target pages)
(2) Downloading and parsing the massive Wikipedia data sets (better idea)
Once you have lat/long coordinates, which I assume are in the wiki page’s geotag format, you can use the great circle formula to compute relative distances, and bypass Google’s API entirely.
The moral of this story? When you’ve dealing with datasets this massive, you’re going to want to do as much of it offline as possible.