I want to use wikipedia dump for my project. The below information is required for my project.
- For an wikipedia entry, I want to know which other language contain the page?
- I want an downloadable data in csv or other common format.
Is there a way to get this data?
Thanks
Bala
The Wikimedia foundation provides XML dumps of all of its projects, including the English language Wikipedia.
Parsing an English-language wiki article for inter-language links is fairly easy: the syntax for such links are
[[language_code:Name of other language Wikipedia article]], where language_code is usually a two or three letter code (such astlhfor Klingon), based on an ISO standard except for a few exceptions, such assimplefor Simple English.