I am doing a project, for which I need to know all the wikipedia article names (I don’t need the content). Is there a place where I can download this data.
Share
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
Check out this page here on Wikipedia – there is an option to just download an archive with the names of the articles. Here’s the actual path to the download page:
Edit:
You may notice non-English titles appearing in the list (and some profanity – be advised) contained in
enwiki-latest-all-titles-in-ns0.gz. This is because by default most people create content on the main English wiki (language codeen). If you were to investigate other language dumps you will observe there are different sets of articles.Reading on the main download page, there are references to being able to use the Wikipedia API to perform some types of querying on Wikipedia, but I’m not sure this will resolve your problem (taxonomy of the pages doesn’t seem to provide a simple way to differentiate “English” content vs “content on English wiki”).