I’m building a website in django that needs to extract key words from short

Question

0

Asked: May 14, 20262026-05-14T01:59:29+00:00 2026-05-14T01:59:29+00:00

I’m building a website in django that needs to extract key words from short

0

I’m building a website in django that needs to extract key words from short (twitter-like) messages.

I’ve looked at packages like topia.textextract and nltk – but both seem to be overkill for what I need to do. All I need to do is filter words like “and”, “or”, “not” while keeping nouns and verbs that aren’t conjunctives or other parts of speech. Are there any “simpler” packages out there that can do this?

EDIT: This needs to be done in near real-time on a production website, so using a keyword extraction service seems out of the question, based on their response times and request throttling.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-14T01:59:29+00:00

You can make a set sw of the “stop words” you want to eliminate (maybe copy it once and for all from the stop words corpus of NLTK, depending how familiar you are with the various natural languages you need to support), then apply it very simply.

E.g., if you have a list of words sent that make up the sentence (shorn of punctuation and lowercased, for simplicity), [word for word in sent if word not in sw] is all you need to make a list of non-stopwords — could hardly be easier, right?

To get the sent list in the first place, using the re module from the standard library, re.findall(r'\w+', sentstring) might suffice if sentstring is the string with the sentence you’re dealing with — it doesn’t lowercase, but you can change the list comprehension I suggest above to [word for word in sent if word.lower() not in sw] to compensate for that and (btw) keep the word’s original case, which may be useful.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m building a website in django that needs to extract key words from short

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply