I want to remove stop words. Here is my code import nltk from nltk.corpus

Question

0

Asked: June 12, 20262026-06-12T11:55:51+00:00 2026-06-12T11:55:51+00:00

I want to remove stop words. Here is my code import nltk from nltk.corpus

0

I want to remove stop words. Here is my code

import nltk
from nltk.corpus import stopwords
import string

u="The apple is the pomaceous fruit of the apple tree, species Malus domestica in the rose family (Rosaceae). It is one of the most widely cultivated tree fruits, and the most widely known of the many members of genus Malus that are used by humans."

v="An orange is a fruit of the orangle tree. it is the most cultivated tree fruits"

u=u.lower()
v=v.lower()

u_list=nltk.word_tokenize(u)
v_list=nltk.word_tokenize(v)

for word in u_list:
    if word in stopwords.words('english'):
        u_list.remove(word)
for word in v_list:
    if word in stopwords.words('english'):
        v_list.remove(word)

print u_list
print "\n\n\n\n"
print v_list

But only some stop words are removed. Please help me with this

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-12T11:55:52+00:00

Editorial Team

2026-06-12T11:55:52+00:00Added an answer on June 12, 2026 at 11:55 am

The problem with what you are doing is list.remove(x) only removes the first occurrence of x, not every x. To remove every instance, you could use filter, but I would opt for something like this:

u_list = [word for word in u_list if word not in stopwords.words('english')]

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I want to remove stop words. Here is my code import nltk from nltk.corpus

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply