I have a list of strings from which I need to remove all elements

Question

0

Asked: May 31, 20262026-05-31T06:49:46+00:00 2026-05-31T06:49:46+00:00

I have a list of strings from which I need to remove all elements

0

I have a list of strings from which I need to remove all elements that match a substring from another list. I am trying to do this with lists, nested loops, and regex.

The output from the following snippet produces [“We don’t”, “need no”, “education”] instead of the desired [“education”]. I’m new to Python and this is my first experiment with regex, and I’m stuck on the sytax.

import re

testfile = ["We don't", "need no", "education"]
stopwords = ["We", "no"]
dellist = []

for x in range(len(testfile)):
    for y in range(len(stopwords)):
        if re.match(r'\b' + stopwords[y] + '\b', testfile[x], re.I):
            dellist.append(testfile[x])

for x in range(len(dellist)):
    if dellist[x] in testfile:
        del testfile[testfile.index(dellist[x])]

print testfile

The line

if re.match(r'\b' + stopwords[y] + '\b', testfile[x], re.I):

returns “None” for all iterations through the loop, so I’m guessing this is where my problem lies…

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-31T06:49:47+00:00

It’s because re.match tests for a match from the start of the string.

Try re.search instead. Also, you’re missing the r on your second '\b':

if re.search(r'\b' + stopwords[y] + r'\b', testfile[x], re.I):

Also, you could just use list comprehension to build up dellist (you could probably use list comprehension to build up the new testfile entirely, but it escapes me at the moment):

dellist = [w for w in testfile for test in stopwords if re.search(test,w,re.I)]

Another thought – since you’re using re module anyway, why don’t you combine your stopwords into \b(We|no)\b and then you can just test testfile against the one regex?

regex = r'\b(' + '|'.join(stopwords) + r')\b'  # r'\b(We|no)\b'

Now you just have to look for words that don’t match that regex:

newtestfile = [w for w in testfile if re.search(regex,w,re.I) is None]
# newtestfile is ['education']

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a list of strings from which I need to remove all elements

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply