Im new to Python NLTK and really need your advise. I want to open

Question

0

Asked: June 11, 20262026-06-11T23:34:04+00:00 2026-06-11T23:34:04+00:00

Im new to Python NLTK and really need your advise. I want to open

0

Im new to Python NLTK and really need your advise.
I want to open my own txt file and do some preprocessing like replacing words with its regex.
I’ve tried to do it as in NLTK 2.0 Cookbook

import re
replacement_patterns = [
        (r'won\'t', 'will not'),
        (r'can\'t', 'cannot'),
        (r'i\'m', 'i am'),
        (r'ain\'t', 'is not'),
        (r'(\w+)\'ll', '\g<1> will'),
        (r'(\w+)n\'t', '\g<1> not'),
        (r'(\w+)\'ve', '\g<1> have'),
        (r'(\w+t)\'s', '\g<1> is'),
        (r'(\w+)\'re', '\g<1> are'),
        (r'(\w+)\'d', '\g<1> would'),
]
class RegexpReplacer(object):
    def __init__(self, patterns=replacement_patterns):
                self.patterns = [(re.compile(regex), repl) for (regex, repl) in patterns]

    def replace(self, line):
                s = line

                for (pattern, repl) in self.patterns:
                        (s, count) = re.subn(pattern, repl, s)

                return s

it works perfect but how can I use it with my txt file?
I’ve tried to do my own way but I think its wrong

    import nltk
f=open("C:/nltk_data/file.txt", "rU")
raw=f.readlines()
from replacers import RegexpReplacer
replacer=RegexpReplacer()
replacer.replace(raw)

thx in advance!!!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-11T23:34:05+00:00

Editorial Team

2026-06-11T23:34:05+00:00Added an answer on June 11, 2026 at 11:34 pm

I think you want to use the read method to read all the file contents into a string first.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Im new to Python NLTK and really need your advise. I want to open

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply