i’m trying to read a file into my python program and apply tokenizer on

Question

0

Asked: May 26, 20262026-05-26T13:25:03+00:00 2026-05-26T13:25:03+00:00

i’m trying to read a file into my python program and apply tokenizer on

0

i’m trying to read a file into my python program and apply tokenizer on it to split the text into a set of sentences. However, in my output i’m getting the ‘/n’ character that i’d like to avoid in the output, as it might hinder my further processes on the sentences.
I read the input using the read() command. Also tried readline(). i’m still getting the newline characters on my output. Any suggestions on avoiding this?

file_sent = open(path,'r')
all_sents = file_sent.read()
sent_all = print all_sents
tokenized_sents = sent_tokenize(sent_all)

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-26T13:25:04+00:00

Editorial Team

2026-05-26T13:25:04+00:00Added an answer on May 26, 2026 at 1:25 pm

If you want to remove the newlines entirely:

all_sents = file_sent.read().replace('\n', '')

If you want to replace them with spaces:

all_sents = file_sent.read().replace('\n', ' ')

Obviously you could replace them with something else if you wanted.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

i’m trying to read a file into my python program and apply tokenizer on

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply