Doh! My designer had a single tag wrapping each group…

Question

0

Asked: May 12, 20262026-05-12T12:32:49+00:00 2026-05-12T12:32:49+00:00

I have a CSV-like text file that has about 1000 lines. Between each record

0

I have a CSV-like text file that has about 1000 lines. Between each record in the file is a long series of dashes. The records generally end with a \n, but sometimes there is an extra \n before the end of the record. Simplified example:

"1x", "1y", "Hi there"
-------------------------------
"2x", "2y", "Hello - I'm lost"
-------------------------------
"3x", "3y", "How ya
doing?"
-------------------------------

I want to replace the extra \n’s with spaces, i.e. concatenate the lines between the dashes. I thought I would be able to do this (Python 2.5):

text = open("thefile.txt", "r").read()    
better_text = re.sub(r'\n(?!\-)', ' ', text)

but that seems to replace every \n, not just the ones that are not followed by a dash. What am I doing wrong?

I am asking this question in an attempt to improve my own regex skills and understand the mistakes that I made. The end goal is to generate a text file in a format that is usable by a specific VBA for Word macro that generates a styled Word document which will then be digested by a Word-friendly CMS.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-12T12:32:50+00:00

Editorial Team

2026-05-12T12:32:50+00:00Added an answer on May 12, 2026 at 12:32 pm

You need to exclude the line breaks at the end of the separating lines. Try this:

\n(?<!-\n)(?!-)

This regular expression uses a negative look-behind assertion to exclude \n that’s preceeded by an -.

0

Reply
Share
Share

- Report

How to approach applying for a job at a company ...

What is a programmer’s life like?

How to handle personal stress caused by utterly incompetent and ...

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions