I want multi-line strings in java, so I seek a simple preprocessor to convert C-style multi-lines into single lines with a literal ‘\n’.
Before:
System.out.println("convert trailing backslashes\
this is on another line\
\
\
above are two blank lines\
But don't convert non-trailing backslashes, like: \"\t\" and \'\\\'");
After:
System.out.println("convert trailing backslashes\nthis is on another line\n\n\nabove are two blank lines\nBut don't convert non-trailing backslashes, like: \"\t\" and \'\\\'");
I thought sed would do it well, but sed is line-based, so replacing the ‘\’ and the newline that follows it (effectively joining the two lines) is not very natural in sed. I adapted sredden79’s oneliner to the following – it works, it’s clever, but it’s not clear:
sed ':a { $!N; s/\\\n/\\n/; ta }'
The substitute is of escaped literal backslash, newline with escaped literal backslash, n. :a is a label and ta is goto label if the substitute found a match; $ means the last line, and $! is the opposite (i.e. all lines but the last). N means to append the next line to the pattern space (thus making the \n character visible.)
EDIT here’s a variation to keep compiler error line numbers etc accurate: it turns each extended line into "..."+\n (and handles the first and last lines of the String correctly):
sed ':a { $!N; s/\\\n/\\n"+\n"/; ta }'
giving:
System.out.println("convert trailing backslashes\n"+
"this is on another line\n"+
"\n"+
"\n"+
"above are two blank lines\n"+
"But don't convert non-trailing backslashes, like: \"\t\" and \'\\\'");
EDIT Actually, it would be better have Perl/Python style multi-line, where it starts and ends with a special code on one line (“”” for python, I think).
Is there a simpler, saner, clearer way (maybe not using sed)?
A perl one-liner:
This will read either stdin or the file(s) named after it on the command line and write the output to stdout.
If you’re using an editor that supports filtering, like vi or emacs, just filter your text through the above command and you’re done:
If you’re using Windows and have to worry about
\r:although I think win32 Perl handles
\ritself so this may be unnecessary.The
-0777option is a special case of the-0(that’s a zero) option that defines the line or record separator. In this case, it means that we don’t want any separator so read the entire file in as a single string.The
-peoption is a combination of-p(process line-by-line and print the result) and-e(next argument is (a line of) the program to execute)