Here is my string: String str = <pre>LVI . The Day of Battle

Question

0

Asked: May 30, 20262026-05-30T17:39:32+00:00 2026-05-30T17:39:32+00:00

Here is my string: String str = <pre>LVI . The Day of Battle

0

Here is my string:

String str = "<pre><font size="5"><strong><u>LVI . The Day of Battle</u></strong></font>        
<font
size="4"><strong>";

I want to remove all html tags in a string with using StringTokenizer. But I don’t understand how to use StringTokenizer for this situation. Because when I use str.replaceAll("\\<.*?>",""), it is not efficient to remove all tags because some tags will be on the next line of string, as seen the string above. But I want to do it for all situations between < and >. How can I do it? (I want to achieve it using StringTokenizer). Thanks..

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-30T17:39:34+00:00

Editorial Team

2026-05-30T17:39:34+00:00Added an answer on May 30, 2026 at 5:39 pm

Trying to process HTML with regexes or StringTokenizer alone is… painful.

This answer is compulsory reading before you go any further.

If your HTML files are simple, you might get away with removing the newlines, then applying a regex, then reformatting the HTML – or try multiline regexes.

But you should really look at using a proper HTML parser. See this question (and probably many others…)

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Here is my string: String str = <pre><font size=5><strong><u>LVI . The Day of Battle</u></strong></font>

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply