You need event.simulate.js Fire the event.. button.observe('click', function(event) { hiddenField.setValue(someValue);…

Question

0

Asked: May 13, 20262026-05-13T12:01:02+00:00 2026-05-13T12:01:02+00:00

I want to parse a HTML code and create objects from their text representation

0

I want to parse a HTML code and create objects from their text representation in table. I have several columns and I want to save context of certain columns on every row.
Now, I have the HTML code and I understand I should use Pattern and Matcher to get those strings, but I don’t know how to write required regular expression.

This is a row I will be parsing:

<tr><td><a href="delirium.htm">Delirium</a></td><td>65...</tr>

So, I want to extract Delirium from that string. How do I write regular expression that sais

get me the string that is between the string htm"> and </a></td>

?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-13T12:01:03+00:00

This is a common question on SO and the answer is always the same: regular expressions are a poor and limited tool for parsing HTML because HTML is not a regular language.

You should be using an HTML parser, for example HTML Parser.

If you’re curious what I mean by “regular language”, have a look at JMD, Markdown and a Brief Overview of Parsing and Compilers. Basically a regular expression is a DFA (deterministic finite automaton or deterministic finite state machine). HTML requires a PDA (pushdown automaton) to parse. A PDA is a DFA with a stack. It’s how it handles recursive elements.

How to approach applying for a job at a company ...

What is a programmer’s life like?

How to handle personal stress caused by utterly incompetent and ...

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions