I am curling a webpage with erlang and naturally getting html code, so far so good!
This is how my code looks like:
startCURL()->
Cmd = "curl \"" ++ "http://mywebsite.com/something.html" ++ "\"",
Output = os:cmd(Cmd),
io:format("The HTML output: ~s~n", [Output]).
Now to the problem, I want to exclude various tags in the html via erlang. Example:
<b>Hello</b> to “Hello” without the <b> and </b>.
I really appreciate all help I can get!
Thanks!
You can use the replace function in re module to replace unwanted tags with empty strings.