I’m on a strict time limit and I really need a regex to parse this type of anchor (they’re all in this format)
<a href="20120620_0512_c2_1024.jpg">20120620_0512_c2_102..></a>
for the URL
20120620_0512_c2_1024.jpg
I know its not a full URL, it’s relative, please help
Here’s my code so far
year = datestr(now,'yyyy');
timestamp = datestr(now,'yyyymmdd');
html = urlread(['http://sohowww.nascom.nasa.gov//data/REPROCESSING/Completed/' year '/c2/' timestamp '/']);
links = regexprep(html, '<a href=.*?>', '');
Try the following:
The resulting cell array (truncated):