I have seen a number of posts here that describe how to parse HTML tables using the XML package. That said, I have got my code to work except that my first data row gets read in as my column names.
My code is taken from the answser at this link
How can I get around this?
Many thanks,
Brock
From the XML package documentation for “readHTMLTable”:
So running the same command but passing in
header=FALSEshould do what you want. Alternatively, if you have a vector of column names you want to use instead, you can useheader=your_vector_of_column_namesto set the column names instead of setting them to empty strings.