This is the page in question: http://phoenix.craigslist.org/cpg/
What I would like to do is to create an array that looks like this:
Date (as captured by the h4 tag on that page) => in cell [0][0][0],
Link Text => in cell [0][1][0]
Link href => in cell [0][1][1]
i.e. in each row, I store each of those items per row.
What I have done is simply pulled all the h4 tags in and stored them in a hash like this:
contents2[link[:date]] = content_page.css("h4").text
The problem with this is that one cell stores all the text from the h4 tags on the entire page…whereas I would like to have 1 date to 1 cell.
So as an example:
0 => Mon May 28 - Leads need follow up - (Phoenix) - http://phoenix.craigslist.org/wvl/cpg/3043296202.html
1=> Mon May 28 - .Net/Java Developers - (phoenix) - http://phoenix.craigslist.org/cph/cpg/3043067349.html
Any thoughts on how I might approach this, with code would be greatly appreciated.
How’s this?