On Wed, Dec 31, 2008 at 08:33:26AM +0900, Cs Webgrl wrote:
> Hi.
>
> I am currently scraping a page with scRUBYt and am not getting the
> results as expected.
>
> Instead of the correctly formatted xml document I'm getting the
> following.
>
> <record>
> <1>book a</1>
> <1>book b</1>
> <1>book c</1>
> <2>chapter aa</2>
> <2>chapter bb</2>
> <2>chapter cc</2>
> <3>verse aaa</3>
> <3>verse bbb</3>
> <3>verse ccc</3>
> </record>
This is a correctly formatted XML document. You just have numbers for
tag names.
> My code looks like this:
>
> listing "//a[@id*='volume'>" do
> book "//a[@class='1']"
> chapter "//span[@class='2']"
> verse "//a[@id*='3']"
> end
>
> Any ideas?
Have you tried something like this:
book "//2[@id='whatevs']"
That should get you access to the tags.
Hope that helps!
--
Aaron Patterson
http://tenderlovemaking.com/