0

YQL and HTML5

Hi,
I was querying the html5 doctor website today, trying to return all article elements on the page.
From within the console I ran:
select * from html where url='http://html5doctor.com/' and xpath="//article"
which returned no results

select * from html where url='http://html5doctor.com/' and xpath="//h1"
returned all the results I would expect - all the h1 elements are housed within an article element.

I think it is most likely that I am just doing something wrong rather than new elements being unsuppored, but I can't see what.

Many thanks in advance,
Ant

by
2 Replies
  • QUOTE (kalliste_uk @ Apr 5 2010, 09:03 AM) <{POST_SNAPBACK}>
    Hi,
    I was querying the html5 doctor website today, trying to return all article elements on the page.
    From within the console I ran:
    select * from html where url='http://html5doctor.com/' and xpath="//article"
    which returned no results

    select * from html where url='http://html5doctor.com/' and xpath="//h1"
    returned all the results I would expect - all the h1 elements are housed within an article element.

    I think it is most likely that I am just doing something wrong rather than new elements being unsuppored, but I can't see what.

    Many thanks in advance,
    Ant


    We use a standard "tidy" library for HTML. It's likely that the new HTML5 elements aren't supported yet in that library. We'll take a look and see what can be done.

    Jonathan
    0
  • QUOTE (Jonathan @ Apr 12 2010, 09:55 AM) <{POST_SNAPBACK}>
    We use a standard "tidy" library for HTML. It's likely that the new HTML5 elements aren't supported yet in that library. We'll take a look and see what can be done.

    Jonathan


    Ah awesome, in both ways: that you'll look into it, and that I wasn't doing something glaringly stupid :)many thanks
    0

Recent Posts

in YQL