hi..i have a query that is capturing html content from a web page. The page has simple divs with ptags , but in the divs, there are line breaks using br tags. Example;
CODE
<div>
<p>This is a category<br/>
here is content<br/>
here is more content<br/>
other content<br/>
end of content <br/>
</p>
...
</div>
my YQL query looks something like:
CODE
select * from html where url="http://www.website.com/detail/" and xpath='//div[@id="collumn-01"]'
but my json results come out like:
CODE
"1999",
{
"strong": "2009",
"br": [
null,
null,
null,
null,
null,
null
],
"content": "\n7th, end of content"
},
{
"strong": "2008",
"br": [
null,
null,
null,
null
],
"content": "\n4th, end of content"
},
{
"strong": "2007",
"br": [
null,
null,
null,
null
],
"content": "\n7th, end of content"
},
As you can see, the results refelect the content with <br> tags, but only gives the last line of that bunch....each null represents content that is not parsing.
Any ideas how to get the null to show the actual contnet between break tags? The xml format, of which i would rather not fall back on, shows the expected results