It pulls out all of the paragraph ('<p>') tags on the page. Now, the 4th paragraph on the page contains the date and time text that the post was created (in the above case this string: "Date: 2010-11-19, 11:41AM EST"). You can see it in the XML output. However, if I switch to JSON output, that same date/time text is now missing in that same paragraph. What could possibly cause that?
And before anyone asks, unfortunately I have to use JSON and can't use XML output.