Internationalization Support

Character Encoding

YQL supports most of the character sets in the IANA Character Sets Registery. YQL uses the HTTP header Content-Type in the request to determine the character encoding for the response body. If no character encoding is specified, YQL uses the default UTF-8. The YQL statement can also determine the character encoding for the body with the key charset. If the character encoding is specified in both places, the character set specified by charset has precedence.

For example, to request YQL use ISO/IEC 8859-1 to encode the response body, do one of the following:

  • In your request, set the HTTP header Content-Type as shown below:

    Content-Type: text/html; charset=iso-8859-1

  • In the YQL statement, specify the character set with the key charset as shown below:

    select * from html where url='http://example.com' and charset='iso-8559-1'

Note

The YQL built-in function sort only correctly sorts results in English.

Table of Contents