I am happy to announce that based on some research and a Greasemonkey hack to make people aware of the consequences, Yahoo! is now a search engine that has natural language search results.
HTML has a wonderful attribute called
lang that allows you to define the language of the text in the current HTML element. This seems a bit superfluous as it has nothing to do with the display of the language specific character set (which is the encoding and another issue). However, defining the language has other benefits.
The first one is that search engines and other robots know what language the text is in and thus have a much less harder job to differentiate between keywords and stopwords.
The second, and most important has to do with accessibility. If you do not see the text but you get it read out to you then the pronunciation is very important. Visually impaired surfers use screen readers to tell them what is on the current page, and by defining the language, you make thisRead More »from Yahoo! search results now with natural language support