Home | Index

SearchMonkey Guide

Chapter 3. Site Owner Guide

Table of Contents

Factors to Consider When Choosing a Data Delivery Method
Comparing Data Delivery Methods
XML-Based Atom Feeds
eRDF/RDFa Markup
Microformats
OpenSearch
Custom Data Services
Submitting Feeds
The Process of Creating a Feed
Selecting Content for the Feed
Selecting Pages (URLs)
General Content Guidelines
Ineligible Content
Understanding DataRSS
DataRSS Elements and Attributes
DataRSS Vocabularies
DataRSS in SearchMonkey Data Services
Creating the Feed
Standard Feed Requirements
Constructing the Feed
Submitting the Feed
Leveraging the Data Web
Microformats
Supported Microformats
RDF
Enhanced Results User Agent

Site Owners have web sites containing the data retrieved by SearchMonkey applications. To be used by SearchMonkey applications, this data must be structured. Site owners can make structured data available to Yahoo! in any of the following ways:

[Note] Note

RDFa support is not currently available. This will become available when the specification has stabilized in the community.

This chapter contains the following sections:

Factors to Consider When Choosing a Data Delivery Method

There are several factors to consider when choosing one of these methods to provide your data to SearchMonkey.

Freshness

If the freshness of the data is a concern, custom data services can retrieve the most current data from your website. It can take several days for Yahoo! to crawl the markup on your web pages. Feeds require that you set up a process to ensure the data is current and valid, and are downloaded periodically.

User Experience

Since markup and feed data is already present in the Yahoo! system, the display of search results to the user are almost immediate, while custom data services can be slow.

Human Resources

The amount of work required by staff to set up and provide the data sources depends on your available resources. Providing data to Yahoo! as markup requires engineering work on the front-end, while data through feeds and custom data services requires back-end work.

Privacy of Data

If privacy of your data is a concern, then feeds are preferable; marking up your pages in microformats or RDFa/eRDF makes your data available to anyone on the Internet.

Standards

Microformats are the leading established standard for web page markup. RDFa and eRDF are also widely accepted standards. Since these are all open standards, marking up your pages using any of these formats makes your content more easily reusable.

Maintenance

Maintaining the data you provide for SearchMonkey is important to ensure that the data is current and valid. Feeds typically require that you set up a process to send feed updates whenever there is new information available. Markup means that changes to your front end can affect the data Yahoo! gets when it crawls your pages. In the case of custom data services, you must implement them to ensure low latency in the search results.