Yahoo! Search
How can I reduce the number of requests you make on my web site?
Last Updated: 24 August 2007
Text Size: A A A

Save to My Help

Save this article to My Help for easy reference. You can visit the article at any time from any computer.

Replace an article

You have reached the maximum number of saved articles. Your oldest saved article will be replaced with the new one.

Since we crawl billions of pages from the entire Web, we use a large number of systems for web crawling. Therefore, your web server might log requests from a number of different Yahoo! crawler client IP addresses. The different crawler systems are coordinated to limit the activity on any single web server. We determine a single "web server" by IP address, so if your host is serving multiple IPs, it might see higher levels of activity.

If there are directories on your web server which you do not want represented in web search results, use robot exclusion rules as described in "How do I prevent my site or certain subdirectories from being crawled?" An exclusion rule can reduce the number of pages [property_slurp/] reads from your server.

There is a Yahoo! Slurp-specific extension to robots.txt that allows you to set a lower limit on our crawler request rate.

You can add a "Crawl-delay: x.x" instruction, where "x.x" is a delay value between successive crawler accesses. If the crawler rate is a problem for your server, you can set the delay to a maximum of 10, but we suggest you start with small values (0.5–1), and increase as needed to an acceptable value for your server. Larger delay values add more delay between successive crawl accesses and decrease the maximum crawl rate to your web server.

For example, a robots.txt rule to set a crawl-delay of 1 for Yahoo! Slurp looks like:

User-agent: Slurp

Crawl-delay: 1

A shorter delay value of 0.5 would look like:

User-agent: Slurp

Crawl-delay: 0.5

In general you should restrict total crawler activity to your server by disallowing unimportant content with a robots.txt rule. Setting a crawl-delay may limit the coverage and freshness of your content representation in Yahoo! Search results. If you do feel that a crawl-delay is necessary, use small values (0.5–1) to avoid blocking Slurp discovery and refresh of your key content.

Was this information helpful?      

My Help

Forgot your ID or password?

Sign In

Sign in to see your account information saved articles and more.
  1. Recent Searches

  2. Saved Articles

    Sign in to see your account information saved articles and more.

Top Questions

Still Need Help?

Related Topics

Copyright © 2009 Yahoo! UK Limited. All rights reserved.

Privacy Policy - Terms of Service