Yahoo!7 recognises and honors the access policy for robots specified within the robots.txt file that lives (or can be created) on any WWW server.
To exclude your entire web site or a specific section (directory) of your server from the Yahoo!7 News Search index, simply place a file called robots.txt at the root of your server.
To prevent most robots, including Yahoo!'s, from scanning your site, you can add these lines to the /robots.txt file on your server:
User-agent: *
Disallow: /
To prevent just Yahoo!7 News from crawling your site, add these lines to the /robots.txt file on your server:
User-agent: yahoo-newscrawler
Disallow: /
You can also be more selective and indicate which parts of your server cannot be accessed by robots. Learn more about robots.txt here.
Blocking RSS Feed Only
If you don't want to be included in the RSS feed offered by Yahoo!7 News Search, but want your content to remain accessible in the normal news results, please use the News feedback form. Select the "Remove Site From RSS Feed" option from the pull-down menu. Make sure to include the following information in the feedback window to assure that we can process your request:
After you complete the form with the information requested, we'll need to verify that you're the content owner or authorised by the content owner to make the request.
Yahoo!7 will make every effort to remove your products and domain within seven (7) business days after verifying your authorisation.
After you make this request, all news articles from your source will be removed from the RSS index.
Please note: Articles removed from RSS may otherwise be available through Yahoo!7 News Search or Yahoo!7 Web Search.