Rails 中的 robots.txt
# See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
# To ban all spiders from the entire site uncomment the next two lines:
# User-agent: *
# Disallow: /
“ User-agent:* ”表示本节适用于所有机器人。“ Disallow:/ ”告诉机器人它不应该访问网站上的任何页面。
# https://www.taobao.com/robots.txt 页面
User-agent: Baiduspider
Allow: /article
Allow: /oshtml
Allow: /wenzhang
Disallow: /product/
Disallow: /
User-Agent: Googlebot
Allow: /article
Allow: /oshtml
Allow: /product
Allow: /spu
Allow: /dianpu
Allow: /wenzhang
Allow: /oversea
Allow: /list
Disallow: /
User-agent: Bingbot
Allow: /article
Allow: /oshtml
Allow: /product
Allow: /spu
Allow: /dianpu
Allow: /wenzhang
Allow: /oversea
Allow: /list
Disallow: /
User-Agent: 360Spider
Allow: /article
Allow: /oshtml
Allow: /wenzhang
Disallow: /
User-Agent: Yisouspider
Allow: /article
Allow: /oshtml
Allow: /wenzhang
Disallow: /
User-Agent: Sogouspider
Allow: /article
Allow: /oshtml
Allow: /product
Allow: /wenzhang
Disallow: /
User-Agent: Yahoo! Slurp
Allow: /product
Allow: /spu
Allow: /dianpu
Allow: /wenzhang
Allow: /oversea
Allow: /list
Disallow: /
User-Agent: *
Disallow: /