# file: robots.txt,v 1.0 2009/11/23 created by AN # JD108.CN # 按照robots.txt的标准写法,规定一些不允许爬虫爬的页面或目录。 # robots.txt 的写法参照 # Format is: # User-agent: # Disallow: | # ----------------------------------------------------------------------------- User-agent: Yahoo Slurp Disallow: / User-agent: Msnbot Disallow: / User-agent: Scooter Disallow: / User-agent: Sogou web spider Disallow: / User-agent: sogou spider2 Disallow: / User-agent: hl_ftien_spider Disallow: / User-agent: YodaoBot Disallow: /