本文记录了全世界比较出名的Robots.txt 列表需要设置的搜索蜘蛛。如何设置那个目录不想被搜索引擎收录的可参照下去设置。 当然也必须从Robots.txt 去设置 Google的蜘蛛: Googlebot 如需要参考的可以参照本文: User-agent: Black Hole User-agent: Titan User-agent: WebStripper User-agent: NetMechanic User-agent: CherryPicker User-agent: EmailCollector User-agent: EmailSiphon User-agent: WebBandit User-agent: EmailWolf User-agent: ExtractorPro User-agent: CopyRightCheck User-agent: Crescent User-agent: NICErsPRO User-agent: SiteSnagger User-agent: ProWebWalker User-agent: CheeseBot User-agent: mozilla/4 User-agent: mozilla/5 User-agent: Mozilla/4.0 (compatible; MSIE 4.0; Windows 95) User-agent: Mozilla/4.0 (compatible; MSIE 4.0; Windows 9 User-agent: ia_archiver User-agent: ia_archiver/1.6 User-agent: Alexibot User-agent: Teleport User-agent: TeleportPro User-agent: Wget User-agent: MIIxpc User-agent: WebZip User-agent: WebZip/4.0 User-agent: WebStripper User-agent: WebSauger User-agent: WebCopier User-agent: NetAnts User-agent: Mister PiX User-agent: WebAuto User-agent: TheNomad User-agent: RMA User-agent: libWeb/clsHTTPDisallow: / User-agent: asterias User-agent: spanner User-agent: InfoNaviRobot User-agent: Mozilla/4.0 (compatible; BullsEye; Windows 95) User-agent: Crescent Internet ToolPak HTTPOLE Control v.1.0 User-agent: CherryPickerSE/1.0 User-agent: CherryPickerElite/1.0 User-agent: WebBandit/3.50 User-agent: NICErsPRO User-agent: Foobot User-agent: WebmasterWorldForumBot User-agent: SpankBot User-agent: BotALot User-agent: lwp-trivial User-agent: Microsoft URL Control – 6.00.8169 User-agent: URLy Warning User-agent: Wget User-agent: LinkWalker User-agent: cosmos User-agent: moget User-agent: hloader User-agent: humanlinks User-agent: LinkextractorPro User-agent: Offline Explorer 编辑:Windear User-agent: LexiBot User-agent: Offline Explorer User-agent: The Intraformant User-agent: True_Robot/1.0 User-agent: True_Robot User-agent: BlowFish/1.0 User-agent: JennyBot User-agent: MIIxpc/4.2 User-agent: BuiltBotTough User-agent: BackDoorBot/1.0 User-agent: WebEnhancer User-agent: suzuran User-agent: VCI WebViewer VCI WebViewer Win32 User-agent: VCI User-agent: Szukacz/1.4 User-agent: QueryN Metasearch User-agent: Openfind data gathere User-agent: Openfind User-agent: Xenus Link Sleuth 1.1c User-agent: Xenus User-agent: Zeus User-agent: RepoMonkey Bait & Tackle/v1.01 User-agent: RepoMonkey User-agent: Zeus 32297 Webster Pro V2.9 Win32 User-agent: Webster Pro User-agent: EroCrawler User-agent: LinkScan/8.1a Unix Disallow: / User-agent: Keyword Density/0.9 User-agent: Kenjin Spider User-agent: Cegbfeieh Different: User-agent: larbin User-agent: b2w/0.1 User-agent: Copernic User-agent: URL_Spider_Pro User-agent: CherryPicker 编辑:Windear User-agent: EmailCollector User-agent: EmailSiphon User-agent: ExtractorPro User-agent: CopyRightCheck User-agent: Crescent User-agent: SiteSnagger User-agent: ProWebWalker User-agent: CheeseBot User-agent: LNSpiderguy User-agent: mozilla User-agent: mozilla/3 User-agent: mozilla/4 User-agent: TheNomad User-agent: WWW-Collector-E User-agent: libWeb/clsHTTP User-agent: httplib User-agent: turingos User-agent: InfoNaviRobot User-agent: Harvest/1.5 User-agent: Crescent Internet ToolPak HTTP OLE Control v.1.0 User-agent: CherryPickerElite/1.0 User-agent: NICErsPRO User-agent: DittoSpyder User-agent: Foobot User-agent: BotALot User-agent: lwp-trivial/1.34 User-agent: lwp-trivial User-agent: LinkextractorPro User-agent: Offline Explorer User-agent: Mata Hari User-agent: LexiBot User-agent: Web Image Collector User-agent: True_Robot User-agent: BlowFish/1.0 User-agent: MIIxpc/4.2 User-agent: BackDoorBot/1.0 User-agent: toCrawl/UrlDispatcher User-agent: WebEnhancer User-agent: VCI WebViewer VCI WebViewer Win32 User-agent: Szukacz/1.4 User-agent: QueryN Metasearch User-agent: Openfind data gathere User-agent: Openfind User-agent: Zeus User-agent: RepoMonkey Bait & Tackle/v1.01 User-agent: Openbot User-agent: Zeus Link Scout User-agent: Zeus 32297 Webster Pro V2.9 Win32 User-agent: EroCrawler User-agent: LinkScan/8.1a Unix User-agent: Kenjin Spider User-agent: Iron33/1.0.2 User-agent: GetRight/4.2 User-agent: FairAd Client User-agent: Aqua_Products User-agent: Radiation Retriever 1.1 User-agent: WebmasterWorld Extractor User-agent: Oracle Ultra Search User-agent: MSIECrawler User-agent: PerMan User-agent: searchpreview User-agent: naver User-agent: dumbot User-agent: Hatena Antenna User-agent: grub-client User-agent: grub User-agent: b2w/0.1 User-agent: psbot User-agent: Python-urllib User-agent: Crescent User-agent: SiteSnagger User-agent: ProWebWalker User-agent: CheeseBot User-agent: Mister PiX User-agent: WebAuto User-agent: TheNomad User-agent: WWW-Collector-E User-agent: RMA User-agent: httplib User-agent: InfoNaviRobot User-agent: Harvest/1.5 User-agent: Bullseye/1.0 User-agent: Mozilla/4.0 (compatible; BullsEye; Windows 95) User-agent: CherryPickerElite/1.0 User-agent: URLy Warning User-agent: humanlinks User-agent: The Intraformant User-agent: True_Robot/1.0 User-agent: BlowFish/1.0 User-agent: JennyBot User-agent: MIIxpc/4.2 User-agent: BuiltBotTough User-agent: ProPowerBot/2.14 User-agent: BackDoorBot/1.0 User-agent: WebEnhancer User-agent: VCI WebViewer VCI WebViewer Win32 User-agent: QueryN Metasearch User-agent: Openfind data gathere User-agent: Openfind User-agent: Xenus Link Sleuth 1.1c User-agent: Zeus User-agent: RepoMonkey Bait & Tackle/v1.01 User-agent: RepoMonkey User-agent: Microsoft URL Control User-agent: Openbot User-agent: URL Control User-agent: Webster Pro User-agent: EroCrawler User-agent: LinkScan/8.1a Unix User-agent: Keyword Density/0.9 User-agent: Bookmark search tool User-agent: GetRight/4.2 User-agent: FairAd Client User-agent: Aqua_Products User-agent: WebmasterWorld Extractor User-agent: Flaming AttackBot User-agent: MSIECrawler User-agent: PerMan User-agent: sootle User-agent: es User-agent: Enterprise_Search/1.0 User-agent: Enterprise_Search
下列为比较出名的搜索引擎蜘蛛名称:
百度的蜘蛛:baiduspider
Yahoo的蜘蛛:Yahoo Slurp
MSN的蜘蛛:Msnbot
Altavista的蜘蛛:Scooter
Lycos的蜘蛛: Lycos_Spider_(T-Rex)
Alltheweb的蜘蛛: FAST-WebCrawler/
INKTOMI的蜘蛛: Slurp
User-agent(用户代理设置):(蜘蛛名字)
拒绝:(文件名字)
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Wget
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Mozilla/4.0 (compatible; MSIE 4.0; Windows NT)
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Telesoft
Disallow: /
User-agent: Website Quester
Disallow: /
Disallow: /
User-agent: moget/2.1
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: WWW-Collector-E
Disallow: /
Disallow: /
Disallow: /
User-agent: turingos
Disallow: /
Disallow: /
Disallow: /
User-agent: Harvest/1.5
Disallow: /
User-agent: ExtractorPro
Disallow: /
User-agent: Bullseye/1.0
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Microsoft URL Control – 5.01.4511
Disallow: /
User-agent: DittoSpyder
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: lwp-trivial/1.34
Disallow: /
Disallow: /
User-agent: BunnySlippers
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Wget/1.5.3
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
首发:SEO、SEM密谋、站长资源信息网
地址:http://www.seoplot.com,/
转载请出处和保留连接!否则必究
User-agent: Mata Hari
Disallow: /
Disallow: /
Disallow: /
User-agent: Web Image Collector
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: ProPowerBot/2.14
Disallow: /
Disallow: /
User-agent: toCrawl/UrlDispatcher
Disallow: /
Disallow: /
User-agent: TightTwatBot
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Kenjin Spider
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: psbot
Disallow: /
User-agent: Python-urllib
Disallow: /
User-agent: NetMechanic
Disallow: /
Disallow: /
Disallow: /
首发:SEO、SEM密谋、站长资源信息网
地址:http://www.seoplot.com,/
转载请出处和保留连接!否则必究
Disallow: /
Disallow: /
User-agent: WebBandit
Disallow: /
User-agent: EmailWolf
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Mozilla
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: mozilla/5
Disallow: /
User-agent: WebAuto
Disallow: /
Disallow: /
Disallow: /
User-agent: RMA
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: CherryPickerSE/1.0
Disallow: /
Disallow: /
User-agent: WebBandit/3.50
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: URLy Warning
Disallow: /
User-agent: hloader
Disallow: /
User-agent: humanlinks
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: The Intraformant
Disallow: /
User-agent: True_Robot/1.0
Disallow: /
Disallow: /
Disallow: /
User-agent: JennyBot
Disallow: /
Disallow: /
User-agent: BuiltBotTough
Disallow: /
User-agent: ProPowerBot/2.14
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: suzuran
Disallow: /
Disallow: /
User-agent: VCI
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Xenus Link Sleuth 1.1c
Disallow: /
User-agent: Xenus
Disallow: /
Disallow: /
Disallow: /
User-agent: RepoMonkey
Disallow: /
Disallow: /
User-agent: URL Control
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Keyword Density/0.9
Disallow: /
Disallow: /
Disallow: /
User-agent: Bookmark search tool
Disallow: /
Disallow: /
Disallow: /
User-agent: Gaisbot
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Flaming AttackBot
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: larbin
Disallow: /
Disallow: /
User-agent: Copernic
Disallow: /
Disallow: /
Disallow: /
User-agent: EmailWolf
Disallow: /
User-agent: ExtractorPro
Disallow: /
User-agent: CopyRightCheck
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: LNSpiderguy
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: turingos
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Crescent Internet ToolPak HTTP OLE Control v.1.0
Disallow: /
User-agent: CherryPickerSE/1.0
Disallow: /
Disallow: /
User-agent: NICErsPRO
Disallow: /
Disallow: /
Disallow: /
User-agent: Web Image Collector
Disallow: /
Disallow: /
Disallow: /
User-agent: True_Robot
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: toCrawl/UrlDispatcher
Disallow: /
Disallow: /
User-agent: suzuran
Disallow: /
Disallow: /
User-agent: VCI
Disallow: /
User-agent: Szukacz/1.4
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Xenus
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Zeus Link Scout
Disallow: /
User-agent: Zeus 32297 Webster Pro V2.9 Win32
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Kenjin Spider
Disallow: /
User-agent: Iron33/1.0.2
Disallow: /
Disallow: /
Disallow: /
Disallow: /
User-agent: Gaisbot
Disallow: /
Disallow: /
User-agent: Radiation Retriever 1.1
Disallow: /
Disallow: /
Disallow: /
User-agent: Oracle Ultra Search
Disallow: /
Disallow: /
Disallow: /
User-agent: searchpreview
Disallow: /
Disallow: /
Disallow: /
Disallow: /
Disallow: /
世界各大搜索引擎的蜘蛛名称列表_seo网站优化
版权申明:本站文章部分自网络,如有侵权,请联系:west999com@outlook.com 特别注意:本站所有转载文章言论不代表本站观点! 本站所提供的图片等素材,版权归原作者所有,如需使用,请与原作者联系。未经允许不得转载:IDC资讯中心 » 世界各大搜索引擎的蜘蛛名称列表_seo网站优化
相关推荐
-      关于seo最佳的实践方法_seo网站优化
-      seo中十大影响链接权重的因素浅析_seo网站优化
-      seo新手教程:title的写法_seo网站优化
-      seo:刚入门还不如不入门的_seo网站优化
-      google补充材料没消失,内链优化很重要_seo网站优化
-      献给想我一样初基础网站优化的朋友们_seo网站优化
-      田锋林:seo博客细节调整_seo网站优化
-      seo策略之大型网站_seo网站优化