Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

Coding Forum


You are currently viewing our Coding Forum as a guest. Please register to participate.
Login



Reply
How to list multiple crawlers in robots.txt?
Old 04-14-2007, 02:30 PM How to list multiple crawlers in robots.txt?
Moldarin's Avatar
Extreme Talker

Latest Blog Post:
Keyword Density and Title Tags
Posts: 201
Trades: 0
Hi,

What is the proper way of listing multiple "User-agent: " in a robots.txt file?

a)
Code:
User-agent: *
Disallow: /
User-agent: Slurp, Googlebot
Disallow: /private
In robots.txt example a (above): Are all engines blocked from seeing everything; including Slurp, and Googlebot? Since the wildcase says every browser are disallowed from accessing anything than that includes Slurp, and Googlebot as well? And is it correct to list multiple crawlers with comma separation?

In robots.tzt example b (below): Are all engines blocked from seeing everything; including Slurp, and Googlebot? Since the user-agent (line separated) says Googlebot, and Slurp can access everything but /private, but then again the wildcard says all engines are blocked from seeing anything. And is it correct to list multiple crawlers with line separation?

b)
Code:
User-agent: Slurp
User-agent: Googlebot
Disallow: /private
User-agent: *
Disallow: /
Please advice.
__________________
I do not share ad revenue.
Moldarin is offline
Reply With Quote
View Public Profile
 
 
Register now for full access!
Old 04-14-2007, 06:19 PM Re: How to list multiple crawlers in robots.txt?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 41,519
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
individually

you can group Disallow: entries for each User-agent: entry

Code:
User-agent: Googlebot
Disallow: /folder/
Disallow: /folder/filename.ext
But NOT group the User-agent: entries.

IF you want to allow certain bots, then ban all other compliant bots. Your wildcard block should be the last entry. The file should be read and applied top to bottom and the least restrictive entry should be applied by the bot.
__________________
Chris. ->> Links are advertising NOT optimising!! <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- I SEO the only industry where all the cowboys are Indians?
chrishirst is offline
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 04-14-2007, 06:37 PM Re: How to list multiple crawlers in robots.txt?
Moldarin's Avatar
Extreme Talker

Latest Blog Post:
Keyword Density and Title Tags
Posts: 201
Trades: 0
Why can I not group different user-agents?
__________________
I do not share ad revenue.
Moldarin is offline
Reply With Quote
View Public Profile
 
Old 04-14-2007, 08:41 PM Re: How to list multiple crawlers in robots.txt?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 41,519
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
because you can't! The protocol doesn't allow for it.

If you want to rewrite the protocol and get everyone to follow the new spec, fine. But for now you will have to follow the existing protocol specifications.
__________________
Chris. ->> Links are advertising NOT optimising!! <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- I SEO the only industry where all the cowboys are Indians?
chrishirst is offline
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Reply     « Reply to How to list multiple crawlers in robots.txt?
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.34376 seconds with 12 queries