Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

The Other Search Engines


You are currently viewing our The Other Search Engines as a guest. Please register to participate.
Login



Reply
Robots.txt (allow as opposed to exclusion)
Old 06-02-2007, 02:35 PM Robots.txt (allow as opposed to exclusion)
Banned

Posts: 253
Name: Michel Samuel
Trades: 0
I probably already know the answer to this question but what the hell...

Can I do this in my robots.txt file ?

User-agent: robot-I-want-to-crawl-my-site.
allow: /

User-agent: *
Disallow: /


The objective is to give permission to the robots I like.
And disallow all the rest.
Michel Samuel is offline
Reply With Quote
View Public Profile
 
 
Register now for full access!
Old 06-02-2007, 09:28 PM Re: Robots.txt (allow as opposed to exclusion)
chrishirst's Avatar
Missing! presumed drunk.

Posts: 41,528
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
Nope

robots.txt is an exclusion protocol only. It is also a voluntary protocol to follow so not all bots honour the exclusions.
__________________
Chris. ->> Links are advertising NOT optimising!! <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- I SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 06-03-2007, 06:44 AM Re: Robots.txt (allow as opposed to exclusion)
Banned

Posts: 253
Name: Michel Samuel
Trades: 0
Quote:
Originally Posted by chrishirst View Post
Nope

robots.txt is an exclusion protocol only. It is also a voluntary protocol to follow so not all bots honour the exclusions.
I had hope that I was wrong.
C'est la vie.

OK,
In this case anyone have a master list of american search engines ?
I'm going to have to do this the hard way.
Michel Samuel is offline
Reply With Quote
View Public Profile
 
Old 06-05-2007, 09:49 AM Re: Robots.txt (allow as opposed to exclusion)
Average Talker

Posts: 21
Trades: 0
What is the benefit of Robots.txt file?
Lucy is offline
Reply With Quote
View Public Profile
 
Old 06-05-2007, 10:17 AM Re: Robots.txt (allow as opposed to exclusion)
tripy's Avatar
Do not try this at home!

Posts: 3,621
Name: Thierry
Location: I'm the uber Spaminator !
Trades: 0
[quote]
What is the benefit of Robots.txt file?
[/quotes]

You can use it to give hints on the search engines agents, and blacklist some parts of your site, for example.
You can see the robots.txt of this site there:
http://www.webmaster-talk.com/robots.txt
__________________
Only a biker knows why a dog sticks his head out the window.
tripy is offline
Reply With Quote
View Public Profile Visit tripy's homepage!
 
Old 06-08-2007, 03:55 PM Re: Robots.txt (allow as opposed to exclusion)
Learning Newbie's Avatar
Defies a Status

Latest Blog Post:
Astounding Republican Paranoia
Posts: 5,662
Name: John Alexander
Trades: 0
Quote:
Originally Posted by Michel Samuel View Post
In this case anyone have a master list of american search engines ?
I'm going to have to do this the hard way.
No. But can you use geo-IP blocking instead? You can actually ban robots (return an unauthorized message) as opposed to hanging a "do not enter" sign.
__________________

Please login or register to view this content. Registration is FREE


Please login or register to view this content. Registration is FREE
Learning Newbie is offline
Reply With Quote
View Public Profile
 
Old 06-20-2007, 10:40 AM Re: Robots.txt (allow as opposed to exclusion)
Skilled Talker

Posts: 58
Trades: 0
Quote:
Originally Posted by chrishirst View Post
Nope

robots.txt is an exclusion protocol only. It is also a voluntary protocol to follow so not all bots honour the exclusions.

Although Google, Yahoo, Ask etc. have some extensions.

Here's my blog post about having the robots.txt link xml sitemap file.

Basicly you can now write: Sitemap: http://www.example.com/sitemap.xml
__________________

Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE

Please login or register to view this content. Registration is FREE
Thomas Schulz is offline
Reply With Quote
View Public Profile Visit Thomas Schulz's homepage!
 
Old 06-26-2007, 01:42 AM Re: Robots.txt (allow as opposed to exclusion)
Banned

Posts: 510
Name: CHRIS
Location: I live in Google's Home State
Trades: 0
Why would you want to disclude something from showing from your website. please let me know as I would like to know the functions of this protyocol.
Vasity is offline
Reply With Quote
View Public Profile Visit Vasity's homepage!
 
Old 06-26-2007, 04:29 AM Re: Robots.txt (allow as opposed to exclusion)
chrishirst's Avatar
Missing! presumed drunk.

Posts: 41,528
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
There are many reason you would want to exclude pages or folders from a site.

eg:
Many forums have several ways of getting to the same page, using the disallow you can stop compliant bots from accessing the print versions

If you have tracking links attached to many of your external links you can exclude these with;
disallow: /folder/pagename.ext?track=*

see http://www.robotstxt.org for more on the protocol and http://www.highrankings.com/forum/in...p?showforum=62 for more examples and instances.
__________________
Chris. ->> Links are advertising NOT optimising!! <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- I SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 07-01-2007, 03:49 AM Re: Robots.txt (allow as opposed to exclusion)
EGS
EGS's Avatar
Banned

Posts: 862
Name: Justice McCay
Location: New Jersey
Trades: 2
Quote:
Originally Posted by chrishirst View Post
Nope

robots.txt is an exclusion protocol only. It is also a voluntary protocol to follow so not all bots honour the exclusions.
Ditto. Can't include permission on robots.txt - they open themselves to anything and everything on your server if you don't exclude it!
EGS is offline
Reply With Quote
View Public Profile
 
Reply     « Reply to Robots.txt (allow as opposed to exclusion)
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.30928 seconds with 12 queries