Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

The Google Forum


You are currently viewing our The Google Forum as a guest. Please register to participate.
Login



Reply
Page restricted by robots.txt that shouldn't be
Old 09-21-2011, 02:17 PM Page restricted by robots.txt that shouldn't be
Super Talker

Posts: 142
Name: Jim
Location: Nottinghamshire
Trades: 0
Hi Guys

I noticed a while back this page http://www.flypark.co.uk/testimonials.html was no longer cached, I thought nothing of it at the time, but I just noticed webmaster tools is reporting the page is being restricted by the robots.txt file. I checked the file and there is no restriction, even stranger I deleted the robots file from the server and fetched as googlebot and it still returned it being restricted by robots.txt even when I had deleted the file.

I posted this on Google forum but got no response
jim25 is offline
Reply With Quote
View Public Profile Visit jim25's homepage!
 
 
Register now for full access!
Old 09-21-2011, 04:56 PM Re: Page restricted by robots.txt that shouldn't be
vangogh's Avatar
Post Impressionist

Latest Blog Post:
Why Responsive Design?
Posts: 10,815
Name: Steven Bradley
Location: Boulder, Colorado
Trades: 0
The page may not have a cached link, but it seems to be listed in Google's index. I'm not sure why it's being reported as being restricted by robots.txt. I see a small robots.txt file on the site, but nothing in there should block that page.
__________________
l Search Engine Friendly Web Design |
Please login or register to view this content. Registration is FREE

l Tips On Marketing, SEO, Design, and Development |
Please login or register to view this content. Registration is FREE

l
Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
vangogh is offline
Reply With Quote
View Public Profile Visit vangogh's homepage!
 
Old 09-21-2011, 06:52 PM Re: Page restricted by robots.txt that shouldn't be
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
if you want it crawled and indexed get some links pointing to that URI
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 09-22-2011, 04:03 AM Re: Page restricted by robots.txt that shouldn't be
Super Talker

Posts: 142
Name: Jim
Location: Nottinghamshire
Trades: 0
I know the page is indexed, but its not indexed properly. The page even has a page rank 4 but for some reason fetch as googlebot is saying it is restricted by robots. I am worried this may happen to my main landing pages.

It has been like this for a while.

I am going to put a fresh post on Google forum and see if I can get an employee to take a look.
jim25 is offline
Reply With Quote
View Public Profile Visit jim25's homepage!
 
Old 09-22-2011, 05:02 AM Re: Page restricted by robots.txt that shouldn't be
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
The "fetch as Googlebot" doesn't.

If the page is indexed and is showing a SGB "value" then it obviously ISN'T blocked by robots.txt and Webmaster Tools is getting it wrong (and not for the first time either).

Just ignore things that are obviously wrong, put common sense first and rely on "tools" last, if at all.
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 09-22-2011, 05:23 AM Re: Page restricted by robots.txt that shouldn't be
Super Talker

Posts: 142
Name: Jim
Location: Nottinghamshire
Trades: 0
Ok I will put it doen to an error but the fact the page isn't cached or showing the correct title and meta data and hasn't done so for some time does indicate there is a problem.
jim25 is offline
Reply With Quote
View Public Profile Visit jim25's homepage!
 
Old 09-22-2011, 11:10 AM Re: Page restricted by robots.txt that shouldn't be
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
Quote:
hasn't done so for some time does indicate there is a problem.
Not really, it simply indicates that the page does not have enough links pointing to it and therefore not enough REAL PageRank to warrant regular crawling and indexing.
It is NOT a problem it is a SYMPTOM of poor promotional efforts.
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 09-22-2011, 11:14 AM Re: Page restricted by robots.txt that shouldn't be
Super Talker

Posts: 142
Name: Jim
Location: Nottinghamshire
Trades: 0
We will have to disagree here Chris, the page is well linked throughout the site, why would it have sgb 4 if not? I just saved the page under a different filename uploaded, fetched as GBot and it worked.

experiment
testimonials.html restricted by robots.txt
testimonials1.html restricted by robots.txt
form23.html passed no problems

How do you explain this?
jim25 is offline
Reply With Quote
View Public Profile Visit jim25's homepage!
 
Old 09-22-2011, 12:05 PM Re: Page restricted by robots.txt that shouldn't be
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
Ignore what the SGB shows that data could be from six months ago and is from a different DC or server cluster that the WMT data is from as are pages from the SERPs.

It doesn't really matter how "well linked" the URI is throughout the site if the only real PR that enters the network is via a single node, that of the "home" URL.
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 09-23-2011, 04:10 AM Re: Page restricted by robots.txt that shouldn't be
Super Talker

Posts: 142
Name: Jim
Location: Nottinghamshire
Trades: 0
I figured out what it was,

I had a folder called /test disallowed in the robots. Because I didn't have it stated like this /test/ it was restricting all urls in the test folder and urls that began with "test".

A lesson learned here
jim25 is offline
Reply With Quote
View Public Profile Visit jim25's homepage!
 
Old 09-23-2011, 06:39 PM Re: Page restricted by robots.txt that shouldn't be
vangogh's Avatar
Post Impressionist

Latest Blog Post:
Why Responsive Design?
Posts: 10,815
Name: Steven Bradley
Location: Boulder, Colorado
Trades: 0
That makes sense. I'm glad you figured it out. Something so simple that it gets overlooked is usually the problem.
__________________
l Search Engine Friendly Web Design |
Please login or register to view this content. Registration is FREE

l Tips On Marketing, SEO, Design, and Development |
Please login or register to view this content. Registration is FREE

l
Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
vangogh is offline
Reply With Quote
View Public Profile Visit vangogh's homepage!
 
Reply     « Reply to Page restricted by robots.txt that shouldn't be
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.51318 seconds with 12 queries