Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

Coding Forum


You are currently viewing our Coding Forum as a guest. Please register to participate.
Login



Reply
Old 06-12-2007, 01:20 PM Tip - robots.txt
Learning Newbie's Avatar
Defies a Status

Latest Blog Post:
Astounding Republican Paranoia
Posts: 5,662
Name: John Alexander
Trades: 0
I don't know if Blogger will allow this, but I've been thinking it might be a good idea to add a robots.txt file to my blog, and make sure Google isn't crawling my RSS feed and seeing duplicate content. Have been doing homework on this, and I found something interesting that I didn't know.

robots.txt ( all lower case ) has to be in Unix format, with a LF or for VBScript people Chr$(10) for line endings. Not CR+LF - Chr$(13) + Chr$(10) like in Windows - and not simply CR or Chr$(13) like on the Mac, who just had to be different. Your html can be in whatever format you want, Unicode or ASCII, but robots.txt is expected to be a particular way, and while some programs will recognize other formats, a lot of spiders won't.
__________________

Please login or register to view this content. Registration is FREE


Please login or register to view this content. Registration is FREE
Learning Newbie is offline
Reply With Quote
View Public Profile
 
 
Register now for full access!
Reply     « Reply to Tip - robots.txt
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.23244 seconds with 12 queries