Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

The Google Forum


You are currently viewing our The Google Forum as a guest. Please register to participate.
Login



Reply
Help with GSiteCrawler
Old 03-28-2007, 09:15 AM Help with GSiteCrawler
Junior Talker

Posts: 1
Name: Eric
Location: Boston, MA
Trades: 0
I've been using Gsite for a few weeks now, made two successful sitemaps, but now it seems to be acting up. It doesn't heed my filter/ban url requests. I'm trying to filter our community discussion boards by banning all links with /community/, but those pages keep coming up in my url table. The crawler seems to get lost in all these pages. I've left it running overnight even and it wasn't finished in the morning.

Has anybody had these issues before, or have any suggestions? The filters have worked for me before, but it seems like I'm gambling each time I try running it - or I'm missing a preference option somewhere.
edahms is offline
Reply With Quote
View Public Profile
 
 
Register now for full access!
Old 03-28-2007, 11:09 AM Re: Help with GSiteCrawler
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
robots.txt is the place to restrict crawling, where it will be in force for all compliant bots.

A google sitemap is there to assist in crawling with google only. It certainly should not be relied on to stop some pages being accessed.
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 03-30-2007, 11:37 AM Re: Help with GSiteCrawler
charlesgan's Avatar
hosting-rebate.com

Latest Blog Post:
Hostgator Coupons $93
Posts: 279
Location: hosting-rebate.com
Trades: 0
i notice that google bot is getting more efficient, compare to months ago.
submit your URL, and the site will get index very fast.
__________________

Please login or register to view this content. Registration is FREE


Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
|

Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
charlesgan is offline
Reply With Quote
View Public Profile Visit charlesgan's homepage!
 
Old 03-30-2007, 02:08 PM Re: Help with GSiteCrawler
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
read the post, please !!!

the OP is trying to restrict the crawling
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 04-03-2007, 03:43 AM Re: Help with GSiteCrawler
Junior Talker

Posts: 1
Name: John Mueller
Location: Switzerland
Trades: 0
Can you post some specifics (or mail me through the form on the program's site)? I use the filters a lot, also to exclude whole sections of sites, and so far I have had few problems with them. One thing that will not work, however, is adjusting the filters on the fly while it is crawling. You will have to stop the crawlers, wait until they're empty, re-filter the URLs and optimally restart the program for the adjusted filters to take effect (once the URLs are in the crawler, in the URL table or in the crawler-cache they need to be flushed out before you continue).
JohnMu is offline
Reply With Quote
View Public Profile Visit JohnMu's homepage!
 
Reply     « Reply to Help with GSiteCrawler
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.15074 seconds with 12 queries