Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

The Google Forum


You are currently viewing our The Google Forum as a guest. Please register to participate.
Login



Reply
Scraping Google results without violating TOS?
Old 06-30-2011, 01:31 PM Scraping Google results without violating TOS?
Super Talker

Posts: 113
Trades: 0
I made a little script that scrapes Google results so I could check my site's keyword positions in Google. It works great, but I only recently found out that what I'm doing is against Google's TOS.

However, there are tools out there (such as the DigitalPoint tracker tool) that have tons of users and must not be violating Google's TOS since Google could easily identify and shut them down. I'm wondering, how I can scrape Google search results without violating the TOS?

I found the JSON/Atom Custom Search API which appears to be a paid service for over 100 queries/day. Would using this API, or another Google API, allow me to use my tool without violating Google's TOS?
Learnin' n00b is offline
Reply With Quote
View Public Profile
 
 
Register now for full access!
Old 06-30-2011, 02:12 PM Re: Scraping Google results without violating TOS?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
Yep that's why Google created the APIs

If you used the DP tool you would have had to register your OWN API key which allows YOU, as the logged in user to perform up to a thousand queries via the API per twenty four hours.
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 06-30-2011, 02:20 PM Re: Scraping Google results without violating TOS?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
Don't forget though, that the API queries do NOT come from the same datacentres as the web search or toolbar queries come from.
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 06-30-2011, 03:42 PM Re: Scraping Google results without violating TOS?
Super Talker

Posts: 113
Trades: 0
Thanks, Chris!
Learnin' n00b is offline
Reply With Quote
View Public Profile
 
Old 07-01-2011, 12:09 PM Re: Scraping Google results without violating TOS?
Super Talker

Posts: 113
Trades: 0
I was reading the API TOS today and noticed the following:

Quote:
2.2 Interaction with CS Servers. The CS Implementation must not interfere with the normal functioning of the CS Servers, and Customer acknowledges that Google may monitor CS API activity in order to prevent such interference. Except as expressly permitted by the API Documentation, Customer will not use any automated means (such as the use of scraping, robots or load testing) to access, query or otherwise collect CS related information. Google may set usage limits on features of the CS API.
Does that mean I would be violating the TOS (and the other trackers would be too), since such a tool would be considered "automated"?
Learnin' n00b is offline
Reply With Quote
View Public Profile
 
Old 07-01-2011, 03:18 PM Re: Scraping Google results without violating TOS?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
The API documentation tells you what is allowed and what isn't. But basically "automated" means repetitive queries or bulk queries.

The idea of using the API is not for firing off massive amounts of queries, but rather for getting search data in different formats.
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 07-01-2011, 05:31 PM Re: Scraping Google results without violating TOS?
Extreme Talker

Posts: 219
Name: Tom
Trades: 0
If you are going to write your own, you'll definitely want to add some "human emulation" features. Most ranking tools have this. Basically you have one setting that states the minimum number of seconds to wait between search queries (or moving to the next page. Then a second setting for a maximum random delay to wait between search queries (or moving to the next page).

For example, if MIN SEC BETWEEN QUERIES is 15 seconds and MAX RANDOM DELAY is 20 seconds, then your software would wait between 15 and 35 seconds between search queries (or moving between pages).

It's silly though to try to write your own these days. With all of the personalization that goes on at Google these days (like geo-targeting) it's MUCH better to buy a tool which was written by someone who knows what they are doing.
__________________

Please login or register to view this content. Registration is FREE
-
Please login or register to view this content. Registration is FREE

Social-Media is offline
Reply With Quote
View Public Profile
 
Old 07-01-2011, 06:04 PM Re: Scraping Google results without violating TOS?
Super Talker

Posts: 113
Trades: 0
Quote:
Originally Posted by Social-Media View Post
If you are going to write your own, you'll definitely want to add some "human emulation" features. Most ranking tools have this. Basically you have one setting that states the minimum number of seconds to wait between search queries (or moving to the next page. Then a second setting for a maximum random delay to wait between search queries (or moving to the next page).

For example, if MIN SEC BETWEEN QUERIES is 15 seconds and MAX RANDOM DELAY is 20 seconds, then your software would wait between 15 and 35 seconds between search queries (or moving between pages).

It's silly though to try to write your own these days. With all of the personalization that goes on at Google these days (like geo-targeting) it's MUCH better to buy a tool which was written by someone who knows what they are doing.
I'm not trying to do anything shady, though That's why I'm trying to make sure I don't violate Google's TOS.
Learnin' n00b is offline
Reply With Quote
View Public Profile
 
Old 07-05-2011, 03:56 PM Re: Scraping Google results without violating TOS?
Super Talker

Posts: 113
Trades: 0
Quote:
Originally Posted by chrishirst View Post
Don't forget though, that the API queries do NOT come from the same datacentres as the web search or toolbar queries come from.
As you said would happen, it seems that the data from the API is different than from a Google.com search. Does this happen because the API datacenters are not updated as often or something?
Learnin' n00b is offline
Reply With Quote
View Public Profile
 
Old 07-05-2011, 04:52 PM Re: Scraping Google results without violating TOS?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
The whole database and index system is in a permanent state of churn called everflux, where there is no "master" database, and the servers and datacentres are continually receiving data from crawlers, other database and index servers.
The system is never sychronised (see Rankings and the Google "Dance").

The API query servers/datacentres are in the same system but API and toolbar queries are routed away to keep the load balanced.

do nslookup queries on

google.com
search.google.com
toolbarqueries.google.com

and you see they have low TTL and expire periods to allow them to rotate the servers/DCs very quickly (use set q=soa to see time reports)
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 07-10-2011, 07:32 AM Re: Scraping Google results without violating TOS?
Average Talker

Posts: 14
Name: al re
Trades: 0
i have a free api of yahoo for the same type of work. and i use another tools for every search engine. it work like a charm.
__________________

Please login or register to view this content. Registration is FREE

|
Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
rezax123 is offline
Reply With Quote
View Public Profile Visit rezax123's homepage!
 
Reply     « Reply to Scraping Google results without violating TOS?
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.33135 seconds with 12 queries