Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

Coding Forum


You are currently viewing our Coding Forum as a guest. Please register to participate.
Login



Reply
How To Make A Spider?
Old 12-23-2009, 10:51 AM How To Make A Spider?
Physicsguy's Avatar
404 - Title not found

Latest Blog Post:
Challenges
Posts: 824
Name: Scott
Location: Ontario
Trades: 0
Hello,

I've seen posts all over these forums, some so long ago, it was called 'EarnersForums'. They all say that spiders are easy to make, that all they do it grab the contents of a web page, and then stick it in a database, which is then searched.

I, however, have no idea on how to make one.

I've heard of Sphider, and it looks pretty good, but I don't need something that complicated to do what I need. What I need it to do is simply look through a flat file database (let's call it database.php), and take all the entries in it (separated by a '|') and index them. I'll work out keywords after. For now, I just need the title.

The problem is though, where this content in database.php shows up is a page that uses URL parameters to know what chunk of content ot display. Example: http://www.example.com/page.php?content=2
Which would display my second content 'chunk'.

I need the spider to index that page, with that URL, parameters and all, because if the parameters aren't there, then the link would lead to the home page of page.php, not page.php?content=2.

So far, I have found no tutorials, no hints, nothing. In case you don't know, I'm the kind of person that has to make everything for themselves. I don't like to download other's work. I make it myself, as a part of a learning experience , I guess.

I'm not asking for a spider to be written for me in a forum post (and I wasn't asking about the CMS thing, either ), but some simple framework, in english, not code.

Thank you, I hope this isn't too big of a task to ask for

-PG
__________________
Check out my
Please login or register to view this content. Registration is FREE

Last edited by Physicsguy; 12-23-2009 at 10:52 AM..
Physicsguy is online now
Reply With Quote
View Public Profile
 
 
Register now for full access!
Old 12-23-2009, 11:43 AM Re: How To Make A Spider?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 41,517
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
You are mixing up TWO things Indexing and crawling.
All a "crawler" needs to do is

Open URL:
Read Source code:
Store Source code in database :

that's it Your crawler done.
__________________
Chris. ->> Links are advertising NOT optimising!! <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- I SEO the only industry where all the cowboys are Indians?
chrishirst is offline
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 12-23-2009, 11:53 AM Re: How To Make A Spider?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 41,517
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
BTW If you want to pick through VbScript I have a load of routines already built for "ascraping" a page extracting various elements from pages.

such as this one that will extract all the links and anchor text (alt attribute for image links) http://www.candsdesign.co.uk/tools/s...ck/text-links/
__________________
Chris. ->> Links are advertising NOT optimising!! <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- I SEO the only industry where all the cowboys are Indians?
chrishirst is offline
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 12-23-2009, 12:12 PM Re: How To Make A Spider?
Physicsguy's Avatar
404 - Title not found

Latest Blog Post:
Challenges
Posts: 824
Name: Scott
Location: Ontario
Trades: 0
Ah, then all I need is an indexer! I already have a 'crawler'; simply adds the info on add to database.
__________________
Check out my
Please login or register to view this content. Registration is FREE
Physicsguy is online now
Reply With Quote
View Public Profile
 
Old 12-23-2009, 05:02 PM Re: How To Make A Spider?
chrishirst's Avatar
Missing! presumed drunk.

Posts: 41,517
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
that's where it can get complicated.

Because it will have to deal with so MANY posibilities of "messed up" code.
__________________
Chris. ->> Links are advertising NOT optimising!! <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- I SEO the only industry where all the cowboys are Indians?
chrishirst is offline
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Reply     « Reply to How To Make A Spider?
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.15095 seconds with 12 queries