Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

The Database Forum


You are currently viewing our The Database Forum as a guest. Please register to participate.
Login



Reply
2 Simple Concepts I'd Appreciate if Clarified...
Old 11-07-2010, 12:14 PM 2 Simple Concepts I'd Appreciate if Clarified...
Junior Talker

Posts: 1
Trades: 0
Hi,

I am looking to start a website that, basically, is a database-driven search engine. So basically, i will have a database of all the data (via MySQL) and i will have a search engine script that searches through this database and outputs relevant info.

Now, 2 issues i'd greatly appreciate if someone can clarify;

1. The data that i want on the database is all on one website (www.yorku.ca). What im pondering about is how would i go on about extracting all that data- ie. the yorku.ca website has a listing of all their courses, how would i go on about getting only their courses and their course descriptions?

Do i have to do it manually? Or would i have to make some sort of bot? If a bot, are there any sources that can point me in the right direction for developing something like this?

2. Now, the real issue that is more or less just based on experience (which i don't have on this specific topic). So say i got all the data on my database. Now i need to make the search engine, what kind of search engine do i need (lets assume the data is not alot its just of about 100 000 entities).

Now i know the search engine is not like a Google-like script, what i need is a search engine that just searches through a database and nothing else. But after researching a bit, it was not only hard to find database-driven search engine scripts but kind of confusing. One i found was:

Arch Intranet Search Engine: http://www.atnf.csiro.au/computing/software/arch/

But i'm not quite sure if this is the best solution out there.

If anyone can give me resources or point me in the right direction, i'd greatly appreciate it. Thanks!
infinitone is offline
Reply With Quote
View Public Profile
 
 
Register now for full access!
Old 11-07-2010, 04:57 PM Re: 2 Simple Concepts I'd Appreciate if Clarified...
chrishirst's Avatar
Missing! presumed drunk.

Posts: 42,385
Name: Chris Hirst
Location: Blackpool. UK
Trades: 0
Quote:
how would i go on about getting only their courses and their course descriptions?
Ask them if the provide an XML feed or something.
__________________
Chris. ->>
Please login or register to view this content. Registration is FREE
<<-

A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
chrishirst is online now
Reply With Quote
View Public Profile Visit chrishirst's homepage!
 
Old 11-25-2010, 06:53 AM Re: 2 Simple Concepts I'd Appreciate if Clarified...
Super Spam Talker

Posts: 880
Name: Paul W
Trades: 0
What he says! Even if you do work out a satisfactory program to spider the site and extract (reliably) course-only data you'd still face the prospect of doing this regularly and frequently to cater for changes in data. Changes in format of presentation could also cause problems.For the second question, I think you need to look at some basic concepts about databases and database searches, ie queries, plus a look at what programming you'll need to accept queries and present results. http://www.tizag.com/mysqlTutorial/ is one handy and understandable quide.
PaulW is online now
Reply With Quote
View Public Profile
 
Reply     « Reply to 2 Simple Concepts I'd Appreciate if Clarified...
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.42621 seconds with 12 queries