|
These comments are getting absurd.
To see how it's done, build your own search engine. It's not that hard (you'll also see that Chris was right in his post).
So what are the steps?
Well first, you'll need something to visit websites and follow links. Let's just call this a crawler.
Next, you'll need the crawler to store copies of the pages it find so they can be indexed somewhere, let's call this storage place a cache.
You're going to need something to build a search tree from the copies of the pages in the cache, let's call this, the index.
Since a lot of websites go offline and change frequently, a nice service to your users could be an option to display the pages that you keep in your cache (that matched the search), let's call this the public cache.
Sometimes the simplest way to figure out what's going to is to go through the steps of a re-implementation.
|