Web document addresses are URLs for the HTTP URI schema.
HTML is the language that defines the structure and layout of a web document.
Each post does NOT have its own HTML document it is the SAME "HTML" that is accessible by multiple URLs. (Addresses).
The analogy of "robots" to spiders is simply because most species of spider create a "web" to inhabit or catch their prey in, and as SE bots "live" on the World Wide Web the name stuck.
The name of "crawler" is also incorrect as SE bots do not "crawl" from point to point they are instructed by other software agents to go to a particular location and retrieve the data.
The correct term for a software agent that retrieves data is a "user agent", as it is controlled by a user and in the case of SEs the "user" is the data retrieval scheduler (crawl queue in SE parlance)
Technically they are "robots" in the truest meaning of robot as they are independant automatons reacting to a programmed sequence of triggers and instructions.
__________________
Chris. ->> Please login or register to view this content. Registration is FREE <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- Is SEO the only industry where all the cowboys are Indians?
|