|
It's probably because, after looking through some logs on a static site, I have not seen an instance of any crawlers getting a 304 response from a page where browsers do.
crawlers don't have a cache to update so probably don't request an age response. Crawlers don't "read" pages in the same way browser do, they only grab the page source and store it in the DB. Here's a bit of educated guesswork based on my understanding of inverted index storage (which is probably what is used) and how to index multi-terabytes of data in the shortest possible time.
The crawler will generate a checksum from the retrieved data and store this with the data, The indexer will pull the checksum and compare it to the previously stored one. If they match discard the new data. This way processing priority is give to new data.
if-modified headers are in the control of the server operator and can be manipulated using a self generated checksum cannot.
don't take this as gospel BTW. Simply experience and observation and as a programmer it makes sense.
__________________
Chris. ->> Links are advertising NOT optimising!! <<-
A foolish consistency is the hobgoblin of little minds
Thought for today:- I SEO the only industry where all the cowboys are Indians?
|