|
My site is an off root PHP generated ecomerce store, I place html versions of those pages in www root that constitute the catalog, so to speak, to increase site speed and customer experience. These are the pages included in my sitemap.xml. To prevent duplicate content penalties, I have used robots.txt to disallow all PHP versions of those pages.
The problem:
Although I have a good amount of backlinks to my html pages, I just noticed that those that are directed to the PHP pages are being disallowed by robots.txt thus not listed or counted. With what I just read linked from the sticky post on thids subject is it safe for me to assume that:
If I allow the php versions of my catalog pages to be indexed I will suffer no duplicate content penalty and since the html catalog IS my sitemap.xml, they will most likely be the ones chosen for listing. Correct?
I am worried that allowing the php versions to be indexed may cause my site to seem to the search engines to be of poor structure and content, thus hurting my rankings. Something to worry about?
Last edited by dogbyteman; 09-20-2008 at 06:46 PM..
|