I am trying to develop one software for finding out the non indexed pages for the particular site. I wanna know suggestion from you people , how should I start , any logic in your mind, please suggest me.......
I would start by parsing the XMl sitemap for a site then scrape the pages off Google and cross reference. Pretty sure there's already a tool for doing this mind you, but can't remember the name.