Need java developer who can implement web crawling for google search results. Crawling should work like this:
There would be a db table with the following fields:
Sl. No website Keyword Crawl interval updated_links(separated by comma)
The crawler,based on the Crawl interval, should automatically perform a google search for the Keyword targeting the Website given for that particular Keyword. In the google search results, you can notice that, google provides information as to when that url within the website was updated(eg; 45 minutes ago, 3 hours ago, 2 days ago, Aug 22, 2013 and so on). If in search result, you get a result(url or urls) which came after the last crawled time(say n minutes ago, n days ago etc.), then those url's need to be inserted into the updated_links field.