i have a number of existing sites that want to aggregate into one parent site. looking for someone that has extensive experience in scraping content and aggregating it ideally in php. (i would like use wordpress to publish the parent site)
the script/application should be web based, and run off either a windows or linux server (your preference, I have both).
it should be able to manage an unlimited number of pages (some of the sub sites have 10k+ pages), and also scrape the images if there are any.
multi-threading would be helpful to increase speed.