go to a specific site
pick a first option from dropdown menu
each option offers from 1up to 300 pages
each page has 24 subpages
each subpage contains information we need
- location of the image or direct image download
- small text 1
- small text 2
- small text 3
- small text 4
- small text 5
all this info is found in a table with a particular string
- links that we need from 2 to 50
some links are not needed they can be detected by particular string
There are at least 50.000 subpages with data to extract
but when extracting pattern is set there shouldn't be a problem
as the computer does it itself.
My friends tell me the fastest way is to use regular expressions so I guess I recommend that method.
14 freelancers are bidding on average $472 for this job
Hello Sir , I am highly interested in this project i have 3 years++ experience with web crawling .... please provide me a chance to work for you............Thanks
Hello, i have expertise in web scraping. Please disclose the website URL. The data will be extracted with xpath and regex. If you are interested in my bid, please contact me. Thank You. Best Regards.
i already made script for extracting phone numbers from around 30 pages , i can show you the demo on my system , i can make this script for you easily.