Hi,
I need a software to scrape amazon bestsellers (100) from any category I choose, then I will need the url/title/author/description/tags outputed to a csv file (I should be able to preview the output before).
As soon as the program access the URL of any of the top 100, it should take a screenshot of the page (above the fold, not all, only 1024 x 768) and save it as jpg, using the title of the product or book for the jpg file name. ie: '[login to view URL]', the jpg will be stored in the local PC in a predetermined file, we should add that internal link (path) inside the CSV file or output, ie: c:/mypc/software/mypics/my-cool-book(1).jpg
I should be able to define some prefix and/or suffix to the output file ie: if the title is 'my cool book', the output (optional) could be 'my cool book today' ...I chose to add the word 'today' to all the titles scraped... got it? same for the url or any other field.
Please check the source code very well because books and products use different tags (maybe you can add a way to detect that),
We should find a way to use a variable that I can change to match the required field, like if we are looking for <div style='whatever'>any content I want</div> and amazon decides to change 'whatever' for 'idontcare', maybe we can add an option in the program to allow me to change it... then I can use the program normally after checking the source code or maybe you can create an alert (thn the new tag will be <div style='idontcare'>any content I want</div> is that clear? or try to look at the CSS file... you decide.
I will add more thngs to this software this is only the first part, budget is very low so don;t think you are going to get rich, it is a small project, a test... bid right.
sample urls:
[login to view URL]
or
[login to view URL]écor/zgbs/toys-and-games/166210011/
or
[login to view URL]
Regards,
J