Please tell me:
2) Are you capable of developing without a browser or headless browser if necessary? (After RDP session closed, GUI will terminate)
3) If you are using Python please tell me what libraries are capable with ( example: BS4, Scrappy, Selenium, , lxml, Requests) if you are using Java let me know the libraries you are using (example: jsoup, selenium) if you are another language let me know what libraries you use for those.
• 7 root sites and in total 48 subsites.
•The script will be run on Windows Server 2016.
• As you know the RDP session will terminate therefor script will need to use a headless browser in the background sometimes.
• The script takes a list of names and from this list, it will generate direct links for 99%-100% of them (and crawl the remaining sites). There will be many different input files, the format always remains the same, however, the data/names will be different.
• All of the data is in a table on the site
• All output formats and documentation are written
• Basic features such as enabling/disabling sites, custom crawl delay, pause, play, skip, on-screen status display, custom timeout limits /retry attempts are required.
• Proxies rotation functionality required.
• 1 site has a login.
• Should be optimized for efficient use of memory and CPU + Use API links when possible.
• 5 Root sites, 0 subsites.
• The script will input the same input file onto the sites and use the sites "download to excel" feature.
I am the project manager and a Windows System/Networking Administrator with a high IT expertise and project high feedback with 5 years experience here. I'll provide a lot of testing and system resources such as a few Windows VPS'S. Contact me if you are serious about the project. Python is preferred but not required. Long term work/ more projects are available. Unfinished script available.
24 freelancers estão ofertando em média $321 para esse trabalho
With respect to this project I would like to present myself as a candidate for your consideration. 1) development language : Java 2) Yes I am capable of developing without a browser 3) I will be using Java
Hi! Very interested on your project. As a scraping expert, I will be the best candidate for your project. If you assign the project to me, I will make it with high quality on time. Thanks!