I have betwen 700-800 HTML files (poorly formatted). The content varies, but the layout is pretty much the same (but they do not use table HTML tags unfortunately).
I need someone that can scrap very specific data out of these files and present it to me in an Excel, CSV or MySQL dump file with the specific columns that I want.
Attached is an example of what the HTML files look like, and an example of the data I would like to be extracted and how the information extracted would go into specific columns.
Please take a careful look at the example HTML file and the example Excel file.
Also, I am an individual, this is for personal purposes, I am not a company, and my budget is limited.
I take it this would involve you writing a script (Python, PHP, whatver) to automatiaclly go through all the HTML files, extracting the data and putting it into a CSV file or similar. So the question about how hard it is and how long it takes, I would say depends on how quickly would you be able to write a script that does this. Once the script is done, as you know, the work would be done for you of course.
I am not interested in whatever script you write, just on the data that would be collected.
43 freelancers estão ofertando em média $128 para esse trabalho
Hi sir, I'm professional web data extractor, I checked your files, I can write a script to scrape all data from html files, wait for your reply, I can complete this project within 1 day. thanks. Regards Jianhua.
hey, I have seen the attached file and I can get data from these HTML files. I will make a BOT to get this data but the data will be properly formated. Lets talk more in chat.
Hi, I can write a script for you in python that will extract the data you need from these html files. I am availble to start working on this task immediately, Please do not hesitate to contact me, Regards.
Hi, If the HTML files have the same structure that can be done. I can do that very quickly within a day. Contact me for more information thanks, Pandelis