Em Andamento

Scrape Content from [url removed, login to view] - Repost

Hi Expertio,

i just saw that you already did the following, maybe you want to do the same thing for me. We are talking about a small site: [url removed, login to view]://[url removed, login to view]

I need a script written to scrap a website from archive.org. The script will remove all [url removed, login to view] tags/ads in the code, and download all files int the same as original folders and sub folders.

The downloaded website should be complete as it is on [url removed, login to view], and able to be uploaded without further code modification.

Example:

I provide an URL like [url removed, login to view]://[url removed, login to view]; to the script, and it will get ALL content on the page (including subpages)

The URL Structure of the site musn't change.

Need simple web interface, where I enter the starting [url removed, login to view] URL

Each site recovery should contain all pages in HTML format,

All images that the sites was using should e downloaded.

URL structure of the sites should be exactly as it was with original site including links to images internal and outbound links.

Files passing variables (example ending with ?dvar=variable) should also be saved as original

Small budget.

Habilidades: HTML, MySQL, PHP, Captura de dados na web

Ver mais: get web content written, get html code url, archive, remove content, web site recovery, remove content url, scrap url, org website script, url structure, want scrap website, omagold, ads already written, web archive, interface simple interactive, scrap web page, ending script web page, interface simple android, scrap content, android interface simple, download repost script, archive files, scrap content website, gui interface simple

Acerca do Empregador:
( 4 comentários ) Berlin, Germany

ID do Projeto: #5128611

Premiar a:

Expertio

Hired by the Employer

$35 USD em 1 dia
(32 Avaliações)
6.2