Find Jobs
Hire Freelancers

Parallel Python Code That Counts How Many Websites Have Canvas

$10-30 USD

Concluído
Publicado há mais de 6 anos

$10-30 USD

Pago na entrega
I need a simple Python script that scrapes a list of websites in a csv file (e.g. top 500,000 Alexa sites attached), and checks if the website uses Canvas in the HTML (by checking for "<Canvas>") or in JavaScript (by checking for "createElement("canvas")" or "createElement('canvas')"). The code should output the number and percentage of websites using Canvas out of the list. It is recommended that the code uses the Python Libraries “Requests” and/or "BeautifulSoup4" with a similar logic as the one I started writing (attached). The following points need to be satisfied: • The code uses parallel computing for efficiency, so it doesn't run for so long • The http header has to look like it came from a real browser, so websites don't block it • The reading time of a website should not exceed 30 seconds, and should time out if no response for 30 seconds and go to the next website • The script needs to count and print the number of successfully read and unread sites from the csv file of top sites (as the one I am attaching does for the unread). The unread sites could be because a website is no longer available or responsive, or any other reason • The script needs to handle errors and doesn't crash • The script has to print the duration of execution (how many hours, minutes or seconds) • The script has to print the number and percentage of sites containing Canvas either in the HTML source code or JavaScript It would be great if we can have a version that is not parallel to compare the performance, but not super important
ID do Projeto: 15614099

Sobre o projeto

1 proposta
Projeto remoto
Ativo há 6 anos

Quer ganhar algum dinheiro?

Benefícios de ofertar no Freelancer

Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
Concedido a:
Avatar do Usuário
I am a python expert and i can do your work. i can start immediately. and complete your work on time.
$30 USD em 1 dia
4,9 (161 avaliações)
6,2
6,2

Sobre o cliente

Bandeira do(a) SAUDI ARABIA
Alkhobar, Saudi Arabia
5,0
3
Método de pagamento verificado
Membro desde fev. 9, 2014

Verificação do Cliente

Obrigado! Te enviamos um link por e-mail para que você possa reivindicar seu crédito gratuito.
Algo deu errado ao enviar seu e-mail. Por favor, tente novamente.
Usuários Registrados Total de Trabalhos Publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Carregando pré-visualização
Permissão concedida para Geolocalização.
Sua sessão expirou e você foi desconectado. Por favor, faça login novamente.