Find Jobs
Hire Freelancers

Fill in a Spreadsheet with Data

$30-250 USD

Cancelado
Publicado há mais de 7 anos

$30-250 USD

Pago na entrega
This task is about scraping information from a website using names (politician’s names): There are a total of 252728 names. Make sure to observe the 252728 rows. Also, make sure to be able to read the Ñ character. Some names have it. Instructions: 1. For each row of the excel file attached ([login to view URL] – this is a comma delimited text file), go to the link [login to view URL] and enter the key information: NOMBRES, APPELIDO PATERNO, and APELLIDO MATERNO. Then click on “buscar”. 2. Click on the person that exactly matches the key information entered before. Scrape information on FECHA DE NACIMIENTO and DNI (or DOCUMENTO NACIONAL DE IDENTIDAD), if available. These are next to the person’s picture. 3. Then click in “procesos electorales”. Scrape ALL information from every column but “HOJA DE VIDA” and “MAS DATOS”. (See [login to view URL] for tabulation details). Please add the key information (NOMBRES, APPELIDO PATERNO, and APELLIDO MATERNO) next to every row of information scrapped. 4. Then click in every “HOJA DE VIDA” available (these should only be available since 2006). Note that this is the link within the “procesos electorales” subtab. This not the upper right link that says “ver hoja de vida”. 5. Scrape ALL information from each “HOJA DE VIDA”. Copying and pasting this information in separate excel files is ok for me. This is what I have done in the example. Note that there is a clear identifier for each HOJA DE VIDA in example.xlsx. The identifier (in this case) is the DNI-electoralprocesss combination. Each DNI corresponds to a single person (it can be found in the HOJA DE VIDA). And each person submitted at most 1 HOJA DE VIDA per electoral process (“proceso electoral”). DNI works pretty well for me, but you can use whatever is easier for you as long as each “HOJA DE VIDA” can be uniquely identified. 6. I am attaching and example ([login to view URL]). 7. I expect than in (probably less than) 1% of the cases, you will find that identical NOMBRES, APELLIDO PATERNO, and APELLIDO MATERNO correspond to more than one individual (for instance, try APELLIDO PATERNO = “CHACON”, APELLIDO MATERNO = “VASQUEZ”, NOMBRES = “JOSE ELOY”). Record each case separately (i.e., as different people). They should have different DNIs if available. Also the values of the columns in the subtub “procesos electorales” (see bullet 3) in addition to APELLIDO PATERNO, APELLIDO MATERNO and NOMBRES most likely work as a unique identifier. Please, give a quote and estimated time that this project will take you. Javier.
ID do Projeto: 11208477

Sobre o projeto

Projeto remoto
Ativo há 8 anos

Quer ganhar algum dinheiro?

Benefícios de ofertar no Freelancer

Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos

Sobre o cliente

Bandeira do(a) UNITED STATES
Durham, United States
5,0
3
Método de pagamento verificado
Membro desde ago. 6, 2016

Verificação do Cliente

Obrigado! Te enviamos um link por e-mail para que você possa reivindicar seu crédito gratuito.
Algo deu errado ao enviar seu e-mail. Por favor, tente novamente.
Usuários Registrados Total de Trabalhos Publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Carregando pré-visualização
Permissão concedida para Geolocalização.
Sua sessão expirou e você foi desconectado. Por favor, faça login novamente.