Encerrado

Wikipedia data dump miner

I 'm looking for wikipedia and machine learning expert.

- Are you an expert Wikipedia dump files?

- Do you love to write scripts that automate extractions?

- Which scripting languages do you already know? Python, Bash?

- Work closely with our teams building user experiences and collaborative machine learning algorithms.

What do you think of this fist task

1. given two languages , say en and zh.

2. and a page category , like Living people.

3. and a specified WP dump date.

4. generate a set of sets of name string.

where each set has all of the en redirects and zh redirects for a given pair of en-zh linked titles.

For example, the set for Vladimr Putin's page would have all his redirects in English as well as his page name in Chinese and all of its redirects.

If you like that as a starting task, please give me an hour estimate for it and we can start a contract with that as the first task going forward we have a bunch of tasks of this kind.

Habilidades: Aprendizado de Máquina , Python, Wikipedia

Ver mais: wikipedia backup, hadoop wikipedia, dbpedia, wikipedia miner, wikipedia dump, want write living, wikipedia data dump, xml wikipedia data, php data dump, importing wiktionary data dump, wikipedia html dump, wikipedia history dump, test wikipedia database dump, processing wikipedia database dump, wikipedia database dump

Acerca do Empregador:
( 0 comentários ) Beijing, China

ID do Projeto: #14845502

14 freelancers estão ofertando em média $22/hora para este trabalho

vorasiddh4it

You can see my last project which are based on Algorithm Development Machine Learning and I can complete your project perfectly. We have 10+ years experience in software development. We have developed 400+ projects Mais

$20 USD / hora
(7 Comentários)
4.2
usmanvardag

I can write a script using Python's request library that will generate a set of sets of name string, based on your specified criteria. The request library is really powerful and allows features such as persistent sess Mais

$25 USD / hora
(3 Comentários)
3.6
ramani86

I prefer python and have played around with wikipedia...mostly with wiki pages, categories, api etc. and not necessarily with redirects, but I understand redirects....

$25 USD / hora
(4 Comentários)
3.2
revival786

Hi, I am a professional in providing high quality Wikipedia pages. I am highly interested in your project. Please PM to discuss more. Revival My Portfolio & Reviews: https://www.freelancer.com/u/revival786 W Mais

$15 USD / hora
(1 Comentário)
3.1
$22 USD / hora
(1 Comentário)
2.2
anil348techwires

We are specialist in Python Development, PHP Development and Digital Marketing. Give me a chance to proof we are the best

$22 USD / hora
(0 Comentários)
0.0
webbookstudio

Hello, my name is Michael. I represent Ukrainian based IT-company Webbook Inc that provides services in the IT-sphere for international business. We were carefully reviewing the requirements of the job description, so Mais

$22 USD / hora
(0 Comentários)
0.0
asadkhanking22

i would like to offer you my expertiseas I have done number of my academic projects and I am a professional in the field Contact me and I’ll show you what i am capable of

$22 USD / hora
(0 Comentários)
0.0
$22 USD / hora
(0 Comentários)
0.0
mediaj

I have hands on expertise in python ( beautiful soup ) web crawling, I am also a data engineer where day job involves creating data pipelines for extraction, transformations.

$27 USD / hora
(0 Comentários)
0.0
$22 USD / hora
(0 Comentários)
0.0
$22 USD / hora
(0 Comentários)
0.0
DmytroF

I'm CTO at datascraping [dot] club, we provide data scraping and websites scrapping services, have a lot of experience with machine learning and data scrapping in general. Would love to chat about your project and shar Mais

$22 USD / hora
(0 Comentários)
0.0
AshotJanibekyan

I have been editing Wikipedia more then 3 years, also I have use Pywikibot with my own scripts. Beside that, I love everything related to Wikipedia and I will do this job with love :).

$15 USD / hora
(0 Comentários)
0.0