Concluído

Programatically searching a huge database and sorting results

Programatically searching a huge database and sorting results

The goal is to find all of the most common strings that precede each word or phrase in domain names. This is NOT a manual data entry job!

We have a very large database of ALL .com domain names and another list containing thousands of unique words or phrases. We need to take each word or phrase from the second list and search the whole domain database to find every instance of that word or phrase ONLY IF it starts within the first 7 digits of the domain name. The goal is to create a list of the most common words that precede each term, in order of most popular to the least popular, and show the quantity of each.

To give you an example, if the original term is “REBATE” (term number 1753) you would search the database of nearly 100 million domain names, finding any domain name that contains the term REBATE starting in the first 7 digits of the number. Then list all of the occurrences sorted by the first 7 digits in order of most frequent to least. So the outcome should look something like this (JUST AN EXAMPLE!)

1753,REBATE

THE-REBATE,18

YOUR-REBATE,9

CASH-REBATE,8

EASY-REBATE,8

RAPID-REBATE,7

HOME-REBATE,5

MAILIN-REBATE,5

TAX-REBATE,4

NOWAIT-REBATE,3

GETA-REBATE,2

ETC…

You also have to be able to open a rar file.

The output should be single text file listing multiple terms, one after another like the above.

We have thousands of terms to search. Our goal is to find who can do this the most efficiently, so we will award this to multiple freelancers asking them each to put in ONE HOUR worth of time. Then we will select whoever completes the most terms (and does it properly) to continue. We may work with more than one or may find one freelancer stands out and give them all the work.

I have a good track record as an employer with Freelancer and an even bigger positive record with Upwork which I’ve used for several years. I’m doing it this way because it’s impossible to tell from someone’s profile how efficient and accurate of a worker someone is just from their profile or talking to them. This will be a lot of work for whoever is best at it. The worst case, if someone else is faster and gets more done, you’ll get a positive review and a completed job.

The first step is to answer some questions.

***No automatic proposals. If you don’t answer these questions fully, you won’t be considered.

What tools would you use to do this?

Have you worked with a database of 100 million records before?

What’s the biggest database you’ve worked with before?

What would be your goal or expectation in: Terms Searched and Sorted / hour?

Habilidades: Big Data Sales, Processamento de dados, Extração de Dados, Data Analytics, Data Extraction, Data Scraping, Captura de dados na web

Veja mais: sorted and ordered collection in hibernate example, sorting in dbms, sql order by, best database for full-text search, hibernate order by, hibernate sort vs order, a sorted collection must define and ordering or sorting, database sort vs. programmatic java sort, mysql huge database import, php script database formula results fantasy, vbnet searching access database, team database sports results database, read database display results web page, search image huge database, maintaining huge database, import huge database, create huge database, vb6 project searching data database, import huge database size, database filter results items

Acerca do Empregador:
( 5 comentários ) Salt Point, United States

ID do Projeto: #27394396

Concedido a:

obsurf

Hello. I've checked your description in detail. ***** I'm using Python for this, and have deal with oracle phone number database. ***** I am very experienced, honest, have good skills, and also have much availability t Mais

$35 USD / hora
(0 Comentários)
0.0
(11 Comentários)
5.4

29 freelancers are bidding on average $25/hour for this job

(4 Comentários)
6.8
(43 Comentários)
6.4
nvbishr

What tools would you use to do this? I will write js code with [login to view URL] to do the job Have you worked with a database of 100 million records before? No What’s the biggest database you’ve worked with before? 25m What would Mais

$30 USD / hora
(21 Comentários)
5.1
(20 Comentários)
4.8
ItMasterDev

you don't need a database for this type of job. if you have a dB you need just a plain csv output and some Linux bash command to filter and sort string inside your file. as long as the file is smaller than available Mais

$28 USD / hora
(6 Comentários)
4.1
RusselExpert

Greetings. Thanks for your post. As a senior fullstack developer who has 10+ years of experience, I can deliver satisfactory product in a high quality. What tools would you use to do this? I will use c/c++ because it's Mais

$25 USD / hora
(3 Comentários)
4.1
gargankit642

Nice to meet you I am a Machine Learning expert In all domains such as industry, economy and biomedicine, any hard problem can be resolved using Artificial Intelligence Techniques. In my PhD research, I have developed Mais

$34 USD / hora
(1 Comentário)
3.0
kaloyan13

I have very good parallel programming skills, which we can use to solve your task. What tools would you use to do this? - C/C++ and OpenMP or Pthreads Have you worked with a database of 100 million records before? - No Mais

$20 USD / hora
(4 Comentários)
3.2
rishabsingla003

Hello there, I throughly checked the requirements and really well understand it. First I need to know the type of DB we're using like: Mysql, Oracle, or any other. Frankly says, It can't be possible to traverse 1 Mill Mais

$10 USD / hora
(2 Comentários)
2.8
burakozdiltr

Hello, how are you? Database manager is here. Your project is clear for me and can be started right now. Please feel free to contact me. Best regards. Burak.

$20 USD / hora
(1 Comentário)
2.0
yanakhokhlova199

Hello there! Happy to bid here since I have the capability to build your project. I am a database expert and have rich experience in manipulating, sorting data. So I think you’d better discuss with me for clear require Mais

$25 USD / hora
(1 Comentário)
1.6
nikolaytoplev

- I will use Selemium. - Yes, I have ever worked with that kind of database. - It's MongoDB - I don't make sense what your question is, but if you ask me in chat, I can answer whatever it is. I have read your detail. Mais

$25 USD / hora
(1 Comentário)
1.2
nitinpathak14199

Hi, Myself Nitin Pathak from India. I have been working as Internal Auditor & i receive all data from clients in excel then i make that raw data into Meaningfull data using excel basics & advance Features like V lookup Mais

$10 USD / hora
(0 Comentários)
0.0
piraprakash379

I have 9 years of experience in software development. I recently worked on something similar with client where they will search address based on some keywords like 'street 9' searched within 100million property address Mais

$22 USD / hora
(0 Comentários)
0.0
bradleyglazer

Hi, I like your approach, nice. The will answers your questions are as follows: I would use python, I am learning HTML, php and javascript at the moment, but like coding at a lower level. I coded in ruby for Standard Mais

$20 USD / hora
(0 Comentários)
0.0
aryacena12

I've skills in data entry , data processing , MS-Excel , MS-Word and many more. Plus I complete my work right on time and if you hire me I'll never let you down . It would be pleasure to work with you . Ping me if you Mais

$40 USD / hora
(0 Comentários)
0.0
pmoleleki

I will be using SAS to perform the work I have got more 15 years in data analytics and data quality

$22 USD / hora
(0 Comentários)
0.0
Lavish28

Please give me a chance i am beginner i will try my best i will make u happy to my work i alaways do my best in everything work

$25 USD / hora
(0 Comentários)
0.0
prachishinkar25

Hello, I am experience in Data Analysis, Data cleaning, Data manipulation, Data mining. I performed many roles in Searching and Sorting as well as worked in dirty data. I can do all work using programming so it will b Mais

$25 USD / hora
(0 Comentários)
0.0
KarlSvend

Hello! I provide data collection services, through web scraping and text mining, for data interpretation, comparison, composition, distribution and relationship. Feel free to contact me!

$25 USD / hora
(0 Comentários)
0.0