Cancelado

Domain & Email Harvester(repost)

We Need a piece of software that will generate working internet domains /website addresses.

the software will also be able to produce working email addresses for the domains.

the software MUST be able to generate the above 2 by importing a feed of company names & zip/postcodes and telephone numbers and contact names that we already hold

## Deliverables

Some technical and business points...

?

? ? ? ? ? ? ? ? ? It will utilize a CSV list in the format? (ReferenceNumber, CompanyName, PostalCode, PhoneNumber, FirstName, Surname)

? ? ? ? ? ? ? ? ?

? ? ? ? ? ? ? ? ? It will utilize search engines (check the major engines to determine which offers the highest quality results)

? ? ? ? ? ? ? ? ? ? Will search for the company name, and information? provided in the csv file, to find the best matches.

?

? ? ? ? ? ? ? ? ? It will scan through the DNS information for each search result to determine if it is an accurate match, based on known information.

? ? ? ? ? ? ? ? ? ? ? If a match is found, it will store the information in the output file

?

? ? ? ? ? ? ? ? ? ? ? ? if a match is not found, it will scan other results to determine if they are a match (a max result option will be provided to optimize running time).

?

? ? ? ? ? ? ? ? ? ? ? ? If match is found, option will be provided to allow a 'mini harvester' to run on their web site to find email address's.

?

? ? ? ? ? ? ? ? ? ? ? ? If match is found, option will be provided to allow a 'mini harvester' to scan through the DNS info to find email address's.

?

? ? ? ? ? ? ? ? ? ? ? When application is finished running, it will output the results in this format (as csv):

? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ReferenceNumber, CompanyName, PostalCode, PhoneNumber, FirstName, Surname, URL, Email-1, Email-2, Email-3, etc

1. Domain searcher / mini harvester

?

Yes we do require these to be split so that we can switch the harvester on / off. We also require that the harvester can be run separately. i.e we would have 3 options: (i) run domain searcher with the harvester automatically,

(ii) run the domain searcher by itself and (iii) run the mini harvester by itself. This is from a legal perspective as much as for convenience.

?

2. Search Engine(s)

?

It would be useful if there was some way we could add / remove / edit the search engine urls. If we have the source code then this is possible, but importing them from a config file when the software starts would be useful.

?

3. Results file

?

It would be easier for us to process the results file if it contained one row per url / email found rather than one row per Reference. I suspect it would be easier to programme as well. I have suggested that the URL / Email Address status is included in the output file. For example, a url might return "200/OK", "302/Found", "401/Unauthorised" and still be valid for emails.

?

For example:

? ? ? ? ? Output from domain searcher (and optional input for mini-harvester):

?

? ? ? ? ? ? ? ? ? ? ? Reference, Company, Postcode, Telephone, FName, SName, URL, <blank>, StatusText

? ? ? ? ? Output from mini-harvester for contact match:

? ? ? ? ? ? ? ? ? ? ? Reference, Company, Postcode, Telephone, FName, SName, URL, Email, StatusText

? ? ? ? ? Output from mini-harvester for domain matches for different

contacts:

? ? ? ? ? ? ? ? ? ? ? Reference, Company, Postcode, Telephone, FName, SName, URL, Email, StatusText

?

CSV format is fine. We can also handle tab-separated or tilde-separated files with or without quotes to delimit the columns for both import and export. The important point of the formats above is that they are all the same structure.

?

4. Test results

?

We would require the results from our 1k test file so that we can analyse them before purchase.

?

5. Source code

?

A copy of the source code will be? neccessary

?

Which language will the programme be written in?

?

We will hold the rights to the software??

Habilidades: PHP

Ver mais: switch domain name, programme internet, valid email address, internet programme, internet import export business, need web searcher, hold domain, email valid, email address valid, domain hold, best website formats, best domain, mini tab, email search engine, programme site web, the harvester, scan website emails, find email, find email address, find domain, find company email, php split csv files, search company email format, email contacts search, visual search engine php

Acerca do Empregador:
( 0 comentários ) United Kingdom

ID do Projeto: #3026490