Em Andamento

Site Scrape Import into Database

We'd like to automate the import of DVD data into our database from partner sites. This data includes title, stars, box art photo link, dvd features, runtime, date released etc.

We'd like to scan DVD UPC's and have that data passed to a vendor site via a URL to find the product. This is supported by the partner site (eg. [url removed, login to view]).*contains adult content*

If that product is found, we'd like the the path (with our affiliate code) to be written to our database as well as the dvd information found there.

----So site #1 data to be scraped based on UPC scan:

Title

Product Photo

Affiliate link ([url removed, login to view];upc=BAR CODE #)

Price/format

Format (ie DVD)

Release Date

Cast/Stars

Studio Name

Runtime

To get other dvd info that is not found on site #1 we'd like the title to be used to search site #2 and scrape the information and saved to our database. (ie. [url removed, login to view];lid=listing)

----So site #2 data to be scraped based on title search from site #2:

Product ID

DVD Features

Price/format

Format (DVD, PayPerView, VideoOnDemand...)

DVD Features

Director

We will also need a regular feed setup to grab a zip file and import the data into our database from site #3. See project clarification board for format specifications. The matching Video on Demand title will be auto-populated when searching/scraping sites #1 & #2.

If the UPC scanned does not give a result from site #1, we'd like the interface to provide entering the title that will search site #2 and the data from site #2.

---So if no UPC match, scrape of site #2 based on manual entry of a title includes:

Title

Product ID

DVD Features

Price/format

Format (DVD, PPV, VOD, DIVX...)

DVD Features

Director

Cast/Stars

Runtime

Studio Name

Release Date

If no match is found on either site by UPC scanning or manual title entry, we'd like an option to either enter the data manually or skip and continue scanning.

This solution can be an online or client-based application. There must be a configuration setting for affiliate codes/paths that are written to the db, paths for photos being written, etc. Offline app. (Intel/PC) must be fault proof, meaning when end of scanning is completed and an upload of the data is performed, it can't choke on one record or entry being mal-formed. (we are not committed to any specific solution-so just an example).

Habilidades: Programação C, Javascript, PHP

Ver mais: scrape vod sites, scrape import, vod client, video into, upload video get url, skip searching, site vod, site scraping online, searching for art, scan code online, online photo video, it director, intel com, gamelink com, gamelink, find the art, find photo online, find art info, code offline, auto codes, scan dvd database, affiliate link code vod mallcom, scrape amazon dvd art, scrape imports, database vendor

Acerca do Empregador:
( 13 comentários ) Riverview, United States

ID do Projeto: #154008