Objective is to process a video and identifies clusters of audio in it that are similar. For example, a video of two people talking would generate two clusters, under each cluster a list of the segments (ie start time/end time) that belong to that cluster.
A simple UI is required to take in video URLs, the video is then processed and results are displayed. The UI will allow the user to do some lightweight actions like accept a certain segment or reject another one. The user will be able to look-up a some database that will store audio fingerprints. Everything to be built on Google’s cloud/app engine platform.
8 freelancers are bidding on average $643 for this job
We are four people in team having more than 7 years experiences with web development .we are running our small company . just give us chance you will be happy with our services