
Concluído
Publicado
Pago na entrega
I have a single-table Access file that holds roughly 47,000 live-music venue records. Each record has 40 fields. Years of inconsistent data entry mean the same club might appear ten different ways—and often with slightly different contact spellings as well. You may create additional tables (contacts, etc) if necessary. What I need from you • Run a smart, repeatable deduplication process that matches venues by a blend of name similarity and contact-detail similarity, not just one or the other. • Collapse every detected cluster into one master record while preserving every phone, email, address, or website tied to that venue. All contact info must survive the merge. • Return the cleaned data in the very same Access format so it can drop straight back into our internal tools. Helpful context – One table only, but up to four contacts per venue. – I want exactly one row per real-world venue when you’re done. – No data loss: if two duplicates list different bookers, both bookers must appear in the merged contact list. Deliverables 1. The deduplicated .accdb file. 2. A brief log or summary showing how many duplicates were detected and merged—helpful for audits and future imports. Acceptance criteria • Zero duplicate venue names left after fuzzy + contact match passes. • All original contact fields retained; nothing overwritten or dropped. • Database opens and runs with no broken relationships or field type changes. Let me know the tools or scripting language you plan to use (Python with fuzzywuzzy, VBA, SQL Server Integration Services, etc.) and the turnaround time you’d need. Please include your fixed price or milestone breakdown when you respond. I will provide the selected freelancer with the ACCESS database. Let me know if there are additional questions you need answered in order to provide a bid.
ID do Projeto: 40139163
88 propostas
Projeto remoto
Ativo há 2 meses
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos

Hello, I am very interested in helping you clean and deduplicate your Access database. I have extensive experience working with large datasets in Microsoft Access and Python, including fuzzy matching, data normalization, and ensuring full data integrity. I plan to use Python with libraries like fuzzywuzzy for name matching and SQL queries for precise record consolidation. This approach allows for automation, repeatability, and auditing. After processing, I will provide the cleaned .accdb file along with a concise summary of duplicates detected and merged. The database will retain all original field types and relationships, ready to drop back into your internal tools. I am confident in my ability to deliver accurate, high-quality results for this dataset of 47,000 records. I can complete this work in 5 days. My proposed fixed price for the full project is 100 USD, which includes all deduplication, testing, and delivery of the cleaned database and summary report. Please let me know if you have any additional requirements or if you would like me to clarify the workflow before starting. I am ready to begin as soon as you provide the Access file. Thank you for considering my proposal.
$100 USD em 5 dias
2,0
2,0
88 freelancers estão ofertando em média $261 USD for esse trabalho

Youssef, Full-Time Freelancer and Python Programmer With a solid background in data extraction, web scraping, and automation, I've successfully tackled over 155 projects involving complex data challenges. Your Access Venue Database Deduplication project, particularly the task of unifying 47,000 records from inconsistent entries, sounds like a great fit. I see the critical need to identify duplicate venues using a blend of name and contact similarity, then consolidate them into a single master record without losing any associated phone, email, address, or website information. My approach would leverage Python, likely with Pandas for data handling and fuzzywuzzy for robust similarity matching, ensuring all contact details from up to four contacts per venue are preserved and returned in the original .accdb format with an audit log. Could you clarify how the up to four contacts per venue are structured within the 40 fields? Ready to start now.
$200 USD em 1 dia
7,3
7,3

Hello! I'm a Databases Developer, Access, Excel and VBA expert more than twenty years. I have completed many complex projects in my practice and I am ready to carry out your in the price and in the shortest possible time and with the highest quality. You can read reviews about my work at: https://www.freelancer.com/u/VladimirLilenko?w=f Regards, Vladimir
$100 USD em 4 dias
7,2
7,2

Hi I can deduplicate your 47,000-row Access database using a repeatable, audit-friendly process that blends fuzzy name matching with contact-detail similarity, not just one signal alone. The common failure in venue datasets is collapsing records too aggressively and losing bookers or alternate contacts; I’ll solve this by clustering first, then merging into a single master row while preserving all phones, emails, addresses, and websites. I typically use Python (RapidFuzz/FuzzyWuzzy + phonetic matching) with deterministic scoring rules, then write the results cleanly back to an .accdb so field types and compatibility stay intact. If needed, I’ll normalize contacts into helper tables during processing, then re-embed them so you still end with exactly one row per real-world venue. Every merge will be logged so you have a clear audit trail showing how many duplicates were detected and consolidated. The final database will open normally in Access with no broken relationships or overwritten data. I’ve handled similar deduplication jobs where accuracy mattered more than speed, and I’m comfortable working directly with Access files. Thanks, Hercules
$300 USD em 7 dias
6,9
6,9

Greetings, Thank you for considering my application for this project. As an AI Engineer and Python Developer with over 8+ years of experience, I bring a wealth of knowledge and expertise in the field of Python, Deep Learning. I have carefully reviewed the project description and am eager to discuss your specific needs and requirements in more detail. My commitment is to provide dedicated support and consistent follow-up throughout the project's lifecycle. Please feel free to reach out to me to further discuss how I can contribute to the success of your project. Looking forward to the opportunity of working together. Best regards, KuroKien
$160 USD em 1 dia
6,7
6,7

Hi, I can clean and deduplicate your Access file so each real-world venue appears once while keeping all contact info intact. I plan to use Python with fuzzy matching and SQL to ensure no data is lost. I can deliver the cleaned .accdb file along with a brief merge summary in [X] days. Could you clarify if any contacts are linked across multiple venues? Looking forward to helping you streamline this database!
$300 USD em 5 dias
6,5
6,5

Hi Derrick, Thank you for considering my proposal. With over 8 years of real-world experience and freelance work in Excel, I am equipped to assist you with this project. I have carefully reviewed the requirements and am eager to collaborate with you. I would like to connect with you in chat to discuss your project further. Regards,
$100 USD em 1 dia
6,3
6,3

Hi, I hope you're doing well. I understand you're looking for Access Venue Database Deduplication I am the ideal candidate for your project. I have read the provided job description and I understand what you are looking for. I have over 10+ years of experience Python, Data Processing, Data Entry, Excel, SQL, Microsoft Access, Data Analysis, Data Integration, Database Design, Database Management .Please feel free to further discuss the requirements and timeline for the project. I'd be happy to assist you. I am ready to start right now. ✅ No Upfront Payment ✅ Release Milestone After Completion ✅ 100% Project Completion Rate You can visit my Profile https://www.freelancer.com/u/HiraMahmood4072 Thank you
$160 USD em 7 dias
6,4
6,4

Dear Concerned I am an experienced MS ACCESS developer. I have done many projects through freelancer. You can review the feedbacks. If interested in hiring, please let me know. We can discuss the scope timelines and budger accordingly. Thanks
$100 USD em 7 dias
5,9
5,9

Hello, I understand you’re looking for a repeatable deduplication solution for a 47,000-record Microsoft Access venue database that identifies duplicates using both fuzzy venue-name similarity and contact-detail similarity. I have strong experience cleaning large Access datasets using Python + SQL logic, producing stable .accdb outputs that drop back into existing tools with no field-type changes or broken structure. I will run a structured workflow: normalize key fields (name, phone, email, website, address), then cluster likely duplicates using fuzzy matching combined with deterministic contact checks. Each cluster will be collapsed into one master venue record, and every unique phone, email, address, website, and booker/contact value will be preserved by consolidating into the existing contact fields or, where needed, adding a clean contacts table linked to the master venue—ensuring zero data loss and one row per real-world venue. You will receive the deduplicated .accdb plus an audit summary showing counts of detected duplicates, clusters merged, and merge rules used. The final database will open cleanly, remain refreshable for future imports, and meet your acceptance criteria with consistent naming, preserved contact details, and no remaining duplicates after the fuzzy + contact passes. Thanks, Asif
$300 USD em 2 dias
5,8
5,8

Hi client, I'm Denis Redzepovic, an experienced developer with expertise in Excel, Microsoft Access, Data Analysis, SQL, Data Processing, Database Design, Python, Data Integration, Database Management and Data Entry. I have worked extensively on diverse Python projects, ranging from backend development and automation to data processing and API integrations. My deep understanding of Python’s libraries and frameworks allows me to build efficient, scalable, and maintainable solutions. I pay close attention to code quality and performance to ensure your project runs flawlessly. With my solid experience, I’m confident I can deliver results that exceed your expectations. I focus on writing clean, maintainable, and scalable code because I know the difference between 99% and 100%. If you hire me, I’ll do my best until you’re completely satisfied with the result. Let’s discuss your project details so I can tailor the perfect Python solution for you. Thanks, Denis
$225 USD em 1 dia
5,5
5,5

Hello, I’m a Senior Software Engineer with extensive experience in Python automation and web scraping & C# WindowFormApp and WFP. I’ve carefully reviewed your requirements and I can deliver a reliable, production-ready solution — not a quick workaround. ✅ Clean and maintainable code ✅ Clear communication ✅ On-time delivery I’d be happy to discuss your project details and propose the best technical approach. Best regards, Samir
$180 USD em 1 dia
5,5
5,5

MS Access developer, SQL cettified, expert in data processing and ready to do the required deduplication, organize, restructuring unique row by club without loosing any part of original useful unique info. I understand the required exactly , the challenges and I know how to do it the best way. CHEERS
$250 USD em 3 dias
5,4
5,4

Using my extensive 33 years of global market experience, my sole-proprietorship style of work ensures you'll have a direct and dedicated partner on this project. Although my skills primarily revolve around marketing, advertising and app development, I have in-depth knowledge of data management including database deduplication. Understanding the value of preserving data integrity, I reassure you that no information will be overwritten or dropped during the course of deduplication. For this project, I'm suggesting the use of Python with fuzzywuzzy, a robust library we can leverage to effectively find name similarity and contact-detail similarity as you've specified. My aim is to give you a deduplicated .accdb file along with a detailed summary showing the exact number of merged duplicates for future reference. Given my tight schedule and to provide optimal focus on your project I'd need an estimated turnaround time of three days. My well-practiced availability is another advantage on this venture. As I'm accustomed to working across various time zones, any timing or scheduling needs you may have can be accommodated without issue. Overall, I offer you not only my pertinent skills but also my strong commitment and unwavering determination to deliver excellence in accuracy and timeliness on this Access Venue Database Deduplication project.
$300 USD em 79 dias
5,2
5,2

Dear Client, I am a skilled data professional with expertise in SQL, Python, and Excel. I have successfully completed similar projects in data processing and database management. I am confident in my ability to help you with the Access Venue Database Deduplication project. I propose to use a combination of fuzzy matching algorithms in Python, along with SQL queries for data manipulation, to efficiently deduplicate the venue records in your Access file. I will ensure that all contact details are preserved during the merging process and deliver the cleaned data back to you in the same Access format. I have a proven track record of delivering high-quality results within specified timelines. I am eager to discuss your project further and provide a tailored solution to meet your needs. Thank you for considering my proposal. Best regards, Ali Zahid
$200 USD em 7 dias
5,1
5,1

Hi, I have extensive experience cleaning and deduplicating large Access databases and can help with your 47,000-record live-music venue file. I would use Python with pandas and fuzzy matching libraries (RapidFuzz or FuzzyWuzzy) to identify duplicates based on both venue name similarity and contact details, then merge clusters into a single master record per venue while preserving all unique phone numbers, emails, addresses, and websites. I can create auxiliary tables if needed to handle multiple contacts per venue. The final deliverable will be a fully deduplicated .accdb file with all original contact fields retained, plus a concise log showing the number of duplicates merged. I can complete this efficiently and ensure the database opens cleanly with no broken relationships. I’m happy to provide a milestone breakdown and turnaround estimate once I have the file.
$2.300 USD em 13 dias
5,0
5,0

Hello I am working with MS-Access database and VBA for the past 22 years. I have also developed many important application using MS-Access as both front-end and back-end tool and have completed a good number of Access Projects through Freelancer.com. I would do your work and is ready to start right now. I have gone through your project specification. Hope to hear positively from u. Regards John
$100 USD em 7 dias
4,8
4,8

Hello, I am a Python Developer with 15+ years of experience in building secure, scalable, and high-performance applications. I specialize in Python-based backend development, automation scripts, API development, data processing, and integrating third-party services. My expertise includes Django, Flask, FastAPI, REST APIs, MySQL/PostgreSQL, and cloud deployment. I also recently worked on integrating the OpenAI API for auto-generated content, images, and automation features—showing my ability to adopt modern AI technologies. If you are looking for a dedicated Python Developer who delivers clean code, reliability, and fast results, I’d be glad to work on your project
$100 USD em 7 dias
4,8
4,8

Hi Derrick, Let's create a repeatable deduplication that collapses ~47,000 venue rows into one master record while preserving every phone, email, address and website. You need a dedupe that blends fuzzy name matching with contact-detail similarity to ensure exactly one row per real venue while keeping up to four contacts intact. Years of inconsistent entry and multiple booker variants are the real challenge—merges must never overwrite or drop distinct contacts. I've done cleanups on large Access tables and CSVs using Python (pandas + rapidfuzz), Access SQL and VBA to produce a normalized master venue table plus a contacts table that preserves all original fields and provenance. I move fast, keep logic transparent and audit-ready, and focus on zero data loss. I'm responsive, timezone-flexible, and open to your budget and timeline. If that fits, let’s chat about next steps. Best regards, Saad
$200 USD em 3 dias
4,2
4,2

I can clean and deduplicate your Access database safely while preserving every piece of contact information. I’ll run a repeatable, rule-based deduplication process that matches venues using a combination of fuzzy name similarity and contact details (phone, email, address, website), not just one field. Detected duplicates will be clustered and merged into a single master record per real-world venue, with all associated contacts retained—no overwriting, no data loss. If needed, I’ll normalize the structure by introducing supporting tables (e.g., contacts) while ensuring the final database opens cleanly in Access and drops straight back into your existing tools. Every phone, email, booker, and website tied to a venue will survive the merge. You’ll receive the cleaned .accdb file plus a short audit summary showing how many duplicates were detected and merged, useful for validation and future imports. I typically use Python (pandas + fuzzy matching) combined with Access-compatible exports, or VBA/SQL where appropriate, depending on what best fits your schema. Once I review the database, I can confirm the exact approach, turnaround time, and provide a clear fixed price or milestone plan. Happy to start as soon as you share the Access file.
$200 USD em 7 dias
4,3
4,3

Dear Client, Greetings!! I have gone through the project description, and found that all of the mentioned requirements fall over my expertise, as I have hands-on experience on python, AI/ML, Data Science, software building, etc.I’ve handled messy Access DB cleanups like this before , I’ll use Python and fuzzy match to cluster by venue name and contact detail, merge into one master row while keepng every phone/email/booker with zero data loss. I’ll return the same .accdb with a merge log and no broken fields quick, repeatable proces. One question: are the contact fields stored in fixed columns or as repeated groups per venue? Also,I have been coding on Machine Learning and Data Science with python from past 7 years. I have the experience of working with 4 giant tech companies, including freelancing on upwork, fiverr and freelancer. Hope to hear from you soon!!. Regards, Rojan
$220 USD em 7 dias
4,4
4,4

West Hills, United States
Método de pagamento verificado
Membro desde jun. 20, 2008
$30-250 USD
$30-35 USD
$35-100 USD
$30-250 USD
$30-100 USD
$35-60 AUD / hora
₹1500-12500 INR
₹1500-12500 INR
$10-70 USD
$1500-3000 CAD
$10-30 CAD
£10-30 GBP
₹12500-37500 INR
$30-250 USD
$15-25 USD / hora
£10-15 GBP / hora
$750-1500 USD
$250-750 USD
₹750-1250 INR / hora
$100 NZD
$10-30 USD
$750-1500 USD
₹1500-12500 INR
₹750-1250 INR / hora
₹600-1500 INR