
Fechado
Publicado
I need an OCR component that plugs directly into our existing order-processing software and reliably reads supplier purchase orders that arrive as JPEG scans. The engine only has to recognise English, but accuracy must be high enough to extract line-item tables, dates, totals and supplier IDs without manual correction. You are free to build on Tesseract, OpenCV, or a deep-learning stack such as TensorFlow or PyTorch, as long as the final module returns structured JSON we can map to our database. Deliverables I expect: • Source code with clear build instructions • A small training/validation dataset plus instructions for expanding it • API documentation and a quick demo script • Brief report on achieved accuracy and how to retrain the model if vendor layouts change I can supply a representative set of purchase-order images for development and testing the moment we start.
ID do Projeto: 40176309
19 propostas
Projeto remoto
Ativo há 6 dias
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
19 freelancers estão ofertando em média $9 USD/hora for esse trabalho

Hi, I understand you need a high-accuracy OCR module that integrates with your order-processing system, extracts structured data from JPEG purchase orders, and outputs reliable JSON for database mapping. I will share my portfolio on your first message. Implementation Approach: • Image preprocessing with OpenCV and layout detection • OCR pipeline using Tesseract or deep-learning model for table/text extraction • Structured JSON output, API layer, testing, and retraining guidelines Queries: • Approximate variation in supplier layout formats? • Expected monthly document volume? • Preferred deployment environment? I am confident about the delivery after this first meeting.
$9 USD em 40 dias
4,4
4,4

Hello Just read your post and it seems you are looking for someone skilled in OCR extraction form scanned purchase oders, including accurate table, dataee, total, and supplier ID recognition with structured JSON output. With my years of extensive experience and exceptiional experitse in OCR pipelines using Tesseract, OpenCV, and deep-learning-based text extraction, I am confident that I can build a reliable component that pulgs directly into your existing order-processing system. I have hands-on experience designing OCR workflows that handle noisy JPEG scans, normalize layouts, extract line-tehm tables, and return clean, database-ready JSON with high accuracy, Let's connect and see how great valueI can add to yoru busniess. Best Regards Raka
$4 USD em 40 dias
2,8
2,8

Hi There!!! The Goal of the project:- To develop a highly accurate OCR engine that integrates with your existing order-processing software and extracts structured JSON data from JPEG purchase orders. I have carefully read and understood your requirements for English text recognition, extraction of line-item tables, dates, totals, and supplier IDs, as well as the need for source code, API documentation, and training instructions for future dataset expansion. I am confident I am the best fit for this project because I have extensive experience in building OCR solutions using Tesseract, OpenCV, and deep learning frameworks for production-ready applications. I will provide: 1. OCR module with structured JSON output and source code with build instructions. 2. Small training/validation dataset with guidance for expansion and retraining. 3. API documentation, demo script, and accuracy report. I have 9+ years experience as a full stack developer and have completed similar projects involving document scanning, table extraction, and automated data processing. Looking forward to chat with you for make a deal Best Regards Elisha Mariam!
$3 USD em 40 dias
1,5
1,5

Hi There! ⚜⭐⭐⭐⭐⚜build a custom ocr module that reads jpeg purchase orders and returns structured json with tables, dates, totals, and supplier ids.⚜⭐⭐⭐⭐⚜ I HAVE BUILT OCR SOLUTIONS USING TESSERACT AND OPENCV THAT EXTRACT LINE ITEMS AND KEY FIELDS WITH HIGH ACCURACY, AND I CAN SHARE SIMILAR WORK IN CHAT. I WILL CREATE A NODE.JS OCR COMPONENT THAT PLUGS INTO YOUR ORDER PROCESSING SYSTEM, PROVIDING CLEAN JSON OUTPUT FOR TABLES, DATES, TOTALS, AND SUPPLIER IDS. THE MODULE WILL INCLUDE BUILD INSTRUCTIONS, A SMALL TRAINING AND VALIDATION DATASET, API DOCUMENTATION, AND A DEMO SCRIPT. I WILL ALSO PROVIDE A SHORT REPORT ON ACCURACY AND HOW TO RETRAIN WHEN VENDOR LAYOUTS CHANGE. Please share a sample set of purchase order images so we can start. Warm Regards, Farhin B.
$3 USD em 40 dias
1,2
1,2

Hi there! Are you expecting the OCR component to handle variations in image quality like rotated or skewed JPEG scans out of the box? Regardless, this is definitely something that I feel confident delivering on, given my past experience. I would love to discuss your project further! Looking forward hearing from you. Kind Regards, Corné
$2 USD em 14 dias
0,9
0,9

Hello, I understand that you're looking for an OCR engine that seamlessly integrates with your existing order-processing software to reliably read supplier purchase orders from JPEG scans. Your emphasis on accuracy for extracting tables, dates, totals, and IDs resonates well with my background. I have successfully developed OCR systems in previous projects, achieving high accuracy rates without the need for manual correction. My experience with Tesseract, OpenCV, and deep-learning frameworks like TensorFlow allows me to create tailored solutions that meet specific requirements. ✅My Plan - Develop the OCR engine using a suitable framework (Tesseract or TensorFlow) for optimal accuracy. - Provide clear source code and comprehensive build instructions. - Create a training dataset and easy-to-follow instructions for updating it. - Supply detailed API documentation and a demo script for easy integration. - Conduct a report on the achieved accuracy and guidelines for model retraining. Best regards, Hongqiang Chen
$20 USD em 14 dias
0,0
0,0

Hello. I read your requirement i will do that. Please come on chat we will discuss more about this. I will waiting your reply
$5 USD em 40 dias
0,0
0,0

Hi, We would like to grab this opportunity and will work till you get 100% satisfied with our work. We are an expert team which have many years of experience on OCR, Node.js Lets connect in chat so that We discuss further. Thank You
$4 USD em 40 dias
0,0
0,0

Custom OCR Engine Development I’m a full-stack software engineer with expertise in React, Node.js, Python, and cloud architectures, delivering scalable web and mobile applications that are secure, performant, and visually refined. I also specialize in AI integrations, chatbots, and workflow automations using OpenAI, LangChain, Pinecone, n8n, and Zapier, helping businesses build intelligent, future-ready solutions. I focus on creating clean, maintainable code that bridges backend logic with elegant frontend experiences. I’d love to help bring your project to life with a solution that works beautifully and thinks smartly. To review my samples and achievements, please visit:https://www.freelancer.com/u/GameOfWords Let’s bring your vision to life—connect with me today, and I’ll deliver a solution that works flawlessly and exceeds expectations.
$4 USD em 40 dias
0,0
0,0

⚠️You are not looking for a coder. You are looking for someone who can build this properly. That is exactly why your project stood out.⚠️ Your focus on creating a high-accuracy OCR component that integrates seamlessly with order-processing software to extract line-item tables and key fields from English-language JPEG scans demonstrates a deep commitment to reliable, structured data capture. This approach aligns with ensuring smooth operational flow and data integrity over quick, patchwork solutions. At DigitaSyndicate, a UK-based digital systems agency, we build precision-engineered automation and AI-driven systems designed for performance and long-term scalability. Our development process emphasizes intuitive integration, streamlined data extraction, and future-proof architectures that accommodate evolving vendor layouts without compromising accuracy. We recently delivered a comparable OCR module for a logistics client that transformed unstructured scans into actionable JSON data with sustained accuracy above 95 percent. Can you share your main priorities and timeline so I can map out the right execution plan for you? Casper M. Project Lead | DigitaSyndicate Precision-Built Digital Systems.
$4 USD em 14 dias
0,0
0,0

I’m giving The solution will reliably extract line items, dates, totals, and supplier IDs using a robust OCR and layout-analysis pipeline. Deliverables include documented source code, a demo script, training data with retraining guidelines, and an accuracy report.
$3,35 USD em 40 dias
0,0
0,0

Hi there, I understand the complexity of dealing with scanned supplier POs. They are never perfect—bad lighting, skewed angles, and noisy backgrounds make standard extraction a nightmare. It’s frustrating when software promises automation but still forces your team to manually double-check every single line item. The Risk: Most basic OCR implementations (like raw Tesseract) are dangerous for financial documents. They destroy table structures—merging columns or misreading a "0" as an "O" in a price total. If your module injects corrupt data into your database, it doesn't just fail; it creates "silent errors" in your accounting that can take weeks to unravel. You need structure, not just a bag of words. The Solution: As an AI Systems Architect specializing in Computer Vision, I build Structure-Aware OCR Pipelines. Instead of blindly reading text, my approach: 1. Pre-processing: Uses OpenCV to de-skew and clean the JPEGs (enhancing contrast for older scans). 2. Layout Analysis: Identifies the geometry of the table rows before reading the text, ensuring line items never get mixed up. 3. Validation: Returns strict, type-checked JSON that your database can trust immediately. I will deliver a module that turns messy scans into clean, actionable data, eliminating the need for manual oversight. Best, Sasanka AI Systems Architect | Computer Vision Expert
$30 USD em 30 dias
0,0
0,0

Brooklyn, United States
Membro desde jan. 24, 2026
$30-250 USD
₹100-400 INR / hora
$10-30 USD
$30-250 CAD
$3000-5000 USD
₹12500-37500 INR
$8-15 AUD / hora
$30-250 USD
$15-25 USD / hora
₹400-750 INR / hora
₹600-1500 INR
$250-750 USD
$15-25 USD / hora
$10-30 USD
₹1500-12500 INR
$15-25 AUD / hora
$10-30 USD
₹1500-12500 INR
€750-1500 EUR
$3000-5000 USD