
Fechado
Publicado
Pago na entrega
I have a Python 3.12.3 project that ingests video streams and image batches for face recognition (mtcnn 1.0.0 + insightface 0.7.3), OCR (paddleocr 2.10.0 on paddlepaddle 3.0.0 / paddlepaddle-gpu 2.6.2), and post-processing with scikit-learn 1.6.0. Although one GPU-ready wheel is present, all processing still executes on the CPU. The goal is full NVIDIA CUDA utilisation across the entire workflow, from frame decoding to final inference. I need you to: • Profile the current code, pinpoint CPU-bound sections, and migrate or rewrite them for GPU execution (CUDA, CuDNN, cuBLAS, or other relevant CUDA-based APIs). • Update or swap libraries where necessary—feel free to recommend faster CUDA-compatible alternatives if they will not break accuracy (e.g., CuPy, TensorRT, NVIDIA Video Codec SDK). • Modify the code so GUI-less batch processing and real-time video runs stay identical in behaviour and output. • Provide a concise “from-scratch” setup script or README covering driver versions, conda/pip commands, and any environment variables. • Deliver a short benchmark report showing the speed-up you achieved. I’m open to adding extra libraries or frameworks if they make a clear impact, so please include your suggestions in your bid. Intel Ice Lake with NVIDIA® Tesla® T4 Number of GPUs: 1 vCPU: 32 RAM: 128 GB Disk space: 200 GB CUDA Version: 12.9 Video Driver Version: 575.64.03 cudnn: - libcudnn9-cuda-12: 9.17.1.4-1 - libcudnn9-headers-cuda-12: 9.17.1.4-1 mtcnn 1.0.0 insightface 0.7.3 onnxruntime-gpu 1.23.2 scikit-learn 1.6.0 (DBSCAN) paddleocr 2.10.0 paddlepaddle 3.0.0 paddlepaddle-gpu 2.6.2 Python 3.12.3 boto3 1.38.0 If you have proven experience accelerating computer-vision workloads on CUDA, I’d love to see it in action here.
ID do Projeto: 40143394
60 propostas
Projeto remoto
Ativo há 21 dias
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
60 freelancers estão ofertando em média $139 AUD for esse trabalho

Hi there, I’ve carefully reviewed your project requirements, and with my extensive experience in developing Python scripts and applications, I’m confident that I can deliver a high-quality solution tailored to your needs. Whether it’s automation, data processing, or custom application development, I have the skills to ensure your project’s success. I’d love to discuss how I can contribute and help bring your vision to life. Feel free to check out my portfolio for more examples of my work: Portfolio: https://www.freelancer.com/u/webmasters486 Looking forward to hearing from you! Best regards, Muhammad Adil
$120 AUD em 3 dias
6,1
6,1

With exceptional problem-solving skills, a knack for efficiency, and years of experience in accelerating computer-vision workloads on CUDA, I can offer you a step beyond what you're currently expecting. I am delighted to be a top 0.03% ranked freelancer with an extensive background in C programming and data processing complemented by my understanding of Image Processing and Python proficiency. These skills will prove invaluable in migrating your current code executed on the CPU to full GPU utilization through the relevant CUDA-based APIs. I understand that maintaining the integrity of real-time video runs and GUI-less batch processing is critical. My expertise lies exactly in ensuring that behavioral consistency whilst transforming your present workflow into one that optimally makes use of NVIDIA CUDA, CuDNN, cuBLAS, and any other relevant APIs that could make this project more seamless. As someone keen on delivering excellence, you can count on me to not only complete this migration but also provide a comprehensive "from-scratch" setup script or README to facilitate easy adoption across your team.
$250 AUD em 7 dias
6,3
6,3

With my expertise in C Programming, Python, Data Processing, CUDA, and Image Processing, I am well-equipped to accelerate your vision code with GPU utilization. I am confident in pinpointing CPU-bound sections and migrating them to CUDA-based APIs for optimal performance. I am eager to discuss the full project scope to adjust the budget accordingly. Let's optimize your workflow for NVIDIA CUDA and achieve significant speed-up. Please go through my profile to see my 15 years of experience. Let's discuss the job details and get started right away.
$175 AUD em 7 dias
6,3
6,3

⭐Hi, I’m ready to assist you right away!⭐ I believe I’d be a great fit for your project since I have extensive experience in accelerating computer-vision workloads on CUDA and optimizing code for GPU execution. My background in C Programming and Python, along with my deep understanding of Video Streaming, Video Processing, Deep Learning, Computer Vision, CUDA, Data Processing, and Image Processing, positions me well to tackle the challenges of your project. I am skilled in profiling code to identify CPU-bound sections and migrating them for GPU execution using CUDA, CuDNN, cuBLAS, and other relevant CUDA-based APIs. I am proficient in updating libraries to ensure full NVIDIA CUDA utilization and maintaining GUI-less batch processing and real-time video runs' behavior and output consistency. This project aims to revolutionize your workflow by leveraging CUDA for faster and more efficient processing, ultimately enhancing accuracy and speed. If you are looking for a dedicated professional to enhance your project with cutting-edge technology, I am here to deliver exceptional results. If you have any questions, would like to discuss the project in more detail, or would like to know how I can help, we can schedule a meeting. Thank you. Maxim
$74 AUD em 3 dias
5,6
5,6

Hi client, I'm Denis Redzepovic, an experienced developer with expertise in Data Processing, Video Streaming, C Programming, Python, Computer Vision, Deep Learning, CUDA, Video Processing and Image Processing. I have worked extensively on diverse Python projects, ranging from backend development and automation to data processing and API integrations. My deep understanding of Python’s libraries and frameworks allows me to build efficient, scalable, and maintainable solutions. I pay close attention to code quality and performance to ensure your project runs flawlessly. With my solid experience, I’m confident I can deliver results that exceed your expectations. I focus on writing clean, maintainable, and scalable code because I know the difference between 99% and 100%. If you hire me, I’ll do my best until you’re completely satisfied with the result. Let’s discuss your project details so I can tailor the perfect Python solution for you. Thanks, Denis
$150 AUD em 3 dias
5,5
5,5

I can handle this end-to-end and make sure your entire pipeline truly runs on NVIDIA CUDA, not just on paper. I’m a Python expert with hands-on experience accelerating computer-vision workflows (face recognition, OCR, real-time video) using CUDA, cuDNN, TensorRT, CuPy, and GPU video decoding. I’ll quickly profile your current code, identify CPU bottlenecks, and migrate them to efficient GPU paths without changing output or behavior. What I’ll do: Ensure real GPU usage across decoding, inference, and post-processing Replace or optimize libraries where needed (TensorRT, CuPy, GPU video codecs) Keep batch and real-time pipelines identical in results Provide a clean from-scratch setup guide Deliver a clear benchmark showing real speed-up I work fast, clean, and focus on measurable performance gains. If CUDA acceleration is the goal, this is exactly my lane.
$180 AUD em 1 dia
5,7
5,7

As a seasoned Full-Stack Developer with impeccable project completion and timing, plus a collection of highly favorable reviews from satisfied clients, I am confident in not just delivering your desired GPU acceleration, but doing so proficiently and professionally. My AI skillsets align perfectly with the requirements of your project, having worked on numerous advanced Computer Vision tasks like object detection tracking and counting, image processing and recognition (OCR), and exploiting other related CUDA-based APIs such as OpenCV. Over the course of my career, I've developed a deep understanding and competence in optimizing algorithms for maximum computational efficiency. I guarantee to profile your current code meticulously, strategically rewriting or migrating CPU-bound sections onto the GPU for NVIDIA CUDA utilization. To further underline my suitability for your project, I will deliver a comprehensive "from-scratch" setup script or README that documents all the relevant driver versions, conda/pip commands, and environment variables to make future migrations smoother. Additionally, I will provide a detailed benchmark report showcasing the speed-up achieved through the GPU acceleration implementation. Rest assured, your project is in trustworthy hands - contact me now let's get started on supercharging your vision code!
$140 AUD em 2 dias
5,6
5,6

Hello, I can help optimize your Python project to fully utilize NVIDIA CUDA for faster processing. By profiling your existing code, I’ll identify bottlenecks and transition CPU-bound sections to GPU execution. I’ll ensure that both batch and real-time video processing work seamlessly, and provide a setup guide to ensure smooth deployment. Expect a clear improvement in performance with a benchmarking report showcasing the speed-up achieved. I look forward to discussing how we can make this project more efficient. Best regards, Juan
$140 AUD em 1 dia
5,6
5,6

Hello, I specialize in accelerating computer vision workflows using NVIDIA CUDA, and I'm excited to take on your project. I'll begin by profiling your code to identify CPU-bound sections and optimize them for full GPU utilization using CUDA, CuDNN, and cuBLAS. If necessary, I can recommend and integrate CUDA-compatible libraries like CuPy or TensorRT to enhance performance without compromising accuracy. I'll ensure the modified code maintains identical behavior for both GUI-less batch processing and real-time video tasks. Additionally, I'll provide a detailed setup script or README covering all required installations and configurations, ensuring seamless deployment. For transparency, I'll deliver a benchmark report showcasing the speed improvements achieved. If there are any specific libraries or frameworks you're considering, I'd be happy to discuss their potential impact. Questions: • Are there any specific GPU models you plan to use for this acceleration? • Do you have any preferred CUDA libraries or frameworks already in use? I look forward to optimizing your vision code for maximum performance. Thanks and best regards, Kamran
$90 AUD em 5 dias
5,1
5,1

Dear Client, Greetings!! I have gone through the project description, and found that all of the mentioned requirements fall over my expertise, as I have hands-on experience on python, AI/ML, Data Science, software building, etc.I can profile your Python pipeline and migrate CPU-bound sections to full NVIDIA CUDA acceleration, covering face recognition, OCR, and post-processing, while keeping output identical for batch and real-time video. I will recommend and integrate GPU-optimized libraries where appropriate, provide a setup script, and benchmark the speed-up. Which GPU will you be running on, and are you open to swapping any libraries for faster CUDA alternatives if accuracy is maintained? Also,I have been coding on Machine Learning and Data Science with python from past 7 years. I have the experience of working with 4 giant tech companies, including freelancing on upwork, fiverr and freelancer. Hope to hear from you soon!!. Regards, Rojan
$160 AUD em 7 dias
4,7
4,7

Hello, I specialize in optimizing computer vision workflows for NVIDIA CUDA. For your project, I will profile the current Python code to identify CPU-bound sections and migrate them for GPU execution using CUDA, CuDNN, and other relevant APIs. I'll explore CUDA-compatible libraries like CuPy or TensorRT to maintain accuracy while enhancing performance. I'll ensure both batch processing and real-time video processing remain consistent in output and behavior. A detailed setup script or README will be provided, outlining driver versions and installation commands. Additionally, I'll deliver a benchmark report to quantify the speed improvements achieved. Questions: • Are there specific sections where you've observed bottlenecks? • Do you have a preferred format for the benchmark report? Looking forward to accelerating your vision code efficiency and seeing substantial performance gains. Thanks and best regards, Faizan
$90 AUD em 5 dias
4,3
4,3

With a deep understanding and practical experience in both computer vision and CUDA programming, I am ideally positioned to boost the performance of your Python project. My proficiency in Python, C, and C++ perfectly aligns with the task at hand. Not only have I worked on similar face recognition projects requiring CUDA optimization but my background also includes image processing and motion detection – all crucial elements for accomplishing your goal. Over the years, I have sharpened my skills in CUDA programming and leveraged its power to deliver game-changing speed-ups for computer vision applications like yours. Pinpointing CPU-bound sections, converting them for GPU execution using relevant CUDA-based APIs like CUDA, CuDNN, cuBLAS (to name a few) won't be a problem as it's something I've done successfully before. I'll also suggest updating or swapping libraries wherever applicable to ensure seamless compatibility and maximum speed. My forte extends beyond just coding - I'm adept in documenting complex processes comprehensively. You can expect a concise "from-scratch" setup script or README that will cover all essential details ranging from driver versions to conda/pip commands and environment variables. And rest assured, just like you need it, I'll ensure batch processing and real-time video runs stay identical in behavior and output while exploiting CUDA fully!
$250 AUD em 7 dias
4,7
4,7

Hello! I understand your requirement for accelerating your Python-based face recognition and OCR project using NVIDIA CUDA. Your goal of achieving full GPU utilization across the entire workflow is clear, and I have extensive experience in optimizing similar computer vision workloads for significantly improved performance. In my previous projects, I successfully migrated CPU-bound processes to GPU execution, resulting in notable speed-ups of up to 70%. This included using CUDA and other APIs to enhance data processing capabilities. ✅My Plan - Profile the current code to identify CPU bottlenecks. - Migrate identified sections to GPU using CUDA and CuDNN. - Recommend and implement faster, CUDA-compatible libraries without sacrificing accuracy. - Ensure GUI-less batch processing maintains behavior and output fidelity. - Create a detailed setup script with required dependencies and environments. - Deliver a benchmark report showcasing speed improvements. Could you share any specific metrics you’re looking to improve with this acceleration? Also, are there any particular CUDA-compatible libraries you prefer? Best regards, Hongqiang Chen
$230 AUD em 2 dias
4,0
4,0

Hi! I have proven experience accelerating computer-vision pipelines on NVIDIA GPUs, including face recognition, OCR, and real-time video inference on CUDA-enabled systems like the Tesla T4. I will begin by profiling your existing code to identify CPU-bound stages such as decoding, preprocessing, inference, and post-processing. All compatible components (MTCNN, InsightFace, PaddleOCR, ONNX Runtime GPU) will be verified and reconfigured to run fully on CUDA, with CPU logic migrated to GPU-accelerated alternatives where appropriate. Where it delivers real gains without affecting accuracy, I will introduce optimized CUDA-based libraries such as CuPy, TensorRT, or GPU-accelerated video decoding. Both batch and real-time pipelines will remain functionally identical, with no changes to outputs or behavior. You will receive a clean setup guide covering drivers, CUDA/cuDNN, environment variables, and installation steps. Finally, I will provide a concise benchmark report demonstrating the performance improvements achieved.
$200 AUD em 7 dias
4,0
4,0

Hi, I hope you are doing well. Very happy to bid your on project because my skills are fitted in your project. I’ve accelerated end-to-end CV pipelines on NVIDIA GPUs (CUDA/cuDNN/TensorRT) by moving decode + pre/post processing to GPU (NVDEC/OpenCV CUDA/CuPy) and deploying ONNX/TensorRT engines for MTCNN/InsightFace and PaddleOCR to eliminate CPU bottlenecks. I will profile your current code to pinpoint CPU-bound stages (decode, resize/normalize, NMS, OCR post-proc, sklearn steps) and refactor them into a fully GPU pipeline using NVDEC + CUDA preprocessing + TensorRT/ONNXRuntime-CUDA, keeping outputs identical for batch and real-time modes. I will deliver an install README/setup script (drivers, CUDA, conda/pip, env vars) and a benchmark report showing FPS/latency gains before vs after. If you send the message , we can discuss the project more. Thanks.
$100 AUD em 3 dias
3,8
3,8

As a highly skilled and results-oriented Data Scientist, I have extensive experience with optimizing GPU computing for data processing tasks in various domains, including computer vision. My deep understanding of Python coupled with my proven expertise in GPU utilization with CUDA will be immensely valuable for your project that aims to accelerate vision code. To ensure you get the most out of your NVIDIA Tesla T4 and Intel Ice Lake for video streams and image batches processing, I will thoroughly profile your current code, identifying CPU-bound sections in the workflow. Then, I'll diligently migrate and rewrite those sections, utilizing relevant CUDA-based APIs like CUDA, CuDNN, and cuBLAS to achieve maximum GPU utilization. Additionally, I'm well-versed with libraries like CuPy, TensorRT and NVIDIA Video Codec SDK that can potentially help boost the speed without compromising accuracy. My commitment goes beyond delivering optimized code. I'm keen on providing a GUI-less batch processing and real-time video run identical in behavior and output as before - ensuring a seamless transition from CPU to GPU. Lastly, I promise to deliver a comprehensive setup script or README file explaining precise environment specifications such as driver versions, conda/pip commands, necessary environment variables which will further ensures hassle-free integration.
$30 AUD em 1 dia
3,4
3,4

⭐If you award me, your smile shows up.⭐ Hi Your project immediately stood out to me—it's very similar to one I successfully completed just recently. The core challenges, structure, and technical requirements feel highly familiar, with only a handful of unique elements that align perfectly with my established expertise. This close match is excellent news: it lets me bypass the usual ramp-up period, avoid trial and error, and deliver clean, high-quality work quickly and confidently. I bring deep, hands-on experience with Deep Learning, Computer Vision, C Programming, CUDA, Video Streaming, Data Processing, Python, Image Processing and Video Processing, along with proven strategies and best practices refined through multiple comparable projects. You can review a directly relevant example in my portfolio here: https://www.freelancer.com/u/thomasb726 I’d be happy to discuss your specific goals in more detail and share a few tailored ideas based on what has worked well in similar scenarios. Why clients consistently choose and return to me: • Clear, proactive, and timely communication—you’ll always know exactly where the project stands • I treat your deadlines, budget, and reputation with the same priority I give my own • Responsive, approachable, and committed to making the entire process smooth and stress-free • Strong ongoing support after delivery; many clients build long-term working relationships as a result If you're looking for precise execution, exceptional quality,
$150 AUD em 1 dia
3,1
3,1

As a Python and C programming expert, I have successfully developed and optimized numerous high-performance computing solutions that included GPU acceleration using CUDA, CuDNN, cuBLAS, and other related CUDA-based APIs - making me a perfect match for this project. With over eight years of experience crafting dynamic web and app solutions, I have a deep understanding of the impact optimized code can have on overall performance. Having honed my skills while collaborating on countless innovative projects, I can efficiently profile your current code to identify and rewrite the CPU-bound portions for GPU execution. Leveraging faster CUDA-compatible libraries like CuPy, TensorRT, and NVIDIA Video Codec SDK where necessary,en, will be as second nature to me as ensuring your GUI-less batch processing and real-time video runs demonstrate consistent behavior and outputs. In addition to delivering a significant speed-up, I will provide you with a from-scratch setup script or README covering all the details you need like driver versions, environment variables, conda/pip commands —possibly born out of my experience in using different cloud services such as GCP- combined with my DevOps background in Linux, Git, Docker. I will then complete the project with a benchmark report that clearly shows the improvements achieved. It will be a pleasure to put my skills to work for you and deliver results that exceed your expectations. As always
$140 AUD em 7 dias
2,9
2,9

Hi, I read your brief closely — Python 3.12.3, mtcnn 1.0.0 + insightface 0.7.3, paddleocr 2.10.0 (PaddlePaddle 3.0.0 / paddlepaddle-gpu 2.6.2) and scikit-learn 1.6.0 — and I’m confident I can push the whole pipeline onto the GPU so your “GPU-ready” wheel actually uses the GPU. I’ll start with a targeted profile to locate CPU-bound hotspots (frame decoding, pre/post-processing, model I/O), then migrate heavy ops to CUDA (NVIDIA Video Codec SDK for decoding, CuPy where NumPy is a bottleneck, TensorRT or cuDNN/cuBLAS for inference, and optional TensorRT conversion for insightface models). I’ll ensure paddlepaddle-gpu is correctly installed or recommend swapping to TensorRT/ONNX paths where it improves throughput without changing outputs. GUI-less batch and realtime behavior will be preserved and tested. I also have practical experience with LLMs, Transformers, chatbots, classification, and vector search — I know how to take a full pipeline from data to deployment and benchmark it sensibly. I can deliver a profiling report + prototype patches in ~5 days and a full migration with benchmarks and a from-scratch README/setup script in 7 days. Do you have representative sample streams and the target GPU model (vendor/model and driver/CUDA versions) I should use for profiling and benchmarks? Thanks, Larasati
$125 AUD em 3 dias
2,9
2,9

Hi! I specialize in GPU acceleration for Python computer-vision workflows. I’ll profile your current pipeline to identify CPU bottlenecks in mtcnn, insightface, and paddleocr, then migrate them to CUDA using GPU-enabled libraries (TensorRT, CuPy, paddlepaddle-gpu). I’ll ensure real-time video and batch processing behave identically, optimize inference across the Tesla T4, and provide a setup script/README with drivers, conda/pip commands, and environment variables. A benchmark report with speed-ups will be included.
$140 AUD em 3 dias
3,0
3,0

Alfredton, Australia
Método de pagamento verificado
Membro desde nov. 16, 2021
$30-250 USD
$1500-3000 USD
$250-750 AUD
$250-750 AUD
$250-750 USD
£10-20 GBP
$10-30 USD
$30-250 USD
₹1500-12500 INR
$100 CAD
₹1500-12500 INR
$10-30 USD
€30-250 EUR
$250-750 USD
$250-750 USD
₹250000-500000 INR
₹100-400 INR / hora
₹750-1250 INR / hora
$250-750 USD
$15-25 USD / hora
$5000-10000 USD
₹12500-37500 INR
€18-36 EUR / hora
₹1500-12500 INR