
Fechado
Publicado
Pago na entrega
I’m building a fully-automated voice bot that can place large volumes of cold sales calls in both English and Spanish. The heart of the system will be an LLM that drives the conversation, paired with streaming speech-to-text and text-to-speech so responses feel natural and arrive with almost no delay. Key expectations • Real-time audio streaming: the transcription engine must keep up with live dialogue, and the TTS layer should return audio quickly enough to avoid awkward pauses. • Clean LLM integration: the model must receive each user utterance in milliseconds, maintain short-term context, and craft persuasive sales replies that stay on script when required. • Scalable call handling: the architecture should support bursts of concurrent outbound calls without degradation in latency or voice quality. Deliverables 1. A deployable service (Docker or similar) that accepts a phone number list, initiates calls, and handles the full STT → LLM → TTS loop. 2. Configuration for both English and Spanish voices with seamless language switching. 3. Simple REST or gRPC hooks so I can feed new prompts or adjust the sales script on the fly. 4. Basic analytics—call duration, hang-up reason, and transcript export. 5. Instructions for running the stack on a cloud instance of my choice. Acceptance criteria • End-to-end response time under two seconds for 95 % of turns. • Minimum 90 % transcription accuracy on common phone-line audio. • Demonstrated parallel handling of at least 50 simultaneous calls in a controlled test. If you have experience combining SIP or Twilio dialing with Whisper-style STT, real-time TTS engines like Amazon Polly or ElevenLabs, and conversational LLMs, I’d love to see what you can bring to the project. Deadline: Within 10 days.
ID do Projeto: 40136870
71 propostas
Projeto remoto
Ativo há 20 dias
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
71 freelancers estão ofertando em média $14.996 USD for esse trabalho

Hello, This plan outlines a deployable, low-latency voice sales bot in English and Spanish. The approach uses streaming STT and fast TTS to keep conversation flow tight, with an LLM that receives each utterance in milliseconds, maintains short-term context, and can follow the script when needed. The system will be Dockerized, accepts a phone-number list, and runs the full STT → LLM → TTS loop with scalable session management to handle bursts of calls. Language switching and multiple voice configurations will be built in, while REST or gRPC hooks let you swap prompts on the fly. Basic analytics will cover call duration, hang-up reason, and transcript export, and clear instructions will be provided to run the stack on a cloud instance of your choice. If you want to see what a production-grade build looks like, I’ll deliver a clean, proven architecture with robust monitoring and error handling. Which telephony stack would you prefer (Twilio-based or pure SIP), and any regional constraints or compliance requirements to consider? Best regards,
$20.000 USD em 15 dias
8,0
8,0

⭐⭐⭐⭐⭐ Build an Automated Voice Bot for Cold Sales Calls in English & Spanish ❇️ Hi My Friend, I hope you're doing well. I've reviewed your project details and see you're looking for an automated voice bot for sales calls. You don't need to look any further; Zohaib is here to help you! My team has completed over 50 similar projects for voice automation. I will ensure the bot seamlessly integrates speech-to-text and text-to-speech, providing quick and natural responses while maintaining conversation flow. ➡️ Why Me? I can easily build your automated voice bot as I have 5 years of experience in voice technologies, LLM integration, and real-time audio processing. My expertise includes working with Twilio, SIP, speech recognition, and text-to-speech systems. Besides, I have a strong grip on cloud deployment and analytics, ensuring your project meets all requirements efficiently. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. I look forward to discussing this with you! ➡️ Skills & Experience: ✅ Voice Bot Development ✅ Speech-to-Text Integration ✅ Text-to-Speech Implementation ✅ LLM Integration ✅ Real-Time Audio Streaming ✅ Twilio & SIP Experience ✅ Docker Deployment ✅ REST & gRPC APIs ✅ Language Switching ✅ Data Analytics ✅ Multi-Call Handling ✅ Quick Response Optimization Waiting for your response! Best Regards, Zohaib
$12.000 USD em 2 dias
7,7
7,7

We have successfully completed similar projects, combining SIP/Twilio dialing with Whisper-style STT, real-time TTS engines like Amazon Polly, and conversational LLMs. Our expertise in AI-first product development aligns perfectly with your need for a low-latency voice bot. We understand the importance of real-time audio streaming and seamless LLM integration to maintain natural conversational flow. With over 8 years of experience, we excel in building scalable, intelligent systems. Our background in AI, LLMs, and automation ensures that your bot will handle concurrent calls without latency issues while maintaining high transcription accuracy. We are confident that our skills in Python, NLP, and cloud-native deployment will deliver a robust, scalable solution. Our portfolio includes advanced voice bots and AI systems, showcasing our capability to meet your expectations. We are eager to demonstrate our commitment to quality and transparent collaboration. Are there any specific features or integrations you envision for future scalability? Q: What cloud provider do you prefer for deployment? Q: Do you have any specific analytics requirements beyond those mentioned? Let's transform your vision into a scalable, impactful solution. Looking forward to your response. Kind Regards, Puru Gupta
$20.000 USD em 44 dias
7,6
7,6

With over a decade of experience in web and mobile development, I understand the need for a Low-Latency Voice Sales Call Bot that can handle large volumes of calls in real-time. Your project requires seamless integration of live dialogue, short response times, and scalable architecture, all of which align perfectly with my skill set. I have successfully delivered similar projects in the past, especially in the areas of AI/ML development and real-time communication systems. My expertise in building automated solutions for eCommerce and FinTech industries will ensure that your Low-Latency Voice Sales Call Bot meets all your expectations. I am confident in my ability to deliver the deployable service you require, with a focus on real-time audio streaming, clean LLM integration, and scalable call handling. Let's work together to bring your vision to life. I look forward to the opportunity to discuss your project in more detail. Please feel free to reach out to me so we can get started on creating a successful Low-Latency Voice Sales Call Bot for you.
$16.000 USD em 75 dias
6,3
6,3

As a highly skilled and dedicated software engineer, my combination of expertise in AI chatbot development, proficiency in Python, and long-standing experience with hovering around complex problems lines up perfectly with the demands of this intriguing voice bot project. In addition to my diverse skill set, I have also spent ample time perfecting my understanding of Linux, which could prove invaluable when setting up your preferred cloud instance for the system. So if you're looking to hit those two-second end-to-end response times for 95% of turns while maintaining 90% transcription accuracy - you've found the right person for the job. A project as cutting edge as this deserves a freelancer with deep-rooted knowledge of modern technologies, and that's exactly what I bring to the table. My familiarity with parallel handling and SIP/Twilio dialing fits seamlessly with whisper-style STT demands. Besides, my prior success in leveraging real-time TTS engines like Amazon Polly is directly applicable to your specific requirements here. Plus, rest assured knowing that not only can I handle the specified minimum of 50 simultaneous calls but I'm always eager to exceed expectations.
$15.000 USD em 60 dias
5,9
5,9

Dear , We carefully studied the description of your project and we can confirm that we understand your needs and are also interested in your project. Our team has the necessary resources to start your project as soon as possible and complete it in a very short time. We are 25 years in this business and our technical specialists have strong experience in Python, Audio Services, Testing / QA, Voice Talent, Twilio, Natural Language Processing, Large Language Model, AI Chatbot Development and other technologies relevant to your project. Please, review our profile https://www.freelancer.com/u/tangramua where you can find detailed information about our company, our portfolio, and the client's recent reviews. Please contact us via Freelancer Chat to discuss your project in details. Best regards, Sales department Tangram Canada Inc.
$18.888 USD em 100 dias
7,3
7,3

Hi! I can build your low-latency AI voice sales bot that handles high-volume outbound calls in English and Spanish with real-time speech-to-text, AI response generation, and text-to-speech. What I’ll deliver • A deployable voice service (Docker) that takes a phone list and runs full AI-driven sales calls • Real-time STT → LLM → TTS pipeline optimized for low latency (~1–2s) • Context-aware AI to follow sales scripts and handle objections • English & Spanish voices with natural speech • Call logging & transcripts for QA and analytics • API endpoints to update scripts and prompts • Cloud-ready architecture (Twilio/SIP compatible) Why me I’m a Full-Stack & AI engineer with strong experience in Python, real-time streaming systems, telephony APIs, and LLM integration. I’ve built production bots that handle concurrent calls with stable audio pipelines and reliable deployment. Timeline 4–7 weeks depending on call volume and telephony provider. To start Which telephony API will you use (Twilio or SIP)? Do you already have the sales scripts? Do you need call recordings? Ready to start immediately.
$15.000 USD em 7 dias
6,0
6,0

Hello, Would you be open to a working demo of a sub-2-second latency voice bot handling 50+ concurrent sales calls—built with Whisper, your choice of LLM, and Twilio—before we commit to anything? I've shipped low-latency voice systems combining streaming STT/TTS with conversational LLMs. My approach prioritizes millisecond-level transcription buffering and parallel call management through async I/O, ensuring your sales scripts run at human pace across languages without degradation. Let's discuss your audio infrastructure preferences and I'll outline how we architect this for your cloud platform—whether that's optimizing Polly/ElevenLabs throughput or exploring alternative TTS layers. Best, Smith
$15.000 USD em 7 dias
5,5
5,5

✅ Nice to meet you here ✅ I am very excited about the opportunity to build your fully-automated bilingual sales call bot. I have experience designing real-time audio systems that combine streaming speech-to-text, text-to-speech, and conversational LLMs to deliver natural, low-latency interactions. I can develop a deployable service that handles the full STT → LLM → TTS loop with an end-to-end response time under two seconds, supports both English and Spanish voices with seamless language switching, and scales to handle 50 or more concurrent calls without loss of quality. The system will include REST or gRPC hooks for updating scripts dynamically and exporting analytics such as transcripts, call durations, and hang-up reasons. I have hands-on experience integrating SIP and Twilio dialing with Whisper-style transcription and real-time TTS engines like Amazon Polly and ElevenLabs. I can meet your 10-day deadline and provide full instructions for deployment on your chosen cloud platform. I would love the opportunity to bring this system to life and ensure it performs reliably at scale. Best regards, Jiayin
$15.000 USD em 7 dias
4,8
4,8

Hi, I’m Karthik, a senior AI / voice systems engineer with 10+ years of experience building low-latency, real-time conversational systems across cloud, telephony, and LLM-driven automation. I’ve worked with Twilio/SIP calling, streaming STT, real-time TTS, and LLM orchestration, and I understand that for outbound sales bots the real challenge is latency, concurrency, and conversational flow—not just model choice. How I’ll deliver • Real-time STT → LLM → TTS streaming pipeline with sub-2s response targets • Twilio/SIP-based outbound calling with burst concurrency support • English + Spanish voices with seamless language switching • Script-aware LLM prompting with short-term memory control • Dockerised, cloud-ready service • REST/gRPC hooks to update prompts and scripts dynamically • Analytics: call duration, hang-up reason, transcripts Architecture focus • Async, event-driven design for 50+ concurrent calls • Low-latency STT (Whisper-style / streaming engines) • Fast TTS (Polly / ElevenLabs or equivalent) • Scalable deployment on your preferred cloud Why me • 10+ years in distributed systems & real-time pipelines • Proven experience with voice, AI, and production latency constraints • Delivery-focused and comfortable with tight 10-day timelines If you want a production-grade voice sales bot that sounds natural and scales cleanly, I’m ready to start immediately. Regards, Karthik AI & Voice Systems Engineer 10+ Years Experience
$19.990 USD em 7 dias
5,0
5,0

Nice to meet you ,The requirements of your project match my areas of work and skills, to introduce myself. My name is Anthony Muñoz and i am the lead engineer for DS Pro IT agency. I have worked for over 10 years as a Full-Stack and software development engineer and have successfully done multiple jobs. It will be a pleasure to work together to make your project. Feel free to discuss about the project with me, greetings.
$27.742 USD em 7 dias
3,8
3,8

Hi. I can build and deploy this system end-to-end within your 10-day deadline. I have hands-on experience combining Twilio outbound calling, real-time streaming STT, low-latency TTS, and LLM-driven conversational logic for scalable voice automation. I’ll deliver a Dockerized service that handles full STT → LLM → TTS loops, supports English & Spanish with seamless switching, exposes REST hooks for live script updates, and includes call analytics and transcript export. The architecture will be optimized for sub-2s response time and tested with 50+ concurrent calls.
$15.000 USD em 10 dias
3,6
3,6

Hi, I have reviewed the details of your project. We have solid experience working with real time audio systems, LLM driven conversations, and scalable calling infrastructure. we will design a service that initiates outbound calls using Twilio or SIP, streams live audio to a speech to text engine, and passes each user response to the LLM in milliseconds. The LLM will maintain short term context and follow your sales script while keeping replies natural. The system will be built to scale and handle many parallel calls without losing quality. We will deliver a deployable service using Docker, support both English and Spanish voices with smooth switching, provide simple API hooks to update prompts or scripts, and include basic analytics like call duration, hang up reason, and full transcripts. Clear instructions will be included to run everything on your chosen cloud. Let's have a detailed discussion, as it will help me give you a complete plan, including a timeline and estimated budget. I will share my portfolio in the chat. Best regards, Mughiraa
$15.000 USD em 7 dias
3,3
3,3

Hi there, I’m Kristopher Kramer from McKinney, Texas. I’ve worked on similar projects before, and as a senior full-stack and AI engineer, I’ve got the experience to get this done right. I’m available to start right away and happy to chat through the details whenever you’re ready. Looking forward to talking with you soon! Best, Kristopher Kramer
$10.000 USD em 10 dias
4,2
4,2

Yes, we’ve successfully completed the same projects before and are confident we can deliver this one with the highest quality and within your timeline. We specialize in this type of work and would love to discuss your requirements in detail to ensure a smooth and timely delivery.
$15.000 USD em 7 dias
4,8
4,8

Greetings! I’m a top-rated freelancer with 16+ years of experience and a portfolio of 750+ satisfied clients. I specialize in delivering high-quality, professional voice sales bot creating services tailored to your unique needs. Please feel free to message me to discuss your project and review my portfolio. I’d love to help bring your ideas to life! Looking forward to collaborating with you! Best regards, Revival
$10.000 USD em 30 dias
2,9
2,9

Hello, This is exactly the kind of system I build. I have hands-on experience delivering real-time, LLM-driven voice agents that combine SIP/Twilio calling, low-latency STT, fast TTS, and scalable backend orchestration—designed specifically for outbound sales use cases. How I’d implement this (within 10 days): Architecture Dialing: Twilio Programmable Voice (SIP-ready, outbound at scale) STT: Streaming Whisper (faster-whisper / WhisperX) or Deepgram for phone-line audio LLM: Low-latency GPT-style model with rolling context + scripted guardrails TTS: ElevenLabs or Amazon Polly (English & Spanish voices, instant switching) Backend: Python (async), WebSockets for audio streaming, Dockerized services Key Capabilities Sub-2s turn latency via parallel STT → LLM → TTS streaming Script-aware persuasion with fallback logic and objection handling Burst-safe concurrency (tested with 50+ parallel calls) REST/gRPC endpoints to update prompts, scripts, and call batches live Analytics: call duration, hang-up reason, transcripts, language used What you’ll receive Deployable Docker stack Load-tested demo with ≥50 concurrent calls Clear cloud deployment guide (AWS/GCP/Azure) Clean, documented codebase ready for iteration I can start immediately and deliver an end-to-end working system within your deadline. Best regards, Enock Isaboke
$11.449 USD em 7 dias
2,9
2,9

Hi, there. The hardest part of low-latency voice bots isn’t dialing or prompts—it’s keeping the entire audio loop streaming without stalls when concurrency spikes. I’ve seen systems hit targets in single-call demos and miss SLAs once parallel calls begin. I’d design this as a fully streaming pipeline end to end: SIP/Twilio audio streams into real-time STT, partial transcripts fed incrementally to the LLM, and TTS returning audio chunks immediately, not after full text generation. Context is kept deliberately small and structured so the model responds fast and stays on script. I’d sequence work by validating latency with one call, then 10, then 50 in controlled tests, adding analytics and language switching only after timing is stable. Everything ships Dockerized with clean runtime controls. Are scripts strictly deterministic, or should the bot be allowed limited improvisation per call?
$13.000 USD em 40 dias
1,6
1,6

Hello, how are you? I am an experienced AI engineer and I can help you with Low-Latency Voice Sales Call Bot Development. I have experience building real-time voice bots for outbound sales calls. I can develop a scalable system that handles many calls at the same time with low delay. The bot will support both English and Spanish with natural, human-like voices. I will integrate live speech-to-text, an LLM for conversation control, and fast text-to-speech. The solution will be fully deployable using Docker on your preferred cloud platform. You will be able to update scripts, prompts, and call logic through simple APIs. Hope do discuss more details. Thank you.
$15.000 USD em 7 dias
1,6
1,6

I am Sumit Joshi from Sacesta Technologies. I will build your real-time bilingual cold-calling voice bot with a low-latency STT → LLM → TTS loop, scalable outbound calling, and script control via APIs. Recommended stack • Calling: Twilio Voice (or SIP trunk) with WebSocket media streaming • STT: streaming engine tuned for phone audio, language auto-detect with EN/ES lock • LLM: OpenAI realtime or low-latency chat with strict system prompts and guardrails • TTS: ElevenLabs or Amazon Polly neural voices, cached intros and fillers to reduce pauses • Backend: Node.js or Python services, Redis queues, Postgres for analytics, Docker deploy Core build • Real-time pipeline with barge-in, partial transcripts, and fast turn-taking • Persistent per-call state, short context window, and “stay on script” controls • Language switching based on first turns and customer preference • Concurrency design for 50+ calls: worker pools, rate limits, backpressure, retries • REST hooks to update prompts, offer lists, and objection handling without redeploy Deliverables • Dockerized service that loads number lists, dials, streams, logs, and exports transcripts • EN/ES voice configs and per-call switching • Analytics: duration, outcomes, hangup reason tags, exportable transcripts Relevant work • Voice and LLM conversation flows with Twilio-style webhooks, persistent chat state, and integrations (email/webhooks) similar to your current server-side bot work
$15.000 USD em 7 dias
1,4
1,4

Palm Beach Gardens, United States
Membro desde jan. 10, 2026
$20000-50000 USD
₹12500-37500 INR
$15-25 USD / hora
₹12500-37500 INR
$250-750 USD
$750-1500 NZD
€750-1500 EUR
$20-30 SGD / hora
₹600-1500 INR
$250-750 USD
₹37500-75000 INR
$15-25 USD / hora
$250-750 USD
$30-250 USD
$20-50 USD
$30-250 USD
$30-250 USD
$250-750 USD
£250-750 GBP
₹1500-12500 INR
₹750-1250 INR / hora