
Fechado
Publicado
I’m building a new American-English text-to-speech engine and need one engaging, crystal-clear voice (male or female) to supply three hours of finished audio. Once I approve your short, unpaid sample, I’ll send the full script—six distinct narrative styles that will ultimately feed the model. The final sessions must meet the following capture specs so the data can pass straight into the pipeline without extra clean-up: • WAV, mono, 48 kHz, 16-bit • Peak no hotter than ‑3 dB FS, average loudness between ‑18 LUFS and ‑24 LUFS • Background noise below ‑65 dB RMS (device self-noise below ‑85 dB) • Reverb under 0.4 s and speech intelligibility D50 above 97 % • Absolutely no clipping Deliver exactly three hours of polished, style-labeled takes that reflect the six script categories I’ll provide. If you can record to spec from a treated room, deliver consistent file naming, and are comfortable signing a standard usage release for TTS, you’re the voice I want to hear first.
ID do Projeto: 40150368
4 propostas
Projeto remoto
Ativo há 16 dias
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
4 freelancers estão ofertando em média $54 USD/hora for esse trabalho

Looking for a voice that will resonate with your audience? My name is amar, and I am an experienced audio professional who specializes in audio production, sound design and have created numerous engaging videos with voice-overs. Your project aligns perfectly with my skills and interests. I fully understand the specifications required to ensure the clearest, high-quality audio as needed for your TTS engine. In my treated audio room, I can guarantee audio that meets the specified loudness, peak levels, reverb time and noise cancellation parameters. My consistent file-naming approach ensures seamless organization too. Furthermore, I believe in the power of good communication to achieve great results. Once we begin the project, you can expect regular updates on progress and open dialogue about any changes or adjustments. If you choose me for this project, rest assured of crystal-clear voices that accurately reflect your six script categories. Let's make your new American-English text-to-speech engine an extraordinary one together!
$50 USD em 40 dias
6,2
6,2

I’ve lived in the United States and speak American English naturally, with a clear and neutral accent that fits professional narration and TTS work. I’m very comfortable switching between tones and styles, so adapting to six different narrative voices won’t be an issue. I record from a controlled environment using a professional microphone and interface, and I’m used to working with strict technical specs (levels, noise floor, consistency, clean file naming). Delivering long-form, consistent audio without drift in tone or quality is something I take seriously. I’m happy to record a short sample first so you can check voice fit and technical quality before moving forward, and I’m comfortable signing a standard usage release for TTS once approved.
$50 USD em 40 dias
3,1
3,1

Not only do I have a clear voice with good annunciation, I also own a recording studio and would love to try something like this.
$65 USD em 40 dias
0,0
0,0

Hello, I’m building a new American-English text-to-speech engine and need one engaging, crystal-clear voice (male or female) to supply three hours of finished audio. Once I approve your short, unpaid sample, I’ll send the full script—six distinct narrative styles that will ultimately feed the model. The final sessions must meet the following capture specs so the data can pass straight into the pipeline without extra clean-up: • WAV, mono, 48 kHz, 16-bit • Peak no hotter than ‑3 dB FS, average loudness between ‑18 LUFS and ‑24 LUFS • Background noise below ‑65 dB RMS (device self-noise below ‑85 dB) • Reverb under 0.4 s and speech intelligibility D50 above 97 % • Absolutely no clipping Deliver exactly three hours of polished, style-labeled takes that reflect the six script categories I’ll provide. If you can record to spec from a treated room, deliver consistent file naming, and are comfortable signing a standard usage release for TTS, you’re the voice I want to hear first. Best regards, Sowod
$50 USD em 30 dias
0,0
0,0

Ummedabad, India
Método de pagamento verificado
Membro desde mar. 9, 2017
$10-30 USD
$10-30 USD
mín. $50 USD / hora
$10-30 USD
$8-15 USD / hora
₹600-1500 INR
$10-11 USD
£20-250 GBP
$10-30 USD
$250-750 AUD
₹600-1500 INR
$250-750 USD
$25-50 USD / hora
$750-1500 CAD
$15-25 USD / hora
$250-750 USD
₹600-1500 INR
₹600-1500 INR
$250-750 USD
₹100-400 INR / hora
₹100-400 INR / hora
₹600-1500 INR
₹1500-12500 INR
£10-20 GBP
₹1500-12500 INR