Newest 'text-to-speech' Questions

0 votes

0 answers

82 views

audio from ElevenLabs API isn't played on browser when called

I'm building a little "educative game". My girlfriend is a bachelor for pedagogy and wanted to try an English learning game for a college project. Here I tried to implement the ElevenLabs ...

Gabriel Braga

17

asked Mar 27 at 1:13

0 votes

0 answers

54 views

XTTS v2 produces hallucinations when running multiple inferences sequentially, but works fine individually

I'm using XTTS v2 fine-tuned for Vietnamese (vnTTS). Problem: - Running inference on a single sentence → perfect output - Running inference on multiple sentences in a loop → weird sounds/...

Duy Đỗ Đình

1

asked Mar 19 at 10:42

Advice

0 votes

0 replies

37 views

Best practices for preprocessing audio dataset for StyleTTS training, consistent vs adaptive processing?

I’m preparing a 20-hour single-speaker dataset for training StyleTTS (text-to-speech). I have ~13k recorded sentences with consistent recording conditions (same mic, room, gain settings). For ...

Chandan naidu

1

asked Jan 7 at 17:47

0 votes

0 answers

97 views

How to correctly parse a WAV file for use with Android TTS

I am trying to create a TTS engine for a TTS service i have running locally on my network. I used an AI to generate the basics and set up the engine with the android system. But i am not able to get ...

Kaki Master Of Time

1,674

asked Jan 3 at 16:14

1 vote

0 answers

101 views

TTS audio is looped back into microphone even with iOS AEC enabled (AVAudioSession.voiceChat)

I’m developing a voice-based app on iOS (Xcode), similar to ChatGPT’s voice mode. The app is currently being tested on an iPhone SE. Intended behavior While TTS is playing, if the user starts ...

gwonjiyoung권지용

1

asked Dec 14, 2025 at 13:34

0 votes

0 answers

39 views

Auto detect language in Azure Speech Services SpeechSynthesizer

In the microsoft-cognitiveservices-speech-sdk JavaScript SDK SpeechSynthesizer.FromConfig takes AutoDetectSourceLanguageConfig as input but what does it actually do? The documentation here Select ...

Magnuti

305

asked Nov 27, 2025 at 12:12

Advice

1 vote

0 replies

88 views

Text-to-Speech function in a 3D Unity game

I am doing a 3D game using a Unity, and I need my NPC can communicate with the players with voice I want my NPC can speak with a voice that response is generated by the LLM, so the response will ...

Alden Ling

15

asked Oct 30, 2025 at 13:30

0 votes

2 answers

383 views

Too slow audio chunk generation using XTTS-v2 inference_stream function

I'm currently trying to make a realtime audio generation using Coqui-tts XTTS-v2 but I've ran into a problem of a slow audio chunk generation of the model. I'm using GPU RTX 2070 super 8GB VRAM and I'...

Artem Melnyk

11

asked Oct 9, 2025 at 11:26

1 vote

1 answer

85 views

TTS onDone callback never fires on Samsung (Android 15) post-SpeechRecognizer, even with AUDIOFOCUS_REQUEST_GRANTED

I'm facing a very specific, reproducible bug and I've hit a wall after trying all the standard solutions. I would appreciate any insight. I am developing a voice assistant setup flow where the app ...

Andrei Babenko

21

asked Oct 4, 2025 at 12:20

0 votes

0 answers

102 views

Flutter piper_tts package: CMake/NDK build fails with ld.lld: error: unable to find library -lespeak-ng

I am trying to integrate the piper_tts package into my Flutter project on Windows (using Android Studio, NDK 27, and CMake 3.22.1). When I run my app, the native build step fails during linking. Here ...

Ritesh Patil

1

asked Sep 16, 2025 at 15:58

0 votes

0 answers

51 views

Azure Speech Service Speaker Diarization: How to Optimize Real-Time Transcription Latency (Node.js + Angular)

I'm using Azure Speech-to-Text with speaker diarization in a real-time transcription app. Backend: Node.js (v18), using microsoft-cognitiveservices-speech-sdk and WebSocket server. Frontend: Angular (...

SGR

2,375

asked Sep 5, 2025 at 10:30

1 vote

2 answers

303 views

One Signal Notification with background supported Speech to text (TTS Speak)

I am developing a Flutter app where I use OneSignal for push notifications and flutter_tts for text-to-speech. Currently: When the app is in the foreground, text-to-speech works perfectly upon ...

Ramji

70

asked Aug 28, 2025 at 11:05

0 votes

0 answers

626 views

Trying to use Piper TTS on top of Espeak-ng from C++ throwing runtime exception "failed to set eSpeak-ng voice"

I looked for an open source library that could read words in various languages and perform text to speech (TTS) with good quality. The goal is something that would work without cloud on relatively ...

Dov

8,664

asked Jul 25, 2025 at 21:22

0 votes

0 answers

105 views

Ducking not working with package flutter_tts

I’m developing a Flutter navigation app that uses the flutter_tts package to provide spoken directions. However, when a voice instruction is played, Spotify (or other background music) stops ...

Android_devNL

167

asked Jul 20, 2025 at 13:30

0 votes

0 answers

61 views

browser inbuilt TTS not working in MAC chrome browser, but working in window chrome

This code working perfectly in widows google chorme browser, but now working in MAC os chrome browser. Can anybody help me please... autotmatically or programatically clicking the play button so after ...

Ashish Rawat

1

asked Jul 1, 2025 at 12:17

Collectives™ on Stack Overflow

audio from ElevenLabs API isn't played on browser when called

XTTS v2 produces hallucinations when running multiple inferences sequentially, but works fine individually

Best practices for preprocessing audio dataset for StyleTTS training, consistent vs adaptive processing?

How to correctly parse a WAV file for use with Android TTS

TTS audio is looped back into microphone even with iOS AEC enabled (AVAudioSession.voiceChat)

Auto detect language in Azure Speech Services SpeechSynthesizer

Text-to-Speech function in a 3D Unity game

Too slow audio chunk generation using XTTS-v2 inference_stream function

TTS onDone callback never fires on Samsung (Android 15) post-SpeechRecognizer, even with AUDIOFOCUS_REQUEST_GRANTED

Flutter piper_tts package: CMake/NDK build fails with ld.lld: error: unable to find library -lespeak-ng

Azure Speech Service Speaker Diarization: How to Optimize Real-Time Transcription Latency (Node.js + Angular)

One Signal Notification with background supported Speech to text (TTS Speak)

Trying to use Piper TTS on top of Espeak-ng from C++ throwing runtime exception "failed to set eSpeak-ng voice"

Ducking not working with package flutter_tts

browser inbuilt TTS not working in MAC chrome browser, but working in window chrome

Hot Network Questions