3,750 questions
0
votes
0
answers
82
views
audio from ElevenLabs API isn't played on browser when called
I'm building a little "educative game". My girlfriend is a bachelor for pedagogy and wanted to try an English learning game for a college project.
Here I tried to implement the ElevenLabs ...
0
votes
0
answers
54
views
XTTS v2 produces hallucinations when running multiple inferences sequentially, but works fine individually
I'm using XTTS v2 fine-tuned for Vietnamese (vnTTS).
Problem:
- Running inference on a single sentence → perfect output
- Running inference on multiple sentences in a loop → weird sounds/...
Advice
0
votes
0
replies
37
views
Best practices for preprocessing audio dataset for StyleTTS training, consistent vs adaptive processing?
I’m preparing a 20-hour single-speaker dataset for training StyleTTS (text-to-speech). I have ~13k recorded sentences with consistent recording conditions (same mic, room, gain settings).
For ...
0
votes
0
answers
97
views
How to correctly parse a WAV file for use with Android TTS
I am trying to create a TTS engine for a TTS service i have running locally on my network.
I used an AI to generate the basics and set up the engine with the android system. But i am not able to get ...
1
vote
0
answers
101
views
TTS audio is looped back into microphone even with iOS AEC enabled (AVAudioSession.voiceChat)
I’m developing a voice-based app on iOS (Xcode), similar to ChatGPT’s voice mode.
The app is currently being tested on an iPhone SE.
Intended behavior
While TTS is playing, if the user starts ...
0
votes
0
answers
39
views
Auto detect language in Azure Speech Services SpeechSynthesizer
In the microsoft-cognitiveservices-speech-sdk JavaScript SDK SpeechSynthesizer.FromConfig takes AutoDetectSourceLanguageConfig as input but what does it actually do?
The documentation here Select ...
Advice
1
vote
0
replies
88
views
Text-to-Speech function in a 3D Unity game
I am doing a 3D game using a Unity, and I need my NPC can communicate with the players with voice
I want my NPC can speak with a voice that response is generated by the LLM, so the response will ...
0
votes
2
answers
383
views
Too slow audio chunk generation using XTTS-v2 inference_stream function
I'm currently trying to make a realtime audio generation using Coqui-tts XTTS-v2 but I've ran into a problem of a slow audio chunk generation of the model. I'm using GPU RTX 2070 super 8GB VRAM and I'...
1
vote
1
answer
85
views
TTS onDone callback never fires on Samsung (Android 15) post-SpeechRecognizer, even with AUDIOFOCUS_REQUEST_GRANTED
I'm facing a very specific, reproducible bug and I've hit a wall after trying all the standard solutions. I would appreciate any insight.
I am developing a voice assistant setup flow where the app ...
0
votes
0
answers
102
views
Flutter piper_tts package: CMake/NDK build fails with ld.lld: error: unable to find library -lespeak-ng
I am trying to integrate the piper_tts package into my Flutter project on Windows (using Android Studio, NDK 27, and CMake 3.22.1). When I run my app, the native build step fails during linking.
Here ...
0
votes
0
answers
51
views
Azure Speech Service Speaker Diarization: How to Optimize Real-Time Transcription Latency (Node.js + Angular)
I'm using Azure Speech-to-Text with speaker diarization in a real-time transcription app.
Backend: Node.js (v18), using microsoft-cognitiveservices-speech-sdk and WebSocket server.
Frontend: Angular (...
1
vote
2
answers
303
views
One Signal Notification with background supported Speech to text (TTS Speak)
I am developing a Flutter app where I use OneSignal for push notifications and flutter_tts for text-to-speech.
Currently:
When the app is in the foreground, text-to-speech works perfectly upon ...
0
votes
0
answers
626
views
Trying to use Piper TTS on top of Espeak-ng from C++ throwing runtime exception "failed to set eSpeak-ng voice"
I looked for an open source library that could read words in various languages and perform text to speech (TTS) with good quality. The goal is something that would work without cloud on relatively ...
0
votes
0
answers
105
views
Ducking not working with package flutter_tts
I’m developing a Flutter navigation app that uses the flutter_tts package to provide spoken directions. However, when a voice instruction is played, Spotify (or other background music) stops ...
0
votes
0
answers
61
views
browser inbuilt TTS not working in MAC chrome browser, but working in window chrome
This code working perfectly in widows google chorme browser, but now working in MAC os chrome browser. Can anybody help me please...
autotmatically or programatically clicking the play button so after ...