Skip to content

what piper has to do with espeak-ng? #390

Answered by synesthesiam
omega3 asked this question in Q&A
Discussion options

You must be logged in to vote

espeak-ng has a special mode where it doesn't speak the text, but instead outputs phonemes. These phonemes are fundamental sound units of human languages, written using the International Phonetic Alphabet (IPA).

Turning text into phonemes is hard, so Piper outsources it to espeak-ng (for now). The rest of Piper is completely different from espeak-ng and festival. Those programs paste together small snippets of sounds to speak. Piper uses a machine learning model that generates audio from phonemes using a neural network. These two approaches (pasting snippets and neural networks) are not compatible, even though they both use phonemes underneath.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by omega3
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants