Skip to content

Commit

Permalink
Add sample TTS clips in README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
kaiidams committed Nov 3, 2023
1 parent 3b247c5 commit 0754471
Show file tree
Hide file tree
Showing 6 changed files with 25 additions and 2 deletions.
1 change: 0 additions & 1 deletion NeMoOnnxSharp.Example/Program.cs
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,6 @@
using System.Threading.Tasks;
using System.Runtime.InteropServices;
using NeMoOnnxSharp.Models;
using System.Reflection;

namespace NeMoOnnxSharp.Example
{
Expand Down
10 changes: 10 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -109,6 +109,16 @@ Name | Target | Predicted
61-70968-0061.wav|you are a worthy leech will presently whispered robin the wine has worked a marvel|you are a worthy leech will presently whispered robin the wine has worked a marvel
61-70968-0062.wav|ay and show you some pretty tricks|i enshow you some pretty tricks

## Text-to-speech samples

NeMoOnnxSharp supports text-to-speech with FastSpeech and HiFiGAN.

Generated | Target
--- | ---
[generated-61-70968-0000.wav](generated-61-70968-0000.wav)|he began a confused complaint against the wizard who had vanished behind the curtain on the left
[generated-61-70968-0001.wav](generated-61-70968-0001.wav)|give not so earnest a mind to these mummeries child
[generated-61-70968-0002.wav](generated-61-70968-0002.wav)|a golden fortune and a happy life

## Exporting ONNX

Exported ONNX file is included in this repository. But if you want to do it yourself,
Expand Down
3 changes: 3 additions & 0 deletions generated-61-70968-0000.wav
Git LFS file not shown
3 changes: 3 additions & 0 deletions generated-61-70968-0001.wav
Git LFS file not shown
3 changes: 3 additions & 0 deletions generated-61-70968-0002.wav
Git LFS file not shown
7 changes: 6 additions & 1 deletion test_data/README.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,15 @@
# Test data

These file are from [LibriSpeech](http://www.openslr.org/12)
These files are from [LibriSpeech](http://www.openslr.org/12)

- `transcript.txt`
- `61-70968-0052.wav`
- ...

This file is from [NVIDIA NeMo tutorial](https://github.com/NVIDIA/NeMo/blob/main/tutorials/asr/Online_Offline_Speech_Commands_Demo.ipynb)

- `SpeechCommands_demo.wav`

This file is from [Thrsten Voice](https://github.com/thorstenMueller/Thorsten-Voice)

- `samples_thorsten-21.06-emotional_neutral.wav`

0 comments on commit 0754471

Please sign in to comment.