Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use reference voice or random generate voice, the result voice is deep and dull #624

Open
5 tasks done
xhjcxxl opened this issue Oct 18, 2024 · 3 comments
Open
5 tasks done
Labels
enhancement New feature or request

Comments

@xhjcxxl
Copy link

xhjcxxl commented Oct 18, 2024

Self Checks

  • I have searched for existing issues search for existing issues, including closed ones.
  • I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
  • [FOR CHINESE USERS] 请务必使用英文提交 Issue,否则会被关闭。谢谢!:)
  • Please do not modify this template :) and fill in all the required fields.

1. Is this request related to a challenge you're experiencing? Tell me about your story.

hello,I try use reference voice or random voice to build my voice, but the result voice is deep and dull.Deep and resonant(生成的声音低沉沉闷,不够洪亮,不如官网给出的参考声音)
the female4:
female4.wav.zip
the result1:
audio (47).wav.zip
the female5:
female5.wav.zip
the result2:
audio (48).wav.zip

the websit is good,like this:
image

1.so, i want to konw how to set params to fix it, i use reference api params:

{
  "text": "请问您期望的安装日期是几月几日,您可以说8月3日,温馨提示",
  "chunk_length": 200,
  "format": "wav",
  "mp3_bitrate": 64,
  "references": [],
  "reference_id": "female5",
  "normalize": true,
  "opus_bitrate": -1000,
  "latency": "normal",
  "streaming": false,
  "emotion": null,
  "max_new_tokens": 1024,
  "top_p": 0.7,
  "repetition_penalty": 1.2,
  "temperature": 0.7
}
  1. i try use random voice timbre to generate voice,then i get a good voice;i want to fixed voice, and want to generate voice timbre always, it is has param to fixed it?(我想得到一个比较好的随机音色后,想固定这个音色,固定下来)
  • I am interested in contributing to this feature.
@xhjcxxl xhjcxxl added the enhancement New feature or request label Oct 18, 2024
@AnyaCoder
Copy link
Collaborator

还可以用--reference_id(仅能用一个)来代替--reference_audio和--reference_text, 前提是在项目根目录下创建references/<your reference_id>文件夹, 里面放上任意个音频与对应的标注文本。 目前支持的参考音频最多加起来总时长90s。
Did you follow it?

@xhjcxxl
Copy link
Author

xhjcxxl commented Oct 19, 2024

还可以用--reference_id(仅能用一个)来代替--reference_audio和--reference_text, 前提是在项目根目录下创建references/<your reference_id>文件夹, 里面放上任意个音频与对应的标注文本。 目前支持的参考音频最多加起来总时长90s。 Did you follow it?

yes,i try build dir in references,then i use reference_id,it's work ok! but the result voice is deep and dull
image
image

@AnyaCoder
Copy link
Collaborator

AnyaCoder commented Oct 19, 2024

i try use random voice timbre to generate voice,then i get a good voice;i want to fixed voice, and want to generate voice timbre always, it is has param to fixed it?(我想得到一个比较好的随机音色后,想固定这个音色,固定下来)

see #627
it seems not possible to fix a voice timbre because there is not speaker embedding used in the model . You can just only make the output deterministic, given some reference audio.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants