use reference voice or random generate voice, the result voice is deep and dull #624

xhjcxxl · 2024-10-18T13:40:48Z

Self Checks

I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:）
Please do not modify this template :) and fill in all the required fields.

1. Is this request related to a challenge you're experiencing? Tell me about your story.

hello，I try use reference voice or random voice to build my voice, but the result voice is deep and dull.Deep and resonant（生成的声音低沉沉闷，不够洪亮，不如官网给出的参考声音）
the female4:
female4.wav.zip
the result1:
audio (47).wav.zip
the female5:
female5.wav.zip
the result2:
audio (48).wav.zip

the websit is good,like this:

1.so, i want to konw how to set params to fix it, i use reference api params:

{
  "text": "请问您期望的安装日期是几月几日，您可以说8月3日，温馨提示",
  "chunk_length": 200,
  "format": "wav",
  "mp3_bitrate": 64,
  "references": [],
  "reference_id": "female5",
  "normalize": true,
  "opus_bitrate": -1000,
  "latency": "normal",
  "streaming": false,
  "emotion": null,
  "max_new_tokens": 1024,
  "top_p": 0.7,
  "repetition_penalty": 1.2,
  "temperature": 0.7
}

i try use random voice timbre to generate voice，then i get a good voice；i want to fixed voice, and want to generate voice timbre always, it is has param to fixed it?（我想得到一个比较好的随机音色后，想固定这个音色，固定下来）

I am interested in contributing to this feature.

The text was updated successfully, but these errors were encountered:

AnyaCoder · 2024-10-19T03:08:52Z

还可以用--reference_id(仅能用一个)来代替--reference_audio和--reference_text, 前提是在项目根目录下创建references/<your reference_id>文件夹，里面放上任意个音频与对应的标注文本。目前支持的参考音频最多加起来总时长90s。
Did you follow it?

xhjcxxl · 2024-10-19T06:11:34Z

还可以用--reference_id(仅能用一个)来代替--reference_audio和--reference_text, 前提是在项目根目录下创建references/<your reference_id>文件夹，里面放上任意个音频与对应的标注文本。目前支持的参考音频最多加起来总时长90s。 Did you follow it?

yes，i try build dir in references,then i use reference_id,it's work ok! but the result voice is deep and dull

AnyaCoder · 2024-10-19T13:15:46Z

i try use random voice timbre to generate voice，then i get a good voice；i want to fixed voice, and want to generate voice timbre always, it is has param to fixed it?（我想得到一个比较好的随机音色后，想固定这个音色，固定下来）

see #627
it seems not possible to fix a voice timbre because there is not speaker embedding used in the model . You can just only make the output deterministic, given some reference audio.

xhjcxxl added the enhancement New feature or request label Oct 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use reference voice or random generate voice, the result voice is deep and dull #624

use reference voice or random generate voice, the result voice is deep and dull #624

xhjcxxl commented Oct 18, 2024 •

edited

Loading

AnyaCoder commented Oct 19, 2024

xhjcxxl commented Oct 19, 2024

AnyaCoder commented Oct 19, 2024 •

edited

Loading

use reference voice or random generate voice, the result voice is deep and dull #624

use reference voice or random generate voice, the result voice is deep and dull #624

Comments

xhjcxxl commented Oct 18, 2024 • edited Loading

Self Checks

1. Is this request related to a challenge you're experiencing? Tell me about your story.

AnyaCoder commented Oct 19, 2024

xhjcxxl commented Oct 19, 2024

AnyaCoder commented Oct 19, 2024 • edited Loading

xhjcxxl commented Oct 18, 2024 •

edited

Loading

AnyaCoder commented Oct 19, 2024 •

edited

Loading