Skip to content

更细粒度的流式推理模式#2671

Merged
RVC-Boss merged 10 commits into
RVC-Boss:mainfrom
ChasonJiang:better_steaming_mode_merge
Nov 28, 2025
Merged

更细粒度的流式推理模式#2671
RVC-Boss merged 10 commits into
RVC-Boss:mainfrom
ChasonJiang:better_steaming_mode_merge

Conversation

@ChasonJiang

@ChasonJiang ChasonJiang commented Nov 24, 2025

Copy link
Copy Markdown
Contributor

支持更细粒度的流式推理模式。

三种模式:
1.return_fragment:之前版本的streaming_mode,chunk长度就是分句的token长度,质量最好(和baseline一致),但响应速度最慢。
2.streaming_mode:相比return_fragment的chunk更细,但是首包长度不定(动态chunk长度),质量较好(比baseline咬字可能稍弱),响应速度一般。
3.streaming_mode+fixed_length_chunk:在streaming_mode的基础上开启fixed_length_chunk,chunk长度相对固定(可能小于设定的chunk长度),首包长度固定,质量一般(比baseline咬字可能稍弱,且当chunk长度设置过短时可能出现基音断裂不连贯的现象),响应速度最快。

总结:
1.质量:return_fragment > streaming_mode > streaming_mode+fixed_length_chunk
2.响应速度: streaming_mode+fixed_length_chunk >> streaming_mode > return_fragment

	modified:   GPT_SoVITS/TTS_infer_pack/TTS.py
	modified:   GPT_SoVITS/module/models.py
	modified:   GPT_SoVITS/AR/models/t2s_model.py
	modified:   GPT_SoVITS/TTS_infer_pack/TTS.py
	modified:   GPT_SoVITS/module/models.py
	modified:   GPT_SoVITS/TTS_infer_pack/TTS.py
	modified:   GPT_SoVITS/module/models.py
	modified:   api_v2.py
@nanhui69

Copy link
Copy Markdown

最新发布的么?

@RVC-Boss

Copy link
Copy Markdown
Owner

正常的推理的话,(steam是False?),效果是不影响的吧?@ChasonJiang

@ChasonJiang

Copy link
Copy Markdown
Contributor Author

正常的推理的话,(steam是False?),效果是不影响的吧?@ChasonJiang

@RVC-Boss 正常推理就不是流模式了(return_fragment也是False),不影响效果。开启流模式就根据上述的三种模式来。

@RVC-Boss RVC-Boss merged commit 92ab59c into RVC-Boss:main Nov 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants