Skip to content

Add more engdict#395

Merged
RVC-Boss merged 1 commit into
RVC-Boss:mainfrom
KamioRinn:Add-more-engdict
Feb 5, 2024
Merged

Add more engdict#395
RVC-Boss merged 1 commit into
RVC-Boss:mainfrom
KamioRinn:Add-more-engdict

Conversation

@KamioRinn

Copy link
Copy Markdown
Contributor

更新英文词库
1、增加cmudict-fast。更新时间2021,加入了spotify、github、ipad等互联网时代新词汇
2、增加engdict-hot,用来补充cmudict-fast中没有收录的词汇,待完善
3、从cmudict、cmudict-fast、cmudict-hot重新生成字典cache

兼容性
在语种切割auto的情况下,会调用LangSegment对文本进行分割处理。触发LangSegment英文优化(# 其中英文缩写字母如“LCD”,英文标准发音为“L-C-D”,# 而在语音合成TTS项目中,一般用空格隔开字母来单独发音:“L C D”)。导致输入IPAD会被优化成I P A D单个英文字母朗读,无法匹配字典中正确读法
而在语种切割english的情况下,纯大写输入的词语不会被LangSegment优化,若匹配不到字典则由g2p_en处理

兼容性解决方案
可考虑进一步规范输入、增加提示来避免LangSegment优化变成负优化,同时给单english也进行LangSegment处理以达成一致。

@RVC-Boss RVC-Boss merged commit 210eb5f into RVC-Boss:main Feb 5, 2024
@KamioRinn KamioRinn deleted the Add-more-engdict branch February 5, 2024 10:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants