Spaces:

joytou
/

my-speech

Running

App Files Files Community

my-speech / docs /zh /samples.md

joytou

init project

882ea5e 4 months ago

preview code

raw

history blame contribute delete

7.93 kB

	# 例子

	v1.2 的样本可以在 [Bilibili](https://www.bilibili.com/video/BV1wz421B71D/) 观看。

	以下样本来自 v1.1 版本的模型。

	## 中文句子 1
	```
	人间灯火倒映湖中，她的渴望让静水泛起涟漪。若代价只是孤独，那就让这份愿望肆意流淌。
	流入她所注视的世间，也流入她如湖水般澄澈的目光。
	```

	<table>
	<thead>
	<tr>
	<th>说话人</th>
	<th>输入音频</th>
	<th>合成音频</th>
	</tr>
	</thead>
	<tbody>
	<tr>
	<td>纳西妲 (原神)</td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/0_input.wav" /></td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/0_output.wav" /></td>
	</tr>
	<tr>
	<td>钟离 (原神)</td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/1_input.wav" /></td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/1_output.wav" /></td>
	</tr>
	<tr>
	<td>芙宁娜 (原神)</td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/2_input.wav" /></td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/2_output.wav" /></td>
	</tr>
	<tr>
	<td>随机说话人 1</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/4_output.wav" /></td>
	</tr>
	<tr>
	<td>随机说话人 2</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/5_output.wav" /></td>
	</tr>
	</tbody>
	</table>


	## 中文句子 2
	```
	你们这个是什么群啊，你们这是害人不浅啊你们这个群！谁是群主，出来！真的太过分了。你们搞这个群干什么？
	我儿子每一科的成绩都不过那个平均分呐，他现在初二，你叫我儿子怎么办啊？他现在还不到高中啊？
	你们害死我儿子了！快点出来你这个群主！再这样我去报警了啊！我跟你们说你们这一帮人啊，一天到晚啊，
	搞这些什么游戏啊，动漫啊，会害死你们的，你们没有前途我跟你说。你们这九百多个人，好好学习不好吗？
	一天到晚在上网。有什么意思啊？麻烦你重视一下你们的生活的目标啊？有一点学习目标行不行？一天到晚上网是不是人啊？
	```

	<table>
	<thead>
	<tr>
	<th>说话人</th>
	<th>输入音频</th>
	<th>合成音频</th>
	</tr>
	</thead>
	<tbody>
	<tr>
	<td>纳西妲 (原神)</td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/0_input.wav" /></td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/6_output.wav" /></td>
	</tr>
	<tr>
	<td>随机说话人</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/7_output.wav" /></td>
	</tr>
	</tbody>
	</table>


	## 中文句子 3
	```
	大家好，我是 Fish Audio 开发的开源文本转语音模型。经过十五万小时的数据训练，
	我已经能够熟练掌握中文、日语和英语，我的语言处理能力接近人类水平，声音表现形式丰富多变。
	作为一个仅有亿级参数的模型，我相信社区成员能够在个人设备上轻松运行和微调，让我成为您的私人语音助手。
	```


	<table>
	<thead>
	<tr>
	<th>说话人</th>
	<th>输入音频</th>
	<th>合成音频</th>
	</tr>
	</thead>
	<tbody>
	<tr>
	<td>随机说话人</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/zh/8_output.wav" /></td>
	</tr>
	</tbody>
	</table>

	## 英文句子 1

	```
	In the realm of advanced technology, the evolution of artificial intelligence stands as a
	monumental achievement. This dynamic field, constantly pushing the boundaries of what
	machines can do, has seen rapid growth and innovation. From deciphering complex data
	patterns to driving cars autonomously, AI's applications are vast and diverse.
	```

	<table>
	<thead>
	<tr>
	<th>说话人</th>
	<th>输入音频</th>
	<th>合成音频</th>
	</tr>
	</thead>
	<tbody>
	<tr>
	<td>随机说话人 1</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/en/0_output.wav" /></td>
	</tr>
	<tr>
	<td>随机说话人 2</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/en/1_output.wav" /></td>
	</tr>
	</tbody>
	</table>

	## 英文句子 2
	```
	Hello everyone, I am an open-source text-to-speech model developed by
	Fish Audio. After training with 150,000 hours of data, I have become proficient
	in Chinese, Japanese, and English, and my language processing abilities
	are close to human level. My voice is capable of a wide range of expressions.
	As a model with only hundreds of millions of parameters, I believe community
	members can easily run and fine-tune me on their personal devices, allowing
	me to serve as your personal voice assistant.
	```

	<table>
	<thead>
	<tr>
	<th>说话人</th>
	<th>输入音频</th>
	<th>合成音频</th>
	</tr>
	</thead>
	<tbody>
	<tr>
	<td>随机说话人</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/en/2_output.wav" /></td>
	</tr>
	</tbody>
	</table>

	## 日文句子 1

	```
	先進技術の領域において、人工知能の進化は画期的な成果として立っています。常に機械ができることの限界を
	押し広げているこのダイナミックな分野は、急速な成長と革新を見せています。複雑なデータパターンの解読か
	ら自動運転車の操縦まで、AIの応用は広範囲に及びます。
	```


	<table>
	<thead>
	<tr>
	<th>说话人</th>
	<th>输入音频</th>
	<th>合成音频</th>
	</tr>
	</thead>
	<tbody>
	<tr>
	<td>随机说话人 1</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/ja/0_output.wav" /></td>
	</tr>
	<tr>
	<td>随机说话人 2</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/ja/1_output.wav" /></td>
	</tr>
	</tbody>
	</table>

	## 日文句子 2
	```
	皆さん、こんにちは。私はフィッシュオーディオによって開発されたオープンソースのテ
	キストから音声への変換モデルです。15万時間のデータトレーニングを経て、
	中国語、日本語、英語を熟知しており、言語処理能力は人間に近いレベルです。
	声の表現も多彩で豊かです。数億のパラメータを持つこのモデルは、コミュニティ
	のメンバーが個人のデバイスで簡単に実行し、微調整することができると
	信じています。これにより、私を個人の音声アシスタントとして活用できます。
	```

	<table>
	<thead>
	<tr>
	<th>说话人</th>
	<th>输入音频</th>
	<th>合成音频</th>
	</tr>
	</thead>
	<tbody>
	<tr>
	<td>随机说话人</td>
	<td> - </td>
	<td><audio controls preload="auto" src="https://demo-r2.speech.fish.audio/v1.1-sft-large/ja/2_output.wav" /></td>
	</tr>
	</tbody>
	</table>