Commit
·
88af147
verified
·
0
Parent(s):
Duplicate from HeartMuLa/HeartCodec-oss
Browse filesCo-authored-by: HeartMuLa <HeartMuLa@users.noreply.huggingface.co>
- .gitattributes +35 -0
- README.md +60 -0
- config.json +60 -0
- model.safetensors +3 -0
.gitattributes
ADDED
|
@@ -0,0 +1,35 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
*.7z filter=lfs diff=lfs merge=lfs -text
|
| 2 |
+
*.arrow filter=lfs diff=lfs merge=lfs -text
|
| 3 |
+
*.bin filter=lfs diff=lfs merge=lfs -text
|
| 4 |
+
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
| 5 |
+
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
| 6 |
+
*.ftz filter=lfs diff=lfs merge=lfs -text
|
| 7 |
+
*.gz filter=lfs diff=lfs merge=lfs -text
|
| 8 |
+
*.h5 filter=lfs diff=lfs merge=lfs -text
|
| 9 |
+
*.joblib filter=lfs diff=lfs merge=lfs -text
|
| 10 |
+
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
| 11 |
+
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
| 12 |
+
*.model filter=lfs diff=lfs merge=lfs -text
|
| 13 |
+
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
| 14 |
+
*.npy filter=lfs diff=lfs merge=lfs -text
|
| 15 |
+
*.npz filter=lfs diff=lfs merge=lfs -text
|
| 16 |
+
*.onnx filter=lfs diff=lfs merge=lfs -text
|
| 17 |
+
*.ot filter=lfs diff=lfs merge=lfs -text
|
| 18 |
+
*.parquet filter=lfs diff=lfs merge=lfs -text
|
| 19 |
+
*.pb filter=lfs diff=lfs merge=lfs -text
|
| 20 |
+
*.pickle filter=lfs diff=lfs merge=lfs -text
|
| 21 |
+
*.pkl filter=lfs diff=lfs merge=lfs -text
|
| 22 |
+
*.pt filter=lfs diff=lfs merge=lfs -text
|
| 23 |
+
*.pth filter=lfs diff=lfs merge=lfs -text
|
| 24 |
+
*.rar filter=lfs diff=lfs merge=lfs -text
|
| 25 |
+
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
| 26 |
+
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
| 27 |
+
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
| 28 |
+
*.tar filter=lfs diff=lfs merge=lfs -text
|
| 29 |
+
*.tflite filter=lfs diff=lfs merge=lfs -text
|
| 30 |
+
*.tgz filter=lfs diff=lfs merge=lfs -text
|
| 31 |
+
*.wasm filter=lfs diff=lfs merge=lfs -text
|
| 32 |
+
*.xz filter=lfs diff=lfs merge=lfs -text
|
| 33 |
+
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
+
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
+
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,60 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
language:
|
| 4 |
+
- zh
|
| 5 |
+
- en
|
| 6 |
+
- ja
|
| 7 |
+
- ko
|
| 8 |
+
- es
|
| 9 |
+
pipeline_tag: text-to-audio
|
| 10 |
+
tags:
|
| 11 |
+
- music
|
| 12 |
+
- art
|
| 13 |
+
---
|
| 14 |
+
|
| 15 |
+
|
| 16 |
+
## Model Details
|
| 17 |
+
|
| 18 |
+
### Model Description
|
| 19 |
+
|
| 20 |
+
<!-- Provide a longer summary of what this model is. -->
|
| 21 |
+
|
| 22 |
+
- **Developed by:** [HeartMuLa Team]
|
| 23 |
+
- **License:** [Apache 2.0]
|
| 24 |
+
|
| 25 |
+
### Model Sources
|
| 26 |
+
|
| 27 |
+
<!-- Provide the basic links for the model. -->
|
| 28 |
+
|
| 29 |
+
- **Github Repo:** https://github.com/HeartMuLa/heartlib
|
| 30 |
+
- **Paper:** https://arxiv.org/abs/2601.10547
|
| 31 |
+
- **Demo:** https://heartmula.github.io/
|
| 32 |
+
- **HeartMuLa-oss-3B:** https://huggingface.co/HeartMuLa/HeartMuLa-oss-3B
|
| 33 |
+
- **HeartCodec-oss:** https://huggingface.co/HeartMuLa/HeartCodec-oss
|
| 34 |
+
- **HeartTranscriptor-oss:** https://huggingface.co/HeartMuLa/HeartTranscriptor-oss
|
| 35 |
+
|
| 36 |
+
|
| 37 |
+
## Get Started
|
| 38 |
+
|
| 39 |
+
Check our github repo https://github.com/HeartMuLa/heartlib for a quickstart and local deployment of HeartMuLa.
|
| 40 |
+
|
| 41 |
+
## Citation
|
| 42 |
+
|
| 43 |
+
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
| 44 |
+
If you find HeartMuLa useful, please cite:
|
| 45 |
+
|
| 46 |
+
```bibtex
|
| 47 |
+
@misc{yang2026heartmulafamilyopensourced,
|
| 48 |
+
title={HeartMuLa: A Family of Open Sourced Music Foundation Models},
|
| 49 |
+
author={Dongchao Yang and Yuxin Xie and Yuguo Yin and Zheyu Wang and Xiaoyu Yi and Gongxi Zhu and Xiaolong Weng and Zihan Xiong and Yingzhe Ma and Dading Cong and Jingliang Liu and Zihang Huang and Jinghan Ru and Rongjie Huang and Haoran Wan and Peixu Wang and Kuoxi Yu and Helin Wang and Liming Liang and Xianwei Zhuang and Yuanyuan Wang and Haohan Guo and Junjie Cao and Zeqian Ju and Songxiang Liu and Yuewen Cao and Heming Weng and Yuexian Zou},
|
| 50 |
+
year={2026},
|
| 51 |
+
eprint={2601.10547},
|
| 52 |
+
archivePrefix={arXiv},
|
| 53 |
+
primaryClass={cs.SD},
|
| 54 |
+
url={https://arxiv.org/abs/2601.10547},
|
| 55 |
+
}
|
| 56 |
+
```
|
| 57 |
+
|
| 58 |
+
|
| 59 |
+
## Contact
|
| 60 |
+
If you are interested in HeartMuLa, feel free to reach us at heartmula.ai@gmail.com
|
config.json
ADDED
|
@@ -0,0 +1,60 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"architectures": [
|
| 3 |
+
"HeartCodec"
|
| 4 |
+
],
|
| 5 |
+
"attention_head_dim": 64,
|
| 6 |
+
"causal": true,
|
| 7 |
+
"codebook_dim": 32,
|
| 8 |
+
"codebook_size": 8192,
|
| 9 |
+
"commitment_weight": 1.0,
|
| 10 |
+
"decay": 0.9,
|
| 11 |
+
"default_kernel_size": 7,
|
| 12 |
+
"delay_kernel_size": 5,
|
| 13 |
+
"dim": 512,
|
| 14 |
+
"downsample_factors": [
|
| 15 |
+
3,
|
| 16 |
+
4,
|
| 17 |
+
4,
|
| 18 |
+
4,
|
| 19 |
+
5
|
| 20 |
+
],
|
| 21 |
+
"downsample_kernel_sizes": [
|
| 22 |
+
6,
|
| 23 |
+
8,
|
| 24 |
+
8,
|
| 25 |
+
8,
|
| 26 |
+
10
|
| 27 |
+
],
|
| 28 |
+
"in_channels": 1024,
|
| 29 |
+
"init_channel": 64,
|
| 30 |
+
"latent_hidden_dim": 128,
|
| 31 |
+
"model_type": "heartcodec",
|
| 32 |
+
"norm_type": "ada_norm_single",
|
| 33 |
+
"num_attention_heads": 24,
|
| 34 |
+
"num_bands": 1,
|
| 35 |
+
"num_layers": 24,
|
| 36 |
+
"num_layers_2": 6,
|
| 37 |
+
"num_quantizers": 8,
|
| 38 |
+
"num_samples": 2,
|
| 39 |
+
"out_channels": 256,
|
| 40 |
+
"res_kernel_size": 7,
|
| 41 |
+
"sample_rate": 48000,
|
| 42 |
+
"threshold_ema_dead_code": 2,
|
| 43 |
+
"torch_dtype": "float32",
|
| 44 |
+
"transformers_version": "4.51.3",
|
| 45 |
+
"upsample_factors": [
|
| 46 |
+
5,
|
| 47 |
+
4,
|
| 48 |
+
4,
|
| 49 |
+
4,
|
| 50 |
+
3
|
| 51 |
+
],
|
| 52 |
+
"upsample_kernel_sizes": [
|
| 53 |
+
10,
|
| 54 |
+
8,
|
| 55 |
+
8,
|
| 56 |
+
8,
|
| 57 |
+
6
|
| 58 |
+
],
|
| 59 |
+
"use_cosine_sim": false
|
| 60 |
+
}
|
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3ee7adb88916644d6e639d7ecb9c42cd328556ec8aacc58f0ee4551aee5057d5
|
| 3 |
+
size 6638384012
|