English
music
JoshuaW1997 commited on
Commit
a5df656
·
verified ·
1 Parent(s): 0cce61f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -2
README.md CHANGED
@@ -1,5 +1,17 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
4
 
5
  # FUTGA: Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation
@@ -8,6 +20,7 @@ license: apache-2.0
8
 
9
  ## News
10
 
 
11
  - [07/28] We released [**FUTGA-7B**](https://huggingface.co/JoshuaW1997/FUTGA) and **training/inference code** based on [SALMONN-7B](https://huggingface.co/tsinghua-ee/SALMONN) backbone!
12
 
13
  ## Overview
@@ -29,5 +42,14 @@ We build **FUTGA-7B** based on SALMONN. Follow the instructions from [SALMONN](h
29
 
30
 
31
  ## Datasets
32
- We generate dense captions for full-length songs in [MusicCaps](https://huggingface.co/JoshuaW1997/FUTGA/tree/main/Data-MusicCaps) and [SongDescriber](https://huggingface.co/JoshuaW1997/FUTGA/tree/main/Data-SongDescriber),
33
- where ```raw captions``` are directly generated from FUTGA-7B and ```seg_captions_features``` contain automatically segmented captions with structures and textual-audio features for each segment.
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ datasets:
4
+ - JoshuaW1997/FUTGA
5
+ language:
6
+ - en
7
+ metrics:
8
+ - bertscore
9
+ - bleu
10
+ - rouge
11
+ base_model:
12
+ - lmsys/vicuna-7b-v1.5
13
+ tags:
14
+ - music
15
  ---
16
 
17
  # FUTGA: Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation
 
20
 
21
  ## News
22
 
23
+ - [10/02] We released [**FUTGA-Dataset**](https://huggingface.co/datasets/JoshuaW1997/FUTGA) including MusicCaps, SongDescriber, HarmonixSet, and AudioSet (train and eval).
24
  - [07/28] We released [**FUTGA-7B**](https://huggingface.co/JoshuaW1997/FUTGA) and **training/inference code** based on [SALMONN-7B](https://huggingface.co/tsinghua-ee/SALMONN) backbone!
25
 
26
  ## Overview
 
42
 
43
 
44
  ## Datasets
45
+ Please visit our dataset repo [FUTGA-Dataset](https://huggingface.co/datasets/JoshuaW1997/FUTGA). We currently include MusicCaps, SongDescriber, HarmonixSet, and AudioSet (train and eval).
46
+
47
+ ## Citation
48
+ If you use our models or datasets in your research, please cite it as follows:
49
+ ```bibtex
50
+ @article{wu2024futga,
51
+ title={Futga: Towards Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation},
52
+ author={Wu, Junda and Novack, Zachary and Namburi, Amit and Dai, Jiaheng and Dong, Hao-Wen and Xie, Zhouhang and Chen, Carol and McAuley, Julian},
53
+ journal={arXiv preprint arXiv:2407.20445},
54
+ year={2024}
55
+ }