File size: 1,415 Bytes
6a02221
 
 
 
 
 
 
 
 
 
3d52a00
 
6a02221
 
 
 
 
 
3d52a00
 
 
 
 
6a02221
 
 
3d52a00
6a02221
 
 
3d52a00
6a02221
 
 
7422747
6a02221
 
 
7422747
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
---

tags:
- gpt2
- text-generation
- music-modeling
- music-generation

widget:
 - text: "PIECE_START"
 - text: "PIECE_START STYLE=JSFAKES GENRE=JSFAKES TRACK_START INST=48 BAR_START NOTE_ON=60"
 - text: "PIECE_START STYLE=JSFAKES GENRE=JSFAKES TRACK_START INST=48 BAR_START NOTE_ON=58"

---


# GPT-2 for Music

Language Models such as GPT-2 can be used for Music Generation. The idea is to represent pieces of music as texts, effectively reducing the task to Language Generation.

This model is a rather small instance of GPT-2 trained on [TristanBehrens/js-fakes-4bars](https://huggingface.co/datasets/TristanBehrens/js-fakes-4bars). The model generates 4 bars at a time of Bach-like chorales with four voices (soprano, alto, tenor, bass).

If you are contribute, if you want to say hello, if you want to know more, find me on [LinkedIn](https://www.linkedin.com/in/dr-tristan-behrens-734967a2/)

## Model description

The model is GPT-2 with 6 decoders and 8 attention-heads each. The context length is 512. The embedding dimensions are 512 as well. The vocabulary size is 119.

## Intended uses & limitations

This model is just a proof of concept. It shows that HuggingFace can be used to compose music.

### How to use

There is a notebook in the repo that you can run on Google Colab.

### Limitations and bias

Since this model has been trained on a very small corpus of music, it is overfitting heavily.