bigjoedata commited on
Commit
43ed3b5
β€’
1 Parent(s): e506e05

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -0
README.md ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # 🎸 πŸ₯ Rockbot 🎀 🎧
2
+ A [GPT-2](https://huggingface.co/blog/how-to-generate) based lyrics generator fine-tuned on the writing styles of 16000 songs by 270 artists across MANY genres (not just rock).
3
+
4
+ **Instructions:** Type in a fake song title, pick an artist, click "Generate".
5
+
6
+ Note: Due to the nature of language models, lyrics bleed across artists and you may see NSFW lyrics unexpectedly (e.g., from The Beatles), especially if you change the configuration to allow more entropy. I have made no attempt to censor lyrics whatsoever.
7
+
8
+ Finally, these lyrics are computer generated. Not all of these will be non-repetitive and/or coherent. Just have fun.
9
+
10
+ [Repository](https://github.com/bigjoedata/rockbot)
11
+
12
+ [Model page on Hugging Face](https://huggingface.co/bigjoedata/rockbot)
13
+
14
+ 🎹 πŸͺ˜ 🎷 🎺 πŸͺ— πŸͺ• 🎻
15
+ ## Background
16
+ With the shutdown of [Google Play Music](https://en.wikipedia.org/wiki/Google_Play_Music) I used Google's takeout function to gather the metadata from artists I've listened to over the past several years. I wanted to take advantage of this bounty to build something fun. I scraped the top 50 lyrics for artists I'd listened to at least once from [Genius](https://genius.com/), then fine tuned [GPT-2's](https://openai.com/blog/better-language-models/) 124M token model using the [AITextGen](https://github.com/minimaxir/aitextgen) framework after considerable post-processing.
17
+
18
+ ### Full Tech Stack
19
+ [Google Play Music](https://en.wikipedia.org/wiki/Google_Play_Music) (R.I.P.).
20
+ [Python](https://www.python.org/).
21
+ [Streamlit](https://www.streamlit.io/).
22
+ [GPT-2](https://openai.com/blog/better-language-models/).
23
+ [AITextGen](https://github.com/minimaxir/aitextgen).
24
+ [Pandas](https://pandas.pydata.org/).
25
+ [LyricsGenius](https://lyricsgenius.readthedocs.io/en/master/).
26
+ [Google Colab](https://colab.research.google.com/) (GPU based Training).
27
+ [Knime](https://www.knime.com/) (data cleaning).
28
+
29
+
30
+ ## How to Use The Model
31
+ Please refer to [AITextGen](https://github.com/minimaxir/aitextgen) for much better documentation.
32
+
33
+ ### Training Parameters Used
34
+
35
+ ai.train("lyrics.txt",
36
+ line_by_line=False,
37
+ from_cache=False,
38
+ num_steps=10000,
39
+ generate_every=2000,
40
+ save_every=2000,
41
+ save_gdrive=False,
42
+ learning_rate=1e-3,
43
+ batch_size=3,
44
+ eos_token="<|endoftext|>",
45
+ #fp16=True
46
+ )
47
+ ### To Use
48
+
49
+
50
+ Generate With Prompt (Use Title Case):
51
+ Song Name
52
+ BY
53
+ Artist Name
54
+