teticio commited on
Commit
d1fdbae
1 Parent(s): 80c2c54

remove mel import

Browse files
Files changed (2) hide show
  1. README.md +0 -1
  2. notebooks/audio_encoder.ipynb +0 -1
README.md CHANGED
@@ -209,7 +209,6 @@ accelerate launch ...
209
  We can generate audio conditional on a text prompt - or indeed anything which can be encoded into a bunch of numbers - much like DALL-E2, Midjourney and Stable Diffusion. It is generally harder to find good quality datasets of audios together with descriptions, although the people behind the dataset used to train Stable Diffusion are making some very interesting progress [here](https://github.com/LAION-AI/audio-dataset). I have chosen to encode the audio directly instead based on "how it sounds", using a [model which I trained on hundreds of thousands of Spotify playlists](https://github.com/teticio/Deej-AI). To encode an audio into a 100 dimensional vector
210
 
211
  ```python
212
- from diffusers import Mel
213
  from audiodiffusion.audio_encoder import AudioEncoder
214
 
215
  audio_encoder = AudioEncoder.from_pretrained("teticio/audio-encoder")
 
209
  We can generate audio conditional on a text prompt - or indeed anything which can be encoded into a bunch of numbers - much like DALL-E2, Midjourney and Stable Diffusion. It is generally harder to find good quality datasets of audios together with descriptions, although the people behind the dataset used to train Stable Diffusion are making some very interesting progress [here](https://github.com/LAION-AI/audio-dataset). I have chosen to encode the audio directly instead based on "how it sounds", using a [model which I trained on hundreds of thousands of Spotify playlists](https://github.com/teticio/Deej-AI). To encode an audio into a 100 dimensional vector
210
 
211
  ```python
 
212
  from audiodiffusion.audio_encoder import AudioEncoder
213
 
214
  audio_encoder = AudioEncoder.from_pretrained("teticio/audio-encoder")
notebooks/audio_encoder.ipynb CHANGED
@@ -7,7 +7,6 @@
7
  "metadata": {},
8
  "outputs": [],
9
  "source": [
10
- "from diffusers import Mel\n",
11
  "from audiodiffusion.audio_encoder import AudioEncoder"
12
  ]
13
  },
 
7
  "metadata": {},
8
  "outputs": [],
9
  "source": [
 
10
  "from audiodiffusion.audio_encoder import AudioEncoder"
11
  ]
12
  },