JanBabela commited on
Commit
88bdd0f
1 Parent(s): f74dc5e

Update index.html

Browse files
Files changed (1) hide show
  1. index.html +17 -3
index.html CHANGED
@@ -9,10 +9,24 @@
9
  <body>
10
  <div class="card">
11
  <h1>Riffusion-Melodiff-v1</h1>
12
- <p>You can modify this app directly by editing <i>index.html</i> in the Files and versions tab.</p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  <p>
14
- Also don't forget to check the
15
- <a href="https://huggingface.co/docs/hub/spaces" target="_blank">Spaces documentation</a>.
16
  </p>
17
  </div>
18
  </body>
 
9
  <body>
10
  <div class="card">
11
  <h1>Riffusion-Melodiff-v1</h1>
12
+ <p>Riffusion-Melodiff is simple, but interesting idea, (that I have not seen anywhere else) how to modify your music.</p>
13
+ <p>Riffusion-Melodiff is built on a top of Riffusion model, which is fine-tuned Stable Diffusion model to generate Mel Spectrogram. (Spectrogram is kind of
14
+ visual representation of music by dividing waveforms into frequencies.) Riffusion-Melodiff does not contain new model, there was no new training, nor fine-tunig.
15
+ It uses the same model as Riffusion only in a different way.</p>
16
+ <p>Riffusion-Melodiff uses Img2Img pipeline from Diffusers library to modify images of Mel Spectrograms to produce cover versions of music. Just upload your audio
17
+ in wav format (if you have audio in a different format, transfer it first to wav by online converter). Then you may use Img2img pipeline from the Diffusers library
18
+ with your prompr, seed and strength. Stregth parameter decides, how much will modified audio relate to initial audio and how much it will relate to the prompt.
19
+ When strength is too low the spectrogram is too similar with original one and we do not receive new modification. When strength is too high, then spectrogram is too
20
+ close to the new promopt, which may cause loss of melody and/or tempo from the base image. Good values of strength are usually about 0,4-0,5.</p>
21
+ <p>Good modifications are possible for proper prompt, seed and strength values. Those modifications will keep the tempo and melody from the initial audio, but
22
+ they will change eg. instrument, playing that melody. Also with this pipeline longer than 5s music modifications are possible. If you cut your audio into 5s pieces
23
+ and use the same prompt, seed and strength for each modification, generated smaples will be somewhat consistent. So if you concatenate them together, you will have
24
+ longer audio modified.</p>
25
+ <p>Quality of the generated music is not amazing, (mediocre, I would say) and it needs a bit of prompt and seed engineering. But it shows one way, how to do it
26
+ in the future.</p>
27
  <p>
28
+ Colab notebook is included, where you can find step by step, how to do it.
29
+ <a href="https://huggingface.co/spaces/JanBabela/Riffusion-Melodiff-v1/blob/main/melodiff_v1.ipynb" target="_blank">Melodiff_v1</a>.
30
  </p>
31
  </div>
32
  </body>