Spaces:

lambdalabs
/

generative-music-visualizer

Build error

App Files Files Community

Jeremy Hummel commited on Dec 12, 2022

Commit

d87960f

1 Parent(s): f5a1e6e

Updates markdown

Browse files

Files changed (1) hide show

app.py +10 -7

app.py CHANGED Viewed

@@ -35,33 +35,36 @@ network_choices = [
 description = \
 """
 Generate visualizations on an input audio file using [StyleGAN3](https://nvlabs.github.io/stylegan3/) (Karras, Tero, et al. "Alias-free generative adversarial networks." Advances in Neural Information Processing Systems 34 (2021): 852-863.).
 Inspired by [Deep Music Visualizer](https://github.com/msieg/deep-music-visualizer), which used BigGAN (Brock et al., 2018)
 Developed by Jeremy Hummel at [Lambda](https://lambdalabs.com/)
 """
 article = \
 """
 ## How does this work?
-The audio is transformed to a spectral representation by using Short-time Fourier transform (STFT). [librosa]()
 Starting with an initial noise vector, we perform a random walk, adjusting the length of each step with the power gradient.
 This pushes the noise vector to move around more when the sound changes.
 ## Parameter info:
-*Network*: various pre-trained models from NVIDIA, "afhqv2" is animals, "ffhq" is faces, "metfaces" is artwork.
-*Truncation*: controls how far the noise vector can be from the origin. `0.7` will generate more realistic, but less diverse samples,
 while `1.2` will can yield more interesting but less realistic images.
-*Tempo Sensitivity*: controls the how the size of each step scales with the audio features
-*Jitter*: prevents the same exact noise vectors from cycling repetitively, if set to `0`, the images will repeat during
 repetitive parts of the audio
-*Frame Length*: controls the number of audio frames per video frame in the output.
 If you want a higher frame rate for visualizing very rapid music, lower the frame length.
 If you want a lower frame rate (which will complete the job faster), raise the frame length
-*Max Duration*: controls the max length of the visualization, in seconds. Use a shorter value here to get output
 more quickly, especially for testing different combinations of parameters.
 """
 # Media sources:

 description = \
 """
 Generate visualizations on an input audio file using [StyleGAN3](https://nvlabs.github.io/stylegan3/) (Karras, Tero, et al. "Alias-free generative adversarial networks." Advances in Neural Information Processing Systems 34 (2021): 852-863.).
 Inspired by [Deep Music Visualizer](https://github.com/msieg/deep-music-visualizer), which used BigGAN (Brock et al., 2018)
 Developed by Jeremy Hummel at [Lambda](https://lambdalabs.com/)
 """
 article = \
 """
 ## How does this work?
+The audio is transformed to a spectral representation by using Short-time Fourier transform (STFT) with [librosa](https://librosa.org/doc/latest/index.html).
 Starting with an initial noise vector, we perform a random walk, adjusting the length of each step with the power gradient.
 This pushes the noise vector to move around more when the sound changes.
 ## Parameter info:
+**Network**: various pre-trained models from NVIDIA, "afhqv2" is animals, "ffhq" is faces, "metfaces" is artwork.
+**Truncation**: controls how far the noise vector can be from the origin. `0.7` will generate more realistic, but less diverse samples,
 while `1.2` will can yield more interesting but less realistic images.
+**Tempo Sensitivity**: controls the how the size of each step scales with the audio features
+**Jitter**: prevents the same exact noise vectors from cycling repetitively, if set to `0`, the images will repeat during
 repetitive parts of the audio
+**Frame Length**: controls the number of audio frames per video frame in the output.
 If you want a higher frame rate for visualizing very rapid music, lower the frame length.
 If you want a lower frame rate (which will complete the job faster), raise the frame length
+**Max Duration**: controls the max length of the visualization, in seconds. Use a shorter value here to get output
 more quickly, especially for testing different combinations of parameters.
 """
 # Media sources: