22 11 74

Martin Viewegger

Viewegger

AI & ML interests

None yet

Recent Activity

liked a model 12 days ago

ostris/Flex.1-alpha

liked a model 12 days ago

drewThomasson/fineTunedTTSModels

reacted to alibabasglab's post with 👍 14 days ago

Introducing open-sourced ClearerVoice-Studio. A powerful speech processing AI tool to dramatically improve your speech quality. Checkout demo page: https://huggingface.co/spaces/alibabasglab/ClearVoice and https://modelscope.cn/studios/iic/ClearerVoice-Studio. Give us a Star on Github: https://github.com/modelscope/ClearerVoice-Studio!

View all activity

Organizations

None yet

Viewegger's activity

liked 2 models 12 days ago

ostris/Flex.1-alpha

Text-to-Image • Updated 12 days ago • 15.4k • 321

drewThomasson/fineTunedTTSModels

Updated 10 days ago • 4

reacted to alibabasglab's post with 👍 14 days ago

Post

1178

Introducing open-sourced ClearerVoice-Studio. A powerful speech processing AI tool to dramatically improve your speech quality. Checkout demo page: alibabasglab/ClearVoice and https://modelscope.cn/studios/iic/ClearerVoice-Studio. Give us a Star on Github: https://github.com/modelscope/ClearerVoice-Studio!

reacted to alibabasglab's post with 👍 18 days ago

Post

5268

🎉 ClearerVoice-Studio New Feature: Speech Super-Resolution with MossFormer2 ! 🚀
We’re excited to announce that ClearerVoice-Studio now supports speech super-resolution, powered by our latest MossFormer2-based model!
What’s New?

🔊 Convert Low-Resolution to High-Resolution Audio:
Transform low-resolution audio (effective sampling rate ≥ 16 kHz) into crystal-clear, high-resolution audio at 48 kHz.

🤖 Cutting-Edge Technology:
Leverages the MossFormer2 model plus HiFi-GAN, optimised for generating high-quality audio with enhanced perceptual clarity.

🎧 Enhanced Listening Experience:
Perfect for speech enhancement, content restoration, and high-fidelity audio applications.

🌟 Try It Out!
Upgrade to the latest version of ClearerVoice-Studio (https://github.com/modelscope/ClearerVoice-Studio) to experience this powerful feature. Check out the updated documentation and examples in our repository.

Let us know your thoughts, feedback, or feature requests in the Issues section.

liked a model 26 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 1 day ago • 55.4k • 2.59k

liked 2 models about 1 month ago

strangerzonehf/Flux-C7-Sketch-LoRA

Text-to-Image • Updated Dec 20, 2024 • 66 • 12

strangerzonehf/Flux-Sketch-Smudge-LoRA

Text-to-Image • Updated Dec 20, 2024 • 9.31k • 35

New activity in tensorart/stable-diffusion-3.5-medium-turbo about 1 month ago

Benefit using Lora + finetuned model

#1 opened about 1 month ago by

Viewegger

liked 2 models 2 months ago

unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF

Updated Nov 15, 2024 • 12.8k • 58

NexaAIDev/OmniVLM-968M

Updated Dec 17, 2024 • 1.55k • 499

New activity in jpgallegoar/F5-Spanish 2 months ago

Model quality on cca 200hour of audio

#8 opened 2 months ago by

Viewegger

New activity in PetrosStav/F5-TTS-Greek 2 months ago

Dataset size and output quality

#2 opened 2 months ago by

Viewegger

New activity in marduk-ra/F5-TTS-German 2 months ago

Training process details

#2 opened 2 months ago by

Nils11

liked a Space 3 months ago

Running

411

👁

Qwen2.5 Coder Demo

reacted to m-ric's post with 🔥 3 months ago

Post

790

𝗔𝗿𝗲 𝘀𝗰𝗮𝗹𝗶𝗻𝗴 𝗹𝗮𝘄𝘀 𝗼𝘃𝗲𝗿? 𝗔 𝗿𝗲𝗽𝗼𝗿𝘁 𝗳𝗿𝗼𝗺 𝘁𝗵𝗲 𝗜𝗻𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝗼𝗻 𝗮𝗻𝗻𝗼𝘂𝗻𝗰𝗲𝗱 𝘁𝗵𝗮𝘁 𝗢𝗽𝗲𝗻𝗔𝗜 𝗶𝘀 𝘀𝗲𝗲𝗶𝗻𝗴 𝗱𝗶𝗺𝗶𝗻𝗶𝘀𝗵𝗶𝗻𝗴 𝗿𝗲𝘁𝘂𝗿𝗻𝘀 𝗳𝗿𝗼𝗺 𝘀𝗰𝗮𝗹𝗶𝗻𝗴 𝘂𝗽 𝘁𝗵𝗲 𝗻𝗲𝘅𝘁 𝗚𝗣𝗧 𝗺𝗼𝗱𝗲𝗹𝘀.

📊 What are scaling laws? These are empiric laws that say "Every time you increase compute spent in training 10-fold, your LLM's performance will go up by a predictable tick". Of course, they apply only if you train your model with the right methods.

The image below illustrates it: they're from a paper by Google, "Scaling Autoregressive Models for Content-Rich Text-to-Image Generation", and they show how quality and instruction following of models improve when you scale the model up (which is equivalent to scaling up the compute spent in training).

➡️ These scaling laws have immense impact: they triggered the largest gold rush ever, with companies pouring billions into scaling up theiur training. Microsoft and OpenAI spent 100B into their "Startgate" mega training cluster, due to start running in 2028.

🤔 So, what about these reports of scaling laws slowing down?

If they are true, they would mean a gigantic paradigm shift, as the hundreds of billions poured by AI companies into scaling could be a dead-end. ⛔️

But I doubt it: until the most recent publications, scaling laws showed no signs of weakness, and the researchers at the higher end of the scale-up seems to imply the scaling up continues.

Wait and see!

1 reply

liked a model 3 months ago

prithivMLmods/SD3.5-Large-Photorealistic-LoRA

Text-to-Image • Updated Nov 16, 2024 • 15.3k • 56

upvoted a paper 3 months ago

NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks

Paper • 2410.20650 • Published Oct 28, 2024 • 16

reacted to yongchanghao's post with 🔥 3 months ago

Post

3760

We just released a paper (NeuZip) that compresses VRAM in a lossless manner to run larger models. This should be particularly useful when VRAM is insufficient during training/inference. Specifically, we look inside each floating number and find that the exponents are highly compressible (as shown in the figure below).

Read more about the work at NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks (2410.20650)

liked a dataset 3 months ago

ai4b-hf/GLOBE-annotated

Viewer • Updated Oct 31, 2024 • 582k • 584 • 4

New activity in gokaygokay/Flux-Seamless-Texture-LoRA 3 months ago

Size of the dataset?

#1 opened 3 months ago by

Viewegger