Yatharth  Sharma's picture

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

Organizations

None yet

YaTharThShaRma999's activity

reacted to its5Q's post with 🀯 about 15 hours ago
view post
Post
980
Am I missing something, or there is still no way to filter by model size while searching for models? It has been a requested feature since 2022, but I haven't seen any updates since! With the amount of different models coming out, I think the size filter would be a great extension of the search functionality, especially when looking for smaller models, which are a lot less prevalent.
reacted to hexgrad's post with πŸ”₯ about 22 hours ago
view post
Post
1142
Happy New Year! πŸŒƒ af_sky landed in Kokoro, along with an article: hexgrad/Kokoro-82M
  • 1 reply
Β·
reacted to csabakecskemeti's post with 🀯 3 days ago
replied to csabakecskemeti's post 3 days ago
view reply

Yep quality is amazing too, imo comparable to claude sonnet 3.5 and even better then gpt4o at certain tasks while being 50x cheaper then sonnet 3.5.

reacted to cfahlgren1's post with πŸš€ 3 days ago
reacted to hexgrad's post with πŸ”₯ 4 days ago
view post
Post
2400
πŸ‡¬πŸ‡§ Four British voices have joined hexgrad/Kokoro-82M (Apache TTS model): bf_emma, bf_isabella, bm_george, bm_lewis
liked a Space 5 days ago
reacted to hexgrad's post with πŸ€— 6 days ago
view post
Post
3036
Tonight, Adam & Michael join the 82M Apache TTS model in hexgrad/Kokoro-82M
reacted to hexgrad's post with πŸ”₯ 7 days ago
view post
Post
3859
Merry Christmas! πŸŽ„ Open sourced a small TTS model at hexgrad/Kokoro-82M
  • 2 replies
Β·
reacted to merve's post with πŸ‘€ 9 days ago
reacted to AdinaY's post with πŸ‘€πŸ”₯ 9 days ago
view post
Post
2843
QvQ-72B-PreviewπŸŽ„ an open weight model for visual reasoning just released by Alibaba_Qwen team
Qwen/qvq-676448c820912236342b9888
✨ Combines visual understanding & language reasoning.
✨ Scores 70.3 on MMMU
✨ Outperforms Qwen2-VL-72B-Instruct in complex problem-solving
reacted to jbilcke-hf's post with πŸš€ 14 days ago
view post
Post
2747
Doing some testing with HunyuanVideo on the Hugging Face Inference Endpoints πŸ€—

prompt: "a Shiba Inu is acting as a DJ, he wears sunglasses and is mixing and scratching with vinyl discs at a Ibiza sunny sand beach party"

1280x720, 22 steps, 121 frames

There are still some things to iron out regarding speed and memory usage, right now it takes 20min on an A100 (see attached charts)

but you can check it out here:

https://huggingface.co/jbilcke-hf/HunyuanVideo-for-InferenceEndpoints

There are various things I want to try like the 100% diffusers version and other models (LTX-Video..)