314 350 577

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

reacted to AtAndDev's post with 🔥 about 15 hours ago

Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.

updated a model 1 day ago

YaTharThShaRma999/voices

updated a model 7 days ago

YaTharThShaRma999/SparkTTS-LLM

View all activity

Organizations

None yet

YaTharThShaRma999's activity

upvoted a paper 8 days ago

A Multimodal Symphony: Integrating Taste and Sound through Generative AI

Paper • 2503.02823 • Published 9 days ago • 2

upvoted a paper 9 days ago

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Paper • 2503.01183 • Published 10 days ago • 26

upvoted a paper 10 days ago

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Paper • 2502.20583 • Published 14 days ago • 11

upvoted a paper 21 days ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published 23 days ago • 37

upvoted a paper 24 days ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published 27 days ago • 51

upvoted a paper 29 days ago

Magic 1-For-1: Generating One Minute Video Clips within One Minute

Paper • 2502.07701 • Published 30 days ago • 34

upvoted 2 papers about 1 month ago

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Paper • 2501.15654 • Published Jan 26 • 13

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 25

upvoted 2 papers about 2 months ago

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published Jan 15 • 15

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10 • 48

upvoted 8 papers 2 months ago

HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

Paper • 2412.20735 • Published Dec 30, 2024 • 11

Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 26

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Paper • 2412.20631 • Published Dec 30, 2024 • 15

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

Paper • 2412.21037 • Published Dec 30, 2024 • 24

upvoted 2 papers 3 months ago

IDOL: Instant Photorealistic 3D Human Creation from a Single Image

Paper • 2412.14963 • Published Dec 19, 2024 • 6

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Paper • 2412.15213 • Published Dec 19, 2024 • 26