SVRL

community

AI & ML interests

None defined yet.

Recent Activity

SivilTaram updated a model 2 days ago

SVRL/verl-scalable-0402_batch768_clipratio0.3_Qwen2.5-14B_scale-reasoning-data-v2-nos-fixmc-fil0-fil8

MrLight authored a paper 8 days ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

wenhu authored a paper 8 days ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

View all activity

SVRL's activity

SivilTaram

updated a model 2 days ago

SVRL/verl-scalable-0402_batch768_clipratio0.3_Qwen2.5-14B_scale-reasoning-data-v2-nos-fixmc-fil0-fil8

Updated 2 days ago

MrLight

authored a paper 8 days ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published 10 days ago • 38

wenhu

authored a paper 8 days ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published 10 days ago • 38

wenhu

authored a paper 9 days ago

Towards Trustworthy GUI Agents: A Survey

Paper • 2503.23434 • Published 12 days ago • 20

wenhu

authored a paper 10 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published 12 days ago • 115

SivilTaram

authored 5 papers 15 days ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Paper • 2411.07763 • Published Nov 12, 2024

When Attention Sink Emerges in Language Models: An Empirical View

Paper • 2410.10781 • Published Oct 14, 2024

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 16

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 57

Scaling up Masked Diffusion Models on Text

Paper • 2410.18514 • Published Oct 24, 2024

SivilTaram

authored a paper 17 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 18 days ago • 29

SivilTaram

authored a paper 21 days ago

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Paper • 2503.15450 • Published 22 days ago • 11

wenhu

authored a paper 25 days ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published 28 days ago • 18

wenhu

authored a paper 30 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published about 1 month ago • 62

wenhu

authored a paper about 1 month ago

ABC: Achieving Better Control of Multimodal Embeddings using VLMs

Paper • 2503.00329 • Published Mar 1 • 18

MrLight

authored 5 papers about 1 month ago

Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models

Paper • 2310.07712 • Published Oct 11, 2023

PixelWorld: Towards Perceiving Everything as Pixels

Paper • 2501.19339 • Published Jan 31 • 17

DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers

Paper • 2502.18460 • Published Feb 25 • 2

Document Screenshot Retrievers are Vulnerable to Pixel Poisoning Attacks

Paper • 2501.16902 • Published Jan 28

VISA: Retrieval Augmented Generation with Visual Source Attribution

Paper • 2412.14457 • Published Dec 19, 2024