Jay P's picture

Jay P

jayomb

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

BlinkDL/rwkv7-g1

liked a dataset 4 days ago

ylacombe/expresso

liked a model 5 days ago

LGAI-EXAONE/EXAONE-Deep-2.4B

View all activity

Organizations

jayomb's activity

upvoted a collection 8 days ago

🪿 RWKV7

RWKV7 models • 12 items • Updated 3 days ago • 7

upvoted a paper 8 days ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 9 days ago • 131

upvoted 2 articles 13 days ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

16 days ago

• 68

Article

FastRTC: The Real-Time Communication Library for Python

about 1 month ago

• 147

upvoted a collection 20 days ago

cool datasets

158 items • Updated 8 days ago • 15

upvoted a collection about 1 month ago

Synthetic Data and Self-Improvement

76 items • Updated 4 days ago • 7

upvoted 5 papers about 1 month ago

Non-instructional Fine-tuning: Enabling Instruction-Following Capabilities in Pre-trained Language Models without Instruction-Following Data

Paper • 2409.00096 • Published Aug 27, 2024 • 1

RNR: Teaching Large Language Models to Follow Roles and Rules

Paper • 2409.13733 • Published Sep 10, 2024 • 1

Response Tuning: Aligning Large Language Models without Instruction

Paper • 2410.02465 • Published Oct 3, 2024 • 13

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 214

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9 • 34

upvoted a paper 3 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

upvoted a collection 10 months ago

abliterated-v3

Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3, 2024 • 116

upvoted a paper 10 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

upvoted 2 papers 11 months ago

Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training

Paper • 2405.06932 • Published May 11, 2024 • 20

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11, 2024 • 4

upvoted a paper 12 months ago

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 38

upvoted a collection about 1 year ago

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 124