Ashiq Rahman's picture

29 5

Ashiq Rahman

TangoDJ

·

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

Paper - Application

updated a collection 11 days ago

upvoted a paper 11 days ago

Transformers without Normalization

View all activity

Organizations

TangoDJ's activity

upvoted a paper 11 days ago

Transformers without Normalization

Paper • 2503.10622 • Published 23 days ago • 152

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 373

upvoted 6 papers 4 months ago

The Unbearable Slowness of Being: Why do we live at 10 bits/s?

Paper • 2408.10234 • Published Aug 3, 2024 • 1

An Evolved Universal Transformer Memory

Paper • 2410.13166 • Published Oct 17, 2024 • 3

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 81

Flow Matching Guide and Code

Paper • 2412.06264 • Published Dec 9, 2024 • 1

Discovering Preference Optimization Algorithms with and for Large Language Models

Paper • 2406.08414 • Published Jun 12, 2024 • 17

Reinforcement Learning: An Overview

Paper • 2412.05265 • Published Dec 6, 2024 • 6

upvoted 6 papers 5 months ago

Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations

Paper • 2411.00640 • Published Nov 1, 2024 • 3

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 62

Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 17

Scalable MatMul-free Language Modeling

Paper • 2406.02528 • Published Jun 4, 2024 • 11

RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9, 2024 • 36

Scaling Laws for Precision

Paper • 2411.04330 • Published Nov 7, 2024 • 8

upvoted a collection 5 months ago

Cosmos Tokenizer

A suite of image and video tokenizers • 13 items • Updated 2 days ago • 40

upvoted a collection 7 months ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 121

upvoted a paper 12 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 258

upvoted 3 papers about 1 year ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 110

Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 24

Humanoid Locomotion as Next Token Prediction

Paper • 2402.19469 • Published Feb 29, 2024 • 28