2 33 12

Junjie Chen

coderchen01

https://junjie-chen.info

AI & ML interests

Efficient AI, Multimodal AI, Generative AI

Recent Activity

upvoted a paper about 1 month ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

liked a dataset about 1 month ago

microsoft/SCBench

liked a Space about 1 month ago

HuggingFaceH4/blogpost-scaling-test-time-compute

View all activity

Organizations

None yet

coderchen01's activity

upvoted a paper about 1 month ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125

liked a dataset about 1 month ago

microsoft/SCBench

Viewer • Updated 28 days ago • 922 • 2.78k • 6

liked 2 Spaces about 1 month ago

Running

478

📈

Scaling test-time compute

Running

📈

Number Tokenization Blog

upvoted a paper about 2 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 124

liked a model about 2 months ago

nvidia/Hymba-1.5B-Instruct

Text Generation • Updated 19 days ago • 4.59k • 220

liked a Space about 2 months ago

Running on CPU Upgrade

572

🌎

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

liked a model about 2 months ago

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • Updated Dec 2, 2024 • 45.2k • 330

upvoted a paper about 2 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 48

upvoted an article 2 months ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

•

May 21, 2024

• 35

upvoted a paper 2 months ago

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Paper • 2411.09944 • Published Nov 15, 2024 • 12

updated a dataset 2 months ago

coderchen01/HarmfulGeneration-HarmBench

Viewer • Updated Nov 20, 2024 • 9.61k • 41 • 2

liked a dataset 2 months ago

Babelscape/ALERT

Viewer • Updated Jun 20, 2024 • 45.7k • 129 • 10

upvoted a paper 2 months ago

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 20

liked a Space 2 months ago

Running on CPU Upgrade

128

🔥

Hallucinations Leaderboard

upvoted a paper 3 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 47

upvoted an article 3 months ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

•

Oct 9, 2023

• 7

upvoted a paper 3 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

upvoted 2 articles 3 months ago

Article

Scaling AI-based Data Processing with Hugging Face + Dask

Oct 9, 2024

• 27

Article

How 🤗 Accelerate runs very large models thanks to PyTorch

Sep 27, 2022

• 10