3 2 1

Weilin Zhao

Achazwl

https://weilin-zhao.com

Achazwl

AI & ML interests

None yet

Recent Activity

authored a paper 9 days ago

Tool Learning with Foundation Models

authored a paper 9 days ago

Unlock Predictable Scaling from Emergent Abilities

authored a paper 9 days ago

OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models

View all activity

Organizations

Achazwl's activity

authored 7 papers 9 days ago

Tool Learning with Foundation Models

Paper • 2304.08354 • Published Apr 17, 2023 • 3

Unlock Predictable Scaling from Emergent Abilities

Paper • 2310.03262 • Published Oct 5, 2023 • 3

OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models

Paper • 2307.03084 • Published Jul 5, 2023 • 1

OpenPrompt: An Open-source Framework for Prompt-learning

Paper • 2111.01998 • Published Nov 3, 2021 • 1

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting

Paper • 2402.13720 • Published Feb 21, 2024 • 7

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published 21 days ago • 7

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Paper • 2502.12085 • Published 24 days ago • 2

upvoted a paper 9 days ago

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Paper • 2502.12085 • Published 24 days ago • 2

commented a paper 9 days ago

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published 21 days ago • 7 •

upvoted a paper 9 days ago

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published 21 days ago • 7

authored a paper 3 months ago

Densing Law of LLMs

Paper • 2412.04315 • Published Dec 5, 2024 • 19

authored a paper 6 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4, 2024 • 29

authored a paper 7 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 82

authored a paper 9 months ago

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Paper • 2406.15718 • Published Jun 22, 2024 • 14

New activity in openbmb/MiniCPM-V-2 11 months ago

GGUF file

#6 opened 11 months ago by

BB8-dev

authored a paper 11 months ago

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 22

authored a paper 12 months ago

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

Paper • 2403.09347 • Published Mar 14, 2024 • 22

New activity in openbmb/MiniCPM-2B-sft-fp32 about 1 year ago

Whether you will provide a raw base model without STF and DPO

#1 opened about 1 year ago by

Benyou

liked a model about 1 year ago

openbmb/MiniCPM-2B-sft-bf16

Text Generation • Updated Sep 7, 2024 • 8.05k • 118