2 11 14

cheng

zhoujun

BlankCheng

AI & ML interests

None yet

Recent Activity

updated a collection 8 days ago

Reasoning

updated a collection 8 days ago

Reasoning

updated a collection 8 days ago

Reasoning

View all activity

Organizations

zhoujun's activity

liked a Space 2 months ago

2.5k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 2 months ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 3.17k • 108

liked a model 3 months ago

Qwen/Qwen2.5-Math-7B-Instruct

Text Generation • Updated Sep 23, 2024 • 51.6k • 71

liked a Space 6 months ago

Decentralized Arena Leaderboard

🥇

Display model leaderboard evaluations

liked a dataset 6 months ago

LLM360/TxT360

Updated 8 days ago • 100k • 229

liked a Space 6 months ago

110

TxT360: Trillion Extracted Text

📖

Create a large, deduplicated dataset for LLM pre-training

liked 2 datasets about 1 year ago

minimario/FOLIO

Viewer • Updated Jan 2, 2024 • 1.21k • 245 • 1

bigcode/the-stack-v2

Viewer • Updated Apr 23, 2024 • 5.45B • 2.61k • 354

liked 2 models about 1 year ago

deepseek-ai/deepseek-coder-7b-instruct-v1.5

Text Generation • Updated Feb 5, 2024 • 5.65k • 133

deepseek-ai/deepseek-coder-1.3b-instruct

Text Generation • Updated Mar 7, 2024 • 130k • 126

liked a model over 1 year ago

meta-llama/Llama-2-7b-chat-hf

Text Generation • Updated Apr 17, 2024 • 1.19M • 4.38k

liked a dataset almost 2 years ago

bigcode/ta-prompt

Viewer • Updated May 4, 2023 • 650 • 192 • 198

liked 2 Spaces over 2 years ago

Binder

🔗

245

Code generation with 🤗

✨

Generate code snippets using language models