Djuunaa's picture

Djuunaa

djuna

·

AI & ML interests

None yet

Recent Activity

new activity 4 days ago

zed-industries/zeta:Need Example for inference code

liked a model 5 days ago

aipgpt/Txt-Polisher-Douyin-Style

liked a model 8 days ago

THUDM/GLM-4-9B-0414

View all activity

Organizations

djuna's activity

upvoted a collection about 2 months ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated 29 days ago • 59

upvoted 3 collections 3 months ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Feb 26 • 117

AI4Privacy_v2

Collection for AI4Privacy Version 2 trained on PII200k • 6 items • Updated Sep 25, 2024 • 4

DeepSeek R1 AWQ

7 items • Updated Jan 22 • 5

upvoted 2 papers 3 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Paper • 2501.02625 • Published Jan 5 • 16

upvoted 2 collections 3 months ago

QTIP Quantized Models

See https://github.com/Cornell-RelaxML/qtip • 30 items • Updated Dec 9, 2024 • 12

Quantized DeepSeek R1 Distill

3 items • Updated Jan 22 • 3

upvoted 3 collections 4 months ago

Small Reasoning Model

12 items • Updated Feb 26 • 4

Dolphin 3.0

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated Feb 7 • 139

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated Feb 13 • 86

upvoted an article 7 months ago

Article

Introducing Community Tools on HuggingChat

Sep 16, 2024

• 36

upvoted a collection 9 months ago

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17, 2024 • 71