Gynt's picture

27 561

Gynt

celeski

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

bartowski/agentica-org_DeepCoder-14B-Preview-GGUF

liked a model 1 day ago

agentica-org/DeepCoder-14B-Preview

View all activity

Organizations

None yet

celeski's activity

upvoted a collection 10 days ago

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 8 days ago • 44

upvoted a collection 29 days ago

Gemma 3 Release

17 items • Updated 8 days ago • 322

upvoted a collection 2 months ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 5 days ago • 216

upvoted a paper 4 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 152

upvoted 2 collections 4 months ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 10 items • Updated 24 days ago • 109

OLMo 2

Artifacts for the second set of OLMo models. • 27 items • Updated 21 days ago • 108

upvoted 2 collections 6 months ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated Feb 24 • 96

v4

18 items • Updated Oct 20, 2024 • 31

upvoted an article 6 months ago

Article

Releasing Swift Transformers: Run On-Device LLMs in Apple Devices

Aug 8, 2023

• 32

upvoted a collection 6 months ago

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 13 • 70

upvoted 2 collections 7 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 590

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 28 days ago • 300

upvoted an article 10 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 191

upvoted 4 collections 10 months ago

SSMs

A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated about 4 hours ago • 27

Core ML Gallery Models

7 items • Updated Oct 4, 2024 • 35

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated about 4 hours ago • 162

RecurrentGemma Release

8 items • Updated 8 days ago • 40

upvoted a paper 10 months ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 76