Cosmo's picture

Cosmo

cosmojg

·

https://cosmo.red

AI & ML interests

Machine learning and computational neuroscience

Recent Activity

upvoted an article 1 day ago

Fine-Tune Whisper with 🤗 Transformers

liked a model 3 days ago

microsoft/MAI-DS-R1

liked a model 3 days ago

soob3123/amoral-gemma3-1B-v2

View all activity

Organizations

None yet

cosmojg's activity

upvoted an article 1 day ago

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 218

upvoted a collection 3 days ago

Amoral Collection

Unaligned LLMS • 4 items • Updated 1 day ago • 5

upvoted a collection 5 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 5 days ago • 161

upvoted a collection 15 days ago

Llama 4

Llama 4 release • 10 items • Updated 18 days ago • 447

upvoted 2 collections 28 days ago

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 20 days ago • 49

Orpheus-FASTAPI

Collection of quants for Canopy's Orpheus • 14 items • Updated 4 days ago • 3

upvoted 3 articles about 1 month ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 357

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 398

Article

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

Dec 23, 2024

• 43

upvoted a collection about 1 month ago

👩‍💻 OlympicCoder

Reasoning datasets and models for competitive coding • 4 items • Updated Mar 11 • 16

upvoted an article about 1 month ago

Article

Open R1: How to use OlympicCoder locally for coding?

Mar 20

• 56

upvoted a collection about 1 month ago

Jamba 1.6

The AI21 Jamba family of models are hybrid SSM-Transformer foundation models, outperforming open model competitors on quality and speed. • 2 items • Updated Mar 6 • 13

upvoted a collection about 2 months ago

Cohere Labs Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 8 days ago • 68

upvoted an article 2 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

Feb 19

• 69

upvoted a collection 2 months ago

PaliGemma 2 Mix

13 items • Updated 20 days ago • 60

upvoted an article 2 months ago

Article

Welcome the Falcon 3 Family of Open Models!

Dec 17, 2024

• 126

upvoted a paper 3 months ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 18

upvoted a collection 3 months ago

DeepSeek-VL2

5 items • Updated Feb 9 • 72

upvoted an article 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k