Gökdeniz Gülmez (Isaak Carter Augustus)'s picture

Gökdeniz Gülmez (Isaak Carter Augustus)

Goekdeniz-Guelmez

·

AI & ML interests

Transformers / NLP / Multimodal / Realtime M2M / J.O.S.I.E.v4o

Recent Activity

New activity about 13 hours ago

mlx-community/Mamba-Codestral-7B-v0.1-4bit:[WIP] Upload folder using huggingface_hub (multi-commit 0bb930c7d8029239c283b898a046127f24d8e8c88ee1fd7426528349cbbd9b30)

New activity about 13 hours ago

mlx-community/Mamba-Codestral-7B-v0.1-8bit:Update README.md

upvoted a collection 1 day ago

View all activity

Organizations

Goekdeniz-Guelmez's activity

upvoted a collection 1 day ago

Molmo

4 items • Updated 1 day ago • 1

upvoted 2 papers 18 days ago

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Paper • 2411.03823 • Published 19 days ago • 43

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published 19 days ago • 60

upvoted a collection 20 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated 4 days ago • 177

upvoted a paper 25 days ago

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25 • 74

upvoted an article 28 days ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

29 days ago

• 37

upvoted 2 collections about 1 month ago

Papers - MoE

45 items • Updated Aug 23 • 3

Josiefied and Abliterated

Abliterated, and further fine-tuned to be the most uncensored models available. • 11 items • Updated 7 days ago • 3

upvoted 3 collections about 2 months ago

🍷 FineWeb datasets

5 items • Updated Jun 26 • 20

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 10 days ago • 274

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 20 items • Updated 4 days ago • 40

upvoted a paper 2 months ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16 • 38

upvoted 2 collections 2 months ago

Mamba

Mamba is a new LLM architecture that integrates the Structured State Space sequence model to manage lengthy data sequences. • 11 items • Updated Oct 12 • 1

Transformers compatible Mamba

This release includes the `mamba` repositories compatible with the `transformers` library • 5 items • Updated Mar 6 • 36

upvoted 2 articles 2 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 371

Article

Recreating o1 at Home with Role-Play LLMs

By

•

Sep 20

• 20

upvoted 3 collections 2 months ago

Qwen2.5

The Qwen 2.5 models are a series of AI models trained on 18 trillion tokens, supporting 29 languages and offering advanced features such as instructio • 33 items • Updated Oct 12 • 4

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 383

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 218

upvoted an article 3 months ago

Article

What is Retrieval-based Voice Conversion WebUI?

By

•

Aug 18

• 9