2 59 224

Blanc Swan

blancsw

https://swan-blanc.fr/

AI & ML interests

ChatBot

Recent Activity

upvoted a paper 7 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

liked a model 13 days ago

Unbabel/TowerInstruct-Mistral-7B-v0.2

liked a dataset 13 days ago

Unbabel/TowerBlocks-v0.1

View all activity

Organizations

blancsw's activity

upvoted a paper 7 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 13 days ago • 141

liked a model 13 days ago

Unbabel/TowerInstruct-Mistral-7B-v0.2

Translation • Updated Sep 4, 2024 • 1.52k • 16

liked a dataset 13 days ago

Unbabel/TowerBlocks-v0.1

Viewer • Updated Mar 4, 2024 • 637k • 114 • 28

liked a model 13 days ago

LLaMAX/LLaMAX3-8B

Text Generation • Updated Dec 6, 2024 • 168 • 35

liked a Space 13 days ago

575

Open Deep-Research

🏆

OpenAI's Deep Research, but open

upvoted an article 13 days ago

Article

Open-source DeepResearch – Freeing our search agents

23 days ago

• 1.1k

upvoted 2 papers 13 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 16 days ago • 137

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 15 days ago • 44

updated a model 15 days ago

Infomaniak-AI/smolLM2-135M-Instruct-movie-reco

Updated 15 days ago

published a model 16 days ago

Infomaniak-AI/smolLM2-135M-Instruct-movie-reco

Updated 15 days ago

updated a model 16 days ago

Infomaniak-AI/smolLM2-135M-Instruct-structure-output

Text Generation • Updated 16 days ago • 38

published 2 models 16 days ago

blancsw/SmolLM2-135M-Instruct-structure-output

Updated 16 days ago

Infomaniak-AI/smolLM2-135M-Instruct-structure-output

Text Generation • Updated 16 days ago • 38

upvoted an article 17 days ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 322

liked a dataset 17 days ago

ChristianAzinn/json-training

Viewer • Updated Aug 23, 2024 • 20.6k • 399 • 16

liked a model 18 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 2 days ago • 1.17M • • 1.18k

liked a dataset 19 days ago

IJUN/FakeNews

Viewer • Updated Jan 13 • 362 • 79 • 2

upvoted an article 22 days ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 202

liked a model 25 days ago

ibm-granite/granite-embedding-278m-multilingual

upvoted a collection 29 days ago

SmolVLM 256M & 500M

Collection

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 6 days ago • 69