Omar Sanseviero's picture

Omar Sanseviero

osanseviero

·

https://osanseviero.github.io/hackerllama/

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Recent Activity

new activity 14 days ago

google/gemma-3-1b-it:Update README.md

updated a model 14 days ago

google/gemma-3-27b-pt-qat-q4_0-gguf

View all activity

Organizations

Posts 19

Post

13025

Diaries of Open Source. Part 15 🤗

🕵️‍♀️Idefics 2 is out, a multimodal open-source model with very nice capabilities
Models, demo, and datasets: HuggingFaceM4/idefics2-661d1971b7c50831dd3ce0fe
Blog: https://hf.co/blog/idefics2

💾Snowflake released snowflake-arctic-embed, a family of powerful small embedding models
Model: Snowflake/snowflake-arctic-embed-m
Blog: https://www.snowflake.com/blog/introducing-snowflake-arctic-embed-snowflakes-state-of-the-art-text-embedding-family-of-models/

✨Pile-T5, EleutherAI's T5 model trained on 2T tokens
Blog: https://blog.eleuther.ai/pile-t5/
Models: EleutherAI/pile-t5-65a76a0d0022dd270b385a66
GitHub: https://github.com/EleutherAI/improved-t5

🤖CodeQwen1.5-7B base and chat models. Models trained on 3T tokens strong benchmark results for code generation, editing and SQL
Blog post: https://qwenlm.github.io/blog/codeqwen1.5/
Demo: Qwen/CodeQwen1.5-7b-Chat-demo
Models: Qwen/CodeQwen1.5-7B and Qwen/CodeQwen1.5-7B-Chat

Misc
🦉 DocOwl1.5: Unified Stucture Learning for OCR-free Document Understanding mPLUG/DocOwl
👀Cerule - a tiny Vision LM model Tensoic/Cerule-v0.1
ChemLLM - a LLM for chemistry and molecule science ⚗️https://hf.co/AI4Chem/ChemLLM-7B-Chat-1.5-DPO
Distil Whisper Large
📝New pdf/OCR datasets with 19 samples pixparse/pdf-document-ocr-datasets-660701430b0346f97c4bc628
🔥Gretel AI high quality text-to-sql synthetic dataset gretelai/synthetic_text_to_sql

Articles 26

Article

188

Llama can now see and run on your device - welcome Llama 3.2

View all Articles

Collections 13

Papers 5

arxiv:2503.19786

arxiv:2310.16944

arxiv:2303.12582

arxiv:2211.05100

spaces 180

InstantCoder

Generate app code from ideas

Co2 Estimator

Estimate CO2 activities from an image

How Much Do I Cost

Distilabel Dataset Generator

Create datasets with FAQs and SFT prompts

Mistral Super Fast

Non Streaming Example

models 301

osanseviero/qwen2.5_0.5b-instruct-q2_K_test

Updated Oct 11, 2024

osanseviero/qwen2.5-0.5b-instruct-q2_K

Updated Oct 10, 2024 • 36 • 1

osanseviero/o-blob-3.2

Updated Oct 10, 2024 • 5

osanseviero/test-in-go7

Updated Oct 8, 2024

osanseviero/test-in-go6

Updated Oct 8, 2024

osanseviero/test-in-go5

Updated Oct 8, 2024

osanseviero/Reflection-Llama-3.1-70B-GGUF

Text Generation • Updated Sep 16, 2024 • 92

osanseviero/test-in-go4

Updated Sep 13, 2024

osanseviero/test-in-go3

Updated Sep 13, 2024

osanseviero/test-in-go

Updated Sep 12, 2024

datasets 38

osanseviero/super-fun-llamas

Viewer • Updated Sep 13, 2024 • 10 • 38 • 1

osanseviero/fun_llamas

Viewer • Updated Sep 12, 2024 • 50 • 40

osanseviero/my-llamas

Viewer • Updated Sep 11, 2024 • 100 • 18

osanseviero/bill_summary_us_chunks-similarity

Viewer • Updated Jul 12, 2024 • 2k • 31

osanseviero/bill_summary_us_chunks

Viewer • Updated Jul 12, 2024 • 3.45M • 38

osanseviero/testing_geospatial

Updated Jul 8, 2024 • 35

osanseviero/ag_misclassifications

Viewer • Updated Oct 8, 2023 • 200 • 26

osanseviero/test_hacks

Updated Apr 28, 2023 • 11

osanseviero/example_ola

Viewer • Updated Mar 24, 2023 • 2 • 8

osanseviero/langchain_hub_test

Viewer • Updated Jan 30, 2023 • 1 • 23