s3nh's picture

s3nh

s3nh

AI & ML interests

Quantization, LLMs, Deep Learning for good. Follow me if you like my work. Patreon.com/s3nh

Recent Activity

new activity 1 day ago
SmolTuners/README:Gh organization
new activity 1 day ago
SmolTuners/README:Optimizers
liked a dataset 1 day ago
fluently-sets/ultraset
View all activity

Organizations

ESPnet's profile picture Gradio-Blocks-Party's profile picture Lajonbot's profile picture The Waifu Research Department's profile picture AblateIt's profile picture Blog-explorers's profile picture BangumiBase's profile picture CyberHarem's profile picture HydraLM's profile picture GOAT.AI's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Social Post Explorers's profile picture Spinner-GPT-4's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture Smol Community's profile picture

s3nh's activity

New activity in SmolTuners/README 1 day ago

Gh organization

1
#3 opened 1 day ago by
s3nh

Optimizers

#2 opened 1 day ago by
s3nh
New activity in SmolTuners/README 3 days ago

Datasets

3
#1 opened 5 days ago by
s3nh
reacted to merve's post with ๐Ÿง  5 days ago
view post
Post
1664
A complete RAG pipeline includes a reranker, which ranks the documents to find the best document ๐Ÿ““
Same goes for multimodal RAG, multimodal rerankers which we can integrate to multimodal RAG pipelines!
Learn how to build a complete multimodal RAG pipeline with vidore/colqwen2-v1.0 as retriever, lightonai/MonoQwen2-VL-v0.1 as reranker, Qwen/Qwen2-VL-7B-Instruct as VLM in this notebook that runs on a GPU as small as L4 ๐Ÿ”ฅ https://huggingface.co/learn/cookbook/multimodal_rag_using_document_retrieval_and_reranker_and_vlms
reacted to fdaudens's post with ๐Ÿค— 5 days ago
view post
Post
1130
๐Ÿค Want to share your AI models while protecting your work? Licenses are key!

Fascinating to see that nearly 60% of models on the Hub use Apache & MIT licenses.

Explore the viz here: huggingface/open-source-ai-year-in-review-2024
reacted to Lewdiculous's post with โž• 5 days ago
view post
Post
1654
Hello fellow LLMers, just a quick notice that some of my activity will be moved into the AetherArchitectural Commuity and split with @Aetherarchio .

[here] https://huggingface.co/AetherArchitectural

All activity should be visible in the left side of my profile.
  • 1 reply
ยท
reacted to fdaudens's post with ๐Ÿ‘ 5 days ago
view post
Post
1129
๐Ÿ” From instruction-following to creative storytelling, dive into 2024's most impactful AI datasets! These gems are shaping everything from scientific research to video understanding.

Check it out: huggingface/open-source-ai-year-in-review-2024
replied to louisbrulenaudet's post 5 days ago
reacted to louisbrulenaudet's post with ๐Ÿค— 5 days ago
view post
Post
1728
Iโ€™ve published a new dataset to simplify model merging ๐Ÿค—

This dataset facilitates the search for compatible architectures for model merging with @arcee_aiโ€™s mergekit, streamlining the automation of high-performance merge searches ๐Ÿ“–

Dataset : louisbrulenaudet/mergekit-configs
  • 1 reply
ยท
reacted to nyuuzyou's post with ๐Ÿ‘ 5 days ago
view post
Post
1503
โœˆ๏ธ Aircraft Dataset & Generation Model nyuuzyou/aircraft-images & nyuuzyou/AircraftFLUX-LoRA

Dataset Features:
โ€ข 165,340 high-res aircraft images with metadata
โ€ข Machine-generated English captions
โ€ข Detailed aircraft specs, registration & flight info
โ€ข Environmental context descriptions

LoRA model specializes in:
โ€ข Realistic aircraft generation
โ€ข Accurate technical details for unpopular airplanes compared to black-forest-labs/FLUX.1-schnell
โ€ข Proper airline liveries
โ€ข Contextual aviation scenes
updated a Space 5 days ago
replied to danielhanchen's post 5 days ago
reacted to danielhanchen's post with ๐Ÿค—๐Ÿ‘ 5 days ago
reacted to stefan-it's post with โค๏ธ 5 days ago
view post
Post
1127
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.

๐Ÿ‘‰ Link: https://github.com/stefan-it/model-garden-lms

An overview of some features:

- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS

I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!

๐Ÿ‘‰ Model Hub Link: https://huggingface.co/model-garden-lms

If you find these resources useful, please give them a like!

Made from Bavarian Oberland with โค๏ธ and ๐Ÿฅจ.
reacted to lucifertrj's post with ๐Ÿ‘€ 5 days ago
view post
Post
483
Image Prompt Engineering Guide:
โžก๏ธ Artistic styling for Image generation
โžก๏ธ Prompt weighting using the parentheses method to generate realistic images.
โžก๏ธ Advanced features like style and positioning control[experimental].
โžก๏ธ Image placement on the generated AI image using Recraft V3 Mockup.

Watch: https://www.youtube.com/watch?v=d3nUG28-jIc
replied to AtAndDev's post 5 days ago