Victor Nogueira's picture

Victor Nogueira

Felladrin

·

https://felladrin.com

felladrin

AI & ML interests

Models to run in the web browser

Recent Activity

updated a collection about 16 hours ago

Foundation Text-Generation Models Below 360M Parameters

published a model about 17 hours ago

Felladrin/Qwen2-96M

updated a model about 17 hours ago

Felladrin/Qwen2-96M

View all activity

Organizations

Felladrin's activity

upvoted an article about 2 months ago

Article

Gradio spaces are the perfect agent tools\!

By

•

Jan 17

• 15

upvoted 4 articles 4 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

By

and 1 other •

Nov 21, 2024

• 35

Article

The Beginners Guide to Cleaning a Dataset

By

•

Nov 18, 2024

• 24

Article

Releasing the largest multilingual open pretraining dataset

By

and 2 others •

Nov 13, 2024

• 100

Article

Introducing GGUF-my-LoRA

By

•

Nov 1, 2024

• 15

upvoted a collection 5 months ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 111

upvoted 2 articles 8 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 333

Article

How to run Gemini Nano locally in your browser

By

•

Jul 11, 2024

• 44

upvoted a collection 9 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 359

upvoted an article 10 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 130

upvoted a collection 11 months ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 37

upvoted 2 articles 11 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 235

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 284

upvoted a paper 11 months ago

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11, 2024 • 4

upvoted a collection 12 months ago

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 124

upvoted 2 papers about 1 year ago

TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese

Paper • 2401.16640 • Published Jan 30, 2024 • 8

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4, 2024 • 93

upvoted 2 collections about 1 year ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 235

Apple MLX-compatible 7B LLMs on the 🤗 Hub

This collection contains the model weights for 7B LLMs for Apple's MLX framework. Find more information at https://github.com/ml-explore/mlx • 8 items • Updated Sep 2, 2024 • 8