Victor Nogueira's picture

Victor Nogueira

Felladrin

·

https://felladrin.com

felladrin

AI & ML interests

Models to run in the web browser

Organizations

Felladrin's activity

upvoted a collection 12 days ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 29 items • Updated 12 days ago • 188

upvoted an article 18 days ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 96

upvoted a collection about 2 months ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated 6 days ago • 24

upvoted 2 articles 2 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 200

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 250

upvoted a paper 2 months ago

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11 • 4

upvoted a collection 3 months ago

Common Corpus

The largest public domain dataset for training LLMs. • 27 items • Updated 1 day ago • 105

upvoted a paper 5 months ago

TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese

Paper • 2401.16640 • Published Jan 30 • 4

upvoted a paper 6 months ago

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4 • 81

upvoted 2 collections 6 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated 6 days ago • 185

Apple MLX-compatible 7B LLMs on the 🤗 Hub

This collection contains the model weights for 7B LLMs for Apple's MLX framework. Find more information at https://github.com/ml-explore/mlx • 8 items • Updated May 7 • 9