Santiago Garcia's picture

Santiago Garcia

santyzenith

·

AI & ML interests

Large language models, Natural Language Processing, Computer Vision, Spanish Large language models.

Recent Activity

liked a model 7 days ago

deepcogito/cogito-v1-preview-llama-8B

liked a model 15 days ago

rasbt/gpt2-from-scratch-pytorch

liked a model 15 days ago

rasbt/llama-3.2-from-scratch

View all activity

Organizations

santyzenith's activity

upvoted a collection 2 months ago

DeepSeek-VL2

5 items • Updated Feb 9 • 72

upvoted a collection 3 months ago

BGE

23 items • Updated Feb 13 • 106

upvoted a collection 4 months ago

RLHF

A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated 2 days ago • 5

upvoted a collection 7 months ago

LLM2Vec

16 items • Updated Oct 8, 2024 • 45

upvoted 2 articles 7 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 308

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 49

upvoted 2 papers 8 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 40

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Paper • 2302.13750 • Published Feb 27, 2023 • 2

upvoted 3 articles 8 months ago

Article

Introduction to Graph Machine Learning

Jan 3, 2023

• 29

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 232

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27, 2024

• 129

upvoted a paper 9 months ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 53

upvoted an article 9 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 354

upvoted an article 10 months ago

Article

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

Oct 21, 2022

• 27

upvoted a paper 10 months ago

Tuna: Instruction Tuning using Feedback from Large Language Models

Paper • 2310.13385 • Published Oct 20, 2023 • 11

upvoted a collection 10 months ago

Knowledge distillation

88 items • Updated Feb 7, 2024 • 7

upvoted 2 articles 10 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 86

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 214

upvoted a paper 10 months ago

Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 14