Blog, Articles, and discussions

Introducing the Open Leaderboard for Hebrew LLMs!

By May 5, 2024 guest • 9

Community Articles

view all

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

about 10 hours ago

• 10

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

•

about 13 hours ago

• 2

Understanding IPOs: A Comprehensive Guide

•

about 14 hours ago

• 1

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

•

about 14 hours ago

A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI

•

6 days ago

• 12

seemore: Implement a Vision Language Model from Scratch

•

6 days ago

• 40

Google Search with LLM

•

7 days ago

• 4

Token Merging for fast LLM inference : Background and first trials with Mistral

•

7 days ago

• 1

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

•

9 days ago

• 24

Expanding Model Context and Creating Chat Models with a Single Click

•

10 days ago

• 25

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

11 days ago

• 47

Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+

•

12 days ago

• 4

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

•

12 days ago

• 9

Can We Train Chat Models with Raw Data?

•

13 days ago

• 17

RealWorldQA, What's New?

•

13 days ago

• 6

Efficient Table Pre-training without Real Data: An Introduction to TAPEX

By May 23, 2022 guest • 1

Putting ethical principles at the core of research lifecycle

By May 19, 2022

Supercharged Customer Service with Machine Learning

By April 25, 2022

Accelerate BERT inference with Hugging Face Transformers and AWS inferentia

By March 16, 2022

Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

By March 11, 2022 guest • 1

BERT 101 🤗 State Of The Art NLP Model Explained

By March 2, 2022

Getting Started with Sentiment Analysis using Python

By February 2, 2022 • 6

Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker

By January 11, 2022

Active Learning with AutoNLP and Prodigy

By December 23, 2021

Perceiver IO: a scalable, fully-attentional model that works on any modality

By December 15, 2021

Training CodeParrot 🦜 from Scratch

By December 8, 2021 • 3

Scaling up BERT-like model Inference on modern CPU - Part 2

By November 4, 2021

Course Launch Community Event

By October 26, 2021

Large Language Models: A New Moore's Law?

By October 26, 2021

Community Articles

view all

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

about 10 hours ago

• 10

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

•

about 13 hours ago

• 2

Understanding IPOs: A Comprehensive Guide

•

about 14 hours ago

• 1

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

•

about 14 hours ago

Mergoo: Efficiently Build Your Own MoE LLM

•

about 22 hours ago

• 29

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

1 day ago

• 17

Top 5 Webflow Agencies Focused On Building Brands For The Future

•

2 days ago

• 1

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

•

5 days ago

• 14

Fish Speech V1 - New Multilingual Open Source TTS Model

•

5 days ago

• 3

A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI

•

6 days ago

• 12

seemore: Implement a Vision Language Model from Scratch

•

6 days ago

• 40

Google Search with LLM

•

7 days ago

• 4

Token Merging for fast LLM inference : Background and first trials with Mistral

•

7 days ago

• 1

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

•

9 days ago

• 24

Expanding Model Context and Creating Chat Models with a Single Click

•

10 days ago

• 25

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

11 days ago

• 47

Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+

•

12 days ago

• 4

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

•

12 days ago

• 9

Can We Train Chat Models with Raw Data?

•

13 days ago

• 17

RealWorldQA, What's New?

•

13 days ago

• 6