MiniLLM

community

https://github.com/microsoft/LMOps/tree/main/minillm

t1101675

Activity Feed

AI & ML interests

Training efficient language models (MiniLLM, MiniPLM)

Recent Activity

t1101675 new activity about 1 month ago

MiniLLM/SFT-OPT-1.3B:Difference between SFT and init models

t1101675 authored a paper 2 months ago

NVILA: Efficient Frontier Visual Language Models

t1101675 updated a dataset 3 months ago

MiniLLM/pile-tokenized

View all activity

MiniLLM's activity

t1101675

in MiniLLM/SFT-OPT-1.3B about 1 month ago

Difference between SFT and init models

#1 opened about 1 month ago by

HyeongSoo

t1101675

authored a paper 2 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 57

t1101675

updated a dataset 3 months ago

MiniLLM/pile-tokenized

Updated Nov 14, 2024 • 75 • 1

t1101675

in MiniLLM/init-gpt2-120M 3 months ago

Adding `safetensors` variant of this model

#1 opened 3 months ago by

SFconvertbot

t1101675

updated 2 models 3 months ago

MiniLLM/teacher-Llama-13B

Text Generation • Updated Oct 30, 2024 • 3

MiniLLM/MiniLLM-Llama-7B

Text Generation • Updated Oct 30, 2024 • 6 • 1

t1101675

updated a model 4 months ago

MiniLLM/Ref-Pretrain-Qwen-104M

Text Generation • Updated Oct 27, 2024 • 878 • 1

t1101675

updated a Space 4 months ago

README

🏃

t1101675

updated 11 models 4 months ago

t1101675

authored a paper 4 months ago

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Paper • 2410.17215 • Published Oct 22, 2024 • 14

AI & ML interests

Recent Activity

Team members 1

MiniLLM's activity

Difference between SFT and init models

Adding `safetensors` variant of this model

README