Matthew Mendoza's picture

3 2 2

Matthew Mendoza

mattm1005

·

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

nanotron/ultrascale-playbook

new activity 7 months ago

mattshumer/ref_70_e3:Blank

new activity 10 months ago

Qwen/CodeQwen1.5-7B:Regarding adopting an apache 2.0 license?

View all activity

Organizations

None yet

mattm1005's activity

liked a Space about 2 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

New activity in mattshumer/ref_70_e3 7 months ago

Blank

#1 opened 7 months ago by

New activity in Qwen/CodeQwen1.5-7B 10 months ago

Regarding adopting an apache 2.0 license?

#9 opened 10 months ago by

upvoted a paper 12 months ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 86

upvoted a paper about 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 612

liked a model about 1 year ago

mistralai/Mixtral-8x7B-v0.1

Text Generation • Updated Jul 24, 2024 • 45k • • 1.7k

New activity in Deci/DeciLM-7B-instruct about 1 year ago

Regarding Source Code For Tokenizer

#11 opened about 1 year ago by

New activity in Deci/DeciLM-7B-instruct over 1 year ago

Safety and ethical guardrails

#10 opened over 1 year ago by