Minh Tran's picture

6 190

Minh Tran

tminh

·

minhtcai

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

unsloth/Llama-3.2-1B-Instruct-GGUF

liked a model 8 days ago

unsloth/Llama-3.2-3B-Instruct-GGUF

liked a model 12 days ago

5CD-AI/Vintern-3B-beta

View all activity

Organizations

tminh's activity

upvoted a collection 7 months ago

ViHateT5 - Vietnamese Hate Speech Detection with T5

5 items • Updated Jul 16 • 2

upvoted a paper 8 months ago

Deep Bidirectional Language-Knowledge Graph Pretraining

Paper • 2210.09338 • Published Oct 17, 2022 • 1

upvoted 3 collections 8 months ago

CodeGemma Release

18 items • Updated 10 days ago • 78

Switch-Transformers release

This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. • 9 items • Updated 10 days ago • 15

Mixtral HQQ Quantized Models

4-bit and 2-bit Mixtral models quantized using https://github.com/mobiusml/hqq • 9 items • Updated Mar 29 • 14

upvoted a paper 9 months ago

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24 • 52