Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mit-han-lab
/
Yi-34B-QServe-g128
like
0
Follow
MIT HAN Lab
166
Text Generation
Transformers
PyTorch
Safetensors
llama
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
refs/pr/1
Yi-34B-QServe-g128
2 contributors
History:
4 commits
nielsr
HF staff
Add model card for LServe
dd9b5c1
verified
22 days ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
LICENSE
Safe
17.4 kB
Create LICENSE
11 months ago
README.md
760 Bytes
Add model card for LServe
22 days ago
config.json
Safe
605 Bytes
upload
11 months ago
generation_config.json
Safe
132 Bytes
upload
11 months ago
model.safetensors.index.json
Safe
44.8 kB
upload
11 months ago
pytorch_model.bin
19.1 GB
LFS
upload
11 months ago
tokenizer.json
Safe
3.56 MB
upload
11 months ago
tokenizer.model
Safe
1.03 MB
LFS
upload
11 months ago
tokenizer_config.json
Safe
320 Bytes
upload
11 months ago