vilm
/

Text Generation
Transformers
Safetensors
MLX
English
qwen2
conversational
Inference Endpoints
text-generation-inference
Edit model card

vilm/Quyen-v0.1-mlx

This model was converted to MLX format from vilm/Quyen-v0.1. Refer to the original model card for more details on the model.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("vilm/Quyen-v0.1-mlx")
response = generate(model, tokenizer, prompt="hello", verbose=True)
Downloads last month
2
Safetensors
Model size
3.95B params
Tensor type
FP16
·

Datasets used to train vilm/Quyen-v0.1-mlx