Open to Work

Perfect Makuwerere

nakue

AI & ML interests

Full LLM lifecycle: fine-tuning (QLoRA), compression (quantization, W8A8/W4A16), and serving (vLLM, disaggregated prefill/decode). Interested in inference economics, GPU memory architecture, and deploying efficient models in resource-constrained settings.

Recent Activity

updated a model about 19 hours ago

nakue/SmolLM2-1.7B-LoRA-adapter

published a model about 19 hours ago

nakue/SmolLM2-1.7B-LoRA-adapter

updated a model about 19 hours ago

nakue/SmolLM2-1.7B-W8A8-instruct

View all activity

Organizations

nakue 's models 7

Perfect Makuwerere

AI & ML interests

Recent Activity

Organizations

nakue 's models 7 Sort: Recently updated

nakue 's models 7