Collection of models from the third LLM course homework. It containes three LLMs fine-tuned using LoRA, QLoRA, and DoRA.
Sergey Pankevich
spankevich
AI & ML interests
None yet
Recent Activity
updated
a model
25 days ago
spankevich/output
published
a model
25 days ago
spankevich/output
updated
a model
26 days ago
spankevich/llm-course-hw3-tinyllamma-qlora
Organizations
None yet
Collections
2
models
9
spankevich/output
Updated
•
3
spankevich/llm-course-hw3-tinyllamma-qlora
Updated
spankevich/llm-course-hw3-dora
Text Generation
•
Updated
•
7
spankevich/llm-course-hw3-lora
Text Generation
•
Updated
•
9
spankevich/llm-hw-2-ppo
Text Generation
•
Updated
•
2
spankevich/trainer_output
Text Classification
•
Updated
•
2
spankevich/llm-hw-2-dpo
Text Generation
•
Updated
•
2
spankevich/llm-hw-2
Updated
spankevich/llm-course-hw1
Text Generation
•
Updated
•
1
datasets
None public yet