Burkov
Andriy
AI & ML interests
None yet
Organizations
None yet
Andriy's activity
Where is the model? 0 downloads means nobody can use it. Please fix.
9
#1 opened 6 days ago
by
Andriy
How does v0.2 manages to support 32k token context without Sliding Window Attention?
3
#85 opened 21 days ago
by
Andriy
What is the max. content length of Mistral-7B-Instruct-v0.2?
16
#43 opened 3 months ago
by
hanshupe
Longer inference time
2
#4 opened about 1 month ago
by
dittops
Instruct-finetuning dataset
#4 opened about 1 month ago
by
Andriy
Finetuning dataset
#2 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#1 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#3 opened about 1 month ago
by
Andriy
instruct-finetuning dataset
1
#2 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#2 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#5 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#3 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#1 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#2 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
1
#1 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#4 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
1
#3 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
1
#6 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#6 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
1
#1 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
1
#3 opened about 1 month ago
by
Andriy
Datasets for function calling and JSON
5
#13 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
#9 opened about 1 month ago
by
Andriy
What the SFT data?
4
#7 opened 5 months ago
by
Ede-CH
Instruct-finetuning dataset
#189 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
1
#8 opened about 1 month ago
by
Andriy
Instruct-finetuning dataset
4
#43 opened about 1 month ago
by
Andriy
Instruct-finetuning data
#12 opened about 1 month ago
by
Andriy
Instruct dataset
#5 opened about 1 month ago
by
Andriy
DPO dataset
#11 opened about 1 month ago
by
Andriy
Instruct dataset
#23 opened about 1 month ago
by
Andriy
the license
2
#8 opened about 1 month ago
by
Andriy
Is it QLoRA or a full finetune?
1
#5 opened about 2 months ago
by
Andriy
Is it QLoRA or a full finetune?
1
#5 opened about 2 months ago
by
Andriy
DeepSpeed ZeRO-3 and full finetune
2
#5 opened about 2 months ago
by
Andriy
Dataset?
5
#1 opened 2 months ago
by
0xbitches
What is the context size of this model?
1
#11 opened 2 months ago
by
Andriy
Questions about architecture (+ LoRA)
2
#16 opened 2 months ago
by
alex0dd
Finetuning setup
#4 opened 2 months ago
by
Andriy
Can you tell us the original models that you merged to create this model?
1
#3 opened 4 months ago
by
Bruce001
What was the dataset used to pretrain Mistral-7B?
1
#38 opened 7 months ago
by
Andriy