metadata
license: apache-2.0
datasets:
- cerebras/SlimPajama-627B
- bigcode/starcoderdata
- HuggingFaceH4/ultrachat_200k
- HuggingFaceH4/ultrafeedback_binarized
- dumb-dev/cpp-10k
- dumb-dev/Encoding-Detection-w-cChardet-DB
- Neloy262/rust_instruction_dataset
- m-a-p/CodeFeedback-Filtered-Instruction
- sahil2801/CodeAlpaca-20k
- vicgalle/alpaca-gpt4
language:
- en
I finetuned TinyLlama/TinyLlama-1.1B-Chat-v1.0 on the following datasets:
- dumb-dev/cpp-10k
- dumb-dev/Encoding-Detection-w-cChardet-DB
- Neloy262/rust_instruction_dataset
- m-a-p/CodeFeedback-Filtered-Instruction
- sahil2801/CodeAlpaca-20k
- vicgalle/alpaca-gpt4