--- license: cc-by-4.0 datasets: - zirui3/TSSB-3M-instructions - conceptofmind/FLAN_2022 - zirui3/zhihu_qa - zirui3/cMedQA2-instructions tags: - code --- # summary This model is bigcode/starcoder fine-tuned on codegen dataset & natural language dataset(chinese/english instruction dataset) # dataset * codegen-instruct * [zirui3/TSSB-3M-instructions](https://huggingface.co/datasets/zirui3/TSSB-3M-instructions)(python code bugfix) * FLAN(english) * [OIG](https://huggingface.co/datasets/laion/OIG) (Open-Assistant,engliesh) * [zirui3/zhihu_qa](https://huggingface.co/datasets/zirui3/zhihu_qa)(chinese) * [COIG](https://huggingface.co/datasets/BAAI/COIG) (chinese) * pCLUE(chinese) * [zirui3/cMedQA2-instructions](https://huggingface.co/datasets/zirui3/TSSB-3M-instructions) (chinese medical domain)