OpenPlatypus_Falcon_7b / README.md

Librarian Bot: Add base_model information to model (#1)

f421203 12 months ago

No virus

1.28 kB

	---
	library_name: peft
	tags:
	- tiiuae/falcon-7b
	- code
	- instruct
	- instruct-code
	- logical-reasoning
	- Platypus2
	datasets:
	- garage-bAInd/Open-Platypus
	base_model: codellama/CodeLlama-7b-hf
	---

	We finetuned TIIUAE/Falcon-7B on the Open-Platypus dataset (garage-bAInd/Open-Platypus) for 3 epochs using [MonsterAPI](https://monsterapi.ai) no-code [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm).

	#### About OpenPlatypus Dataset
	OpenPlatypus is focused on improving LLM logical reasoning skills and was used to train the Platypus2 models. The dataset is comprised of various sub-datasets, including PRM800K, ScienceQA, SciBench, ReClor, TheoremQA, among others. These were filtered using keyword search and Sentence Transformers to remove questions with a similarity above 80%. The dataset includes contributions under various licenses like MIT, Creative Commons, and Apache 2.0.

	The finetuning session got completed in ~ 3 hrs and costed us only `$14` for the entire finetuning run!

	#### Hyperparameters & Run details:
	- Model Path: tiiuae/falcon-7b
	- Dataset: garage-bAInd/Open-Platypus
	- Learning rate: 0.0003
	- Number of epochs: 3
	- Data split: Training: 90% / Validation: 10%
	- Gradient accumulation steps: 1


	---
	license: apache-2.0
	---