shuyuej's picture
Update README.md
7c5ba6e verified
|
raw
history blame contribute delete
No virus
977 Bytes
metadata
model-index:
  - name: MetaMath-LoRA-LLaMA-7B
    results:
      - task:
          type: text-generation
        dataset:
          name: meta-math/MetaMathQA
          type: meta-math/MetaMathQA
        metrics:
          - name: Accuracy (zero-shot)
            type: Accuracy (zero-shot)
            value: 0.58
            verified: true
        source:
          name: Arithmetic Reasoning on GSM8K
          url: https://paperswithcode.com/sota/arithmetic-reasoning-on-gsm8k
license: apache-2.0

Fine-tune LLaMA 2 (7B) with LoRA on meta-math/MetaMathQA

Fine-tune for one epoch

Result:

After the pre-training: Invalid output length: 7, Testing length: 1319 , Accuracy: 0.580

Comparison

The official report accuracy is 0.665 by fine-tuning the whole LLaMA 2 7B model for 3 epochs.

Note: The LoRA adapter is being used for future research purposes.

πŸš€ Adapter Usage

# Load the Pre-trained LoRA Adapter
model.load_adapter("shuyuej/metamath_lora_qkv_llama2_7b")
model.enable_adapters()