aiqwe
/

FinShibainu

Question Answering

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

aiqwe commited on 4 days ago

Commit

fb5c65f

•

1 Parent(s): 61387c8

Update README.md

Files changed (1) hide show

README.md +0 -2

README.md CHANGED Viewed

@@ -58,14 +58,12 @@ for o in outputs:
 | Contents                       | Spec                                |
 |--------------------------------|-------------------------------------|
 | Base model                     | Qwen2.5-7B-Instruct                |
-| Machine                        | A100 SXM 80GB × 2                  |
 | dtype                          | bfloat16                           |
 | PEFT                           | LoRA (r=8, alpha=64)               |
 | Learning Rate                  | 1e-5 (varies by further training)  |
 | LRScheduler                    | Cosine (warm-up: 0.05%)            |
 | Optimizer                      | AdamW                              |
 | Distributed / Efficient Tuning | DeepSpeed v3, Flash Attention      |
-| Global Batch Size              | 128                                |
 # Datset Card
 Reference 데이터셋은 일부 저작권 관계로 인해 Link로 제공합니다.

 | Contents                       | Spec                                |
 |--------------------------------|-------------------------------------|
 | Base model                     | Qwen2.5-7B-Instruct                |
 | dtype                          | bfloat16                           |
 | PEFT                           | LoRA (r=8, alpha=64)               |
 | Learning Rate                  | 1e-5 (varies by further training)  |
 | LRScheduler                    | Cosine (warm-up: 0.05%)            |
 | Optimizer                      | AdamW                              |
 | Distributed / Efficient Tuning | DeepSpeed v3, Flash Attention      |
 # Datset Card
 Reference 데이터셋은 일부 저작권 관계로 인해 Link로 제공합니다.