SubhrajitSain commited on
Commit
4055d3b
·
verified ·
1 Parent(s): 3ac7be1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -16
README.md CHANGED
@@ -2,32 +2,41 @@
2
  library_name: transformers
3
  base_model: SubhrajitSain/anwgpt2-345m
4
  tags:
5
- - generated_from_trainer
 
6
  model-index:
7
  - name: anwgpt2-345m
8
  results: []
 
 
 
 
 
 
9
  ---
10
 
11
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
- should probably proofread and complete it, then remove this comment. -->
13
 
14
- # anwgpt2-345m
15
 
16
- This model is a fine-tuned version of [SubhrajitSain/anwgpt2-345m](https://huggingface.co/SubhrajitSain/anwgpt2-345m) on an unknown dataset.
 
 
 
 
 
17
 
18
- ## Model description
 
19
 
20
- More information needed
 
21
 
22
- ## Intended uses & limitations
 
23
 
24
- More information needed
25
-
26
- ## Training and evaluation data
27
-
28
- More information needed
29
-
30
- ## Training procedure
31
 
32
  ### Training hyperparameters
33
 
@@ -45,4 +54,4 @@ The following hyperparameters were used during training:
45
  - Transformers 4.56.0
46
  - Pytorch 2.8.0+cu126
47
  - Datasets 2.14.6
48
- - Tokenizers 0.22.0
 
2
  library_name: transformers
3
  base_model: SubhrajitSain/anwgpt2-345m
4
  tags:
5
+ - gpt2
6
+ - gpt2-medium
7
  model-index:
8
  - name: anwgpt2-345m
9
  results: []
10
+ license: mit
11
+ datasets:
12
+ - Elriggs/openwebtext-100k
13
+ language:
14
+ - en
15
+ pipeline_tag: text-generation
16
  ---
17
 
18
+ # anwgpt2-355m
 
19
 
20
+ My second attempt at a LLM.
21
 
22
+ ## Model Details
23
+ * **Model Type:** GPT-2
24
+ * **Model Size:** 354,823,168 parameters
25
+ * **Base Model:** `gpt2-medium`
26
+ * **Dataset:** Elriggs/openwebtext-100k
27
+ * **Training Framework:** Hugging Face Transformers
28
 
29
+ ### Intended Use
30
+ This model is intended for text generation tasks.
31
 
32
+ ### Training
33
+ The model was fine-tuned on the `Elriggs/openwebtext-100k` dataset.
34
 
35
+ ### Evaluation
36
+ Evaluation was not recorded.
37
 
38
+ ### Limitations
39
+ May perform repetitions, but very not likely.
 
 
 
 
 
40
 
41
  ### Training hyperparameters
42
 
 
54
  - Transformers 4.56.0
55
  - Pytorch 2.8.0+cu126
56
  - Datasets 2.14.6
57
+ - Tokenizers 0.22.0