Lodo97 commited on
Commit
7a2e714
1 Parent(s): 62fb394

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -13
README.md CHANGED
@@ -1,11 +1,15 @@
1
  ---
2
  library_name: transformers
3
- tags: []
 
 
4
  ---
5
 
6
  # Model Card for Model ID
7
 
8
- <!-- Provide a quick summary of what the model is/does. -->
 
 
9
 
10
 
11
 
@@ -13,29 +17,26 @@ tags: []
13
 
14
  ### Model Description
15
 
16
- <!-- Provide a longer summary of what this model is. -->
17
 
18
- This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
19
 
20
- - **Developed by:** [More Information Needed]
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Shared by [optional]:** [More Information Needed]
23
- - **Model type:** [More Information Needed]
24
- - **Language(s) (NLP):** [More Information Needed]
25
- - **License:** [More Information Needed]
26
- - **Finetuned from model [optional]:** [More Information Needed]
27
 
28
  ### Model Sources [optional]
29
 
30
  <!-- Provide the basic links for the model. -->
31
 
32
- - **Repository:** [More Information Needed]
33
  - **Paper [optional]:** [More Information Needed]
34
  - **Demo [optional]:** [More Information Needed]
35
 
36
  ## Uses
37
 
38
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 
 
39
 
40
  ### Direct Use
41
 
 
1
  ---
2
  library_name: transformers
3
+ language:
4
+ - en
5
+ pipeline_tag: text-generation
6
  ---
7
 
8
  # Model Card for Model ID
9
 
10
+ Coder-2b is a phi-2 fine tuned model trained on jondurbin/py-dpo-v0.1 using Reinforcement Learning from Human Feedback with DPO.
11
+ it is an instruct model capable of generating code starting from an instruction given by the user.
12
+ It is intended for those people who have few hardware resources available and want to speed up the process of writing Python code.
13
 
14
 
15
 
 
17
 
18
  ### Model Description
19
 
20
+ with the idea of creating a model that works on limited hardware, starting from a phi-2 model, coder-2b was fine-tuned with the Vezora/Tested-22k-Python-Alpaca dataset to make it capable of creating python code starting from from a user-written prompt. With further fine tuning, using the jondurbin/py-dpo-v0.1 dataset and leveraging the RLHF DPO technique, the model was further improved to produce more accurate outputs.
21
 
 
22
 
23
+ - **Developed by:** Lodo97
24
+ - **Language(s) (NLP):** English
25
+ - **Finetuned from model Lodo97/Test1:**
 
 
 
 
26
 
27
  ### Model Sources [optional]
28
 
29
  <!-- Provide the basic links for the model. -->
30
 
31
+ - **Repository:** Lodo97/coder-2b-v0.1-hfrl
32
  - **Paper [optional]:** [More Information Needed]
33
  - **Demo [optional]:** [More Information Needed]
34
 
35
  ## Uses
36
 
37
+ - Generate python code from an instruction provided by the user
38
+ - Find errors and bugs
39
+ - Rewrite code
40
 
41
  ### Direct Use
42