aao331 commited on
Commit
fdce146
·
1 Parent(s): 29fb3dd

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +65 -0
README.md ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - es
5
+ ---
6
+
7
+ # Model Card for Model ID
8
+
9
+ <!-- Provide a quick summary of what the model is/does. -->
10
+
11
+ This is Carpincho-13B an Instruction-tuned LLM based on LLama-13B. It is trained to answer in colloquial spanish Argentine language.
12
+ It's based on LLama-13b (https://huggingface.co/decapoda-research/llama-13b-hf).
13
+
14
+ ## Model Details
15
+
16
+ The model is provided in two formats: A low rank adaptation model (LoRA) suitable to apply directly to LLama-13B-HF,
17
+ and a complete merged model quantized to 4bits that only requires 8GB of VRAM. Both models can be used directly in software like
18
+ text-generation-webui https://github.com/oobabooga/text-generation-webui.
19
+ Additionally, a test chatbot based on this neural network is running on the twitter account http://twitter.com/arggpt
20
+
21
+ ### Model Description
22
+
23
+ <!-- Provide a longer summary of what this model is. -->
24
+
25
+
26
+
27
+ - **Developed by:** Alfredo Ortega (@ortegaalfredo)
28
+ - **Model type:** 13B LLM
29
+ - **Language(s):** (NLP): English and colloquial Argentine Spanish
30
+ - **License:** Free for non-commercial use
31
+ - **Finetuned from model [optional]: https://huggingface.co/decapoda-research/llama-13b-hf
32
+
33
+ ### Model Sources [optional]
34
+
35
+ <!-- Provide the basic links for the model. -->
36
+
37
+ - **Repository:** https://huggingface.co/decapoda-research/llama-13b-hf
38
+ - **Paper [optional]:** https://arxiv.org/abs/2302.13971
39
+
40
+ ## Uses
41
+
42
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
43
+ This is a generic LLM chatbot that can be used to interact directly with humans.
44
+
45
+ ## Bias, Risks, and Limitations
46
+
47
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
48
+ This bot is uncensored and may provide shocking answers. Also it contains bias present in the training material.
49
+
50
+ [More Information Needed]
51
+
52
+ ### Recommendations
53
+
54
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
55
+
56
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.
57
+
58
+ ## How to Get Started with the Model
59
+
60
+ The easiest way is to download the text-generation-webui application and place the model inside the 'models' directory.
61
+ Then launch the web interface and run the model as a regular LLama-13B model.
62
+
63
+ ## Model Card Contact
64
+
65
+ Contact the creator at @ortegaalfredo on twitter/github