doberst commited on
Commit
e4c8c29
1 Parent(s): 5403262

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -46
README.md CHANGED
@@ -42,7 +42,8 @@ The intended use of BLING models is two-fold:
42
  1. Provide a high-quality Instruct models that can run on a laptop for local testing. We have found it extremely useful when building a
43
  proof-of-concept, or working with sensitive enterprise data that must be closely guarded, especially in RAG use cases.
44
 
45
- 2. Push the state of the art for smaller Instruct-following models in the 1B - 7B range.
 
46
 
47
  ### Direct Use
48
 
@@ -56,6 +57,8 @@ on a narrower set of Instructions more suitable to a ~1B parameter GPT model.
56
  BLING is ideal for rapid prototyping, testing, and the ability to perform an end-to-end workflow locally on a laptop without
57
  having to send sensitive information over an Internet-based API.
58
 
 
 
59
 
60
  [More Information Needed]
61
 
@@ -69,7 +72,7 @@ having to send sensitive information over an Internet-based API.
69
 
70
  <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
71
 
72
- 1. BLING is not designed for 'chat-bot' or 'consumer-oriented' applications.
73
 
74
  2. BLING is not optimal for most production applications, other than simple and highly specific use cases.
75
 
@@ -85,68 +88,37 @@ mitigate potential bias and safety. We would strongly discourage any use of B
85
 
86
  [More Information Needed]
87
 
88
- ### Recommendations
89
-
90
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
91
-
92
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
93
 
94
  ## How to Get Started with the Model
95
 
96
- Use the code below to get started with the model.
97
 
98
- [More Information Needed]
99
-
100
- ## Training Details
101
 
102
- ### Training Data
103
 
104
- <!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
105
 
106
- [More Information Needed]
107
 
108
- ### Training Procedure
 
109
 
110
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
111
 
112
- #### Preprocessing [optional]
113
-
114
- [More Information Needed]
115
-
116
-
117
- #### Training Hyperparameters
118
-
119
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
120
 
121
 
122
  ## Citation [optional]
123
 
124
  <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
125
 
126
- **BibTeX:**
127
-
128
- [More Information Needed]
129
-
130
- **APA:**
131
-
132
- [More Information Needed]
133
-
134
- ## Glossary [optional]
135
-
136
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
137
-
138
- [More Information Needed]
139
-
140
- ## More Information [optional]
141
-
142
- [More Information Needed]
143
 
144
- ## Model Card Authors [optional]
145
 
146
- [More Information Needed]
147
 
148
- ## Model Card Contact
149
-
150
- [More Information Needed]
151
 
152
 
 
42
  1. Provide a high-quality Instruct models that can run on a laptop for local testing. We have found it extremely useful when building a
43
  proof-of-concept, or working with sensitive enterprise data that must be closely guarded, especially in RAG use cases.
44
 
45
+ 2. Push the state of the art for smaller Instruct-following models in the 1B - 7B range through improved fine-tuning datasets and targeted "instruction" tasks.
46
+
47
 
48
  ### Direct Use
49
 
 
57
  BLING is ideal for rapid prototyping, testing, and the ability to perform an end-to-end workflow locally on a laptop without
58
  having to send sensitive information over an Internet-based API.
59
 
60
+ The first BLING models have been trained on question-answering, key-value extraction, and basic summarization as the core instruction types.
61
+
62
 
63
  [More Information Needed]
64
 
 
72
 
73
  <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
74
 
75
+ 1. BLING is not designed for 'chat-bot' or 'consumer-oriented' applications.
76
 
77
  2. BLING is not optimal for most production applications, other than simple and highly specific use cases.
78
 
 
88
 
89
  [More Information Needed]
90
 
 
 
 
 
 
91
 
92
  ## How to Get Started with the Model
93
 
94
+ The fastest way to get started with BLING is through direct import in transformers:
95
 
96
+ model = AutoModelForCausalLM.from_pretrained("llmware/bling-1b-0.1")
97
+ tokenizer = AutoTokenizer.from_pretrained("llmware/bling-1b-0.1")
 
98
 
99
+ The BLING model was fine-tuned with a simple "<human> and <bot> wrapper", so to get the best results, wrap inference entries as:
100
 
101
+ full_prompt = "<human>: " + my_prompt + "\n" + "<bot>: "
102
 
103
+ The BLING model was fine-tuned with closed-context samples, which assume generally that the prompt consists of sub-parts:
104
 
105
+ 1. Text Passage Context, and
106
+ 2. Specific question or instruction based on the text passage
107
 
108
+ To get the best results, package "my_prompt" as follows:
109
 
110
+ my_prompt = {{text_passage}} + "\n" + {{question/instruction}}
 
 
 
 
 
 
 
111
 
112
 
113
  ## Citation [optional]
114
 
115
  <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
116
 
117
+ ## Model Card Contact
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
118
 
119
+ Darren Oberst & llmware team
120
 
121
+ Please reach out anytime if you are interested in this research program and would like to participate and work with us!
122
 
 
 
 
123
 
124