pszemraj commited on
Commit
57642d1
1 Parent(s): 8785f3c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -4
README.md CHANGED
@@ -10,16 +10,16 @@ model-index:
10
  results: []
11
  widget:
12
  - text: >-
13
- Is this review positive or negative? Review: Best cast iron skillet you will
14
- ever buy. <answer>
15
  example_title: Sentiment analysis
16
  - text: >-
17
  Barack Obama nominated Hilary Clinton as his secretary of state on Monday.
18
- He chose her because she had <answer>
19
  example_title: Coreference resolution
20
  - text: >-
21
  On a shelf, there are five books: a gray book, a red book, a purple book, a
22
- blue book, and a black book <answer>
23
  example_title: Logic puzzles
24
  - text: >-
25
  The two men running to become New York City's next mayor will face off in
@@ -29,6 +29,9 @@ widget:
29
  Is it true that if I have five 5-hour energy drinks in a single 24-hour
30
  period, I get 25 hours of energy and spontaneously explode? <answer>
31
  example_title: 5 hour energy
 
 
 
32
  inference:
33
  parameters:
34
  temperature: 0.6
 
10
  results: []
11
  widget:
12
  - text: >-
13
+ Review: Best cast iron skillet you will
14
+ ever buy. Is this review positive or negative? <answer>
15
  example_title: Sentiment analysis
16
  - text: >-
17
  Barack Obama nominated Hilary Clinton as his secretary of state on Monday.
18
+ He chose her because <answer>
19
  example_title: Coreference resolution
20
  - text: >-
21
  On a shelf, there are five books: a gray book, a red book, a purple book, a
22
+ blue book, and a black book. Here's the puzzle, <answer>
23
  example_title: Logic puzzles
24
  - text: >-
25
  The two men running to become New York City's next mayor will face off in
 
29
  Is it true that if I have five 5-hour energy drinks in a single 24-hour
30
  period, I get 25 hours of energy and spontaneously explode? <answer>
31
  example_title: 5 hour energy
32
+ - text: <-
33
+ what happens if you train a smaller model on a dataset of reinforcement-learning optimized model responses? <answer>
34
+ example_title: deep learning advice
35
  inference:
36
  parameters:
37
  temperature: 0.6