pszemraj
/

t5-base-askscience

Text2Text Generation

information retrieval

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

pszemraj commited on Feb 12, 2022

Commit

e9c7fbc

•

1 Parent(s): 3f4d24f

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -19,6 +19,9 @@ widget:
   example_title: "probability distribution"
 - text: "question: how does exercise help us lose weight? context: I started working out two weeks ago and already feel a lot better, and started to think about it and became deeply confused."
   example_title: "pumpen"
 inference:
   parameters:
     max_length: 64
@@ -41,4 +44,4 @@ inference:
 - for inputs, the model was presented with the post title and the post selftext encoded as: `question: <post title> context: <post selftext>`. You may see better results if queries are posed in this fashion.
 - The top two replies were aggregated and presented to the model as the output text.
-- Training for longer will be explored, but given that the dataset has 127k examples and the loss flatlines at 0.5 epochs this should be fairly viable.

   example_title: "probability distribution"
 - text: "question: how does exercise help us lose weight? context: I started working out two weeks ago and already feel a lot better, and started to think about it and became deeply confused."
   example_title: "pumpen"
+- text: "what is a neural network?"
+  example_title: "deep learning"
 inference:
   parameters:
     max_length: 64
 - for inputs, the model was presented with the post title and the post selftext encoded as: `question: <post title> context: <post selftext>`. You may see better results if queries are posed in this fashion.
 - The top two replies were aggregated and presented to the model as the output text.
+- Training for longer will be explored, but given that the dataset has 127k examples and the loss flatlines at 0.5 epochs so this model should be fairly viable.