Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

What is SamSum Bot?

This is a model fine-tuned on the SamSum dataset. However, instead of training the system to summarize conversations, the model is trained to predict a conversation given a summary. The prompt needs to be in the following form

A partial summary of the conversation is:
{summary}

With the dialogue being:
{dialogue}

where {summary} is a text as in

John went out to buy groceries. He meets Jane on the way and they talk about the weather.

and the {dialogue} needs to be structured with speaking lines preceded by the speaking character

John: Oh hi Jane.
Jane: Nice to see you?
John: The weather looks nice today
Jane: [PREDICTION]

The system is based on the GPTJ-6B by EleutherAI, quantized by Hivemind. It has been fine-tuned according to the LoRa method. A simple back-end is available in this repo, where the model is served using Torchserve. A terminal-like front-end interface is available here. This interface is the one used in my website http://fractalego.io.

Downloads last month
12
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.