m4r1
/

selfrag_llama2_7b-GGUF

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

m4r1 commited on Jan 25

Commit

93b06dd

•

1 Parent(s): f82788a

Update README.md

Files changed (1) hide show

README.md +1 -10

README.md CHANGED Viewed

@@ -1,11 +1,7 @@
 ---
 license: mit
 ---
-This model is a 7B [Self-RAG](https://selfrag.github.io/) model that generates outputs to diverse user queries as well as *reflection tokens* to call the retrieval system adaptively and criticize its own output and retrieved passages.
-Self-RAG is trained on our instruction-following corpora with interleaving passages and reflection tokens using the standard next-token prediction objective, enabling efficient and stable learning with fine-grained feedback.
-At inference, we leverage reflection tokens covering diverse aspects of generations to sample the best output aligning users' preferences.
-See full descriptions in See full descriptions in [our paper](https://arxiv.org/abs/2310.11511).
 ## Usage
 Here, we show an easy way to quickly download our model from HuggingFace and run with `vllm` with pre-given passages. Make sure to install dependencies listed at [self-rag/requirements.txt](https://github.com/AkariAsai/self-rag/blob/main/requirements.txt).
@@ -53,11 +49,6 @@ or
 If you have additional input.
 You can insert paragraphs anywhere after `### Response:\n"`, but make sure to mark paragraphs as paragraph tokens (i.e., `<paragraph>{0}</paragraph>`).
-## Training details
-Our training data is available at the HuggingFace dataset [selfrag_train_data](https://huggingface.co/datasets/selfrag/selfrag_train_data).
-See our official repository for the training details.
-We used 8 A100 40GB for training on the Stability HPC server.
 ## Citation and contact
 If you use this model, please cite our work:
 ```

 ---
 license: mit
 ---
+4 bit quantization of: https://huggingface.co/selfrag/selfrag_llama2_7b
 ## Usage
 Here, we show an easy way to quickly download our model from HuggingFace and run with `vllm` with pre-given passages. Make sure to install dependencies listed at [self-rag/requirements.txt](https://github.com/AkariAsai/self-rag/blob/main/requirements.txt).
 If you have additional input.
 You can insert paragraphs anywhere after `### Response:\n"`, but make sure to mark paragraphs as paragraph tokens (i.e., `<paragraph>{0}</paragraph>`).
 ## Citation and contact
 If you use this model, please cite our work:
 ```