Lots-of-LoRAs
/

Mistral-7B-Instruct-v0.2-4b-r16-task1442

PyTorch

Safetensors

English

Model card Files Files and versions Community

bruel commited on Aug 19

Commit

a1ff916

•

1 Parent(s): 2a1eabe

task1442_doqa_movies_isanswerable

Browse files

Files changed (1) hide show

README.md +22 -13

README.md CHANGED Viewed

@@ -1,9 +1,10 @@
 ---
-library_name: transformers
-tags: []
 ---
-# Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
@@ -15,22 +16,22 @@ tags: []
 <!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
-- **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
 ### Model Sources [optional]
 <!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
 - **Demo [optional]:** [More Information Needed]
 ## Uses
@@ -79,7 +80,7 @@ Use the code below to get started with the model.
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[More Information Needed]
 ### Training Procedure
@@ -174,7 +175,15 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 **BibTeX:**
-[More Information Needed]
 **APA:**

 ---
+language: en
+license: mit
+library_name: pytorch
 ---
+# Model Card for Mistral-7B-Instruct-v0.2-4b-r16-task1442
 <!-- Provide a quick summary of what the model is/does. -->
 <!-- Provide a longer summary of what this model is. -->
+LoRA trained on task1442_doqa_movies_isanswerable
+- **Developed by:** bruel
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
+- **Model type:** LoRA
+- **Language(s) (NLP):** en
+- **License:** mit
+- **Finetuned from model [optional]:** mistralai/Mistral-7B-Instruct-v0.2
 ### Model Sources [optional]
 <!-- Provide the basic links for the model. -->
+- **Repository:** https://github.com/bruel-gabrielsson
+- **Paper [optional]:** "Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead" (2024), Rickard Brüel Gabrielsson, Jiacheng Zhu, Onkar Bhardwaj, Leshem Choshen, Kristjan Greenewald, Mikhail Yurochkin and Justin Solomon
 - **Demo [optional]:** [More Information Needed]
 ## Uses
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+https://huggingface.co/datasets/Lots-of-LoRAs/task1442_doqa_movies_isanswerable sourced from https://github.com/allenai/natural-instructions
 ### Training Procedure
 **BibTeX:**
+@misc{brüelgabrielsson2024compressserveservingthousands,
+    title={Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead},
+    author={Rickard Brüel-Gabrielsson and Jiacheng Zhu and Onkar Bhardwaj and Leshem Choshen and Kristjan Greenewald and Mikhail Yurochkin and Justin Solomon},
+    year={2024},
+    eprint={2407.00066},
+    archivePrefix={arXiv},
+    primaryClass={cs.DC},
+    url={https://arxiv.org/abs/2407.00066},
+}
 **APA:**