Update README.md
Browse files
README.md
CHANGED
@@ -13,11 +13,11 @@ inference:
|
|
13 |
|
14 |
[<img src="https://i.ibb.co/5Lbwyr1/dicta-logo.jpg" width="300px"/>](https://dicta.org.il)
|
15 |
|
16 |
-
#
|
17 |
|
18 |
The DictaLM-2.0 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters trained to specialize in Hebrew text.
|
19 |
|
20 |
-
For full details of this model please read our [release blog post](https://dicta.org.il/dicta-lm).
|
21 |
|
22 |
This is the base model designed for completion (not for chat!) in the GGUF format for use with llama.cpp.
|
23 |
|
@@ -40,5 +40,13 @@ DictaLM 2.0 is a pretrained base model and therefore does not have any moderatio
|
|
40 |
If you use this model, please cite:
|
41 |
|
42 |
```bibtex
|
43 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
44 |
```
|
|
|
13 |
|
14 |
[<img src="https://i.ibb.co/5Lbwyr1/dicta-logo.jpg" width="300px"/>](https://dicta.org.il)
|
15 |
|
16 |
+
# Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities
|
17 |
|
18 |
The DictaLM-2.0 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters trained to specialize in Hebrew text.
|
19 |
|
20 |
+
For full details of this model please read our [release blog post](https://dicta.org.il/dicta-lm) or the [technical report](https://arxiv.org/abs/2407.07080).
|
21 |
|
22 |
This is the base model designed for completion (not for chat!) in the GGUF format for use with llama.cpp.
|
23 |
|
|
|
40 |
If you use this model, please cite:
|
41 |
|
42 |
```bibtex
|
43 |
+
@misc{shmidman2024adaptingllmshebrewunveiling,
|
44 |
+
title={Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities},
|
45 |
+
author={Shaltiel Shmidman and Avi Shmidman and Amir DN Cohen and Moshe Koppel},
|
46 |
+
year={2024},
|
47 |
+
eprint={2407.07080},
|
48 |
+
archivePrefix={arXiv},
|
49 |
+
primaryClass={cs.CL},
|
50 |
+
url={https://arxiv.org/abs/2407.07080},
|
51 |
+
}
|
52 |
```
|