vpj commited on
Commit
c170726
1 Parent(s): b8a84ce

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -15
README.md CHANGED
@@ -8,11 +8,11 @@ license: bigscience-openrail-m
8
  ---
9
 
10
 
11
- [GeoV](https://huggingface.co/docs/transformers/model_doc/geov)-9B is a 9 billion parameter causal language model.
12
 
13
  The GeoV model was designed by Georges Harik and uses
14
  [Rotary Positional Embeddings with Relative distances (RoPER)](http://research.labml.ai/RoPER.html)
15
- by [Georges Hark](https://twitter.com/ghark) and [Varuna Jayasiri](https://twitter.com/vpj).
16
 
17
  [RoPER]((http://research.labml.ai/RoPER.html),
18
  in addition to using relative positions in the attention score calculation by RoPE embeddings,
@@ -43,25 +43,31 @@ The released weights were trained on ~70 billion tokens.
43
  We plan to continue training up to 300 billion tokens and update the weights at every 20b tokens.
44
  This training run is monolingual and uses c4en and english wikipedia datasets.
45
 
 
 
 
 
 
 
46
  ## Generation
47
 
48
- The `generate()` method can be used to generate text using GeoV model.
49
 
50
  ```python
51
- >>> from transformers import GeoVForCausalLM, GeoVTokenizer
52
 
53
- >>> model = GeoVForCausalLM.from_pretrained("GeoV/GeoV-9b")
54
- >>> tokenizer = GeoVTokenizer.from_pretrained("GeoV/GeoV-9b")
55
 
56
- >>> prompt = "In mathematics, topology is the study of"
57
 
58
- >>> input_ids = tokenizer(prompt, return_tensors="pt").input_ids
59
 
60
- >>> gen_tokens = model.generate(
61
- ... input_ids,
62
- ... do_sample=True,
63
- ... temperature=0.9,
64
- ... max_length=100,
65
- ... )
66
- >>> gen_text = tokenizer.batch_decode(gen_tokens)[0]
67
  ```
 
8
  ---
9
 
10
 
11
+ [GeoV](https://github.com/geov-ai/geov)-9B is a 9 billion parameter causal language model.
12
 
13
  The GeoV model was designed by Georges Harik and uses
14
  [Rotary Positional Embeddings with Relative distances (RoPER)](http://research.labml.ai/RoPER.html)
15
+ by [Georges Harik](https://twitter.com/gharik) and [Varuna Jayasiri](https://twitter.com/vpj).
16
 
17
  [RoPER]((http://research.labml.ai/RoPER.html),
18
  in addition to using relative positions in the attention score calculation by RoPE embeddings,
 
43
  We plan to continue training up to 300 billion tokens and update the weights at every 20b tokens.
44
  This training run is monolingual and uses c4en and english wikipedia datasets.
45
 
46
+ ## Installation
47
+
48
+ ```shell
49
+ pip install geov
50
+ ```
51
+
52
  ## Generation
53
 
54
+ [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/geov-ai/geov/blob/master/notebooks/generate.ipynb)
55
 
56
  ```python
57
+ from geov import GeoVForCausalLM, GeoVTokenizer
58
 
59
+ model = GeoVForCausalLM.from_pretrained("GeoV/GeoV-9b")
60
+ tokenizer = GeoVTokenizer.from_pretrained("GeoV/GeoV-9b")
61
 
62
+ prompt = "In mathematics, topology is the study of"
63
 
64
+ input_ids = tokenizer(prompt, return_tensors="pt").input_ids
65
 
66
+ gen_tokens = model.generate(
67
+ input_ids,
68
+ do_sample=True,
69
+ temperature=0.9,
70
+ max_length=100,
71
+ )
72
+ gen_text = tokenizer.batch_decode(gen_tokens)[0]
73
  ```