qnguyen3 commited on
Commit
0389e88
1 Parent(s): a639381

Upload tokenizer

Browse files
Files changed (2) hide show
  1. README.md +28 -34
  2. tokenizer.json +0 -0
README.md CHANGED
@@ -1,42 +1,36 @@
1
  ---
2
  license: apache-2.0
3
  widget:
4
- - text: My name is El Microondas the Wise, and
5
- example_title: El Microondas
6
- - text: Kennesaw State University is a public
7
- example_title: Kennesaw State University
8
- - text: >-
9
- Bungie Studios is an American video game developer. They are most famous
10
- for developing the award winning Halo series of video games. They also
11
- made Destiny. The studio was founded
12
- example_title: Bungie
13
- - text: The Mona Lisa is a world-renowned painting created by
14
- example_title: Mona Lisa
15
- - text: >-
16
- The Harry Potter series, written by J.K. Rowling, begins with the book
17
- titled
18
- example_title: Harry Potter Series
19
- - text: >-
20
- Question: I have cities, but no houses. I have mountains, but no trees. I
21
- have water, but no fish. What am I?
22
 
23
- Answer:
24
- example_title: Riddle
25
- - text: The process of photosynthesis involves the conversion of
26
- example_title: Photosynthesis
27
- - text: >-
28
- Jane went to the store to buy some groceries. She picked up apples,
29
- oranges, and a loaf of bread. When she got home, she realized she forgot
30
- example_title: Story Continuation
31
- - text: >-
32
- Problem 2: If a train leaves Station A at 9:00 AM and travels at 60 mph,
33
- and another train leaves Station B at 10:00 AM and travels at 80 mph, when
34
- will they meet if the distance between the stations is 300 miles?
35
 
36
- To determine
37
- example_title: Math Problem
38
- - text: In the context of computer programming, an algorithm is
39
- example_title: Algorithm Definition
40
  ---
41
  # Mixsmol-4x400M-v0.1 by Ontocord
42
  This is the first checkpoint (Epoch 1) of Mixsmol-4x400M-v0.1
 
1
  ---
2
  license: apache-2.0
3
  widget:
4
+ - text: My name is El Microondas the Wise, and
5
+ example_title: El Microondas
6
+ - text: Kennesaw State University is a public
7
+ example_title: Kennesaw State University
8
+ - text: Bungie Studios is an American video game developer. They are most famous for
9
+ developing the award winning Halo series of video games. They also made Destiny.
10
+ The studio was founded
11
+ example_title: Bungie
12
+ - text: The Mona Lisa is a world-renowned painting created by
13
+ example_title: Mona Lisa
14
+ - text: The Harry Potter series, written by J.K. Rowling, begins with the book titled
15
+ example_title: Harry Potter Series
16
+ - text: 'Question: I have cities, but no houses. I have mountains, but no trees. I
17
+ have water, but no fish. What am I?
 
 
 
 
18
 
19
+ Answer:'
20
+ example_title: Riddle
21
+ - text: The process of photosynthesis involves the conversion of
22
+ example_title: Photosynthesis
23
+ - text: Jane went to the store to buy some groceries. She picked up apples, oranges,
24
+ and a loaf of bread. When she got home, she realized she forgot
25
+ example_title: Story Continuation
26
+ - text: 'Problem 2: If a train leaves Station A at 9:00 AM and travels at 60 mph,
27
+ and another train leaves Station B at 10:00 AM and travels at 80 mph, when will
28
+ they meet if the distance between the stations is 300 miles?
 
 
29
 
30
+ To determine'
31
+ example_title: Math Problem
32
+ - text: In the context of computer programming, an algorithm is
33
+ example_title: Algorithm Definition
34
  ---
35
  # Mixsmol-4x400M-v0.1 by Ontocord
36
  This is the first checkpoint (Epoch 1) of Mixsmol-4x400M-v0.1
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff