adrianeboyd commited on
Commit
0a47720
1 Parent(s): c08e92a

Add fi_udv25_finnishtdt_trf-0.0.1

Browse files
.gitattributes CHANGED
@@ -25,3 +25,8 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
25
  *.zip filter=lfs diff=lfs merge=lfs -text
26
  *.zstandard filter=lfs diff=lfs merge=lfs -text
27
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
25
  *.zip filter=lfs diff=lfs merge=lfs -text
26
  *.zstandard filter=lfs diff=lfs merge=lfs -text
27
  *tfevents* filter=lfs diff=lfs merge=lfs -text
28
+ *.whl filter=lfs diff=lfs merge=lfs -text
29
+ *.npz filter=lfs diff=lfs merge=lfs -text
30
+ *strings.json filter=lfs diff=lfs merge=lfs -text
31
+ vectors filter=lfs diff=lfs merge=lfs -text
32
+ model filter=lfs diff=lfs merge=lfs -text
LICENSE.txt ADDED
@@ -0,0 +1,502 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ This treebank is based on the Turku Dependency Treebank release
2
+ 2013-07-18 available from http://bionlp.utu.fi/fintreebank.html
3
+
4
+ All parts of the Turku Dependency Treebank are distributed under the
5
+ Creative Commons Attribution-ShareAlike 4.0 International license (see
6
+ below for the legal code of the license)
7
+
8
+ Acknowledgments (all names in alphabetic order):
9
+
10
+ * Linguistic annotation: Copyright © 2009-2013, Filip Ginter, Katri
11
+ Haverinen, Samuel Kohonen, Veronika Laippala, Timo Viljanen, Jenna
12
+ Kanerva (Nyblom), Stina Ojala, Anna Missilä - BioNLP group,
13
+ Department of IT, University of Turku
14
+
15
+ * Conversion to the Universal Dependencies scheme: Copyright © 2014,
16
+ Filip Ginter, Jenna Kanerva, Veronika Laippala, Anna Missilä, Stina
17
+ Ojala, Sampo Pyysalo - BioNLP group, Department of IT, University of Turku
18
+
19
+ * Morphological analysis utilizes the output of the OMorFi analyzer
20
+ http://omorfi.googlecode.com
21
+
22
+ * Text (file names from the official release of TDT and CoNLL-U sentence metadata)
23
+
24
+ o Files b101-b116 Copyright © 2010 http://ranneliike.net/blogi.php?nick=Aboa
25
+ o Files b201-b204 Copyright © 2010 http://moumou.indiedays.com/
26
+ o Files b301-b309 Copyright © 2010 http://verna.helmiblogit.mtv3.fi/
27
+ o Files b401-b403 Copyright © 2010 http://femme.helmiblogit.mtv3.fi/
28
+ o Files b501-b511 Copyright © 2011 http://unknownpixels.com/blogi.php
29
+ o Files b601-b607 Copyright © 2011 http://riilivingit.blogspot.fi/
30
+ o Files b701-b723 Copyright © 2011 http://www.mimminkeittio.com/
31
+ o Files b801-b804 Copyright © 2011 http://blogit.mtv3.fi/sopivassasuhteessa/
32
+ o Files e1001-e1080 EuroParl, http://www.statmt.org/europarl/
33
+ o Files f101-f102 © 2011 valkoinenruutu, http://valkoinenruutu.blogspot.com/
34
+ o Files f201-f213 © 2011 http://kaupunkimerenrannalla.blogspot.com/2011_07_01_archive.html
35
+ o Files f301-f303 © 2011 http://sanatlopussa.wordpress.com/2010/09/17/palasin-kaipasitko-minua/
36
+ o Files f401-f404 © 2011 http://neko-kirsikankukka.blogspot.com/2011/01/jatkis.html
37
+ o Files f501-f503 © 2011 http://sanojaperakkain.blogspot.com/2010/12/malmi-malm.html
38
+ o Files f601-f607 © 2011 http://keasin.blogspot.com/2011/09/ikava-uutinen.html
39
+ o Files f701-f703 © 2011 http://putki.livejournal.com/20795.html
40
+ o Files f801-f803 © 2011 http://urs.fi/pdf/URS_tarinoita_02.pdf
41
+ o File f901 © 2011 http://merten.kapsi.fi/paratiisi/?p=16
42
+ o Files fA01-fA04 © 2011 http://ruokamulta.deviantart.com/gallery/7173465#/d4dzjhx
43
+ o Files fB01-fB04 © 2011 http://lusikkainen.kapina.net/2005/05/22/eri-uni/
44
+ o Files fC01-fC05 © 2011 http://isopeikko.vuodatus.net/blog/3043570/lusikka/
45
+ o Files fD01-fD03 © 2011 http://www.katajala.net/jussi/nereidiensaleista/Nereidien_saleista_pohjoisille_rannoille.txt
46
+ o Files fE01-fE06 © 2011 http://elegioitasudenkorennoilta.blogspot.com/2011/04/leppakerttu.html
47
+ o Files fF01-fF04 © 2011 http://ajatusmatkaaja.blogspot.com/2011/10/aamupala-keski-suomessa.html
48
+ o Files h1001-h1080 FinnTreeBank, http://www.ling.helsinki.fi/kieliteknologia/tutkimus/treebank/
49
+ o Files j001-j030 JRC-ACQUIS, http://optima.jrc.it/Acquis/index_2.2.html
50
+ o Files s101-s102 Copyright © 2008 Markus Auvo, File-lehti (http://www.asteriski.fi/toiminta/file_arkisto.php)
51
+ o Files s103-s104 Copyright © 2008 Pekka Salmela, File-lehti (http://www.asteriski.fi/toiminta/file_arkisto.php)
52
+ o File s105 Copyright © 2008 Katarina Stubbe, File-lehti (http://www.asteriski.fi/toiminta/file_arkisto.php)
53
+ o File s106 Copyright © 2008 Juuso Haapanen, File-lehti (http://www.asteriski.fi/toiminta/file_arkisto.php)
54
+ o File s107 Copyright © 2008 Miikka Ryökäs, File-lehti (http://www.asteriski.fi/toiminta/file_arkisto.php)
55
+ o File s108-s109 Copyright © 2008 Antti Levo, File-lehti (http://www.asteriski.fi/toiminta/file_arkisto.php)
56
+ o File s110 Copyright © 2008 Jyrki Pulliainen, File-lehti (http://www.asteriski.fi/toiminta/file_arkisto.php)
57
+ o File s201 Copyright © 2010 Jani Moilanen, Indeksi-lehti (http://index.utu.fi/?id=arkisto)
58
+ o File s203 Copyright © 2010 Katriina Kerttula, Indeksi-lehti (http://index.utu.fi/?id=arkisto)
59
+ o File s202 Copyright © 2010 Niina Niemi, Indeksi-lehti (http://index.utu.fi/?id=arkisto)
60
+ o File s204 Copyright © 2010 Milla Ikonen, Indeksi-lehti (http://index.utu.fi/?id=arkisto)
61
+ o File s205 Copyright © 2010 Lauri Miikkulainen, Indeksi-lehti (http://index.utu.fi/?id=arkisto)
62
+ o File s206 Copyright © 2010 Teemu Alanen, Indeksi-lehti (http://index.utu.fi/?id=arkisto)
63
+ o File s301 Copyright © 2009 Juha Rajajärvi, Potenssi-lehti (http://www.delta.utu.fi/potenssi/arkisto/)
64
+ o File s302 Copyright © 2009 Simo Kauhala & Juha Rajajärvi, Potenssi-lehti (http://www.delta.utu.fi/potenssi/arkisto/)
65
+ o File s303 Copyright © 2009 Kennet Pynninen & Joonas Saario, Potenssi-lehti (http://www.delta.utu.fi/potenssi/
66
+ arkisto/)
67
+ o File s304 Copyright © 2009 Hanna Lehtinen, Potenssi-lehti (http://www.delta.utu.fi/potenssi/arkisto/)
68
+ o File s305 Copyright © 2009 Kennet Pynninen, Potenssi-lehti (http://www.delta.utu.fi/potenssi/arkisto/)
69
+ o File s306 Copyright © 2009 Jussi Harmanen, Potenssi-lehti (http://www.delta.utu.fi/potenssi/arkisto/)
70
+ o File s307 Copyright © 2009 Essi Vuorinen & Joonas Saario, Potenssi-lehti (http://www.delta.utu.fi/potenssi/arkisto/)
71
+ o Files t001-t050 © 2011 Talous Sanomat http://www.taloussanomat.fi/
72
+ o Files u001-u050 Copyright © 2010 University of Turku, http://www.utuonline.fi/
73
+ o Files w001-w200 Wikipedia, http://www.wikipedia.fi/
74
+ o Files wn001-wn100 Wikinews, http://fi.wikinews.org/
75
+
76
+ Legal code of the license:
77
+
78
+ Attribution-ShareAlike 4.0 International
79
+
80
+ =======================================================================
81
+
82
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
83
+ does not provide legal services or legal advice. Distribution of
84
+ Creative Commons public licenses does not create a lawyer-client or
85
+ other relationship. Creative Commons makes its licenses and related
86
+ information available on an "as-is" basis. Creative Commons gives no
87
+ warranties regarding its licenses, any material licensed under their
88
+ terms and conditions, or any related information. Creative Commons
89
+ disclaims all liability for damages resulting from their use to the
90
+ fullest extent possible.
91
+
92
+ Using Creative Commons Public Licenses
93
+
94
+ Creative Commons public licenses provide a standard set of terms and
95
+ conditions that creators and other rights holders may use to share
96
+ original works of authorship and other material subject to copyright
97
+ and certain other rights specified in the public license below. The
98
+ following considerations are for informational purposes only, are not
99
+ exhaustive, and do not form part of our licenses.
100
+
101
+ Considerations for licensors: Our public licenses are
102
+ intended for use by those authorized to give the public
103
+ permission to use material in ways otherwise restricted by
104
+ copyright and certain other rights. Our licenses are
105
+ irrevocable. Licensors should read and understand the terms
106
+ and conditions of the license they choose before applying it.
107
+ Licensors should also secure all rights necessary before
108
+ applying our licenses so that the public can reuse the
109
+ material as expected. Licensors should clearly mark any
110
+ material not subject to the license. This includes other CC-
111
+ licensed material, or material used under an exception or
112
+ limitation to copyright. More considerations for licensors:
113
+ wiki.creativecommons.org/Considerations_for_licensors
114
+
115
+ Considerations for the public: By using one of our public
116
+ licenses, a licensor grants the public permission to use the
117
+ licensed material under specified terms and conditions. If
118
+ the licensor's permission is not necessary for any reason--for
119
+ example, because of any applicable exception or limitation to
120
+ copyright--then that use is not regulated by the license. Our
121
+ licenses grant only permissions under copyright and certain
122
+ other rights that a licensor has authority to grant. Use of
123
+ the licensed material may still be restricted for other
124
+ reasons, including because others have copyright or other
125
+ rights in the material. A licensor may make special requests,
126
+ such as asking that all changes be marked or described.
127
+ Although not required by our licenses, you are encouraged to
128
+ respect those requests where reasonable. More_considerations
129
+ for the public:
130
+ wiki.creativecommons.org/Considerations_for_licensees
131
+
132
+ =======================================================================
133
+
134
+ Creative Commons Attribution-ShareAlike 4.0 International Public
135
+ License
136
+
137
+ By exercising the Licensed Rights (defined below), You accept and agree
138
+ to be bound by the terms and conditions of this Creative Commons
139
+ Attribution-ShareAlike 4.0 International Public License ("Public
140
+ License"). To the extent this Public License may be interpreted as a
141
+ contract, You are granted the Licensed Rights in consideration of Your
142
+ acceptance of these terms and conditions, and the Licensor grants You
143
+ such rights in consideration of benefits the Licensor receives from
144
+ making the Licensed Material available under these terms and
145
+ conditions.
146
+
147
+
148
+ Section 1 -- Definitions.
149
+
150
+ a. Adapted Material means material subject to Copyright and Similar
151
+ Rights that is derived from or based upon the Licensed Material
152
+ and in which the Licensed Material is translated, altered,
153
+ arranged, transformed, or otherwise modified in a manner requiring
154
+ permission under the Copyright and Similar Rights held by the
155
+ Licensor. For purposes of this Public License, where the Licensed
156
+ Material is a musical work, performance, or sound recording,
157
+ Adapted Material is always produced where the Licensed Material is
158
+ synched in timed relation with a moving image.
159
+
160
+ b. Adapter's License means the license You apply to Your Copyright
161
+ and Similar Rights in Your contributions to Adapted Material in
162
+ accordance with the terms and conditions of this Public License.
163
+
164
+ c. BY-SA Compatible License means a license listed at
165
+ creativecommons.org/compatiblelicenses, approved by Creative
166
+ Commons as essentially the equivalent of this Public License.
167
+
168
+ d. Copyright and Similar Rights means copyright and/or similar rights
169
+ closely related to copyright including, without limitation,
170
+ performance, broadcast, sound recording, and Sui Generis Database
171
+ Rights, without regard to how the rights are labeled or
172
+ categorized. For purposes of this Public License, the rights
173
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
174
+ Rights.
175
+
176
+ e. Effective Technological Measures means those measures that, in the
177
+ absence of proper authority, may not be circumvented under laws
178
+ fulfilling obligations under Article 11 of the WIPO Copyright
179
+ Treaty adopted on December 20, 1996, and/or similar international
180
+ agreements.
181
+
182
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
183
+ any other exception or limitation to Copyright and Similar Rights
184
+ that applies to Your use of the Licensed Material.
185
+
186
+ g. License Elements means the license attributes listed in the name
187
+ of a Creative Commons Public License. The License Elements of this
188
+ Public License are Attribution and ShareAlike.
189
+
190
+ h. Licensed Material means the artistic or literary work, database,
191
+ or other material to which the Licensor applied this Public
192
+ License.
193
+
194
+ i. Licensed Rights means the rights granted to You subject to the
195
+ terms and conditions of this Public License, which are limited to
196
+ all Copyright and Similar Rights that apply to Your use of the
197
+ Licensed Material and that the Licensor has authority to license.
198
+
199
+ j. Licensor means the individual(s) or entity(ies) granting rights
200
+ under this Public License.
201
+
202
+ k. Share means to provide material to the public by any means or
203
+ process that requires permission under the Licensed Rights, such
204
+ as reproduction, public display, public performance, distribution,
205
+ dissemination, communication, or importation, and to make material
206
+ available to the public including in ways that members of the
207
+ public may access the material from a place and at a time
208
+ individually chosen by them.
209
+
210
+ l. Sui Generis Database Rights means rights other than copyright
211
+ resulting from Directive 96/9/EC of the European Parliament and of
212
+ the Council of 11 March 1996 on the legal protection of databases,
213
+ as amended and/or succeeded, as well as other essentially
214
+ equivalent rights anywhere in the world.
215
+
216
+ m. You means the individual or entity exercising the Licensed Rights
217
+ under this Public License. Your has a corresponding meaning.
218
+
219
+
220
+ Section 2 -- Scope.
221
+
222
+ a. License grant.
223
+
224
+ 1. Subject to the terms and conditions of this Public License,
225
+ the Licensor hereby grants You a worldwide, royalty-free,
226
+ non-sublicensable, non-exclusive, irrevocable license to
227
+ exercise the Licensed Rights in the Licensed Material to:
228
+
229
+ a. reproduce and Share the Licensed Material, in whole or
230
+ in part; and
231
+
232
+ b. produce, reproduce, and Share Adapted Material.
233
+
234
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
235
+ Exceptions and Limitations apply to Your use, this Public
236
+ License does not apply, and You do not need to comply with
237
+ its terms and conditions.
238
+
239
+ 3. Term. The term of this Public License is specified in Section
240
+ 6(a).
241
+
242
+ 4. Media and formats; technical modifications allowed. The
243
+ Licensor authorizes You to exercise the Licensed Rights in
244
+ all media and formats whether now known or hereafter created,
245
+ and to make technical modifications necessary to do so. The
246
+ Licensor waives and/or agrees not to assert any right or
247
+ authority to forbid You from making technical modifications
248
+ necessary to exercise the Licensed Rights, including
249
+ technical modifications necessary to circumvent Effective
250
+ Technological Measures. For purposes of this Public License,
251
+ simply making modifications authorized by this Section 2(a)
252
+ (4) never produces Adapted Material.
253
+
254
+ 5. Downstream recipients.
255
+
256
+ a. Offer from the Licensor -- Licensed Material. Every
257
+ recipient of the Licensed Material automatically
258
+ receives an offer from the Licensor to exercise the
259
+ Licensed Rights under the terms and conditions of this
260
+ Public License.
261
+
262
+ b. Additional offer from the Licensor -- Adapted Material.
263
+ Every recipient of Adapted Material from You
264
+ automatically receives an offer from the Licensor to
265
+ exercise the Licensed Rights in the Adapted Material
266
+ under the conditions of the Adapter's License You apply.
267
+
268
+ c. No downstream restrictions. You may not offer or impose
269
+ any additional or different terms or conditions on, or
270
+ apply any Effective Technological Measures to, the
271
+ Licensed Material if doing so restricts exercise of the
272
+ Licensed Rights by any recipient of the Licensed
273
+ Material.
274
+
275
+ 6. No endorsement. Nothing in this Public License constitutes or
276
+ may be construed as permission to assert or imply that You
277
+ are, or that Your use of the Licensed Material is, connected
278
+ with, or sponsored, endorsed, or granted official status by,
279
+ the Licensor or others designated to receive attribution as
280
+ provided in Section 3(a)(1)(A)(i).
281
+
282
+ b. Other rights.
283
+
284
+ 1. Moral rights, such as the right of integrity, are not
285
+ licensed under this Public License, nor are publicity,
286
+ privacy, and/or other similar personality rights; however, to
287
+ the extent possible, the Licensor waives and/or agrees not to
288
+ assert any such rights held by the Licensor to the limited
289
+ extent necessary to allow You to exercise the Licensed
290
+ Rights, but not otherwise.
291
+
292
+ 2. Patent and trademark rights are not licensed under this
293
+ Public License.
294
+
295
+ 3. To the extent possible, the Licensor waives any right to
296
+ collect royalties from You for the exercise of the Licensed
297
+ Rights, whether directly or through a collecting society
298
+ under any voluntary or waivable statutory or compulsory
299
+ licensing scheme. In all other cases the Licensor expressly
300
+ reserves any right to collect such royalties.
301
+
302
+
303
+ Section 3 -- License Conditions.
304
+
305
+ Your exercise of the Licensed Rights is expressly made subject to the
306
+ following conditions.
307
+
308
+ a. Attribution.
309
+
310
+ 1. If You Share the Licensed Material (including in modified
311
+ form), You must:
312
+
313
+ a. retain the following if it is supplied by the Licensor
314
+ with the Licensed Material:
315
+
316
+ i. identification of the creator(s) of the Licensed
317
+ Material and any others designated to receive
318
+ attribution, in any reasonable manner requested by
319
+ the Licensor (including by pseudonym if
320
+ designated);
321
+
322
+ ii. a copyright notice;
323
+
324
+ iii. a notice that refers to this Public License;
325
+
326
+ iv. a notice that refers to the disclaimer of
327
+ warranties;
328
+
329
+ v. a URI or hyperlink to the Licensed Material to the
330
+ extent reasonably practicable;
331
+
332
+ b. indicate if You modified the Licensed Material and
333
+ retain an indication of any previous modifications; and
334
+
335
+ c. indicate the Licensed Material is licensed under this
336
+ Public License, and include the text of, or the URI or
337
+ hyperlink to, this Public License.
338
+
339
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
340
+ reasonable manner based on the medium, means, and context in
341
+ which You Share the Licensed Material. For example, it may be
342
+ reasonable to satisfy the conditions by providing a URI or
343
+ hyperlink to a resource that includes the required
344
+ information.
345
+
346
+ 3. If requested by the Licensor, You must remove any of the
347
+ information required by Section 3(a)(1)(A) to the extent
348
+ reasonably practicable.
349
+
350
+ b. ShareAlike.
351
+
352
+ In addition to the conditions in Section 3(a), if You Share
353
+ Adapted Material You produce, the following conditions also apply.
354
+
355
+ 1. The Adapter's License You apply must be a Creative Commons
356
+ license with the same License Elements, this version or
357
+ later, or a BY-SA Compatible License.
358
+
359
+ 2. You must include the text of, or the URI or hyperlink to, the
360
+ Adapter's License You apply. You may satisfy this condition
361
+ in any reasonable manner based on the medium, means, and
362
+ context in which You Share Adapted Material.
363
+
364
+ 3. You may not offer or impose any additional or different terms
365
+ or conditions on, or apply any Effective Technological
366
+ Measures to, Adapted Material that restrict exercise of the
367
+ rights granted under the Adapter's License You apply.
368
+
369
+
370
+ Section 4 -- Sui Generis Database Rights.
371
+
372
+ Where the Licensed Rights include Sui Generis Database Rights that
373
+ apply to Your use of the Licensed Material:
374
+
375
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
376
+ to extract, reuse, reproduce, and Share all or a substantial
377
+ portion of the contents of the database;
378
+
379
+ b. if You include all or a substantial portion of the database
380
+ contents in a database in which You have Sui Generis Database
381
+ Rights, then the database in which You have Sui Generis Database
382
+ Rights (but not its individual contents) is Adapted Material,
383
+
384
+ including for purposes of Section 3(b); and
385
+ c. You must comply with the conditions in Section 3(a) if You Share
386
+ all or a substantial portion of the contents of the database.
387
+
388
+ For the avoidance of doubt, this Section 4 supplements and does not
389
+ replace Your obligations under this Public License where the Licensed
390
+ Rights include other Copyright and Similar Rights.
391
+
392
+
393
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
394
+
395
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
396
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
397
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
398
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
399
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
400
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
401
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
402
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
403
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
404
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
405
+
406
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
407
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
408
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
409
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
410
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
411
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
412
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
413
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
414
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
415
+
416
+ c. The disclaimer of warranties and limitation of liability provided
417
+ above shall be interpreted in a manner that, to the extent
418
+ possible, most closely approximates an absolute disclaimer and
419
+ waiver of all liability.
420
+
421
+
422
+ Section 6 -- Term and Termination.
423
+
424
+ a. This Public License applies for the term of the Copyright and
425
+ Similar Rights licensed here. However, if You fail to comply with
426
+ this Public License, then Your rights under this Public License
427
+ terminate automatically.
428
+
429
+ b. Where Your right to use the Licensed Material has terminated under
430
+ Section 6(a), it reinstates:
431
+
432
+ 1. automatically as of the date the violation is cured, provided
433
+ it is cured within 30 days of Your discovery of the
434
+ violation; or
435
+
436
+ 2. upon express reinstatement by the Licensor.
437
+
438
+ For the avoidance of doubt, this Section 6(b) does not affect any
439
+ right the Licensor may have to seek remedies for Your violations
440
+ of this Public License.
441
+
442
+ c. For the avoidance of doubt, the Licensor may also offer the
443
+ Licensed Material under separate terms or conditions or stop
444
+ distributing the Licensed Material at any time; however, doing so
445
+ will not terminate this Public License.
446
+
447
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
448
+ License.
449
+
450
+
451
+ Section 7 -- Other Terms and Conditions.
452
+
453
+ a. The Licensor shall not be bound by any additional or different
454
+ terms or conditions communicated by You unless expressly agreed.
455
+
456
+ b. Any arrangements, understandings, or agreements regarding the
457
+ Licensed Material not stated herein are separate from and
458
+ independent of the terms and conditions of this Public License.
459
+
460
+
461
+ Section 8 -- Interpretation.
462
+
463
+ a. For the avoidance of doubt, this Public License does not, and
464
+ shall not be interpreted to, reduce, limit, restrict, or impose
465
+ conditions on any use of the Licensed Material that could lawfully
466
+ be made without permission under this Public License.
467
+
468
+ b. To the extent possible, if any provision of this Public License is
469
+ deemed unenforceable, it shall be automatically reformed to the
470
+ minimum extent necessary to make it enforceable. If the provision
471
+ cannot be reformed, it shall be severed from this Public License
472
+ without affecting the enforceability of the remaining terms and
473
+ conditions.
474
+
475
+ c. No term or condition of this Public License will be waived and no
476
+ failure to comply consented to unless expressly agreed to by the
477
+ Licensor.
478
+
479
+ d. Nothing in this Public License constitutes or may be interpreted
480
+ as a limitation upon, or waiver of, any privileges and immunities
481
+ that apply to the Licensor or You, including from the legal
482
+ processes of any jurisdiction or authority.
483
+
484
+
485
+ =======================================================================
486
+
487
+ Creative Commons is not a party to its public licenses.
488
+ Notwithstanding, Creative Commons may elect to apply one of its public
489
+ licenses to material it publishes and in those instances will be
490
+ considered the "Licensor." Except for the limited purpose of indicating
491
+ that material is shared under a Creative Commons public license or as
492
+ otherwise permitted by the Creative Commons policies published at
493
+ creativecommons.org/policies, Creative Commons does not authorize the
494
+ use of the trademark "Creative Commons" or any other trademark or logo
495
+ of Creative Commons without its prior written consent including,
496
+ without limitation, in connection with any unauthorized modifications
497
+ to any of its public licenses or any other arrangements,
498
+ understandings, or agreements concerning use of licensed material. For
499
+ the avoidance of doubt, this paragraph does not form part of the public
500
+ licenses.
501
+
502
+ Creative Commons may be contacted at creativecommons.org.
README.md ADDED
The diff for this file is too large to render. See raw diff
config.cfg ADDED
@@ -0,0 +1,254 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [paths]
2
+ train = "corpus/UD_Finnish-TDT/train.spacy"
3
+ dev = "corpus/UD_Finnish-TDT/dev.spacy"
4
+ vectors = null
5
+ init_tok2vec = null
6
+ tokenizer_source = "training/UD_Finnish-TDT/tokenizer/model-best"
7
+ transformer_source = "training/UD_Finnish-TDT/transformer/model-best"
8
+
9
+ [system]
10
+ gpu_allocator = "pytorch"
11
+ seed = 0
12
+
13
+ [nlp]
14
+ lang = "fi"
15
+ pipeline = ["experimental_char_ner_tokenizer","transformer","senter","tagger","morphologizer","parser","experimental_edit_tree_lemmatizer"]
16
+ batch_size = 64
17
+ disabled = ["senter"]
18
+ before_creation = null
19
+ after_creation = null
20
+ after_pipeline_creation = null
21
+ tokenizer = {"@tokenizers":"spacy-experimental.char_pretokenizer.v1"}
22
+
23
+ [components]
24
+
25
+ [components.experimental_char_ner_tokenizer]
26
+ factory = "experimental_char_ner_tokenizer"
27
+ scorer = {"@scorers":"spacy-experimental.tokenizer_scorer.v1"}
28
+
29
+ [components.experimental_char_ner_tokenizer.model]
30
+ @architectures = "spacy.TransitionBasedParser.v2"
31
+ state_type = "ner"
32
+ extra_state_tokens = false
33
+ hidden_width = 64
34
+ maxout_pieces = 2
35
+ use_upper = true
36
+ nO = null
37
+
38
+ [components.experimental_char_ner_tokenizer.model.tok2vec]
39
+ @architectures = "spacy.Tok2Vec.v2"
40
+
41
+ [components.experimental_char_ner_tokenizer.model.tok2vec.embed]
42
+ @architectures = "spacy.MultiHashEmbed.v2"
43
+ width = 128
44
+ attrs = ["ORTH","LOWER","IS_DIGIT","IS_ALPHA","IS_SPACE","IS_PUNCT"]
45
+ rows = [1000,500,50,50,50,50]
46
+ include_static_vectors = false
47
+
48
+ [components.experimental_char_ner_tokenizer.model.tok2vec.encode]
49
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
50
+ width = 128
51
+ depth = 4
52
+ window_size = 4
53
+ maxout_pieces = 2
54
+
55
+ [components.experimental_edit_tree_lemmatizer]
56
+ factory = "experimental_edit_tree_lemmatizer"
57
+ backoff = "orth"
58
+ min_tree_freq = 1
59
+ overwrite = false
60
+ scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
61
+ top_k = 1
62
+
63
+ [components.experimental_edit_tree_lemmatizer.model]
64
+ @architectures = "spacy.Tagger.v1"
65
+ nO = null
66
+
67
+ [components.experimental_edit_tree_lemmatizer.model.tok2vec]
68
+ @architectures = "spacy-transformers.TransformerListener.v1"
69
+ grad_factor = 1.0
70
+ upstream = "transformer"
71
+ pooling = {"@layers":"reduce_mean.v1"}
72
+
73
+ [components.morphologizer]
74
+ factory = "morphologizer"
75
+ extend = false
76
+ overwrite = false
77
+ scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
78
+
79
+ [components.morphologizer.model]
80
+ @architectures = "spacy.Tagger.v1"
81
+ nO = null
82
+
83
+ [components.morphologizer.model.tok2vec]
84
+ @architectures = "spacy-transformers.TransformerListener.v1"
85
+ grad_factor = 1.0
86
+ upstream = "transformer"
87
+ pooling = {"@layers":"reduce_mean.v1"}
88
+
89
+ [components.parser]
90
+ factory = "parser"
91
+ learn_tokens = false
92
+ min_action_freq = 5
93
+ moves = null
94
+ scorer = {"@scorers":"spacy.parser_scorer.v1"}
95
+ update_with_oracle_cut_size = 100
96
+
97
+ [components.parser.model]
98
+ @architectures = "spacy.TransitionBasedParser.v2"
99
+ state_type = "parser"
100
+ extra_state_tokens = false
101
+ hidden_width = 64
102
+ maxout_pieces = 3
103
+ use_upper = false
104
+ nO = null
105
+
106
+ [components.parser.model.tok2vec]
107
+ @architectures = "spacy-transformers.TransformerListener.v1"
108
+ grad_factor = 1.0
109
+ upstream = "transformer"
110
+ pooling = {"@layers":"reduce_mean.v1"}
111
+
112
+ [components.senter]
113
+ factory = "senter"
114
+ overwrite = false
115
+ scorer = {"@scorers":"spacy.senter_scorer.v1"}
116
+
117
+ [components.senter.model]
118
+ @architectures = "spacy.Tagger.v1"
119
+ nO = null
120
+
121
+ [components.senter.model.tok2vec]
122
+ @architectures = "spacy-transformers.TransformerListener.v1"
123
+ grad_factor = 1.0
124
+ upstream = "transformer"
125
+ pooling = {"@layers":"reduce_mean.v1"}
126
+
127
+ [components.tagger]
128
+ factory = "tagger"
129
+ neg_prefix = "!!!"
130
+ overwrite = false
131
+ scorer = {"@scorers":"spacy.tagger_scorer.v1"}
132
+
133
+ [components.tagger.model]
134
+ @architectures = "spacy.Tagger.v1"
135
+ nO = null
136
+
137
+ [components.tagger.model.tok2vec]
138
+ @architectures = "spacy-transformers.TransformerListener.v1"
139
+ grad_factor = 1.0
140
+ upstream = "transformer"
141
+ pooling = {"@layers":"reduce_mean.v1"}
142
+
143
+ [components.transformer]
144
+ factory = "transformer"
145
+ max_batch_items = 4096
146
+ set_extra_annotations = {"@annotation_setters":"spacy-transformers.null_annotation_setter.v1"}
147
+
148
+ [components.transformer.model]
149
+ @architectures = "spacy-transformers.TransformerModel.v3"
150
+ name = "xlm-roberta-base"
151
+ mixed_precision = true
152
+
153
+ [components.transformer.model.get_spans]
154
+ @span_getters = "spacy-transformers.strided_spans.v1"
155
+ window = 128
156
+ stride = 96
157
+
158
+ [components.transformer.model.grad_scaler_config]
159
+
160
+ [components.transformer.model.tokenizer_config]
161
+ use_fast = true
162
+
163
+ [components.transformer.model.transformer_config]
164
+
165
+ [corpora]
166
+
167
+ [corpora.dev]
168
+ @readers = "spacy.Corpus.v1"
169
+ path = ${paths.dev}
170
+ max_length = 0
171
+ gold_preproc = false
172
+ limit = 0
173
+ augmenter = null
174
+
175
+ [corpora.train]
176
+ @readers = "spacy.Corpus.v1"
177
+ path = ${paths.train}
178
+ max_length = 0
179
+ gold_preproc = false
180
+ limit = 0
181
+ augmenter = null
182
+
183
+ [training]
184
+ train_corpus = "corpora.train"
185
+ dev_corpus = "corpora.dev"
186
+ seed = ${system:seed}
187
+ gpu_allocator = ${system:gpu_allocator}
188
+ dropout = 0.1
189
+ accumulate_gradient = 3
190
+ patience = 5000
191
+ max_epochs = 0
192
+ max_steps = 20000
193
+ eval_frequency = 200
194
+ frozen_components = []
195
+ before_to_disk = null
196
+ annotating_components = []
197
+
198
+ [training.batcher]
199
+ @batchers = "spacy.batch_by_padded.v1"
200
+ discard_oversize = true
201
+ get_length = null
202
+ size = 2000
203
+ buffer = 256
204
+
205
+ [training.logger]
206
+ @loggers = "spacy.ConsoleLogger.v1"
207
+ progress_bar = false
208
+
209
+ [training.optimizer]
210
+ @optimizers = "Adam.v1"
211
+ beta1 = 0.9
212
+ beta2 = 0.999
213
+ L2_is_weight_decay = true
214
+ L2 = 0.01
215
+ grad_clip = 1.0
216
+ use_averages = true
217
+ eps = 0.00000001
218
+
219
+ [training.optimizer.learn_rate]
220
+ @schedules = "warmup_linear.v1"
221
+ warmup_steps = 250
222
+ total_steps = 20000
223
+ initial_rate = 0.00005
224
+
225
+ [training.score_weights]
226
+ token_f = 0.0
227
+ token_p = null
228
+ token_r = null
229
+ token_acc = null
230
+ sents_f = 0.05
231
+ sents_p = 0.0
232
+ sents_r = 0.0
233
+ tag_acc = 0.11
234
+ pos_acc = 0.05
235
+ morph_acc = 0.05
236
+ morph_per_feat = null
237
+ dep_uas = 0.11
238
+ dep_las = 0.11
239
+ dep_las_per_type = null
240
+ lemma_acc = 0.52
241
+
242
+ [pretraining]
243
+
244
+ [initialize]
245
+ vectors = ${paths.vectors}
246
+ init_tok2vec = ${paths.init_tok2vec}
247
+ vocab_data = null
248
+ lookups = null
249
+ before_init = null
250
+ after_init = null
251
+
252
+ [initialize.components]
253
+
254
+ [initialize.tokenizer]
experimental_char_ner_tokenizer/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":1,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
experimental_char_ner_tokenizer/model ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4730e75767ad4297e6f4ba5e164d4194872ca227fa51ed45bdc627f41748aee1
3
+ size 6922248
experimental_char_ner_tokenizer/moves ADDED
@@ -0,0 +1 @@
 
1
+ ��moves�p{"0":{},"1":{"TOKEN":1055628},"2":{"TOKEN":1055628},"3":{"TOKEN":1055628},"4":{"TOKEN":1055628,"":1},"5":{"":1}}�cfg��neg_key�
experimental_edit_tree_lemmatizer/cfg ADDED
The diff for this file is too large to render. See raw diff
experimental_edit_tree_lemmatizer/model ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:260e3e2a5f0f84c7803dddc3190f41a2e70f36a2a580f61f2a9fbab058ae98ae
3
+ size 33169109
experimental_edit_tree_lemmatizer/trees ADDED
Binary file (1.55 MB). View file
fi_udv25_finnishtdt_trf-any-py3-none-any.whl ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:27b170f543973b82263f895903572aac170036281d1e671de4561f2d50e8465b
3
+ size 892718880
meta.json ADDED
The diff for this file is too large to render. See raw diff
morphologizer/cfg ADDED
The diff for this file is too large to render. See raw diff
morphologizer/model ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b469e185658a544ccbdfaaf031ea90f8591397ff106b68eba0139d2cd1343ee2
3
+ size 6352541
parser/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":5,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
parser/model ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:792c0e5c39fe7f19154e2a05fdb61c9794889afce80483535cfab2800f775af4
3
+ size 872957
parser/moves ADDED
@@ -0,0 +1 @@
 
1
+ ��moves�\{"0":{"":85817},"1":{"":64585},"2":{"punct":10027,"nmod:poss":8942,"amod":8394,"nsubj":8152,"advmod":7864,"cc":6516,"obl":5966,"aux":4304,"cop":3297,"mark":3098,"nsubj:cop":2791,"det":2770,"acl":2575,"obj":2535,"nummod":1979,"nmod:gobj":1191,"compound:nn":964,"nmod":937,"advcl":804,"aux:pass":595,"nmod:gsubj":457,"case":334,"compound":150,"cc:preconj":126,"obj||xcomp":118,"nsubj||xcomp:ds":116,"discourse":115,"orphan":107,"xcomp":105,"vocative":72,"obj||xcomp:ds":62,"xcomp:ds":52,"obl||xcomp:ds":50,"obl||xcomp":49,"compound:prt":41,"nsubj||ccomp":19,"cop:own":12,"advmod||xcomp":12,"nmod||nsubj":11,"advmod||xcomp:ds":9,"nsubj:cop||xcomp:ds":8,"nmod||obj":8,"nsubj:cop||ccomp":7,"csubj:cop":7,"obj||ccomp":6,"nmod||nsubj:cop":6,"dep":0},"3":{"punct":13844,"obl":8896,"conj":8457,"obj":6336,"advmod":3261,"nmod":3151,"flat:name":2317,"case":2100,"advcl":1961,"xcomp":1798,"acl:relcl":1633,"ccomp":1572,"nsubj":1131,"appos":1091,"nsubj:cop":976,"xcomp:ds":829,"cop":765,"nummod":708,"flat":705,"parataxis":661,"fixed":420,"acl":332,"cop:own":263,"compound:prt":200,"aux":167,"csubj:cop":157,"orphan":140,"discourse":127,"flat:foreign":74,"amod":56,"csubj":44,"det":39,"acl:relcl||nmod:poss":28,"goeswith":27,"advcl||det":24,"advcl||amod":22,"vocative":17,"advcl||advmod":16,"nsubj||xcomp:ds":12,"compound:nn":12,"mark":10,"appos||obj":9,"ccomp||obj":8,"appos||nsubj":8,"acl:relcl||obj":8,"conj||obj":7,"nsubj:cop||xcomp:ds":6,"cop||xcomp":6,"cc:preconj":6,"acl:relcl||nsubj":6,"acl:relcl||nmod:gobj":6,"nmod||obj":5,"cop||xcomp:ds":5,"conj||nsubj:cop":5,"conj||nsubj":5,"cc":5,"acl:relcl||obl":5,"dep":0},"4":{"ROOT":12217}}�cfg��neg_key�
senter/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ {
2
+ "overwrite":false
3
+ }
senter/model ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:adf06b61b4590b63bfb566e4acc0513d80a2851e9642001ed506b6dcbd212f86
3
+ size 6740
tagger/cfg ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "labels":[
3
+ "A",
4
+ "Adj",
5
+ "Adp",
6
+ "Adv",
7
+ "Adv_V",
8
+ "C",
9
+ "C_V",
10
+ "Foreign",
11
+ "Interj",
12
+ "N",
13
+ "Num",
14
+ "Pron",
15
+ "Punct",
16
+ "Symb",
17
+ "V",
18
+ "V_Pron"
19
+ ],
20
+ "neg_prefix":"!!!",
21
+ "overwrite":false
22
+ }
tagger/model ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dc6e659dba538a10a0130f39fce65b5ed439c743759b7607a1ed99f8efe8ca7b
3
+ size 49804
transformer/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ {
2
+ "max_batch_items":4096
3
+ }
transformer/model ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b95ad4d081edd522f7b316e3aa69ba873bf42f3cb818dc3b3d7a537caa660aa
3
+ size 1126406104
vocab/key2row ADDED
@@ -0,0 +1 @@
 
1
+
vocab/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76be8b528d0075f7aae98d6fa57a6d3c83ae480a8469e668d7b0af968995ac71
3
+ size 1
vocab/strings.json ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:64651e68c2f3185e19c068a498a3bdb9d0bec2373f1d01eea8473bef53e87e1f
3
+ size 3061227
vocab/vectors ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14772b683e726436d5948ad3fff2b43d036ef2ebbe3458aafed6004e05a40706
3
+ size 128
vocab/vectors.cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ {
2
+ "mode":"default"
3
+ }