Commit History

Update dependencies
53bc5fb

Tymec commited on

Add min-df option
8b10b79

Tymec commited on

Improved models
7f29122

Tymec commited on

Change df
cc21abf

Tymec commited on

Add new model trained on sentiment140
419453c

Tymec commited on

Cache label data along with tokenized text data
af84d9b

Tymec commited on

Add imdb50k model
e3095cd

Tymec commited on

Remove all pre-trained models
0fce9f0

Tymec commited on

Fix broken tokenization
447f97e

Tymec commited on

Add slang map
e1645d7

Tymec commited on

Update README architecture
d29d6fe

Tymec commited on

Add progress bar to serialize
632adc4

Tymec commited on

Add emoji dependency
ac221ce

Tymec commited on

More pre-trained models
421ea0c

Tymec commited on

Add more vectorizers, classifiers and CLI options
b0ade1a

Tymec commited on

Refactor typing and update tokenization rules
228859a

Tymec commited on

Update documentation
71069d7

Tymec commited on

Add document frequency threshold
c5ed75e

Tymec commited on

Ignore amazonreviews test
d09d1f6

Tymec commited on

Chunked serialization
afaacd1

Tymec commited on

Update options, force GC, tweak parameters and add flags
18cc46a

Tymec commited on

Ability to change number of parallel jobs for search
8471e78

Tymec commited on

Create model in train_model
3854a1f

Tymec commited on

New model trained on imbb50k dataset
23e75e7

Tymec commited on

Tokenization rework
2c1f9dd

Tymec commited on

Add dataset for testing
e50b20c

Tymec commited on

Add prototyping notebook
a5c3a23

Tymec commited on

Rename app command
db8f6b2

Tymec commited on

Slight optimizations
0ca5366

Tymec commited on

Remove prev entry point
63ffb6b

Tymec commited on

Change HF entry point and add examples
b42b884

Tymec commited on

Handle missing spacy model
7ce074d

Tymec commited on

Remove check file size action
308dcf9

Tymec commited on

fix typo
3178817

Tymec commited on

Import spacy model as module
7b9e59d

Tymec commited on

Remove unused dependencies
16e15df

Tymec commited on

Update HF config
bf1042d

Tymec commited on

Add github actions
edfb539

Tymec commited on

Entry point for HF space
9e32ffe

Tymec commited on

Add evaluate command
cdf1241

Tymec commited on

Use spacy instead of nltk and move data functions to separate module
a092d54

Tymec commited on

Remove unused constanst
9a96b6b

Tymec commited on

First iteration of the model
8450b4f

Tymec commited on

Downgrade to python 3.11
5ae1418

Tymec commited on

Update README.md
1036898
unverified

Tymec commited on

Add cross validation
5a2db0a

Tymec commited on

Hyperparameter tuning in notebook done
88f3204

Tymec commited on

Add notebook for data exploration and hyperparameter tuning
9534cfa

Tymec commited on

Use stopwords from NLTK and download NLTK data
204391c

Tymec commited on

Fix merge
0993d5e

Tymec commited on