Commit History

Add more vectorizers, classifiers and CLI options
b0ade1a

Tymec commited on

Add document frequency threshold
c5ed75e

Tymec commited on

Update options, force GC, tweak parameters and add flags
18cc46a

Tymec commited on

Ability to change number of parallel jobs for search
8471e78

Tymec commited on

Create model in train_model
3854a1f

Tymec commited on

Tokenization rework
2c1f9dd

Tymec commited on

Slight optimizations
0ca5366

Tymec commited on

Handle missing spacy model
7ce074d

Tymec commited on

Import spacy model as module
7b9e59d

Tymec commited on

Use spacy instead of nltk and move data functions to separate module
a092d54

Tymec commited on

Add cross validation
5a2db0a

Tymec commited on

Use stopwords from NLTK and download NLTK data
204391c

Tymec commited on

Completely change the structure of the project
85ac990

Tymec commited on

Restructure project into package structure
667fe9d

Tymec commited on