Tymec commited on
Commit
e3095cd
β€’
1 Parent(s): 0fce9f0

Add imdb50k model

Browse files
Files changed (2) hide show
  1. README.md +5 -10
  2. models/imdb50k_tfidf_ft20000.pkl +3 -0
README.md CHANGED
@@ -209,16 +209,11 @@ graph LR
209
 
210
  ### Pre-trained Models
211
  The following pre-trained models are available for use:
212
- | Dataset | Vectorizer | Features | Classifier | Accuracy | Model |
213
- | --- | --- | --- | --- | --- | --- |
214
- | `sentiment140` | `tfidf` | `LinearRegression` | 20 000 | ❌ | [Here](#) |
215
- | `imdb50k` | `tfidf` | `LinearRegression` | 20 000 | ❌ | [Here](#) |
216
- | `amazonreviews` | `tfidf` | `LinearRegression` | 20 000 | ❌ | [Here](#) |
217
-
218
- The accuracy of the models is based on the cross-validation score using the `test` dataset and `5` folds.
219
-
220
- #### Note
221
- Due to the size of the `amazonreviews` dataset, it was not possible to train a model with a vectorizer other than `hashing`.
222
 
223
 
224
  ## License
 
209
 
210
  ### Pre-trained Models
211
  The following pre-trained models are available for use:
212
+ | Dataset | Vectorizer | Classifier | Features | Accuracy on test | Accuracy on self | Model |
213
+ | --- | --- | --- | --- | --- | --- | --- |
214
+ | `imdb50k` | `tfidf` | `LinearRegression` | 20 000 | 59.59% Β± 0.73% | 74.08% Β± 0.14% | [Here](models/imdb50k_tfidf_ft20000.pkl) |
215
+ | `sentiment140` | `tfidf` | `LinearRegression` | 20 000 | ❌ | ❌ | [Here](models/sentiment140_tfidf_ft20000.pkl) |
216
+ | `amazonreviews` | `hashing` | `LinearRegression` | 1 048 576 | ❌ | ❌ | [Here](models/amazonreviews_hashing_ft1048576.pkl) |
 
 
 
 
 
217
 
218
 
219
  ## License
models/imdb50k_tfidf_ft20000.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f06582370d3fe31c878c0feec254d2a3fcd7f6f6140de10d1bd11e528853d66e
3
+ size 3288