tabular-playground / README.md
rajistics's picture
pushing files to the repo from the example!
f909292
|
raw
history blame
56.4 kB
metadata
library_name: sklearn
tags:
  - sklearn
  - skops
  - tabular-classification
widget:
  structuredData:
    attribute_0:
      - material_7
      - material_7
      - material_7
    attribute_1:
      - material_8
      - material_6
      - material_8
    attribute_2:
      - 9
      - 6
      - 5
    attribute_3:
      - 5
      - 9
      - 8
    loading:
      - 119.49
      - 85.36
      - 73.71
    measurement_0:
      - 11
      - 10
      - 24
    measurement_1:
      - 2
      - 8
      - 7
    measurement_10:
      - 17.138
      - 15.632
      - 15.854
    measurement_11:
      - 19.954
      - 18.992
      - 20.405
    measurement_12:
      - 12.348
      - .nan
      - 13.638
    measurement_13:
      - 13.93
      - 15.148
      - .nan
    measurement_14:
      - 15.889
      - .nan
      - 15.854
    measurement_15:
      - 15.831
      - 15.849
      - 16.555
    measurement_16:
      - 16.102
      - 15.896
      - 17.145
    measurement_17:
      - 643.509
      - 722.585
      - 802.57
    measurement_2:
      - 3
      - 3
      - 7
    measurement_3:
      - 17.659
      - 19.679
      - 17.291
    measurement_4:
      - 11.578
      - 11.49
      - 11.691
    measurement_5:
      - 15.514
      - 18.267
      - 18.289
    measurement_6:
      - 15.99
      - 17.921
      - 17.396
    measurement_7:
      - 12.231
      - 11.978
      - 11.361
    measurement_8:
      - 19.92
      - 18.135
      - 19.67
    measurement_9:
      - 10.555
      - 11.113
      - 11.375
    product_code:
      - A
      - E
      - C

Model description

This is a DecisionTreeClassifier model built for Kaggle Tabular Playground Series August 2022, trained on supersoaker production failures dataset.

Intended uses & limitations

This model is not ready to be used in production.

Training Procedure

Hyperparameters

The model is trained with below hyperparameters.

Click to expand
Hyperparameter Value
memory
steps [('transformation', ColumnTransformer(transformers=[('loading_missing_value_imputer',
                             SimpleImputer(), ['loading']),
                            ('numerical_missing_value_imputer',
                             SimpleImputer(),
                             ['loading', 'measurement_3', 'measurement_4',
                              'measurement_5', 'measurement_6',
                              'measurement_7', 'measurement_8',
                              'measurement_9', 'measurement_10',
                              'measurement_11', 'measurement_12',
                              'measurement_13', 'measurement_14',
                              'measurement_15', 'measurement_16',
                              'measurement_17']),
                            ('attribute_0_encoder', OneHotEncoder(),
                             ['attribute_0']),
                            ('attribute_1_encoder', OneHotEncoder(),
                             ['attribute_1']),
                            ('product_code_encoder', OneHotEncoder(),
                             ['product_code'])])), ('model', DecisionTreeClassifier(max_depth=4))]                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |

| verbose | False | | transformation | ColumnTransformer(transformers=[('loading_missing_value_imputer', SimpleImputer(), ['loading']), ('numerical_missing_value_imputer', SimpleImputer(), ['loading', 'measurement_3', 'measurement_4', 'measurement_5', 'measurement_6', 'measurement_7', 'measurement_8', 'measurement_9', 'measurement_10', 'measurement_11', 'measurement_12', 'measurement_13', 'measurement_14', 'measurement_15', 'measurement_16', 'measurement_17']), ('attribute_0_encoder', OneHotEncoder(), ['attribute_0']), ('attribute_1_encoder', OneHotEncoder(), ['attribute_1']), ('product_code_encoder', OneHotEncoder(), ['product_code'])]) | | model | DecisionTreeClassifier(max_depth=4) | | transformation__n_jobs | | | transformation__remainder | drop | | transformation__sparse_threshold | 0.3 | | transformation__transformer_weights | | | transformation__transformers | [('loading_missing_value_imputer', SimpleImputer(), ['loading']), ('numerical_missing_value_imputer', SimpleImputer(), ['loading', 'measurement_3', 'measurement_4', 'measurement_5', 'measurement_6', 'measurement_7', 'measurement_8', 'measurement_9', 'measurement_10', 'measurement_11', 'measurement_12', 'measurement_13', 'measurement_14', 'measurement_15', 'measurement_16', 'measurement_17']), ('attribute_0_encoder', OneHotEncoder(), ['attribute_0']), ('attribute_1_encoder', OneHotEncoder(), ['attribute_1']), ('product_code_encoder', OneHotEncoder(), ['product_code'])] | | transformation__verbose | False | | transformation__verbose_feature_names_out | True | | transformation__loading_missing_value_imputer | SimpleImputer() | | transformation__numerical_missing_value_imputer | SimpleImputer() | | transformation__attribute_0_encoder | OneHotEncoder() | | transformation__attribute_1_encoder | OneHotEncoder() | | transformation__product_code_encoder | OneHotEncoder() | | transformation__loading_missing_value_imputer__add_indicator | False | | transformation__loading_missing_value_imputer__copy | True | | transformation__loading_missing_value_imputer__fill_value | | | transformation__loading_missing_value_imputer__missing_values | nan | | transformation__loading_missing_value_imputer__strategy | mean | | transformation__loading_missing_value_imputer__verbose | 0 | | transformation__numerical_missing_value_imputer__add_indicator | False | | transformation__numerical_missing_value_imputer__copy | True | | transformation__numerical_missing_value_imputer__fill_value | | | transformation__numerical_missing_value_imputer__missing_values | nan | | transformation__numerical_missing_value_imputer__strategy | mean | | transformation__numerical_missing_value_imputer__verbose | 0 | | transformation__attribute_0_encoder__categories | auto | | transformation__attribute_0_encoder__drop | | | transformation__attribute_0_encoder__dtype | <class 'numpy.float64'> | | transformation__attribute_0_encoder__handle_unknown | error | | transformation__attribute_0_encoder__sparse | True | | transformation__attribute_1_encoder__categories | auto | | transformation__attribute_1_encoder__drop | | | transformation__attribute_1_encoder__dtype | <class 'numpy.float64'> | | transformation__attribute_1_encoder__handle_unknown | error | | transformation__attribute_1_encoder__sparse | True | | transformation__product_code_encoder__categories | auto | | transformation__product_code_encoder__drop | | | transformation__product_code_encoder__dtype | <class 'numpy.float64'> | | transformation__product_code_encoder__handle_unknown | error | | transformation__product_code_encoder__sparse | True | | model__ccp_alpha | 0.0 | | model__class_weight | | | model__criterion | gini | | model__max_depth | 4 | | model__max_features | | | model__max_leaf_nodes | | | model__min_impurity_decrease | 0.0 | | model__min_samples_leaf | 1 | | model__min_samples_split | 2 | | model__min_weight_fraction_leaf | 0.0 | | model__random_state | | | model__splitter | best |

Model Plot

The model plot is below.

Pipeline(steps=[('transformation',ColumnTransformer(transformers=[('loading_missing_value_imputer',SimpleImputer(),['loading']),('numerical_missing_value_imputer',SimpleImputer(),['loading', 'measurement_3','measurement_4','measurement_5','measurement_6','measurement_7','measurement_8','measurement_9','measurement_10','measurement_11','measurement_12','measurement_13','measurement_14','measurement_15','measurement_16','measurement_17']),('attribute_0_encoder',OneHotEncoder(),['attribute_0']),('attribute_1_encoder',OneHotEncoder(),['attribute_1']),('product_code_encoder',OneHotEncoder(),['product_code'])])),('model', DecisionTreeClassifier(max_depth=4))])
Please rerun this cell to show the HTML repr or trust the notebook.

Evaluation Results

You can find the details about evaluation process and the evaluation results.

Metric Value
accuracy 0.786392
f1 score 0.786392

How to Get Started with the Model

Use the code below to get started with the model.

Click to expand
import pickle 
with open(decision-tree-playground-kaggle/model.pkl, 'rb') as file: 
    clf = pickle.load(file)

Model Card Authors

This model card is written by following authors:

huggingface

Model Card Contact

You can contact the model card authors through following channels: [More Information Needed]

Citation

Below you can find information related to citation.

BibTeX:

[More Information Needed]

Tree Plot Tree Plot

Confusion Matrix Confusion Matrix