sm-subgroup-classifier / fi_ID-NA /training_details.txt
erikhenriksson's picture
Upload folder using huggingface_hub
f71be9a verified
raw
history blame contribute delete
882 Bytes
Training Details for fi_ID-NA
========================================
Language: fi
Register: ID-NA
Training Date: 2025-09-26 14:06:57
Data Summary:
- Total samples: 1729
- Training samples: 1383
- Test samples: 346
- Embedding dimension: 1024
Classes:
- Number of classes: 2
- Class names: '', 'comments'
- Class distribution: {'': 389, 'comments': 1340}
Cross-Validation Results:
- CV folds: 5
- CV scores: [0.9855595667870036, 0.9927797833935018, 0.9927797833935018, 0.9927536231884058, 0.9818840579710145]
- CV mean: 0.9892
- CV std: 0.0046
- CV confidence interval: 0.9892 ± 0.0092
Final Performance:
- Test accuracy: 0.9913
Model Configuration:
- Algorithm: Logistic Regression
- Regularization (C): 1.0
- Feature scaling: StandardScaler
- Random state: 42
Files:
- Classifier: model.pkl
- Scaler: scaler.pkl
- Metadata: metadata.pkl
- This file: training_details.txt