Cludoy
/

IntentClassifier

+---
+license: mit
+language:
+  - en
+tags:
+  - intent-classification
+  - tinybert
+  - cnn
+  - education
+  - nlp
+pipeline_tag: text-classification
+---
+# IntentClassifier — TinyBERT-CNN
+A lightweight intent classification model for educational AI tutoring systems. Built on **TinyBERT** (huawei-noah/TinyBERT_General_4L_312D) with a **CNN classification head**, optimized for real-time student intent detection.
+## Model Description
+This model classifies student utterances into 5 pedagogical intent categories:
+| Label ID | Intent | Description |
+|----------|--------|-------------|
+| 0 | **On-Topic Question** | Questions related to the current learning material |
+| 1 | **Off-Topic Question** | Questions unrelated to the current topic |
+| 2 | **Emotional-State** | Expressions of frustration, confusion, excitement, etc. |
+| 3 | **Pace-Related** | Requests to speed up, slow down, or adjust pacing |
+| 4 | **Repeat/Clarification** | Requests to repeat or clarify previous explanations |
+## Architecture
+- **Backbone**: TinyBERT (4-layer, 312-dim) — compact BERT variant
+- **Head**: Multi-kernel CNN (filter sizes 2, 3, 4) + BatchNorm + FC hidden layer
+- **Parameters**: ~14.5M total
+- **Inference**: <50ms on CPU
+```
+TinyBERT → CNN (multi-kernel) → BatchNorm → MaxPool → FC(768→128) → FC(128→5)
+```
+## Performance
+| Metric | Score |
+|--------|-------|
+| **Accuracy** | 99.6% |
+| **F1 Score** | 99.6% |
+| **Precision** | 99.6% |
+| **Recall** | 99.6% |
+| **Test Loss** | 0.069 |
+### Per-Class Performance
+| Intent | Precision | Recall | F1 | Support |
+|--------|-----------|--------|----|---------|
+| On-Topic Question | 0.993 | 0.997 | 0.995 | 300 |
+| Off-Topic Question | 0.997 | 0.993 | 0.995 | 300 |
+| Emotional-State | 0.993 | 1.000 | 0.997 | 300 |
+| Pace-Related | 0.997 | 0.993 | 0.995 | 300 |
+| Repeat/Clarification | 1.000 | 0.997 | 0.998 | 300 |
+## Training Details
+- **Epochs**: 7 (early stopping with patience=5)
+- **Batch Size**: 16
+- **Optimizer**: AdamW with discriminative fine-tuning (BERT LR: 2e-5, Head LR: 1e-3)
+- **Scheduler**: Warmup + Cosine decay
+- **Label Smoothing**: 0.1
+- **Training Duration**: ~212 seconds
+## Usage
+```python
+from TinyBert import IntentClassifier
+# Initialize
+classifier = IntentClassifier(num_classes=5)
+classifier.load_model("prod_tinybert.pt")
+# Predict
+texts = ["Can you explain that again?"]
+contexts = ["topic:Python Loops"]
+predictions, probabilities = classifier.predict(texts, contexts)
+intent_names = ['On-Topic Question', 'Off-Topic Question', 'Emotional-State', 'Pace-Related', 'Repeat/clarification']
+print(f"Predicted: {intent_names[predictions[0]]}")
+print(f"Confidence: {probabilities[0][predictions[0]]:.4f}")
+```
+### Compound Sentence Splitting
+The model includes a `CompoundSentenceSplitter` that can detect and split compound questions:
+```python
+from TinyBert import CompoundSentenceSplitter
+splitter = CompoundSentenceSplitter()
+questions = splitter.split_compound_question("What is a loop and how do I use it?")
+# Returns: ["What is a loop?", "how do I use it?"]
+```
+## Files
+| File | Description |
+|------|-------------|
+| `TinyBert.py` | Model architecture, IntentClassifier wrapper, CompoundSentenceSplitter |
+| `train.py` | Full training pipeline with early stopping and metrics |
+| `auto_trainer.py` | Automated retraining pipeline |
+| `dataset_generator.py` | Synthetic data generation |
+| `test_suite.py` | Comprehensive test suite |
+| `prod_tinybert.pt` | Production model weights |
+| `best_tinybert.pt` | Best checkpoint from latest training |
+| `training_results.json` | Detailed training metrics and history |
+| `data/` | Training, validation, and test datasets |
+## Requirements
+```
+torch
+transformers
+pandas
+numpy
+scikit-learn
+tqdm
+```
+## Citation
+Part of the **AI-Powered Personalized Learning Platform** graduation project.