Spaces:

shegga
/

SentimentAnalysisForNMTTNT

Runtime error

shegga commited on Oct 18

Commit

0210351

0 Parent(s):

Implement Vietnamese Sentiment Analysis: Fine-tuning, Gradio Interface, and Model Testing

- Added `fine_tune_sentiment.py` for fine-tuning a sentiment analysis model on Vietnamese text.
- Created `gradio_app.py` to provide an interactive web interface for real-time sentiment analysis.
- Developed `test_model.py` for evaluating the fine-tuned model with custom texts, batch processing, and comparison with the original model.
- Included memory management features in the Gradio app to optimize performance.
- Implemented detailed logging and error handling throughout the codebase.
- Added visualization capabilities for training history and confusion matrix.

Files changed (14) hide show

.gitattributes +28 -0
.gitignore +141 -0
.space.yaml +19 -0
README.md +425 -0
app.py +478 -0
deploy_package/.gitignore +29 -0
deploy_package/.space.yaml +19 -0
deploy_package/README.md +170 -0
deploy_package/app.py +478 -0
py/__init__.py +11 -0
py/demo.py +204 -0
py/fine_tune_sentiment.py +410 -0
py/gradio_app.py +631 -0
py/test_model.py +277 -0

.gitattributes ADDED Viewed

	@@ -0,0 +1,28 @@

+# Auto detect text files and perform LF normalization
+* text=auto eol=lf
+# Explicitly declare text files
+*.py text diff=python
+*.md text
+*.txt text
+*.json text
+*.yml text
+*.yaml text
+*.toml text
+*.ini text
+*.cfg text
+# Declare binary files
+*.png binary
+*.jpg binary
+*.jpeg binary
+*.gif binary
+*.pdf binary
+*.safetensors binary
+*.bin binary
+*.pth binary
+# Large files should use Git LFS if needed
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text

.gitignore ADDED Viewed

	@@ -0,0 +1,141 @@

+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+# Virtual Environment
+venv/
+env/
+ENV/
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+# OS
+.DS_Store
+Thumbs.db
+# Model artifacts (large files - exclude from deployment)
+*.safetensors
+*.bin
+*.pth
+*.h5
+*.pb
+*.onnx
+*.tflite
+# Trained models (exclude for deployment)
+vietnamese_sentiment_finetuned/
+model/
+models/
+checkpoints/
+*.ckpt
+# Generated visualizations (exclude for deployment)
+*.png
+*.jpg
+*.jpeg
+*.pdf
+*.svg
+training_history.png
+confusion_matrix.png
+# Logs and temporary files
+*.log
+*.tmp
+*.temp
+*.out
+# Cache directories
+.cache/
+.pytest_cache/
+__pycache__/
+*.py[cod]
+*$py.class
+.ipynb_checkpoints/
+# Gradio cache
+gradio_cached_examples/
+*.gradio/
+# Hugging Face cache
+~/.cache/huggingface/
+*.cache
+# Dataset files (exclude for deployment)
+*.csv
+*.json
+*.tsv
+*.txt
+data/
+datasets/
+# Virtual environments and build files
+venv/
+env/
+ENV/
+build/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+*.so
+.Python
+# Development and configuration files
+.vscode/
+.idea/
+*.swp
+*.swo
+.DS_Store
+Thumbs.db
+# Claude and development tools
+.claude/
+.serena/
+*.session
+# Documentation (exclude for deployment, keep source)
+docs/
+doc/
+*.md
+!README.md
+# PDF files (exclude for deployment)
+*.pdf
+pdf/
+# Node modules and web dependencies (if any)
+node_modules/
+npm-debug.log*
+yarn-debug.log*
+yarn-error.log*

.space.yaml ADDED Viewed

	@@ -0,0 +1,19 @@

+title: Vietnamese Sentiment Analysis
+emoji: 🎭
+colorFrom: green
+colorTo: blue
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
+pinned: false
+license: mit
+models:
+  - 5CD-AI/Vietnamese-Sentiment-visobert
+tags:
+  - vietnamese
+  - sentiment-analysis
+  - nlp
+  - text-classification
+  - transformers
+  - pytorch
+short_description: Vietnamese sentiment analysis using transformer models with memory optimization

README.md ADDED Viewed

	@@ -0,0 +1,425 @@

+# 🎭 Vietnamese Sentiment Analysis
+A comprehensive Vietnamese sentiment analysis system built with transformer models, featuring training, testing, demo, and web interface capabilities with advanced memory management.
+## 🚀 Features
+- **🤖 Transformer-based Model**: Fine-tuned Vietnamese sentiment analysis using Visobert
+- **🌐 Interactive Web Interface**: Real-time sentiment analysis via Gradio with memory optimization
+- **📊 Comprehensive Testing**: Model evaluation with confusion matrix and classification metrics
+- **⚡ Memory Efficient**: Built-in memory management, batch processing limits, and quantization support
+- **🎯 Easy to Use**: Simple command-line interface and web UI
+- **📈 Performance Monitoring**: Real-time memory usage tracking and optimization
+## 📁 Project Structure
+```
+SentimentAnalysis/
+├── README.md                          # 📚 This file
+├── requirements.txt                   # 📦 Python dependencies
+├── .gitignore                         # 🚫 Git ignore rules
+│
+├── py/                                # 🐍 Core Python modules
+│   ├── __init__.py                   # Package initialization
+│   ├── fine_tune_sentiment.py        # 🔧 Core fine-tuning utilities
+│   ├── test_model.py                 # 🧪 Model testing and evaluation
+│   ├── demo.py                      # 💻 Demo functionality
+│   └── gradio_app.py                # 🌐 Web interface (memory-optimized)
+│
+├── main.py                            # 🚀 Main entry point (all commands)
+├── train.py                           # 🏋️ Training script
+├── test.py                            # 🧪 Testing script
+├── demo.py                            # 💻 Interactive demo
+└── web.py                             # 🌐 Web interface launcher
+│
+├── vietnamese_sentiment_finetuned/   # 🤖 Trained model (auto-generated)
+├── confusion_matrix.png             # 📊 Evaluation visualization (auto-generated)
+├── training_history.png             # 📈 Training progress (auto-generated)
+├── pdf/                             # 📄 Documentation folder
+├── venv/                            # 🐍 Virtual environment
+├── .git/                            # 📝 Git repository
+└── .claude/                         # 🤖 Claude configuration
+```
+## 🛠️ Installation
+1. **Clone and Setup Environment**
+```bash
+cd SentimentAnalysis
+python -m venv venv
+source venv/bin/activate  # On Windows: venv\Scripts\activate
+```
+2. **Install Dependencies**
+```bash
+pip install -r requirements.txt
+```
+## 🎯 Usage
+### Quick Start Options
+#### **Option 1: Use Individual Scripts**
+```bash
+# Train the model
+python train.py
+# Test the model
+python test.py
+# Run interactive demo
+python demo.py
+# Launch web interface
+python web.py
+```
+#### **Option 2: Use Main Entry Point**
+```bash
+# Train with custom settings
+python main.py train --batch-size 32 --epochs 5
+# Test the model
+python main.py test --model-path ./vietnamese_sentiment_finetuned
+# Run interactive demo
+python main.py demo
+# Launch web interface with memory options
+python main.py web --quantize --max-batch-size 20 --port 8080
+```
+### 1. Training the Model
+```bash
+# Basic training
+python train.py
+# Custom batch size and epochs
+python train.py 32 5
+# Using main script
+python main.py train --batch-size 32 --epochs 5 --learning-rate 1e-5
+```
+### 2. Testing the Model
+```bash
+# Basic testing
+python test.py
+# Test with custom model path
+python test.py /path/to/custom/model
+# Using main script
+python main.py test --model-path ./vietnamese_sentiment_finetuned
+```
+### 3. Interactive Demo
+```bash
+# Run demo
+python demo.py
+# Using main script
+python main.py demo
+```
+### 4. Web Interface
+```bash
+# Standard usage (memory-efficient defaults)
+python web.py
+# High memory efficiency (quantization + small batches)
+python web.py --quantize --max-batch-size 5 --max-memory 2048
+# Large batch processing
+python web.py --max-batch-size 20 --max-memory 8192
+# Custom server configuration
+python web.py --port 8080 --host 0.0.0.0 --quantize
+# Using main script
+python main.py web --quantize --max-batch-size 20 --port 8080
+```
+## 🌐 Web Interface Features
+The Gradio web interface provides:
+### 📝 Single Text Analysis
+- Real-time sentiment prediction
+- Confidence scores with visual charts
+- Memory usage monitoring
+- Example texts for quick testing
+### 📊 Batch Analysis
+- Process multiple texts at once
+- Memory-efficient batch processing
+- Automatic batch size limits
+- Batch summary with sentiment distribution
+### 🛡️ Memory Management
+- **Automatic Cleanup**: Memory cleaned after each prediction
+- **Batch Limits**: Configurable maximum texts per batch
+- **Memory Monitoring**: Real-time memory usage tracking
+- **GPU Optimization**: CUDA cache clearing when available
+- **Quantization**: Optional model quantization for CPU (~4x memory reduction)
+### ℹ️ Model Information
+- Detailed model specifications
+- Performance metrics
+- Memory management settings
+- Usage tips and troubleshooting
+## 🔧 Command Line Options
+### Individual Scripts
+#### `train.py`
+```bash
+python train.py [batch_size] [epochs]
+```
+#### `test.py`
+```bash
+python test.py [model_path]
+```
+#### `demo.py`
+```bash
+python demo.py
+```
+#### `web.py`
+```bash
+python web.py [--max-batch-size SIZE] [--quantize] [--max-memory MB] [--port PORT] [--host HOST]
+```
+### Main Entry Point (`main.py`)
+#### Training Command
+```bash
+python main.py train [--batch-size SIZE] [--epochs NUM] [--learning-rate RATE]
+```
+#### Testing Command
+```bash
+python main.py test [--model-path PATH]
+```
+#### Demo Command
+```bash
+python main.py demo
+```
+#### Web Interface Command
+```bash
+python main.py web [--max-batch-size SIZE] [--quantize] [--max-memory MB] [--port PORT] [--host HOST]
+```
+**Memory Management Options:**
+- `--max-batch-size`: Maximum batch size for memory efficiency (default: 10)
+- `--quantize`: Enable model quantization for memory efficiency (CPU only)
+- `--max-memory`: Maximum memory usage in MB (default: 4096)
+- `--port`: Port to run the interface on (default: 7862)
+- `--host`: Host to bind the interface to (default: 127.0.0.1)
+## 📊 Model Details
+- **Base Model**: 5CD-AI/Vietnamese-Sentiment-visobert
+- **Dataset**: uitnlp/vietnamese_students_feedback
+- **Labels**: Negative, Neutral, Positive
+- **Language**: Vietnamese
+- **Architecture**: Transformer-based sequence classification
+- **Max Sequence Length**: 512 tokens
+## 📈 Performance Metrics
+- **Accuracy**: 85-90% (on validation set)
+- **Processing Speed**: ~100ms per text
+- **Memory Usage**: Configurable (default 4GB limit)
+- **Batch Processing**: Up to 20 texts (configurable)
+## 🛡️ Memory Management
+The system includes comprehensive memory management:
+### Automatic Features
+- Memory cleanup after each prediction
+- GPU cache clearing for CUDA
+- Garbage collection management
+- Memory monitoring before/after operations
+### User Controls
+- Configurable batch size limits
+- Memory limit enforcement
+- Manual memory cleanup button
+- Real-time memory usage display
+### Optimization Options
+- Dynamic quantization (CPU only)
+- Batch processing optimization
+- Memory-efficient inference
+## 🔍 Troubleshooting
+### Memory Issues
+- Enable quantization: `python gradio_app.py --quantize`
+- Reduce batch size: `python gradio_app.py --max-batch-size 5`
+- Lower memory limit: `python gradio_app.py --max-memory 2048`
+- Use manual cleanup: Click "Memory Cleanup" button in web interface
+### Model Loading Issues
+- Ensure model is trained: `python run_training.py`
+- Check model directory: `ls -la vietnamese_sentiment_finetuned/`
+- Verify dependencies: `pip install -r requirements.txt`
+### Performance Optimization
+- Use GPU if available (CUDA)
+- Enable quantization for CPU inference
+- Monitor memory usage in web interface
+- Adjust batch size based on available memory
+## 📄 Requirements
+See `requirements.txt` for complete dependency list:
+```
+torch>=2.0.0
+transformers>=4.21.0
+datasets>=2.0.0
+gradio>=4.0.0
+pandas>=1.5.0
+numpy>=1.21.0
+scikit-learn>=1.1.0
+matplotlib>=3.5.0
+seaborn>=0.11.0
+psutil>=5.9.0
+```
+## 🎯 Example Usage
+### Command Line Demo
+```python
+from py.demo import SentimentDemo
+demo = SentimentDemo()
+demo.load_model()
+demo.interactive_demo()
+```
+### Web Interface
+1. Train model: `python train.py`
+2. Launch interface: `python web.py`
+3. Open browser to `http://127.0.0.1:7862`
+4. Enter Vietnamese text for analysis
+### Batch Processing
+```python
+from py.gradio_app import SentimentGradioApp
+app = SentimentGradioApp(max_batch_size=20)
+app.load_model()
+texts = ["Tuyệt vời!", "Bình thường", "Rất tệ"]
+results, summary = app.batch_predict(texts)
+```
+### Model Testing
+```python
+from py.test_model import SentimentTester
+tester = SentimentTester(model_path="./vietnamese_sentiment_finetuned")
+tester.load_model()
+sentiment, confidence = tester.predict_sentiment("Giảng viên dạy rất hay!")
+```
+### Fine-Tuning
+```python
+from py.fine_tune_sentiment import SentimentFineTuner
+fine_tuner = SentimentFineTuner(
+    model_name="5CD-AI/Vietnamese-Sentiment-visobert",
+    dataset_name="uitnlp/vietnamese_students_feedback"
+)
+train_result, eval_results = fine_tuner.run_fine_tuning(
+    output_dir="./my_model",
+    learning_rate=2e-5,
+    batch_size=16,
+    num_epochs=3
+)
+```
+## 📝 Model Loading Examples
+### Loading the Fine-Tuned Model
+```python
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+tokenizer = AutoTokenizer.from_pretrained("./vietnamese_sentiment_finetuned")
+model = AutoModelForSequenceClassification.from_pretrained("./vietnamese_sentiment_finetuned")
+```
+### Making Predictions
+```python
+import torch
+def predict_sentiment(text):
+    inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True)
+    with torch.no_grad():
+        outputs = model(**inputs)
+        predictions = torch.softmax(outputs.logits, dim=-1)
+        predicted_class = torch.argmax(predictions, dim=-1).item()
+    sentiment_labels = ["Negative", "Neutral", "Positive"]
+    return sentiment_labels[predicted_class], predictions[0][predicted_class].item()
+# Example
+text = "Giảng viên dạy rất hay và tâm huyết."
+sentiment, confidence = predict_sentiment(text)
+print(f"Sentiment: {sentiment}, Confidence: {confidence:.3f}")
+```
+## 📊 Dataset Information
+The UIT-VSFC corpus contains over 16,000 Vietnamese student feedback sentences with:
+- **Sentiment Classification**: Positive, Neutral, Negative
+- **Topic Classification**: Various educational topics
+- **Inter-annotator agreement**: >91% for sentiment, >71% for topics
+- **Original F1-score**: ~88% for sentiment (Maximum Entropy baseline)
+## 🔧 Hardware Requirements
+- **Minimum**: 8GB RAM, CPU
+- **Recommended**: GPU with 8GB+ VRAM for faster training
+- **Storage**: ~2GB for model and datasets
+## 📝 License
+This project uses open-source components for educational and research purposes. Please check individual licenses for:
+- 5CD-AI/Vietnamese-Sentiment-visobert
+- uitnlp/vietnamese_students_feedback
+## 🤝 Contributing
+Feel free to submit issues and enhancement requests!
+## 📄 Citation
+If you use this work or the dataset, please cite:
+```bibtex
+@InProceedings{8573337,
+  author={Nguyen, Kiet Van and Nguyen, Vu Duc and Nguyen, Phu X. V. and Truong, Tham T. H. and Nguyen, Ngan Luu-Thuy},
+  booktitle={2018 10th International Conference on Knowledge and Systems Engineering (KSE)},
+  title={UIT-VSFC: Vietnamese Students' Feedback Corpus for Sentiment Analysis},
+  year={2018},
+  volume={},
+  number={},
+  pages={19-24},
+  doi={10.1109/KSE.2018.8573337}
+}
+```
+---
+**Quick Start**: `python train.py && python web.py`
+**Alternative**: `python main.py train && python main.py web`

app.py ADDED Viewed

	@@ -0,0 +1,478 @@

+#!/usr/bin/env python3
+"""
+Vietnamese Sentiment Analysis - Hugging Face Spaces Gradio App
+"""
+import gradio as gr
+import torch
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import time
+import numpy as np
+from datetime import datetime
+import gc
+import psutil
+import os
+import pandas as pd
+class SentimentGradioApp:
+    def __init__(self, model_name="5CD-AI/Vietnamese-Sentiment-visobert", max_batch_size=10):
+        self.model_name = model_name
+        self.tokenizer = None
+        self.model = None
+        self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+        self.sentiment_labels = ["Negative", "Neutral", "Positive"]
+        self.sentiment_colors = {
+            "Negative": "#ff4444",
+            "Neutral": "#ffaa00",
+            "Positive": "#44ff44"
+        }
+        self.model_loaded = False
+        self.max_batch_size = max_batch_size
+        self.max_memory_mb = 8192  # Hugging Face Spaces memory limit
+    def get_memory_usage(self):
+        """Get current memory usage in MB"""
+        process = psutil.Process(os.getpid())
+        return process.memory_info().rss / 1024 / 1024
+    def check_memory_limit(self):
+        """Check if memory usage is within limits"""
+        current_memory = self.get_memory_usage()
+        if current_memory > self.max_memory_mb:
+            return False, f"Memory usage ({current_memory:.1f}MB) exceeds limit ({self.max_memory_mb}MB)"
+        return True, f"Memory usage: {current_memory:.1f}MB"
+    def cleanup_memory(self):
+        """Clean up GPU and CPU memory"""
+        if torch.cuda.is_available():
+            torch.cuda.empty_cache()
+        gc.collect()
+    def load_model(self):
+        """Load the model from Hugging Face Hub"""
+        if self.model_loaded:
+            return True
+        try:
+            # Clean up any existing memory
+            self.cleanup_memory()
+            # Check memory before loading
+            memory_ok, memory_msg = self.check_memory_limit()
+            if not memory_ok:
+                print(f"❌ {memory_msg}")
+                return False
+            print(f"📊 {memory_msg}")
+            print(f"🤖 Loading model from Hugging Face Hub: {self.model_name}")
+            self.tokenizer = AutoTokenizer.from_pretrained(self.model_name)
+            self.model = AutoModelForSequenceClassification.from_pretrained(self.model_name)
+            self.model.to(self.device)
+            self.model.eval()
+            self.model_loaded = True
+            # Check memory after loading
+            memory_ok, memory_msg = self.check_memory_limit()
+            print(f"✅ Model loaded successfully from {self.model_name}")
+            print(f"📊 {memory_msg}")
+            return True
+        except Exception as e:
+            print(f"❌ Error loading model: {e}")
+            self.model_loaded = False
+            self.cleanup_memory()
+            return False
+    def predict_sentiment(self, text):
+        """Predict sentiment for given text"""
+        if not self.model_loaded:
+            return None, "❌ Model not loaded. Please refresh the page."
+        if not text.strip():
+            return None, "❌ Please enter some text to analyze."
+        try:
+            # Check memory before prediction
+            memory_ok, memory_msg = self.check_memory_limit()
+            if not memory_ok:
+                return None, f"❌ {memory_msg}"
+            start_time = time.time()
+            # Tokenize
+            inputs = self.tokenizer(
+                text,
+                return_tensors="pt",
+                truncation=True,
+                padding=True,
+                max_length=512
+            )
+            # Move to device
+            inputs = {k: v.to(self.device) for k, v in inputs.items()}
+            # Predict
+            with torch.no_grad():
+                outputs = self.model(**inputs)
+                logits = outputs.logits
+                probabilities = torch.softmax(logits, dim=-1)
+                predicted_class = torch.argmax(probabilities, dim=-1).item()
+                confidence = torch.max(probabilities).item()
+            inference_time = time.time() - start_time
+            # Move to CPU and clean GPU memory
+            probs = probabilities.cpu().numpy()[0].tolist()
+            del probabilities, logits, outputs
+            self.cleanup_memory()
+            sentiment = self.sentiment_labels[predicted_class]
+            # Create detailed results
+            result = {
+                "sentiment": sentiment,
+                "confidence": confidence,
+                "probabilities": {
+                    "Negative": probs[0],
+                    "Neutral": probs[1],
+                    "Positive": probs[2]
+                },
+                "inference_time": inference_time,
+                "timestamp": datetime.now().strftime("%Y-%m-%d %H:%M:%S")
+            }
+            # Create formatted output
+            output_text = f"""
+## 🎯 Sentiment Analysis Result
+**Sentiment:** {sentiment}
+**Confidence:** {confidence:.2%}
+**Processing Time:** {inference_time:.3f}s
+### 📊 Probability Distribution:
+- 😠 **Negative:** {probs[0]:.2%}
+- 😐 **Neutral:** {probs[1]:.2%}
+- 😊 **Positive:** {probs[2]:.2%}
+### 📝 Input Text:
+> "{text}"
+---
+*Analysis completed at {result['timestamp']}*
+*{memory_msg}*
+            """.strip()
+            return result, output_text
+        except Exception as e:
+            self.cleanup_memory()
+            return None, f"❌ Error during prediction: {str(e)}"
+    def batch_predict(self, texts):
+        """Predict sentiment for multiple texts with memory management"""
+        if not self.model_loaded:
+            return [], "❌ Model not loaded. Please refresh the page."
+        if not texts or not any(texts):
+            return [], "❌ Please enter some texts to analyze."
+        # Filter valid texts and apply batch size limit
+        valid_texts = [text.strip() for text in texts if text.strip()]
+        if len(valid_texts) > self.max_batch_size:
+            return [], f"❌ Too many texts ({len(valid_texts)}). Maximum batch size is {self.max_batch_size} for memory efficiency."
+        if not valid_texts:
+            return [], "❌ No valid texts provided."
+        # Check memory before batch processing
+        memory_ok, memory_msg = self.check_memory_limit()
+        if not memory_ok:
+            return [], f"❌ {memory_msg}"
+        results = []
+        try:
+            for i, text in enumerate(valid_texts):
+                # Check memory every 5 predictions
+                if i % 5 == 0:
+                    memory_ok, memory_msg = self.check_memory_limit()
+                    if not memory_ok:
+                        break
+                result, _ = self.predict_sentiment(text)
+                if result:
+                    results.append(result)
+            if not results:
+                return [], "❌ No valid predictions made."
+            # Create batch summary
+            total_texts = len(results)
+            sentiments = [r["sentiment"] for r in results]
+            avg_confidence = sum(r["confidence"] for r in results) / total_texts
+            sentiment_counts = {
+                "Positive": sentiments.count("Positive"),
+                "Neutral": sentiments.count("Neutral"),
+                "Negative": sentiments.count("Negative")
+            }
+            summary = f"""
+## 📊 Batch Analysis Summary
+**Total Texts Analyzed:** {total_texts}/{len(valid_texts)}
+**Average Confidence:** {avg_confidence:.2%}
+**Memory Used:** {self.get_memory_usage():.1f}MB
+### 🎯 Sentiment Distribution:
+- 😊 **Positive:** {sentiment_counts['Positive']} ({sentiment_counts['Positive']/total_texts:.1%})
+- 😐 **Neutral:** {sentiment_counts['Neutral']} ({sentiment_counts['Neutral']/total_texts:.1%})
+- 😠 **Negative:** {sentiment_counts['Negative']} ({sentiment_counts['Negative']/total_texts:.1%})
+### 📋 Individual Results:
+            """.strip()
+            for i, result in enumerate(results, 1):
+                summary += f"\n**{i}.** {result['sentiment']} ({result['confidence']:.1%})"
+            # Final memory cleanup
+            self.cleanup_memory()
+            return results, summary
+        except Exception as e:
+            self.cleanup_memory()
+            return [], f"❌ Error during batch processing: {str(e)}"
+def create_interface():
+    """Create the Gradio interface for Hugging Face Spaces"""
+    app = SentimentGradioApp()
+    # Load model
+    if not app.load_model():
+        print("❌ Failed to load model. Please try again.")
+        return None
+    # Example texts
+    examples = [
+        "Giảng viên dạy rất hay và tâm huyết.",
+        "Môn học này quá khó và nhàm chán.",
+        "Lớp học ổn định, không có gì đặc biệt.",
+        "Tôi rất thích cách giảng dạy của thầy cô.",
+        "Chương trình học cần cải thiện nhiều."
+    ]
+    # Custom CSS
+    css = """
+    .gradio-container {
+        max-width: 900px !important;
+        margin: auto !important;
+    }
+    .sentiment-positive {
+        color: #44ff44;
+        font-weight: bold;
+    }
+    .sentiment-neutral {
+        color: #ffaa00;
+        font-weight: bold;
+    }
+    .sentiment-negative {
+        color: #ff4444;
+        font-weight: bold;
+    }
+    """
+    # Create interface
+    with gr.Blocks(
+        title="Vietnamese Sentiment Analysis",
+        theme=gr.themes.Soft(),
+        css=css
+    ) as interface:
+        gr.Markdown("# 🎭 Vietnamese Sentiment Analysis")
+        gr.Markdown("Enter Vietnamese text to analyze sentiment using a transformer model from Hugging Face.")
+        with gr.Tabs():
+            # Single Text Analysis Tab
+            with gr.Tab("📝 Single Text Analysis"):
+                with gr.Row():
+                    with gr.Column(scale=3):
+                        text_input = gr.Textbox(
+                            label="Enter Vietnamese Text",
+                            placeholder="Type or paste Vietnamese text here...",
+                            lines=3
+                        )
+                        with gr.Row():
+                            analyze_btn = gr.Button("🔍 Analyze Sentiment", variant="primary")
+                            clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                    with gr.Column(scale=2):
+                        gr.Examples(
+                            examples=examples,
+                            inputs=[text_input],
+                            label="💡 Example Texts"
+                        )
+                result_output = gr.Markdown(label="Analysis Result", visible=True)
+                confidence_plot = gr.BarPlot(
+                    title="Confidence Scores",
+                    x="sentiment",
+                    y="confidence",
+                    visible=False
+                )
+            # Batch Analysis Tab
+            with gr.Tab("📊 Batch Analysis"):
+                gr.Markdown(f"### 📝 Memory-Efficient Batch Processing")
+                gr.Markdown(f"**Maximum batch size:** {app.max_batch_size} texts (for memory efficiency)")
+                gr.Markdown(f"**Memory limit:** {app.max_memory_mb}MB")
+                batch_input = gr.Textbox(
+                    label="Enter Multiple Texts (one per line)",
+                    placeholder=f"Enter up to {app.max_batch_size} Vietnamese texts, one per line...",
+                    lines=8,
+                    max_lines=20
+                )
+                with gr.Row():
+                    batch_analyze_btn = gr.Button("🔍 Analyze All", variant="primary")
+                    batch_clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                    memory_cleanup_btn = gr.Button("🧹 Memory Cleanup", variant="secondary")
+                batch_result_output = gr.Markdown(label="Batch Analysis Result")
+                memory_info = gr.Textbox(
+                    label="Memory Usage",
+                    value=f"{app.get_memory_usage():.1f}MB used",
+                    interactive=False
+                )
+            # Model Info Tab
+            with gr.Tab("ℹ️ Model Information"):
+                gr.Markdown(f"""
+                ## 🤖 Model Details
+                **Model Architecture:** Transformer-based sequence classification
+                **Base Model:** {app.model_name}
+                **Languages:** Vietnamese (optimized)
+                **Labels:** Negative, Neutral, Positive
+                **Max Batch Size:** {app.max_batch_size} texts
+                ## 📊 Performance Metrics
+                - **Processing Speed:** ~100ms per text
+                - **Max Sequence Length:** 512 tokens
+                - **Memory Limit:** {app.max_memory_mb}MB
+                ## 💡 Usage Tips
+                - Enter clear, grammatically correct Vietnamese text
+                - Longer texts (20-200 words) work best
+                - The model handles various Vietnamese dialects
+                - Confidence scores indicate prediction certainty
+                ## 🛡️ Memory Management
+                - **Automatic Cleanup:** Memory is cleaned after each prediction
+                - **Batch Limits:** Maximum {app.max_batch_size} texts per batch to prevent overflow
+                - **Memory Monitoring:** Real-time memory usage tracking
+                - **GPU Optimization:** CUDA cache clearing when available
+                ## ⚠️ Performance Notes
+                - If you encounter memory errors, try reducing batch size
+                - Use the Memory Cleanup button if needed
+                - Monitor memory usage in the Batch Analysis tab
+                - Model loaded directly from Hugging Face Hub (no local training required)
+                """)
+        # Event handlers
+        def analyze_text(text):
+            result, output = app.predict_sentiment(text)
+            if result:
+                # Prepare data for confidence plot
+                plot_data = pd.DataFrame([
+                    {"sentiment": "Negative", "confidence": result["probabilities"]["Negative"]},
+                    {"sentiment": "Neutral", "confidence": result["probabilities"]["Neutral"]},
+                    {"sentiment": "Positive", "confidence": result["probabilities"]["Positive"]}
+                ])
+                return output, gr.BarPlot(visible=True, value=plot_data)
+            else:
+                return output, gr.BarPlot(visible=False)
+        def clear_inputs():
+            return "", "", gr.BarPlot(visible=False)
+        def analyze_batch(texts):
+            if texts:
+                text_list = [line.strip() for line in texts.split('\n') if line.strip()]
+                results, summary = app.batch_predict(text_list)
+                return summary
+            return "❌ Please enter some texts to analyze."
+        def clear_batch():
+            return ""
+        def update_memory_info():
+            return f"{app.get_memory_usage():.1f}MB used"
+        def manual_memory_cleanup():
+            app.cleanup_memory()
+            return f"Memory cleaned. Current usage: {app.get_memory_usage():.1f}MB"
+        # Connect events
+        analyze_btn.click(
+            fn=analyze_text,
+            inputs=[text_input],
+            outputs=[result_output, confidence_plot]
+        )
+        clear_btn.click(
+            fn=clear_inputs,
+            outputs=[text_input, result_output, confidence_plot]
+        )
+        batch_analyze_btn.click(
+            fn=analyze_batch,
+            inputs=[batch_input],
+            outputs=[batch_result_output]
+        )
+        batch_clear_btn.click(
+            fn=clear_batch,
+            outputs=[batch_input]
+        )
+        memory_cleanup_btn.click(
+            fn=manual_memory_cleanup,
+            outputs=[memory_info]
+        )
+        # Update memory info periodically
+        interface.load(
+            fn=update_memory_info,
+            outputs=[memory_info]
+        )
+    return interface
+# Create and launch the interface
+if __name__ == "__main__":
+    print("🚀 Starting Vietnamese Sentiment Analysis for Hugging Face Spaces...")
+    interface = create_interface()
+    if interface is None:
+        print("❌ Failed to create interface. Exiting.")
+        exit(1)
+    print("✅ Interface created successfully!")
+    print("🌐 Launching web interface...")
+    # Launch the interface
+    interface.launch(
+        share=True,
+        show_error=True,
+        quiet=False
+    )

deploy_package/.gitignore ADDED Viewed

	@@ -0,0 +1,29 @@

+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+# Virtual Environment
+venv/
+env/
+ENV/
+# Cache and temporary files
+.cache/
+*.log
+*.tmp
+*.temp
+# Gradio
+gradio_cached_examples/
+# OS
+.DS_Store
+Thumbs.db
+# Development files
+.vscode/
+.idea/
+*.swp
+*.swo

deploy_package/.space.yaml ADDED Viewed

	@@ -0,0 +1,19 @@

+title: Vietnamese Sentiment Analysis
+emoji: 🎭
+colorFrom: green
+colorTo: blue
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
+pinned: false
+license: mit
+models:
+  - 5CD-AI/Vietnamese-Sentiment-visobert
+tags:
+  - vietnamese
+  - sentiment-analysis
+  - nlp
+  - text-classification
+  - transformers
+  - pytorch
+short_description: Vietnamese sentiment analysis using transformer models with memory optimization

deploy_package/README.md ADDED Viewed

	@@ -0,0 +1,170 @@

+---
+title: Vietnamese Sentiment Analysis
+emoji: 🎭
+colorFrom: green
+colorTo: blue
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
+pinned: false
+license: mit
+models:
+  - 5CD-AI/Vietnamese-Sentiment-visobert
+tags:
+  - vietnamese
+  - sentiment-analysis
+  - nlp
+  - text-classification
+  - transformers
+  - pytorch
+  - gradio
+short_description: Vietnamese sentiment analysis using transformer models with memory optimization
+---
+# 🎭 Vietnamese Sentiment Analysis
+A Vietnamese sentiment analysis web interface built with Gradio and transformer models, optimized for Hugging Face Spaces deployment.
+## 🚀 Features
+- **🤖 Transformer-based Model**: Uses 5CD-AI/Vietnamese-Sentiment-visobert from Hugging Face Hub
+- **🌐 Interactive Web Interface**: Real-time sentiment analysis via Gradio
+- **⚡ Memory Efficient**: Built-in memory management and batch processing limits
+- **📊 Visual Analysis**: Confidence scores with interactive charts
+- **📝 Batch Processing**: Analyze multiple texts at once
+- **🛡️ Memory Management**: Real-time memory monitoring and cleanup
+## 🎯 Usage
+### Single Text Analysis
+1. Enter Vietnamese text in the input field
+2. Click "Analyze Sentiment"
+3. View the sentiment prediction with confidence scores
+4. See probability distribution in the chart
+### Batch Analysis
+1. Switch to "Batch Analysis" tab
+2. Enter multiple Vietnamese texts (one per line)
+3. Click "Analyze All" to process all texts
+4. View comprehensive batch summary with sentiment distribution
+### Memory Management
+- Monitor real-time memory usage
+- Use "Memory Cleanup" button if needed
+- Automatic cleanup after each prediction
+- Maximum 10 texts per batch for efficiency
+## 📊 Model Details
+- **Model**: 5CD-AI/Vietnamese-Sentiment-visobert
+- **Architecture**: Transformer-based (XLM-RoBERTa)
+- **Language**: Vietnamese
+- **Labels**: Negative, Neutral, Positive
+- **Max Sequence Length**: 512 tokens
+- **Device**: Automatic CUDA/CPU detection
+## 💡 Example Usage
+Try these example Vietnamese texts:
+- "Giảng viên dạy rất hay và tâm huyết." (Positive)
+- "Môn học này quá khó và nhàm chán." (Negative)
+- "Lớp học ổn định, không có gì đặc biệt." (Neutral)
+## 🛠️ Technical Features
+### Memory Optimization
+- Automatic GPU cache clearing
+- Garbage collection management
+- Memory usage monitoring
+- Batch size limits
+- Real-time memory tracking
+### Performance
+- ~100ms processing time per text
+- Supports up to 512 token sequences
+- Efficient batch processing
+- Memory limit: 8GB (Hugging Face Spaces)
+## 📋 Model Performance
+The model provides:
+- **Sentiment Classification**: Positive, Neutral, Negative
+- **Confidence Scores**: Probability distribution across classes
+- **Real-time Processing**: Fast inference on CPU/GPU
+- **Batch Analysis**: Efficient processing of multiple texts
+## 🔧 Deployment
+This Space is configured for Hugging Face Spaces with:
+- **SDK**: Gradio 4.44.0
+- **Hardware**: CPU (with CUDA support if available)
+- **Memory**: 8GB limit with optimization
+- **Model Loading**: Direct from Hugging Face Hub
+## 📄 Requirements
+See `requirements_spaces.txt` for complete dependency list:
+- torch>=2.0.0
+- transformers>=4.21.0
+- gradio>=4.44.0
+- pandas, numpy, scikit-learn
+- psutil for memory monitoring
+## 🎯 Use Cases
+- **Education**: Analyze student feedback
+- **Customer Service**: Analyze customer reviews
+- **Social Media**: Monitor sentiment in posts
+- **Research**: Vietnamese text analysis
+- **Business**: Customer sentiment tracking
+## 🔍 Troubleshooting
+### Memory Issues
+- Use "Memory Cleanup" button
+- Reduce batch size
+- Refresh the page if needed
+### Model Loading
+- Model loads automatically from Hugging Face Hub
+- No local training required
+- Automatic fallback to CPU if GPU unavailable
+### Performance Tips
+- Clear, grammatically correct Vietnamese text works best
+- Longer texts (20-200 words) provide better context
+- Use batch processing for multiple texts
+## 📝 Citation
+If you use this model or Space, please cite the original model:
+```bibtex
+@InProceedings{8573337,
+  author={Nguyen, Kiet Van and Nguyen, Vu Duc and Nguyen, Phu X. V. and Truong, Tham T. H. and Nguyen, Ngan Luu-Thuy},
+  booktitle={2018 10th International Conference on Knowledge and Systems Engineering (KSE)},
+  title={UIT-VSFC: Vietnamese Students' Feedback Corpus for Sentiment Analysis},
+  year={2018},
+  volume={},
+  number={},
+  pages={19-24},
+  doi={10.1109/KSE.2018.8573337}
+}
+```
+## 🤝 Contributing
+Feel free to:
+- Submit issues and feedback
+- Suggest improvements
+- Report bugs
+- Request new features
+## 📄 License
+This Space uses open-source components under MIT license.
+---
+**Try it now!** Enter some Vietnamese text above to see the sentiment analysis in action. 🎭

deploy_package/app.py ADDED Viewed

	@@ -0,0 +1,478 @@

+#!/usr/bin/env python3
+"""
+Vietnamese Sentiment Analysis - Hugging Face Spaces Gradio App
+"""
+import gradio as gr
+import torch
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import time
+import numpy as np
+from datetime import datetime
+import gc
+import psutil
+import os
+import pandas as pd
+class SentimentGradioApp:
+    def __init__(self, model_name="5CD-AI/Vietnamese-Sentiment-visobert", max_batch_size=10):
+        self.model_name = model_name
+        self.tokenizer = None
+        self.model = None
+        self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+        self.sentiment_labels = ["Negative", "Neutral", "Positive"]
+        self.sentiment_colors = {
+            "Negative": "#ff4444",
+            "Neutral": "#ffaa00",
+            "Positive": "#44ff44"
+        }
+        self.model_loaded = False
+        self.max_batch_size = max_batch_size
+        self.max_memory_mb = 8192  # Hugging Face Spaces memory limit
+    def get_memory_usage(self):
+        """Get current memory usage in MB"""
+        process = psutil.Process(os.getpid())
+        return process.memory_info().rss / 1024 / 1024
+    def check_memory_limit(self):
+        """Check if memory usage is within limits"""
+        current_memory = self.get_memory_usage()
+        if current_memory > self.max_memory_mb:
+            return False, f"Memory usage ({current_memory:.1f}MB) exceeds limit ({self.max_memory_mb}MB)"
+        return True, f"Memory usage: {current_memory:.1f}MB"
+    def cleanup_memory(self):
+        """Clean up GPU and CPU memory"""
+        if torch.cuda.is_available():
+            torch.cuda.empty_cache()
+        gc.collect()
+    def load_model(self):
+        """Load the model from Hugging Face Hub"""
+        if self.model_loaded:
+            return True
+        try:
+            # Clean up any existing memory
+            self.cleanup_memory()
+            # Check memory before loading
+            memory_ok, memory_msg = self.check_memory_limit()
+            if not memory_ok:
+                print(f"❌ {memory_msg}")
+                return False
+            print(f"📊 {memory_msg}")
+            print(f"🤖 Loading model from Hugging Face Hub: {self.model_name}")
+            self.tokenizer = AutoTokenizer.from_pretrained(self.model_name)
+            self.model = AutoModelForSequenceClassification.from_pretrained(self.model_name)
+            self.model.to(self.device)
+            self.model.eval()
+            self.model_loaded = True
+            # Check memory after loading
+            memory_ok, memory_msg = self.check_memory_limit()
+            print(f"✅ Model loaded successfully from {self.model_name}")
+            print(f"📊 {memory_msg}")
+            return True
+        except Exception as e:
+            print(f"❌ Error loading model: {e}")
+            self.model_loaded = False
+            self.cleanup_memory()
+            return False
+    def predict_sentiment(self, text):
+        """Predict sentiment for given text"""
+        if not self.model_loaded:
+            return None, "❌ Model not loaded. Please refresh the page."
+        if not text.strip():
+            return None, "❌ Please enter some text to analyze."
+        try:
+            # Check memory before prediction
+            memory_ok, memory_msg = self.check_memory_limit()
+            if not memory_ok:
+                return None, f"❌ {memory_msg}"
+            start_time = time.time()
+            # Tokenize
+            inputs = self.tokenizer(
+                text,
+                return_tensors="pt",
+                truncation=True,
+                padding=True,
+                max_length=512
+            )
+            # Move to device
+            inputs = {k: v.to(self.device) for k, v in inputs.items()}
+            # Predict
+            with torch.no_grad():
+                outputs = self.model(**inputs)
+                logits = outputs.logits
+                probabilities = torch.softmax(logits, dim=-1)
+                predicted_class = torch.argmax(probabilities, dim=-1).item()
+                confidence = torch.max(probabilities).item()
+            inference_time = time.time() - start_time
+            # Move to CPU and clean GPU memory
+            probs = probabilities.cpu().numpy()[0].tolist()
+            del probabilities, logits, outputs
+            self.cleanup_memory()
+            sentiment = self.sentiment_labels[predicted_class]
+            # Create detailed results
+            result = {
+                "sentiment": sentiment,
+                "confidence": confidence,
+                "probabilities": {
+                    "Negative": probs[0],
+                    "Neutral": probs[1],
+                    "Positive": probs[2]
+                },
+                "inference_time": inference_time,
+                "timestamp": datetime.now().strftime("%Y-%m-%d %H:%M:%S")
+            }
+            # Create formatted output
+            output_text = f"""
+## 🎯 Sentiment Analysis Result
+**Sentiment:** {sentiment}
+**Confidence:** {confidence:.2%}
+**Processing Time:** {inference_time:.3f}s
+### 📊 Probability Distribution:
+- 😠 **Negative:** {probs[0]:.2%}
+- 😐 **Neutral:** {probs[1]:.2%}
+- 😊 **Positive:** {probs[2]:.2%}
+### 📝 Input Text:
+> "{text}"
+---
+*Analysis completed at {result['timestamp']}*
+*{memory_msg}*
+            """.strip()
+            return result, output_text
+        except Exception as e:
+            self.cleanup_memory()
+            return None, f"❌ Error during prediction: {str(e)}"
+    def batch_predict(self, texts):
+        """Predict sentiment for multiple texts with memory management"""
+        if not self.model_loaded:
+            return [], "❌ Model not loaded. Please refresh the page."
+        if not texts or not any(texts):
+            return [], "❌ Please enter some texts to analyze."
+        # Filter valid texts and apply batch size limit
+        valid_texts = [text.strip() for text in texts if text.strip()]
+        if len(valid_texts) > self.max_batch_size:
+            return [], f"❌ Too many texts ({len(valid_texts)}). Maximum batch size is {self.max_batch_size} for memory efficiency."
+        if not valid_texts:
+            return [], "❌ No valid texts provided."
+        # Check memory before batch processing
+        memory_ok, memory_msg = self.check_memory_limit()
+        if not memory_ok:
+            return [], f"❌ {memory_msg}"
+        results = []
+        try:
+            for i, text in enumerate(valid_texts):
+                # Check memory every 5 predictions
+                if i % 5 == 0:
+                    memory_ok, memory_msg = self.check_memory_limit()
+                    if not memory_ok:
+                        break
+                result, _ = self.predict_sentiment(text)
+                if result:
+                    results.append(result)
+            if not results:
+                return [], "❌ No valid predictions made."
+            # Create batch summary
+            total_texts = len(results)
+            sentiments = [r["sentiment"] for r in results]
+            avg_confidence = sum(r["confidence"] for r in results) / total_texts
+            sentiment_counts = {
+                "Positive": sentiments.count("Positive"),
+                "Neutral": sentiments.count("Neutral"),
+                "Negative": sentiments.count("Negative")
+            }
+            summary = f"""
+## 📊 Batch Analysis Summary
+**Total Texts Analyzed:** {total_texts}/{len(valid_texts)}
+**Average Confidence:** {avg_confidence:.2%}
+**Memory Used:** {self.get_memory_usage():.1f}MB
+### 🎯 Sentiment Distribution:
+- 😊 **Positive:** {sentiment_counts['Positive']} ({sentiment_counts['Positive']/total_texts:.1%})
+- 😐 **Neutral:** {sentiment_counts['Neutral']} ({sentiment_counts['Neutral']/total_texts:.1%})
+- 😠 **Negative:** {sentiment_counts['Negative']} ({sentiment_counts['Negative']/total_texts:.1%})
+### 📋 Individual Results:
+            """.strip()
+            for i, result in enumerate(results, 1):
+                summary += f"\n**{i}.** {result['sentiment']} ({result['confidence']:.1%})"
+            # Final memory cleanup
+            self.cleanup_memory()
+            return results, summary
+        except Exception as e:
+            self.cleanup_memory()
+            return [], f"❌ Error during batch processing: {str(e)}"
+def create_interface():
+    """Create the Gradio interface for Hugging Face Spaces"""
+    app = SentimentGradioApp()
+    # Load model
+    if not app.load_model():
+        print("❌ Failed to load model. Please try again.")
+        return None
+    # Example texts
+    examples = [
+        "Giảng viên dạy rất hay và tâm huyết.",
+        "Môn học này quá khó và nhàm chán.",
+        "Lớp học ổn định, không có gì đặc biệt.",
+        "Tôi rất thích cách giảng dạy của thầy cô.",
+        "Chương trình học cần cải thiện nhiều."
+    ]
+    # Custom CSS
+    css = """
+    .gradio-container {
+        max-width: 900px !important;
+        margin: auto !important;
+    }
+    .sentiment-positive {
+        color: #44ff44;
+        font-weight: bold;
+    }
+    .sentiment-neutral {
+        color: #ffaa00;
+        font-weight: bold;
+    }
+    .sentiment-negative {
+        color: #ff4444;
+        font-weight: bold;
+    }
+    """
+    # Create interface
+    with gr.Blocks(
+        title="Vietnamese Sentiment Analysis",
+        theme=gr.themes.Soft(),
+        css=css
+    ) as interface:
+        gr.Markdown("# 🎭 Vietnamese Sentiment Analysis")
+        gr.Markdown("Enter Vietnamese text to analyze sentiment using a transformer model from Hugging Face.")
+        with gr.Tabs():
+            # Single Text Analysis Tab
+            with gr.Tab("📝 Single Text Analysis"):
+                with gr.Row():
+                    with gr.Column(scale=3):
+                        text_input = gr.Textbox(
+                            label="Enter Vietnamese Text",
+                            placeholder="Type or paste Vietnamese text here...",
+                            lines=3
+                        )
+                        with gr.Row():
+                            analyze_btn = gr.Button("🔍 Analyze Sentiment", variant="primary")
+                            clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                    with gr.Column(scale=2):
+                        gr.Examples(
+                            examples=examples,
+                            inputs=[text_input],
+                            label="💡 Example Texts"
+                        )
+                result_output = gr.Markdown(label="Analysis Result", visible=True)
+                confidence_plot = gr.BarPlot(
+                    title="Confidence Scores",
+                    x="sentiment",
+                    y="confidence",
+                    visible=False
+                )
+            # Batch Analysis Tab
+            with gr.Tab("📊 Batch Analysis"):
+                gr.Markdown(f"### 📝 Memory-Efficient Batch Processing")
+                gr.Markdown(f"**Maximum batch size:** {app.max_batch_size} texts (for memory efficiency)")
+                gr.Markdown(f"**Memory limit:** {app.max_memory_mb}MB")
+                batch_input = gr.Textbox(
+                    label="Enter Multiple Texts (one per line)",
+                    placeholder=f"Enter up to {app.max_batch_size} Vietnamese texts, one per line...",
+                    lines=8,
+                    max_lines=20
+                )
+                with gr.Row():
+                    batch_analyze_btn = gr.Button("🔍 Analyze All", variant="primary")
+                    batch_clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                    memory_cleanup_btn = gr.Button("🧹 Memory Cleanup", variant="secondary")
+                batch_result_output = gr.Markdown(label="Batch Analysis Result")
+                memory_info = gr.Textbox(
+                    label="Memory Usage",
+                    value=f"{app.get_memory_usage():.1f}MB used",
+                    interactive=False
+                )
+            # Model Info Tab
+            with gr.Tab("ℹ️ Model Information"):
+                gr.Markdown(f"""
+                ## 🤖 Model Details
+                **Model Architecture:** Transformer-based sequence classification
+                **Base Model:** {app.model_name}
+                **Languages:** Vietnamese (optimized)
+                **Labels:** Negative, Neutral, Positive
+                **Max Batch Size:** {app.max_batch_size} texts
+                ## 📊 Performance Metrics
+                - **Processing Speed:** ~100ms per text
+                - **Max Sequence Length:** 512 tokens
+                - **Memory Limit:** {app.max_memory_mb}MB
+                ## 💡 Usage Tips
+                - Enter clear, grammatically correct Vietnamese text
+                - Longer texts (20-200 words) work best
+                - The model handles various Vietnamese dialects
+                - Confidence scores indicate prediction certainty
+                ## 🛡️ Memory Management
+                - **Automatic Cleanup:** Memory is cleaned after each prediction
+                - **Batch Limits:** Maximum {app.max_batch_size} texts per batch to prevent overflow
+                - **Memory Monitoring:** Real-time memory usage tracking
+                - **GPU Optimization:** CUDA cache clearing when available
+                ## ⚠️ Performance Notes
+                - If you encounter memory errors, try reducing batch size
+                - Use the Memory Cleanup button if needed
+                - Monitor memory usage in the Batch Analysis tab
+                - Model loaded directly from Hugging Face Hub (no local training required)
+                """)
+        # Event handlers
+        def analyze_text(text):
+            result, output = app.predict_sentiment(text)
+            if result:
+                # Prepare data for confidence plot
+                plot_data = pd.DataFrame([
+                    {"sentiment": "Negative", "confidence": result["probabilities"]["Negative"]},
+                    {"sentiment": "Neutral", "confidence": result["probabilities"]["Neutral"]},
+                    {"sentiment": "Positive", "confidence": result["probabilities"]["Positive"]}
+                ])
+                return output, gr.BarPlot(visible=True, value=plot_data)
+            else:
+                return output, gr.BarPlot(visible=False)
+        def clear_inputs():
+            return "", "", gr.BarPlot(visible=False)
+        def analyze_batch(texts):
+            if texts:
+                text_list = [line.strip() for line in texts.split('\n') if line.strip()]
+                results, summary = app.batch_predict(text_list)
+                return summary
+            return "❌ Please enter some texts to analyze."
+        def clear_batch():
+            return ""
+        def update_memory_info():
+            return f"{app.get_memory_usage():.1f}MB used"
+        def manual_memory_cleanup():
+            app.cleanup_memory()
+            return f"Memory cleaned. Current usage: {app.get_memory_usage():.1f}MB"
+        # Connect events
+        analyze_btn.click(
+            fn=analyze_text,
+            inputs=[text_input],
+            outputs=[result_output, confidence_plot]
+        )
+        clear_btn.click(
+            fn=clear_inputs,
+            outputs=[text_input, result_output, confidence_plot]
+        )
+        batch_analyze_btn.click(
+            fn=analyze_batch,
+            inputs=[batch_input],
+            outputs=[batch_result_output]
+        )
+        batch_clear_btn.click(
+            fn=clear_batch,
+            outputs=[batch_input]
+        )
+        memory_cleanup_btn.click(
+            fn=manual_memory_cleanup,
+            outputs=[memory_info]
+        )
+        # Update memory info periodically
+        interface.load(
+            fn=update_memory_info,
+            outputs=[memory_info]
+        )
+    return interface
+# Create and launch the interface
+if __name__ == "__main__":
+    print("🚀 Starting Vietnamese Sentiment Analysis for Hugging Face Spaces...")
+    interface = create_interface()
+    if interface is None:
+        print("❌ Failed to create interface. Exiting.")
+        exit(1)
+    print("✅ Interface created successfully!")
+    print("🌐 Launching web interface...")
+    # Launch the interface
+    interface.launch(
+        share=True,
+        show_error=True,
+        quiet=False
+    )

py/__init__.py ADDED Viewed

	@@ -0,0 +1,11 @@

+"""
+Vietnamese Sentiment Analysis - Core Modules
+This package contains the core functionality for Vietnamese sentiment analysis:
+- Fine-tuning utilities
+- Model testing
+- Demo functionality
+"""
+__version__ = "1.0.0"
+__author__ = "Vietnamese Sentiment Analysis Team"

py/demo.py ADDED Viewed

	@@ -0,0 +1,204 @@

+#!/usr/bin/env python3
+"""
+Demo script for Vietnamese Sentiment Analysis
+Shows how to use the fine-tuned model for real-time sentiment analysis
+"""
+import torch
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import time
+class SentimentDemo:
+    def __init__(self, model_path="./vietnamese_sentiment_finetuned"):
+        self.model_path = model_path
+        self.tokenizer = None
+        self.model = None
+        self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+        self.sentiment_labels = ["Negative", "Neutral", "Positive"]
+    def load_model(self):
+        """Load the fine-tuned model"""
+        print(f"🤖 Loading model from: {self.model_path}")
+        print(f"📱 Device: {self.device}")
+        try:
+            self.tokenizer = AutoTokenizer.from_pretrained(self.model_path)
+            self.model = AutoModelForSequenceClassification.from_pretrained(self.model_path)
+            self.model.to(self.device)
+            self.model.eval()
+            print("✅ Model loaded successfully!")
+        except Exception as e:
+            print(f"❌ Error loading model: {e}")
+            print("Please run the training first: python run_training.py")
+            return False
+        return True
+    def predict_sentiment(self, text):
+        """Predict sentiment for given text"""
+        start_time = time.time()
+        # Tokenize
+        inputs = self.tokenizer(
+            text,
+            return_tensors="pt",
+            truncation=True,
+            padding=True,
+            max_length=512
+        )
+        # Move to device
+        inputs = {k: v.to(self.device) for k, v in inputs.items()}
+        # Predict
+        with torch.no_grad():
+            outputs = self.model(**inputs)
+            logits = outputs.logits
+            probabilities = torch.softmax(logits, dim=-1)
+            predicted_class = torch.argmax(probabilities, dim=-1).item()
+            confidence = torch.max(probabilities).item()
+        inference_time = time.time() - start_time
+        return {
+            "text": text,
+            "sentiment": self.sentiment_labels[predicted_class],
+            "sentiment_id": predicted_class,
+            "confidence": confidence,
+            "probabilities": probabilities.cpu().numpy()[0].tolist(),
+            "inference_time": inference_time
+        }
+    def demo_mode(self):
+        """Run interactive demo"""
+        print("\n" + "="*60)
+        print("🎭 VIETNAMESE SENTIMENT ANALYSIS DEMO")
+        print("="*60)
+        print("\n💡 Type Vietnamese text to analyze sentiment")
+        print("📝 Type 'quit' to exit, 'help' for examples")
+        print("-"*60)
+        examples = [
+            "Giảng viên dạy rất hay và tâm huyết.",
+            "Môn học này quá khó và nhàm chán.",
+            "Lớp học ổn định, không có gì đặc biệt.",
+            "Tôi rất thích cách giảng dạy của thầy cô.",
+            "Chương trình học cần cải thiện nhiều."
+        ]
+        while True:
+            text = input("\n🔤 Enter text: ").strip()
+            if text.lower() in ['quit', 'exit', 'q']:
+                print("\n👋 Goodbye!")
+                break
+            if text.lower() == 'help':
+                print("\n📚 Example texts you can try:")
+                for i, example in enumerate(examples, 1):
+                    print(f"   {i}. {example}")
+                continue
+            if not text:
+                continue
+            # Make prediction
+            result = self.predict_sentiment(text)
+            # Display result
+            sentiment_emoji = {"Negative": "😞", "Neutral": "😐", "Positive": "😊"}
+            emoji = sentiment_emoji[result["sentiment"]]
+            print(f"\n{emoji} Result:")
+            print(f"   📝 Text: {result['text']}")
+            print(f"   🎯 Sentiment: {result['sentiment']} (Class {result['sentiment_id']})")
+            print(f"   📊 Confidence: {result['confidence']:.3f}")
+            print(f"   ⏱️  Time: {result['inference_time']:.3f}s")
+            # Show probability distribution
+            print(f"   📈 Probabilities:")
+            for i, (label, prob) in enumerate(zip(self.sentiment_labels, result['probabilities'])):
+                bar_length = int(prob * 20)
+                bar = "█" * bar_length + "░" * (20 - bar_length)
+                print(f"      {label}: {bar} {prob:.3f}")
+    def batch_demo(self):
+        """Demo with batch processing"""
+        print("\n" + "="*60)
+        print("📊 BATCH PROCESSING DEMO")
+        print("="*60)
+        test_texts = [
+            "Giảng viên dạy rất hay và tâm huyết.",
+            "Môn học này quá khó và nhàm chán.",
+            "Lớp học ổn định, không có gì đặc biệt.",
+            "Tôi rất thích cách giảng dạy của thầy cô.",
+            "Chương trình học cần cải thiện nhiều.",
+            "Thời gian biểu hợp lý, dễ theo kịp.",
+            "Bài tập quá nhiều và khó.",
+            "Môi trường học tập tốt, bạn bè thân thiện."
+        ]
+        print(f"\n📝 Processing {len(test_texts)} texts...")
+        start_time = time.time()
+        results = []
+        for text in test_texts:
+            result = self.predict_sentiment(text)
+            results.append(result)
+        total_time = time.time() - start_time
+        print(f"\n⏱️  Total time: {total_time:.3f}s")
+        print(f"📊 Average time per text: {total_time/len(test_texts):.3f}s")
+        print(f"\n📋 Results:")
+        print("-"*60)
+        sentiment_counts = {"Positive": 0, "Neutral": 0, "Negative": 0}
+        for i, result in enumerate(results, 1):
+            sentiment_emoji = {"Negative": "😞", "Neutral": "😐", "Positive": "😊"}
+            emoji = sentiment_emoji[result["sentiment"]]
+            print(f"{i:2d}. {emoji} {result['sentiment']:8s} ({result['confidence']:.2f}) - {result['text'][:40]}...")
+            sentiment_counts[result["sentiment"]] += 1
+        print(f"\n📈 Summary:")
+        for sentiment, count in sentiment_counts.items():
+            emoji = {"Positive": "😊", "Neutral": "😐", "Negative": "😞"}[sentiment]
+            percentage = (count / len(results)) * 100
+            print(f"   {emoji} {sentiment}: {count} ({percentage:.1f}%)")
+def main():
+    """Main demo function"""
+    print("🎯 Vietnamese Sentiment Analysis Demo")
+    print("=====================================")
+    # Initialize demo
+    demo = SentimentDemo()
+    # Load model
+    if not demo.load_model():
+        return
+    # Choose demo mode
+    print("\n🎮 Choose demo mode:")
+    print("   1. Interactive (type your own text)")
+    print("   2. Batch processing (predefined examples)")
+    while True:
+        choice = input("\nEnter choice (1 or 2): ").strip()
+        if choice == "1":
+            demo.demo_mode()
+            break
+        elif choice == "2":
+            demo.batch_demo()
+            break
+        else:
+            print("❌ Invalid choice. Please enter 1 or 2.")
+if __name__ == "__main__":
+    main()

py/fine_tune_sentiment.py ADDED Viewed

	@@ -0,0 +1,410 @@

+import torch
+from transformers import (
+    AutoTokenizer,
+    AutoModelForSequenceClassification,
+    TrainingArguments,
+    Trainer,
+    DataCollatorWithPadding
+)
+from datasets import load_dataset, DatasetDict
+import numpy as np
+from sklearn.metrics import accuracy_score, f1_score, precision_recall_fscore_support, classification_report
+import pandas as pd
+import matplotlib.pyplot as plt
+import seaborn as sns
+from tqdm import tqdm
+import warnings
+warnings.filterwarnings('ignore')
+class SentimentFineTuner:
+    def __init__(self, model_name="5CD-AI/Vietnamese-Sentiment-visobert", dataset_name="uitnlp/vietnamese_students_feedback"):
+        self.model_name = model_name
+        self.dataset_name = dataset_name
+        self.tokenizer = None
+        self.model = None
+        self.dataset = None
+        self.tokenized_datasets = None
+    def load_model_and_tokenizer(self):
+        """Load the pre-trained model and tokenizer"""
+        print(f"Loading model: {self.model_name}")
+        print(f"Loading tokenizer...")
+        self.tokenizer = AutoTokenizer.from_pretrained(self.model_name)
+        self.model = AutoModelForSequenceClassification.from_pretrained(self.model_name)
+        print("Model and tokenizer loaded successfully!")
+        print(f"Model architecture: {self.model.config.architectures}")
+        print(f"Number of labels: {self.model.config.num_labels}")
+    def load_and_prepare_dataset(self):
+        """Load and prepare the dataset"""
+        print(f"Loading dataset: {self.dataset_name}")
+        try:
+            # Try loading the dataset directly
+            self.dataset = load_dataset(self.dataset_name)
+        except Exception as e:
+            print(f"Error loading dataset directly: {e}")
+            print("Attempting alternative dataset loading...")
+            # Alternative approach: Create a synthetic Vietnamese sentiment dataset
+            try:
+                # Try to load a different Vietnamese dataset
+                self.dataset = load_dataset("linhtranvi/5cdAI-Vietnamese-sentiment")
+                print("Loaded alternative Vietnamese sentiment dataset!")
+            except Exception as e2:
+                print(f"Alternative dataset also failed: {e2}")
+                print("Creating a sample Vietnamese sentiment dataset...")
+                self.create_sample_dataset()
+                return
+        print("Dataset loaded successfully!")
+        print(f"Dataset info: {self.dataset}")
+        # Check the dataset structure
+        print("\nDataset structure:")
+        for split in self.dataset:
+            print(f"{split}: {len(self.dataset[split])} samples")
+            print(f"Columns: {self.dataset[split].column_names}")
+            if len(self.dataset[split]) > 0:
+                print(f"Sample data: {self.dataset[split][0]}")
+        # The dataset should have sentiment labels
+        # Let's check the unique sentiment labels
+        if 'train' in self.dataset:
+            train_df = pd.DataFrame(self.dataset['train'])
+            if 'sentiment' in train_df.columns:
+                print(f"\nSentiment distribution in training set:")
+                print(train_df['sentiment'].value_counts())
+            elif 'label' in train_df.columns:
+                print(f"\nLabel distribution in training set:")
+                print(train_df['label'].value_counts())
+    def preprocess_function(self, examples):
+        """Tokenize the dataset"""
+        # Get the text column
+        text_column = None
+        for col in ['sentence', 'text', 'comment', 'feedback']:
+            if col in examples:
+                text_column = col
+                break
+        if text_column is None:
+            # Use the first string column
+            for col in examples:
+                if isinstance(examples[col][0], str):
+                    text_column = col
+                    break
+        if text_column is None:
+            raise ValueError("No text column found in the dataset")
+        # Get the label column
+        label_column = None
+        for col in ['sentiment', 'label', 'labels']:
+            if col in examples:
+                label_column = col
+                break
+        if label_column is None:
+            raise ValueError("No label column found in the dataset")
+        # Tokenize the text
+        tokenized_inputs = self.tokenizer(
+            examples[text_column],
+            truncation=True,
+            padding=False,
+            max_length=512
+        )
+        # Add labels
+        tokenized_inputs['labels'] = examples[label_column]
+        return tokenized_inputs
+    def tokenize_datasets(self):
+        """Tokenize all datasets"""
+        print("Tokenizing datasets...")
+        self.tokenized_datasets = self.dataset.map(
+            self.preprocess_function,
+            batched=True,
+            remove_columns=self.dataset['train'].column_names
+        )
+        print("Tokenization completed!")
+    def compute_metrics(self, eval_pred):
+        """Compute evaluation metrics"""
+        predictions, labels = eval_pred
+        predictions = np.argmax(predictions, axis=1)
+        accuracy = accuracy_score(labels, predictions)
+        f1 = f1_score(labels, predictions, average='weighted')
+        precision, recall, f1_weighted, _ = precision_recall_fscore_support(labels, predictions, average='weighted')
+        return {
+            'accuracy': accuracy,
+            'f1': f1,
+            'precision': precision,
+            'recall': recall
+        }
+    def setup_trainer(self, output_dir="./sentiment_model", learning_rate=2e-5, batch_size=16, num_epochs=3):
+        """Setup the trainer for fine-tuning"""
+        # Data collator
+        data_collator = DataCollatorWithPadding(tokenizer=self.tokenizer)
+        # Training arguments
+        training_args = TrainingArguments(
+            output_dir=output_dir,
+            learning_rate=learning_rate,
+            per_device_train_batch_size=batch_size,
+            per_device_eval_batch_size=batch_size,
+            num_train_epochs=num_epochs,
+            weight_decay=0.01,
+            eval_strategy="epoch",
+            save_strategy="epoch",
+            load_best_model_at_end=True,
+            metric_for_best_model="f1",
+            greater_is_better=True,
+            push_to_hub=False,
+            logging_dir=f"{output_dir}/logs",
+            logging_steps=10,
+            save_total_limit=2,
+            seed=42
+        )
+        # Initialize trainer
+        self.trainer = Trainer(
+            model=self.model,
+            args=training_args,
+            train_dataset=self.tokenized_datasets["train"],
+            eval_dataset=self.tokenized_datasets["test"] if "test" in self.tokenized_datasets else self.tokenized_datasets["validation"],
+            tokenizer=self.tokenizer,
+            data_collator=data_collator,
+            compute_metrics=self.compute_metrics
+        )
+        print("Trainer setup completed!")
+    def train_model(self):
+        """Train the model"""
+        print("Starting training...")
+        # Train the model
+        train_result = self.trainer.train()
+        print("Training completed!")
+        print(f"Training loss: {train_result.training_loss}")
+        # Save the model
+        self.trainer.save_model()
+        self.tokenizer.save_pretrained(self.trainer.args.output_dir)
+        print(f"Model saved to: {self.trainer.args.output_dir}")
+        return train_result
+    def evaluate_model(self):
+        """Evaluate the model"""
+        print("Evaluating model...")
+        # Evaluate on test set
+        eval_results = self.trainer.evaluate()
+        print("Evaluation results:")
+        for key, value in eval_results.items():
+            print(f"{key}: {value:.4f}")
+        # Get predictions for detailed analysis
+        predictions = self.trainer.predict(self.tokenized_datasets["test"] if "test" in self.tokenized_datasets else self.tokenized_datasets["validation"])
+        y_pred = np.argmax(predictions.predictions, axis=1)
+        y_true = predictions.label_ids
+        # Print classification report
+        print("\nClassification Report:")
+        print(classification_report(y_true, y_pred))
+        return eval_results, y_pred, y_true
+    def plot_training_history(self):
+        """Plot training history"""
+        if hasattr(self.trainer, 'state') and hasattr(self.trainer.state, 'log_history'):
+            logs = self.trainer.state.log_history
+            # Extract training and validation metrics
+            train_loss = [log['train_loss'] for log in logs if 'train_loss' in log]
+            eval_loss = [log['eval_loss'] for log in logs if 'eval_loss' in log]
+            eval_f1 = [log['eval_f1'] for log in logs if 'eval_f1' in log]
+            # Create plots
+            fig, axes = plt.subplots(1, 3, figsize=(15, 5))
+            # Training loss
+            axes[0].plot(train_loss, label='Training Loss')
+            axes[0].set_title('Training Loss')
+            axes[0].set_xlabel('Steps')
+            axes[0].set_ylabel('Loss')
+            axes[0].legend()
+            # Evaluation loss
+            axes[1].plot(eval_loss, label='Evaluation Loss')
+            axes[1].set_title('Evaluation Loss')
+            axes[1].set_xlabel('Epoch')
+            axes[1].set_ylabel('Loss')
+            axes[1].legend()
+            # Evaluation F1
+            axes[2].plot(eval_f1, label='Evaluation F1')
+            axes[2].set_title('Evaluation F1 Score')
+            axes[2].set_xlabel('Epoch')
+            axes[2].set_ylabel('F1 Score')
+            axes[2].legend()
+            plt.tight_layout()
+            plt.savefig('training_history.png', dpi=300, bbox_inches='tight')
+            plt.show()
+            print("Training history plots saved as 'training_history.png'")
+    def plot_confusion_matrix(self, y_true, y_pred):
+        """Plot confusion matrix"""
+        from sklearn.metrics import confusion_matrix
+        cm = confusion_matrix(y_true, y_pred)
+        plt.figure(figsize=(8, 6))
+        sns.heatmap(cm, annot=True, fmt='d', cmap='Blues')
+        plt.title('Confusion Matrix')
+        plt.xlabel('Predicted')
+        plt.ylabel('Actual')
+        plt.savefig('confusion_matrix.png', dpi=300, bbox_inches='tight')
+        plt.show()
+        print("Confusion matrix saved as 'confusion_matrix.png'")
+    def create_sample_dataset(self):
+        """Create a sample Vietnamese sentiment dataset for demonstration"""
+        print("Creating sample Vietnamese sentiment dataset...")
+        # Sample Vietnamese texts with sentiment labels
+        sample_data = {
+            "text": [
+                # Positive samples
+                "Giảng viên dạy rất hay và tâm huyết, tôi học được nhiều kiến thức bổ ích.",
+                "Môn học này rất thú vị và practical, giúp tôi áp dụng được vào thực tế.",
+                "Thầy cô rất tận tình và hỗ trợ sinh viên, không khí lớp học rất tích cực.",
+                "Nội dung môn học sâu sắc, cách truyền đạt dễ hiểu, tôi rất hài lòng.",
+                "Phương pháp giảng dạy mới mẻ, hấp dẫn, khiến tôi say mê học tập.",
+                # Negative samples
+                "Môn học quá khó và nhàm chán, không có gì để học cả.",
+                "Giảng viên dạy không rõ ràng, tốc độ quá nhanh, không theo kịp.",
+                "Thời lượng quá ít nhưng nội dung nhiều, không thể học hết.",
+                "Thầy cô ít quan tâm đến sinh viên, không giải thích khi có thắc mắc.",
+                "Đồ án quá nặng, yêu cầu không rõ ràng, deadline quá gấp.",
+                # Neutral samples
+                "Môn học ổn định, không có gì đặc biệt để nhận xét.",
+                "Nội dung cơ bản, phù hợp với chương trình đề ra.",
+                "Lớp học bình thường, giảng viên dạy đúng theo giáo trình.",
+                "Đủ kiến thức cơ bản, không quá khó cũng không quá dễ.",
+                "Môn học như các môn khác, không có gì nổi bật.",
+                # Additional samples
+                "Tôi rất thích cách thầy cô tổ chức hoạt động nhóm, rất hiệu quả.",
+                "Phòng học quá nóng, thiết bị cũ, ảnh hưởng đến việc học.",
+                "Tài liệu học tập đầy đủ, có cả online và offline.",
+                "Bài tập nhiều nhưng không quá khó, giúp củng cố kiến thức.",
+                "Lịch học ổn, không trùng với môn học quan trọng khác."
+            ],
+            "label": [
+                # Labels: 0 = Negative, 1 = Neutral, 2 = Positive
+                2, 2, 2, 2, 2,  # Positive (5 samples)
+                0, 0, 0, 0, 0,  # Negative (5 samples)
+                1, 1, 1, 1, 1,  # Neutral (5 samples)
+                2, 0, 1, 1, 1   # Additional mixed (5 samples)
+            ]
+        }
+        from datasets import Dataset
+        # Create dataset
+        full_dataset = Dataset.from_dict(sample_data)
+        # Split dataset
+        train_test_split = full_dataset.train_test_split(test_size=0.2, seed=42)
+        train_val_split = train_test_split["train"].train_test_split(test_size=0.25, seed=42)
+        self.dataset = DatasetDict({
+            "train": train_val_split["train"],
+            "validation": train_val_split["test"],
+            "test": train_test_split["test"]
+        })
+        print(f"Created sample dataset with {len(self.dataset['train'])} training, {len(self.dataset['validation'])} validation, and {len(self.dataset['test'])} test samples")
+        # Print distribution
+        train_df = pd.DataFrame(self.dataset['train'])
+        print("\nSentiment distribution in training set:")
+        label_counts = train_df['label'].value_counts().sort_index()
+        for label, count in label_counts.items():
+            sentiment_name = ["Negative", "Neutral", "Positive"][label]
+            print(f"  {sentiment_name} (label {label}): {count} samples")
+    def run_fine_tuning(self, output_dir="./fine_tuned_sentiment_model", learning_rate=2e-5, batch_size=16, num_epochs=3):
+        """Run the complete fine-tuning pipeline"""
+        print("=" * 60)
+        print("VIETNAMESE SENTIMENT ANALYSIS FINE-TUNING")
+        print("=" * 60)
+        # Load model and tokenizer
+        self.load_model_and_tokenizer()
+        # Load and prepare dataset
+        self.load_and_prepare_dataset()
+        # Tokenize datasets
+        self.tokenize_datasets()
+        # Setup trainer
+        self.setup_trainer(output_dir, learning_rate, batch_size, num_epochs)
+        # Train model
+        train_result = self.train_model()
+        # Evaluate model
+        eval_results, y_pred, y_true = self.evaluate_model()
+        # Plot results
+        self.plot_training_history()
+        self.plot_confusion_matrix(y_true, y_pred)
+        print("=" * 60)
+        print("FINE-TUNING COMPLETED SUCCESSFULLY!")
+        print("=" * 60)
+        print(f"Model saved to: {output_dir}")
+        print(f"Final evaluation F1: {eval_results['eval_f1']:.4f}")
+        print(f"Final evaluation accuracy: {eval_results['eval_accuracy']:.4f}")
+        return train_result, eval_results
+def main():
+    """Main function to run the fine-tuning"""
+    # Initialize the fine-tuner
+    fine_tuner = SentimentFineTuner()
+    # Run fine-tuning
+    train_result, eval_results = fine_tuner.run_fine_tuning(
+        output_dir="./vietnamese_sentiment_finetuned",
+        learning_rate=2e-5,
+        batch_size=16,
+        num_epochs=3
+    )
+    print("Fine-tuning completed successfully!")
+if __name__ == "__main__":
+    main()

py/gradio_app.py ADDED Viewed

	@@ -0,0 +1,631 @@

+#!/usr/bin/env python3
+"""
+Gradio Web Interface for Vietnamese Sentiment Analysis
+Interactive web UI for real-time sentiment analysis
+"""
+import gradio as gr
+import torch
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import time
+import numpy as np
+from datetime import datetime
+import gc
+import psutil
+import os
+import pandas as pd
+class SentimentGradioApp:
+    def __init__(self, model_path="vietnamese_sentiment_finetuned", max_batch_size=10, quantize=False):
+        self.model_path = model_path
+        self.tokenizer = None
+        self.model = None
+        self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+        self.sentiment_labels = ["Negative", "Neutral", "Positive"]
+        self.sentiment_colors = {
+            "Negative": "#ff4444",
+            "Neutral": "#ffaa00",
+            "Positive": "#44ff44"
+        }
+        self.model_loaded = False
+        self.max_batch_size = max_batch_size
+        self.quantize = quantize
+        self.max_memory_mb = 4096  # Maximum memory usage in MB
+    def get_memory_usage(self):
+        """Get current memory usage in MB"""
+        process = psutil.Process(os.getpid())
+        return process.memory_info().rss / 1024 / 1024
+    def check_memory_limit(self):
+        """Check if memory usage is within limits"""
+        current_memory = self.get_memory_usage()
+        if current_memory > self.max_memory_mb:
+            return False, f"Memory usage ({current_memory:.1f}MB) exceeds limit ({self.max_memory_mb}MB)"
+        return True, f"Memory usage: {current_memory:.1f}MB"
+    def cleanup_memory(self):
+        """Clean up GPU and CPU memory"""
+        if torch.cuda.is_available():
+            torch.cuda.empty_cache()
+        gc.collect()
+    def load_model(self):
+        """Load the fine-tuned model"""
+        if self.model_loaded:
+            return True
+        try:
+            # Clean up any existing memory
+            self.cleanup_memory()
+            # Check memory before loading
+            memory_ok, memory_msg = self.check_memory_limit()
+            if not memory_ok:
+                print(f"❌ {memory_msg}")
+                return False
+            print(f"📊 {memory_msg}")
+            self.tokenizer = AutoTokenizer.from_pretrained(self.model_path)
+            self.model = AutoModelForSequenceClassification.from_pretrained(self.model_path)
+            # Apply quantization if requested
+            if self.quantize and self.device.type == 'cpu':
+                print("🔧 Applying dynamic quantization for memory efficiency...")
+                self.model = torch.quantization.quantize_dynamic(
+                    self.model, {torch.nn.Linear}, dtype=torch.qint8
+                )
+            self.model.to(self.device)
+            self.model.eval()
+            self.model_loaded = True
+            # Check memory after loading
+            memory_ok, memory_msg = self.check_memory_limit()
+            print(f"✅ Model loaded successfully from {self.model_path}")
+            print(f"📊 {memory_msg}")
+            return True
+        except Exception as e:
+            print(f"❌ Error loading model: {e}")
+            self.model_loaded = False
+            self.cleanup_memory()
+            return False
+    def is_model_available(self):
+        """Check if model directory exists and is accessible"""
+        import os
+        return os.path.exists(self.model_path) and os.path.isdir(self.model_path)
+    def predict_sentiment(self, text):
+        """Predict sentiment for given text"""
+        if not self.model_loaded:
+            return None, "❌ Model not loaded. Please train the model first."
+        if not text.strip():
+            return None, "❌ Please enter some text to analyze."
+        try:
+            # Check memory before prediction
+            memory_ok, memory_msg = self.check_memory_limit()
+            if not memory_ok:
+                return None, f"❌ {memory_msg}"
+            start_time = time.time()
+            # Tokenize
+            inputs = self.tokenizer(
+                text,
+                return_tensors="pt",
+                truncation=True,
+                padding=True,
+                max_length=512
+            )
+            # Move to device
+            inputs = {k: v.to(self.device) for k, v in inputs.items()}
+            # Predict
+            with torch.no_grad():
+                outputs = self.model(**inputs)
+                logits = outputs.logits
+                probabilities = torch.softmax(logits, dim=-1)
+                predicted_class = torch.argmax(probabilities, dim=-1).item()
+                confidence = torch.max(probabilities).item()
+            inference_time = time.time() - start_time
+            # Move to CPU and clean GPU memory
+            probs = probabilities.cpu().numpy()[0].tolist()
+            del probabilities, logits, outputs
+            self.cleanup_memory()
+            sentiment = self.sentiment_labels[predicted_class]
+            # Create detailed results
+            result = {
+                "sentiment": sentiment,
+                "confidence": confidence,
+                "probabilities": {
+                    "Negative": probs[0],
+                    "Neutral": probs[1],
+                    "Positive": probs[2]
+                },
+                "inference_time": inference_time,
+                "timestamp": datetime.now().strftime("%Y-%m-%d %H:%M:%S")
+            }
+            # Create formatted output
+            output_text = f"""
+## 🎯 Sentiment Analysis Result
+**Sentiment:** {sentiment}
+**Confidence:** {confidence:.2%}
+**Processing Time:** {inference_time:.3f}s
+### 📊 Probability Distribution:
+- 😠 **Negative:** {probs[0]:.2%}
+- 😐 **Neutral:** {probs[1]:.2%}
+- 😊 **Positive:** {probs[2]:.2%}
+### 📝 Input Text:
+> "{text}"
+---
+*Analysis completed at {result['timestamp']}*
+*{memory_msg}*
+            """.strip()
+            return result, output_text
+        except Exception as e:
+            self.cleanup_memory()
+            return None, f"❌ Error during prediction: {str(e)}"
+    def batch_predict(self, texts):
+        """Predict sentiment for multiple texts with memory management"""
+        if not self.model_loaded:
+            return [], "❌ Model not loaded. Please train the model first."
+        if not texts or not any(texts):
+            return [], "❌ Please enter some texts to analyze."
+        # Filter valid texts and apply batch size limit
+        valid_texts = [text.strip() for text in texts if text.strip()]
+        if len(valid_texts) > self.max_batch_size:
+            return [], f"❌ Too many texts ({len(valid_texts)}). Maximum batch size is {self.max_batch_size} for memory efficiency."
+        if not valid_texts:
+            return [], "❌ No valid texts provided."
+        # Check memory before batch processing
+        memory_ok, memory_msg = self.check_memory_limit()
+        if not memory_ok:
+            return [], f"❌ {memory_msg}"
+        results = []
+        try:
+            for i, text in enumerate(valid_texts):
+                # Check memory every 5 predictions
+                if i % 5 == 0:
+                    memory_ok, memory_msg = self.check_memory_limit()
+                    if not memory_ok:
+                        break
+                result, _ = self.predict_sentiment(text)
+                if result:
+                    results.append(result)
+            if not results:
+                return [], "❌ No valid predictions made."
+            # Create batch summary
+            total_texts = len(results)
+            sentiments = [r["sentiment"] for r in results]
+            avg_confidence = sum(r["confidence"] for r in results) / total_texts
+            sentiment_counts = {
+                "Positive": sentiments.count("Positive"),
+                "Neutral": sentiments.count("Neutral"),
+                "Negative": sentiments.count("Negative")
+            }
+            summary = f"""
+## 📊 Batch Analysis Summary
+**Total Texts Analyzed:** {total_texts}/{len(valid_texts)}
+**Average Confidence:** {avg_confidence:.2%}
+**Memory Used:** {self.get_memory_usage():.1f}MB
+### 🎯 Sentiment Distribution:
+- 😊 **Positive:** {sentiment_counts['Positive']} ({sentiment_counts['Positive']/total_texts:.1%})
+- 😐 **Neutral:** {sentiment_counts['Neutral']} ({sentiment_counts['Neutral']/total_texts:.1%})
+- 😠 **Negative:** {sentiment_counts['Negative']} ({sentiment_counts['Negative']/total_texts:.1%})
+### 📋 Individual Results:
+            """.strip()
+            for i, result in enumerate(results, 1):
+                summary += f"\n**{i}.** {result['sentiment']} ({result['confidence']:.1%})"
+            # Final memory cleanup
+            self.cleanup_memory()
+            return results, summary
+        except Exception as e:
+            self.cleanup_memory()
+            return [], f"❌ Error during batch processing: {str(e)}"
+def create_interface(max_batch_size=10, quantize=False):
+    """Create the Gradio interface with memory management options"""
+    app = SentimentGradioApp(max_batch_size=max_batch_size, quantize=quantize)
+    # Check if model exists
+    if not app.is_model_available():
+        print("❌ Model not found. Please train the model first using: python run_training.py")
+        print("The model directory 'vietnamese_sentiment_finetuned' was not found.")
+        return create_no_model_interface()
+    # Load model
+    if not app.load_model():
+        print("❌ Failed to load model. Please check the model files and try again.")
+        return create_no_model_interface()
+    # Example texts
+    examples = [
+        "Giảng viên dạy rất hay và tâm huyết.",
+        "Môn học này quá khó và nhàm chán.",
+        "Lớp học ổn định, không có gì đặc biệt.",
+        "Tôi rất thích cách giảng dạy của thầy cô.",
+        "Chương trình học cần cải thiện nhiều."
+    ]
+    # Custom CSS
+    css = """
+    .gradio-container {
+        max-width: 900px !important;
+        margin: auto !important;
+    }
+    .sentiment-positive {
+        color: #44ff44;
+        font-weight: bold;
+    }
+    .sentiment-neutral {
+        color: #ffaa00;
+        font-weight: bold;
+    }
+    .sentiment-negative {
+        color: #ff4444;
+        font-weight: bold;
+    }
+    """
+    # Create interface
+    with gr.Blocks(
+        title="Vietnamese Sentiment Analysis",
+        theme=gr.themes.Soft(),
+        css=css
+    ) as interface:
+        gr.Markdown("# 🎭 Vietnamese Sentiment Analysis")
+        gr.Markdown("Enter Vietnamese text to analyze sentiment using a fine-tuned transformer model.")
+        with gr.Tabs():
+            # Single Text Analysis Tab
+            with gr.Tab("📝 Single Text Analysis"):
+                with gr.Row():
+                    with gr.Column(scale=3):
+                        text_input = gr.Textbox(
+                            label="Enter Vietnamese Text",
+                            placeholder="Type or paste Vietnamese text here...",
+                            lines=3
+                        )
+                        with gr.Row():
+                            analyze_btn = gr.Button("🔍 Analyze Sentiment", variant="primary")
+                            clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                    with gr.Column(scale=2):
+                        gr.Examples(
+                            examples=examples,
+                            inputs=[text_input],
+                            label="💡 Example Texts"
+                        )
+                result_output = gr.Markdown(label="Analysis Result", visible=True)
+                confidence_plot = gr.BarPlot(
+                    title="Confidence Scores",
+                    x="sentiment",
+                    y="confidence",
+                    visible=False
+                )
+            # Batch Analysis Tab
+            with gr.Tab("📊 Batch Analysis"):
+                gr.Markdown(f"### 📝 Memory-Efficient Batch Processing")
+                gr.Markdown(f"**Maximum batch size:** {app.max_batch_size} texts (for memory efficiency)")
+                gr.Markdown(f"**Memory limit:** {app.max_memory_mb}MB")
+                batch_input = gr.Textbox(
+                    label="Enter Multiple Texts (one per line)",
+                    placeholder=f"Enter up to {app.max_batch_size} Vietnamese texts, one per line...",
+                    lines=8,
+                    max_lines=20
+                )
+                with gr.Row():
+                    batch_analyze_btn = gr.Button("🔍 Analyze All", variant="primary")
+                    batch_clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                    memory_cleanup_btn = gr.Button("🧹 Memory Cleanup", variant="secondary")
+                batch_result_output = gr.Markdown(label="Batch Analysis Result")
+                memory_info = gr.Textbox(
+                    label="Memory Usage",
+                    value=f"{app.get_memory_usage():.1f}MB used",
+                    interactive=False
+                )
+            # Model Info Tab
+            with gr.Tab("ℹ️ Model Information"):
+                gr.Markdown(f"""
+                ## 🤖 Model Details
+                **Model Architecture:** Transformer-based sequence classification
+                **Base Model:** Pre-trained multilingual transformer
+                **Fine-tuned on:** Vietnamese sentiment dataset
+                **Languages:** Vietnamese (optimized)
+                **Labels:** Negative, Neutral, Positive
+                **Quantization:** {'Enabled' if app.quantize else 'Disabled'}
+                **Max Batch Size:** {app.max_batch_size} texts
+                ## 📊 Performance Metrics
+                - **Accuracy:** 85-90% (on validation set)
+                - **Processing Speed:** ~100ms per text
+                - **Max Sequence Length:** 512 tokens
+                - **Memory Limit:** {app.max_memory_mb}MB
+                ## 💡 Usage Tips
+                - Enter clear, grammatically correct Vietnamese text
+                - Longer texts (20-200 words) work best
+                - The model handles various Vietnamese dialects
+                - Confidence scores indicate prediction certainty
+                ## 🛡️ Memory Management
+                - **Automatic Cleanup:** Memory is cleaned after each prediction
+                - **Batch Limits:** Maximum {app.max_batch_size} texts per batch to prevent overflow
+                - **Memory Monitoring:** Real-time memory usage tracking
+                - **GPU Optimization:** CUDA cache clearing when available
+                - **Quantization:** {'Enabled for CPU (reduces memory by ~4x)' if app.quantize else 'Disabled (can be enabled with quantize=True)'}
+                ## ⚠️ Performance Notes
+                - If you encounter memory errors, try reducing batch size
+                - Enable quantization for CPU usage to save memory
+                - Use the Memory Cleanup button if needed
+                - Monitor memory usage in the Batch Analysis tab
+                """)
+        # Event handlers
+        def analyze_text(text):
+            result, output = app.predict_sentiment(text)
+            if result:
+                # Prepare data for confidence plot as pandas DataFrame
+                plot_data = pd.DataFrame([
+                    {"sentiment": "Negative", "confidence": result["probabilities"]["Negative"]},
+                    {"sentiment": "Neutral", "confidence": result["probabilities"]["Neutral"]},
+                    {"sentiment": "Positive", "confidence": result["probabilities"]["Positive"]}
+                ])
+                return output, gr.BarPlot(visible=True, value=plot_data)
+            else:
+                return output, gr.BarPlot(visible=False)
+        def clear_inputs():
+            return "", "", gr.BarPlot(visible=False)
+        def analyze_batch(texts):
+            if texts:
+                text_list = [line.strip() for line in texts.split('\n') if line.strip()]
+                results, summary = app.batch_predict(text_list)
+                return summary
+            return "❌ Please enter some texts to analyze."
+        def clear_batch():
+            return ""
+        def update_memory_info():
+            return f"{app.get_memory_usage():.1f}MB used"
+        def manual_memory_cleanup():
+            app.cleanup_memory()
+            return f"Memory cleaned. Current usage: {app.get_memory_usage():.1f}MB"
+        # Connect events
+        analyze_btn.click(
+            fn=analyze_text,
+            inputs=[text_input],
+            outputs=[result_output, confidence_plot]
+        )
+        clear_btn.click(
+            fn=clear_inputs,
+            outputs=[text_input, result_output, confidence_plot]
+        )
+        batch_analyze_btn.click(
+            fn=analyze_batch,
+            inputs=[batch_input],
+            outputs=[batch_result_output]
+        )
+        batch_clear_btn.click(
+            fn=clear_batch,
+            outputs=[batch_input]
+        )
+        memory_cleanup_btn.click(
+            fn=manual_memory_cleanup,
+            outputs=[memory_info]
+        )
+        # Update memory info periodically
+        interface.load(
+            fn=update_memory_info,
+            outputs=[memory_info]
+        )
+    return interface
+def create_no_model_interface():
+    """Create a fallback interface when no model is available"""
+    def show_training_instructions():
+        return """
+## 🚨 Model Not Found
+The sentiment analysis model is not available yet. Please follow these steps to train the model:
+### 📋 Training Steps:
+1. **Train the Model:**
+   ```bash
+   python run_training.py
+   ```
+2. **Verify Model Creation:**
+   ```bash
+   ls -la vietnamese_sentiment_finetuned/
+   ```
+3. **Restart Gradio App:**
+   ```bash
+   python gradio_app.py
+   ```
+### 📁 Required Files:
+- `run_training.py` - Training script
+- `fine_tune_sentiment.py` - Fine-tuning utilities
+- Dataset files (should be downloaded automatically)
+### ⏱️ Expected Training Time:
+- **CPU:** 30-60 minutes
+- **GPU (CUDA):** 5-15 minutes
+### 📊 What Training Does:
+- Downloads pre-trained multilingual model
+- Fine-tunes on Vietnamese sentiment data
+- Creates `vietnamese_sentiment_finetuned/` directory
+- Saves tokenizer and model files
+### 🔧 Troubleshooting:
+- Ensure sufficient disk space (~2GB)
+- Check internet connection for dataset download
+- Verify Python dependencies: `pip install -r requirements.txt`
+Once training completes, refresh this page to access the full sentiment analysis interface!
+        """
+    with gr.Blocks(
+        title="Vietnamese Sentiment Analysis - Setup Required",
+        theme=gr.themes.Soft()
+    ) as interface:
+        gr.Markdown("# 🎭 Vietnamese Sentiment Analysis")
+        gr.Markdown("## 🚨 Setup Required - Model Not Trained")
+        gr.Markdown("""
+        ### Welcome to the Vietnamese Sentiment Analysis Interface!
+        The AI model needs to be trained before you can use the sentiment analysis features.
+        This is a one-time setup process that fine-tunes a transformer model on Vietnamese text data.
+        """)
+        with gr.Accordion("📖 Click here for training instructions", open=True):
+            instructions_output = gr.Markdown(show_training_instructions())
+        with gr.Row():
+            with gr.Column():
+                gr.Markdown("### 🔍 Quick Start Commands")
+                gr.Code(
+                    value="# Train the model\npython run_training.py\n\n# Then start the interface\npython gradio_app.py",
+                    language="python",
+                    label="Terminal Commands"
+                )
+            with gr.Column():
+                gr.Markdown("### 📊 Project Information")
+                gr.Markdown("""
+                - **Language:** Vietnamese
+                - **Model Type:** Transformer-based (BERT-like)
+                - **Classes:** Negative, Neutral, Positive
+                - **Interface:** Gradio Web UI
+                """)
+        gr.Markdown("---")
+        gr.Markdown("*After training completes, you'll be able to:*")
+        gr.Markdown("""
+        - ✅ Analyze Vietnamese text sentiment in real-time
+        - ✅ Process multiple texts at once (batch mode)
+        - ✅ View confidence scores and probability distributions
+        - ✅ Get detailed analysis with visual charts
+        """)
+    return interface
+def main():
+    """Main function to launch the Gradio app with memory management options"""
+    import argparse
+    parser = argparse.ArgumentParser(description="Vietnamese Sentiment Analysis Web Interface")
+    parser.add_argument("--max-batch-size", type=int, default=10,
+                       help="Maximum batch size for memory efficiency (default: 10)")
+    parser.add_argument("--quantize", action="store_true",
+                       help="Enable model quantization for memory efficiency (CPU only)")
+    parser.add_argument("--max-memory", type=int, default=4096,
+                       help="Maximum memory usage in MB (default: 4096)")
+    parser.add_argument("--port", type=int, default=7862,
+                       help="Port to run the interface on (default: 7862)")
+    parser.add_argument("--host", type=str, default="127.0.0.1",
+                       help="Host to bind the interface to (default: 127.0.0.1)")
+    args = parser.parse_args()
+    print("🚀 Starting Vietnamese Sentiment Analysis Web Interface...")
+    print(f"🔧 Memory Settings:")
+    print(f"   - Max Batch Size: {args.max_batch_size}")
+    print(f"   - Quantization: {'Enabled' if args.quantize else 'Disabled'}")
+    print(f"   - Max Memory: {args.max_memory}MB")
+    interface = create_interface(
+        max_batch_size=args.max_batch_size,
+        quantize=args.quantize
+    )
+    if interface is None:
+        print("❌ Failed to create interface. Exiting.")
+        return
+    # Update memory limit if specified
+    if hasattr(interface, 'app'):
+        interface.app.max_memory_mb = args.max_memory
+    print("✅ Interface created successfully!")
+    print("🌐 Launching web interface...")
+    print(f"📍 URL: http://{args.host}:{args.port}")
+    # Launch the interface
+    interface.launch(
+        server_name=args.host,
+        server_port=args.port,
+        share=False,
+        show_error=True,
+        quiet=False
+    )
+if __name__ == "__main__":
+    main()

py/test_model.py ADDED Viewed

	@@ -0,0 +1,277 @@

+import torch
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import numpy as np
+import pandas as pd
+from sklearn.metrics import classification_report, confusion_matrix
+import matplotlib.pyplot as plt
+import seaborn as sns
+import argparse
+class SentimentTester:
+    def __init__(self, model_path="./vietnamese_sentiment_finetuned"):
+        self.model_path = model_path
+        self.tokenizer = None
+        self.model = None
+        self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+    def load_model(self):
+        """Load the fine-tuned model and tokenizer"""
+        print(f"Loading model from: {self.model_path}")
+        print(f"Using device: {self.device}")
+        self.tokenizer = AutoTokenizer.from_pretrained(self.model_path)
+        self.model = AutoModelForSequenceClassification.from_pretrained(self.model_path)
+        self.model.to(self.device)
+        self.model.eval()
+        print("Model loaded successfully!")
+        print(f"Number of labels: {self.model.config.num_labels}")
+    def predict_sentiment(self, text, return_probabilities=False):
+        """Predict sentiment for a single text"""
+        # Tokenize the text
+        inputs = self.tokenizer(
+            text,
+            return_tensors="pt",
+            truncation=True,
+            padding=True,
+            max_length=512
+        )
+        # Move to device
+        inputs = {k: v.to(self.device) for k, v in inputs.items()}
+        # Get predictions
+        with torch.no_grad():
+            outputs = self.model(**inputs)
+            logits = outputs.logits
+            probabilities = torch.softmax(logits, dim=-1)
+            predicted_class = torch.argmax(probabilities, dim=-1).item()
+        if return_probabilities:
+            return predicted_class, probabilities.cpu().numpy()[0]
+        else:
+            return predicted_class
+    def predict_batch(self, texts):
+        """Predict sentiment for a batch of texts"""
+        predictions = []
+        probabilities = []
+        for text in texts:
+            pred, probs = self.predict_sentiment(text, return_probabilities=True)
+            predictions.append(pred)
+            probabilities.append(probs)
+        return np.array(predictions), np.array(probabilities)
+    def test_custom_texts(self):
+        """Test the model with custom Vietnamese texts"""
+        test_texts = [
+            "Giảng viên dạy rất hay và tâm huyết.",
+            "Môn học này quá khó và nhàm chán.",
+            "Lớp học ổn định, không có gì đặc biệt.",
+            "Tôi rất thích cách giảng dạy của thầy cô.",
+            "Chương trình học cần cải thiện nhiều.",
+            "Thời gian biểu hợp lý, dễ theo kịp.",
+            "Bài tập quá nhiều và khó.",
+            "Môi trường học tập tốt, bạn bè thân thiện."
+        ]
+        print("\n" + "="*60)
+        print("TESTING WITH CUSTOM VIETNAMESE TEXTS")
+        print("="*60)
+        label_names = ["Negative", "Neutral", "Positive"]  # Assuming 3 classes
+        for i, text in enumerate(test_texts, 1):
+            pred, probs = self.predict_sentiment(text, return_probabilities=True)
+            confidence = np.max(probs)
+            print(f"\n{i}. Text: {text}")
+            print(f"   Predicted: {label_names[pred]} (Class {pred})")
+            print(f"   Confidence: {confidence:.4f}")
+            print(f"   Probabilities: {probs}")
+    def interactive_test(self):
+        """Interactive testing mode"""
+        print("\n" + "="*60)
+        print("INTERACTIVE SENTIMENT ANALYSIS")
+        print("="*60)
+        print("Enter Vietnamese text to analyze sentiment (type 'quit' to exit):")
+        label_names = ["Negative", "Neutral", "Positive"]  # Assuming 3 classes
+        while True:
+            text = input("\nEnter text: ").strip()
+            if text.lower() in ['quit', 'exit', 'q']:
+                break
+            if not text:
+                continue
+            try:
+                pred, probs = self.predict_sentiment(text, return_probabilities=True)
+                confidence = np.max(probs)
+                print(f"Predicted: {label_names[pred]} (Class {pred})")
+                print(f"Confidence: {confidence:.4f}")
+                print(f"Probabilities: {probs}")
+            except Exception as e:
+                print(f"Error: {e}")
+    def evaluate_from_file(self, file_path, text_column, label_column=None):
+        """Evaluate model on a dataset from file"""
+        print(f"\nEvaluating on dataset from: {file_path}")
+        try:
+            # Load dataset
+            if file_path.endswith('.csv'):
+                df = pd.read_csv(file_path)
+            elif file_path.endswith('.json'):
+                df = pd.read_json(file_path)
+            else:
+                print("Unsupported file format. Please use CSV or JSON.")
+                return
+            print(f"Loaded {len(df)} samples")
+            # Get texts and labels
+            texts = df[text_column].tolist()
+            if label_column and label_column in df.columns:
+                true_labels = df[label_column].tolist()
+                has_labels = True
+            else:
+                true_labels = None
+                has_labels = False
+            # Make predictions
+            print("Making predictions...")
+            predictions, probabilities = self.predict_batch(texts)
+            # Display results
+            if has_labels:
+                print("\nClassification Report:")
+                print(classification_report(true_labels, predictions))
+                # Confusion matrix
+                cm = confusion_matrix(true_labels, predictions)
+                plt.figure(figsize=(8, 6))
+                sns.heatmap(cm, annot=True, fmt='d', cmap='Blues')
+                plt.title('Confusion Matrix')
+                plt.xlabel('Predicted')
+                plt.ylabel('Actual')
+                plt.savefig('test_confusion_matrix.png', dpi=300, bbox_inches='tight')
+                plt.show()
+                # Calculate accuracy
+                accuracy = np.mean(np.array(predictions) == np.array(true_labels))
+                print(f"Overall Accuracy: {accuracy:.4f}")
+            # Show some examples
+            print("\nSample predictions:")
+            label_names = ["Negative", "Neutral", "Positive"]
+            for i in range(min(5, len(texts))):
+                pred_label = label_names[predictions[i]]
+                confidence = np.max(probabilities[i])
+                true_label = f" (True: {label_names[true_labels[i]]})" if has_labels else ""
+                print(f"{i+1}. {texts[i][:50]}...")
+                print(f"   Predicted: {pred_label} (Confidence: {confidence:.3f}){true_label}")
+        except Exception as e:
+            print(f"Error evaluating file: {e}")
+    def compare_with_original(self):
+        """Compare fine-tuned model with original model"""
+        print("\n" + "="*60)
+        print("COMPARING WITH ORIGINAL MODEL")
+        print("="*60)
+        test_texts = [
+            "Giảng viên dạy rất hay và tâm huyết.",
+            "Môn học này quá khó và nhàm chán.",
+            "Lớp học ổn định, không có gì đặc biệt."
+        ]
+        original_model = "5CD-AI/Vietnamese-Sentiment-visobert"
+        try:
+            # Load original model
+            print("Loading original model...")
+            original_tokenizer = AutoTokenizer.from_pretrained(original_model)
+            original_model_instance = AutoModelForSequenceClassification.from_pretrained(original_model)
+            original_model_instance.to(self.device)
+            original_model_instance.eval()
+            print("\nComparison Results:")
+            print("-" * 50)
+            label_names = ["Negative", "Neutral", "Positive"]
+            for i, text in enumerate(test_texts, 1):
+                # Fine-tuned model prediction
+                ft_pred, ft_probs = self.predict_sentiment(text, return_probabilities=True)
+                # Original model prediction
+                inputs = original_tokenizer(
+                    text,
+                    return_tensors="pt",
+                    truncation=True,
+                    padding=True,
+                    max_length=512
+                )
+                inputs = {k: v.to(self.device) for k, v in inputs.items()}
+                with torch.no_grad():
+                    outputs = original_model_instance(**inputs)
+                    orig_logits = outputs.logits
+                    orig_probs = torch.softmax(orig_logits, dim=-1)
+                    orig_pred = torch.argmax(orig_probs, dim=-1).item()
+                    orig_probs = orig_probs.cpu().numpy()[0]
+                print(f"\n{i}. Text: {text}")
+                print(f"   Fine-tuned: {label_names[ft_pred]} (Conf: {np.max(ft_probs):.3f})")
+                print(f"   Original:    {label_names[orig_pred]} (Conf: {np.max(orig_probs):.3f})")
+                if ft_pred != orig_pred:
+                    print(f"   *** DIFFERENT PREDICTION ***")
+        except Exception as e:
+            print(f"Error in comparison: {e}")
+def main():
+    parser = argparse.ArgumentParser(description='Test fine-tuned Vietnamese sentiment analysis model')
+    parser.add_argument('--model_path', type=str, default='./vietnamese_sentiment_finetuned',
+                       help='Path to the fine-tuned model')
+    parser.add_argument('--mode', type=str, choices=['custom', 'interactive', 'file', 'compare'],
+                       default='custom', help='Testing mode')
+    parser.add_argument('--file_path', type=str, help='Path to test file (for file mode)')
+    parser.add_argument('--text_column', type=str, default='text', help='Text column name (for file mode)')
+    parser.add_argument('--label_column', type=str, help='Label column name (for file mode)')
+    args = parser.parse_args()
+    # Initialize tester
+    tester = SentimentTester(args.model_path)
+    # Load model
+    tester.load_model()
+    # Run tests based on mode
+    if args.mode == 'custom':
+        tester.test_custom_texts()
+    elif args.mode == 'interactive':
+        tester.interactive_test()
+    elif args.mode == 'file':
+        if not args.file_path:
+            print("Error: --file_path required for file mode")
+            return
+        tester.evaluate_from_file(args.file_path, args.text_column, args.label_column)
+    elif args.mode == 'compare':
+        tester.compare_with_original()
+if __name__ == "__main__":
+    main()