YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
SUML - Car Price Prediction
Machine Learning course project at PJATK. Predicts used car prices using AutoGluon ensemble models.
Setup
Clone from Hugging Face
This repository is hosted on Hugging Face Hub to handle large model and data files.
# Install git-lfs (required for large files)
# Ubuntu/Debian: sudo apt install git-lfs
# Fedora: sudo dnf install git-lfs
# macOS: brew install git-lfs
# Windows: Download from https://git-lfs.com
git lfs install
git clone https://huggingface.co/bunny501/SUML
cd SUML
Alternatively, use the Hugging Face CLI:
pip install huggingface_hub
huggingface-cli download bunny501/SUML --local-dir ./SUML
cd SUML
Install Dependencies
python -m venv .venv
# Linux/macOS:
source .venv/bin/activate
# Windows:
.venv\Scripts\activate
pip install -r requirements.txt
Running the App
streamlit run App/main.py
Project Structure
SUML/
βββ App/ # Streamlit web application
β βββ main.py # Main UI - form inputs and prediction display
β βββ inference.py # Model loading and prediction logic
β βββ feature_defaults.json # Default feature values (dataset averages)
β βββ make_model_mapping.json # Car make/model dropdown data
β βββ column_value_ranges.json # Valid ranges for input validation
β
βββ AutogluonModels/ # Trained model files (WeightedEnsemble_L2)
β
βββ Data/ # Datasets
β βββ Cleaned_train.csv # Main training dataset
β βββ sales_ads_train.csv # Raw training data
β βββ sales_ads_test.csv # Raw test data
β βββ synthetic_*.csv # Synthetic data (MostlyAI, SDV)
β
βββ src/ # Source modules
β βββ Autogluon.py # Model training configuration and utilities
β
βββ Notebooks/ # Jupyter notebooks
β βββ EDA.ipynb # Exploratory data analysis
β βββ ValueRangeExtraction.ipynb
β
βββ Predicting-and-Analyzing.../ # Reference project with XGBoost experiments
β βββ DataCleaning.ipynb
β βββ DataExploration.ipynb
β βββ Prediction.ipynb
β βββ ...
β
βββ DatasetCleanUpPreparation.py # Data preprocessing script
βββ requirements.txt # Python dependencies
βββ README.md
Training a New Model
source .venv/bin/activate
python -c "from src.Autogluon import run_exploration; run_exploration()"
Features
The model uses:
- Basic info: year, mileage, condition, body type, doors
- Engine specs: fuel type, power (HP), engine size, transmission, drivetrain
- 20 equipment features: leather, heated seats, AC, cruise control, alloy wheels, LED lights, parking sensors, GPS, Bluetooth, etc.
- Standard features (auto-assumed): ABS, airbags, central locking, power steering
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support