karthik-2905 commited on
Commit
0a26abe
·
verified ·
1 Parent(s): 7a321da

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. README.md +39 -0
  2. README_HF.md +38 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Clustering Algorithms for Customer Segmentation
2
+
3
+ This repository contains a comprehensive implementation of various clustering algorithms to perform customer segmentation on a synthetic dataset. The project explores K-Means, Hierarchical Clustering, DBSCAN, and Gaussian Mixture Models (GMM) to identify distinct customer groups based on age and income.
4
+
5
+ ## Project Structure
6
+
7
+ - `implementation.ipynb`: The main Jupyter notebook containing the entire analysis, from data generation to model evaluation and visualization.
8
+ - `data/`: Contains the synthetic `customer_data.csv` file.
9
+ - `models/`: Stores the trained clustering models and the data scaler.
10
+ - `results/`: Includes the algorithm comparison, detailed analysis, and experiment summary.
11
+ - `visualizations/`: Contains the output plots, such as the elbow method analysis and cluster comparisons.
12
+
13
+ ## Features
14
+
15
+ - **Data Generation**: A synthetic customer dataset is generated with clear cluster structures for effective model training and evaluation.
16
+ - **Multiple Algorithms**: Implements and compares four popular clustering algorithms:
17
+ - K-Means
18
+ - Hierarchical Clustering
19
+ - DBSCAN
20
+ - Gaussian Mixture Models (GMM)
21
+ - **Model Evaluation**: Uses the elbow method and silhouette scores to determine the optimal number of clusters and evaluate performance.
22
+ - **Comprehensive Visualization**: Generates plots to visualize the clusters, compare algorithm performance, and analyze the optimal 'k'.
23
+
24
+ ## How to Use
25
+
26
+ 1. **Clone the repository:**
27
+ ```bash
28
+ git clone https://github.com/GruheshKurra/ClusteringAlgorithms.git
29
+ ```
30
+ 2. **Install dependencies:**
31
+ ```bash
32
+ pip install -r requirements.txt
33
+ ```
34
+ 3. **Run the notebook:**
35
+ Open and run the `implementation.ipynb` notebook in a Jupyter environment to see the full analysis.
36
+
37
+ ## License
38
+
39
+ This project is licensed under the MIT License.
README_HF.md ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ # Clustering Algorithms for Customer Segmentation
6
+
7
+ This repository hosts a project on customer segmentation using various clustering algorithms. It includes the code, a synthetic dataset, trained models, and visualizations.
8
+
9
+ ## Project Overview
10
+
11
+ This project implements and compares the following clustering algorithms for customer segmentation:
12
+ - K-Means
13
+ - Hierarchical Clustering
14
+ - DBSCAN
15
+ - Gaussian Mixture Models (GMM)
16
+
17
+ The goal is to identify distinct customer groups based on their age and income.
18
+
19
+ ## Repository Contents
20
+
21
+ - `implementation.ipynb`: The main Jupyter notebook with the complete analysis.
22
+ - `data/customer_data.csv`: The synthetic dataset used for clustering.
23
+ - `models/`: Contains the saved models for each algorithm and the data scaler.
24
+ - `results/`: Contains detailed analysis and comparison of the algorithms.
25
+ - `visualizations/`: Includes plots for cluster visualization and analysis.
26
+
27
+ ## How to Use
28
+
29
+ You can use the trained models and the dataset from this repository for your own analysis. To get started, you can clone the repository and explore the `implementation.ipynb` notebook.
30
+
31
+ ```bash
32
+ # Clone the repository
33
+ git clone https://huggingface.co/karthik-2905/ClusteringAlgorithms
34
+ ```
35
+
36
+ ## License
37
+
38
+ This project is licensed under the MIT License.