Upload 6 files

Browse files

Files changed (6) hide show

Credit Card Clustering with Machine Learning.ipynb +0 -0
README.md +136 -2
Screenshot_2.png +0 -0
app.py +49 -0
model.pkl +3 -0
requirements.txt +5 -0

Credit Card Clustering with Machine Learning.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

README.md CHANGED Viewed

@@ -1,3 +1,137 @@
 ---
-license: mit
----

+# Credit Card Clustering with Machine Learning
+This project focuses on clustering credit card customers based on their usage behavior using unsupervised machine learning techniques. The goal is to segment customers for better targeting, offers, and personalized financial services.
+## 📌 Objective
+- Understand customer behavior from credit card usage.
+- Segment customers into clusters with similar patterns.
+- Help financial institutions create targeted marketing strategies.
+## 📊 Dataset
+- Source: [Aman Kharwal’s GitHub Dataset](https://raw.githubusercontent.com/amankharwal/Website-data/master/credit_card.csv)
+- Contains features like:
+  - `BALANCE`: Average balance
+  - `PURCHASES`: Total purchases
+  - `CREDIT_LIMIT`: Assigned credit limit
+  - `PAYMENTS`: Amount paid
+  - `TENURE`: Months as a customer
+  - `ONEOFF_PURCHASES`, `INSTALLMENTS_PURCHASES`, etc.
+## 🧹 Data Preprocessing
+- Checked for null values and handled them
+- Dropped irrelevant columns (e.g., `CUST_ID`)
+- Scaled data using `StandardScaler`
+## 🧠 Clustering Algorithm
+- Used **KMeans** algorithm
+- Determined optimal number of clusters using:
+  - Elbow Method
+  - Silhouette Score
+## 📉 Dimensionality Reduction
+- Applied **PCA** for visualizing clusters in 2D space
+## 📈 Results & Analysis
+- Clusters represent different types of customers:
+  - High spenders
+  - Low activity users
+  - Customers using mostly installments
+- Visualized clusters using `matplotlib` and `seaborn`
+## 📦 Libraries Used
+- `pandas`
+- `numpy`
+- `matplotlib`, `seaborn`
+- `scikit-learn`
+## 🔍 Future Improvements
+- Try alternative clustering algorithms like DBSCAN, GMM
+- Add deeper feature engineering
+- Include time-based features for trend analysis
+## 💻 How to Run
+1. Clone the repo:
+    ```bash
+    git clone https://github.com/handecrkc/credit-card-clustering.git
+    ```
+2. Install requirements:
+    ```bash
+    pip install -r requirements.txt
+    ```
+3. Run the notebook:
+    Open `credit_card_clustering.ipynb` in Jupyter Notebook or VS Code
 ---
+## 🧑‍💻 Author
+- **Hande Çarkcı**
+- GitHub: [github.com/handecrkc](https://github.com/handecrkc)
+# 💳 Credit Card Clustering – Streamlit App
+Bu proje, müşterilerin kredi kartı kullanım alışkanlıklarına göre segmentlere ayrılmasını sağlayan bir **Makine Öğrenimi** uygulamasıdır.
+Streamlit ile geliştirilen bu uygulama sayesinde kullanıcıdan alınan veriye göre müşterinin ait olduğu küme tahmin edilir.
+## 🎯 Proje Amacı
+- Kredi kartı kullanıcılarını **benzer davranış gruplarına ayırmak**
+- Finansal kurumlara **hedefli pazarlama stratejileri** sağlamak
+- Kullanıcıya ait segmenti gerçek zamanlı olarak tahmin etmek
+## 🧠 Kullanılan Yöntem
+- **KMeans Clustering**
+- **StandardScaler** ile veri ölçekleme
+- **Streamlit** ile web uygulaması
+## 🗃️ Kullanılan Veri Seti
+- Kaynak: [`CC GENERAL.csv`](https://raw.githubusercontent.com/amankharwal/Website-data/master/credit_card.csv)
+- Sütunlar: `BALANCE`, `PURCHASES`, `CREDIT_LIMIT`, `PAYMENTS`, `TENURE`, vb.
+## 🚀 Uygulamayı Çalıştırmak
+```bash
+git clone https://github.com/kullanici_adin/credit-card-clustering-streamlit.git
+cd credit-card-clustering-streamlit
+pip install -r requirements.txt
+streamlit run app.py
+🖼️ Uygulama Görünümü
+🔍 Küme Açıklamaları
+Küme	Açıklama
+0	🟢 Düşük harcama yapan, düşük riskli müşteri
+1	🟡 Orta seviyede harcama yapan müşteri
+2	🔴 Yüksek harcama yapan ve aktif müşteri
+3	🔵 Taksitli harcamaları yüksek olan müşteri
+🛠️ Gereken Kütüphaneler
+streamlit
+pandas
+numpy
+scikit-learn
+joblib
+## 📜 License
+This project is open-source under the MIT License.

Screenshot_2.png ADDED Viewed

app.py ADDED Viewed

	@@ -0,0 +1,49 @@

+import streamlit as st
+import pandas as pd
+import numpy as np
+import joblib
+# Model ve scaler'ı yükle
+scaler, kmeans = joblib.load("model.pkl")
+st.title("💳 Credit Card Customer Segmentation")
+st.markdown("Müşteri bilgilerini girerek hangi kümeye ait olduğunu öğrenin.")
+# Kullanıcıdan veri al
+def get_user_input():
+    balance = st.number_input("BALANCE", 0.0, 100000.0, 2000.0)
+    purchases = st.number_input("PURCHASES", 0.0, 100000.0, 3000.0)
+    oneoff = st.number_input("ONEOFF_PURCHASES", 0.0, 50000.0, 1000.0)
+    installments = st.number_input("INSTALLMENTS_PURCHASES", 0.0, 50000.0, 2000.0)
+    credit_limit = st.number_input("CREDIT_LIMIT", 100.0, 100000.0, 5000.0)
+    payments = st.number_input("PAYMENTS", 0.0, 100000.0, 2500.0)
+    tenure = st.slider("TENURE (kaç aydır müşteri?)", 0, 12, 6)
+    data = {
+        'BALANCE': balance,
+        'PURCHASES': purchases,
+        'ONEOFF_PURCHASES': oneoff,
+        'INSTALLMENTS_PURCHASES': installments,
+        'CREDIT_LIMIT': credit_limit,
+        'PAYMENTS': payments,
+        'TENURE': tenure
+    }
+    return pd.DataFrame([data])
+# Tahmin yap
+input_df = get_user_input()
+if st.button("Tahmin Et"):
+    scaled_input = scaler.transform(input_df)
+    cluster = kmeans.predict(scaled_input)[0]
+    st.subheader(f"🔍 Tahmin Edilen Küme: {cluster}")
+    yorumlar = {
+        0: "🟢 Düşük harcama yapan, düşük riskli müşteri.",
+        1: "🟡 Orta seviyede harcama yapan müşteri.",
+        2: "🔴 Yüksek harcama yapan ve aktif müşteri.",
+        3: "🔵 Taksitli harcamaları yüksek olan müşteri."
+    }
+    st.write(yorumlar.get(cluster, "Bilinmeyen küme"))

model.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2b60f9e959e6ac0efa1dabc07e019fad4495f6bdc1e1c3da7d54ca656b61ee0b
+size 37266

requirements.txt ADDED Viewed

	@@ -0,0 +1,5 @@

+streamlit
+pandas
+numpy
+scikit-learn
+joblib