iBrokeTheCode commited on
Commit
453dbe9
·
1 Parent(s): f429f8e

chore: Redact README file

Browse files
Files changed (4) hide show
  1. .gitattributes +1 -0
  2. LICENSE +21 -0
  3. README.md +40 -2
  4. public/app-demo.png +3 -0
.gitattributes CHANGED
@@ -35,3 +35,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  home_credit_dataset.csv filter=lfs diff=lfs merge=lfs -text
37
  lgbm_model.joblib filter=lfs diff=lfs merge=lfs -text
 
 
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  home_credit_dataset.csv filter=lfs diff=lfs merge=lfs -text
37
  lgbm_model.joblib filter=lfs diff=lfs merge=lfs -text
38
+ *.png filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ MIT License
2
+
3
+ Copyright (c) 2025 AlexRTC
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in all
13
+ copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21
+ SOFTWARE.
README.md CHANGED
@@ -9,8 +9,40 @@ license: mit
9
  short_description: ML Classification models applied to Home Credit Risk dataset
10
  ---
11
 
12
- Check out marimo at <https://github.com/marimo-team/marimo>
13
- Check out the configuration reference at <https://huggingface.co/docs/hub/spaces-config-reference>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
  ## 3. Technology Stack
16
 
@@ -36,3 +68,9 @@ This project was built using the following technologies and libraries:
36
 
37
  - [Ruff](https://github.com/charliermarsh/ruff): A fast Python linter and code formatter.
38
  - [uv](https://github.com/astral-sh/uv): A fast Python package installer and resolver.
 
 
 
 
 
 
 
9
  short_description: ML Classification models applied to Home Credit Risk dataset
10
  ---
11
 
12
+ # 🏦 Home Credit Default Risk Prediction
13
+
14
+ ## Table of Contents
15
+
16
+ 1. [Project Description](#1-project-description)
17
+ 2. [Methodology & Key Features](#2-methodology--key-features)
18
+ 3. [Technology Stack](#3-technology-stack)
19
+ 4. [Dataset](#4-dataset)
20
+
21
+ ## 1. Project Description
22
+
23
+ This project focuses on building a machine learning pipeline to predict a client's ability to repay a loan. It is a binary classification task that uses a real-world financial dataset to identify clients who may face payment difficulties.
24
+
25
+ The project goes beyond a standard model by including a practical application that:
26
+
27
+ - **Preprocesses and cleans the dataset** for model training.
28
+ - **Trains a machine learning model** to predict loan repayment risk.
29
+ - **Deploys an interactive predictor app** using Marimo, hosted on Hugging Face Spaces.
30
+ - **Allows users to make predictions** by providing the top 10 most influential features.
31
+
32
+ This work showcases a complete end-to-end workflow, transforming raw data into a functional, user-friendly tool for risk assessment.
33
+
34
+ > [!IMPORTANT]
35
+ >
36
+ > - Check out the deployed app here: 👉️ [Home Credit Default Risk Prediction App](https://huggingface.co/spaces/iBrokeTheCode/Home_Credit_Default_Risk_Prediction) 👈️
37
+ > - Check out the Jupyter Notebook for a detailed walkthrough of the project here: 👉️ [Jupyter Notebook](https://huggingface.co/spaces/iBrokeTheCode/Home_Credit_Default_Risk_Prediction/blob/main/tutorial_app.ipynb) 👈️
38
+
39
+ ![App](./public/app-demo.png)
40
+
41
+ ## 2. Methodology & Key Features
42
+
43
+ - **Model Selection:** Four different models were trained and evaluated, with **LightGBM** selected as the final model due to its superior performance, achieving a **ROC AUC score of 0.751** on the test set.
44
+ - **Automated Preprocessing:** The data preprocessing pipeline handles common tasks such as feature scaling and categorical encoding, ensuring the model receives clean and formatted data.
45
+ - **Interactive Predictor:** An application built with **Marimo** allows users to interact with the trained model directly. It uses the **top 10 most important features**—identified from the final LightGBM model—to generate real-time predictions.
46
 
47
  ## 3. Technology Stack
48
 
 
68
 
69
  - [Ruff](https://github.com/charliermarsh/ruff): A fast Python linter and code formatter.
70
  - [uv](https://github.com/astral-sh/uv): A fast Python package installer and resolver.
71
+
72
+ ## 4. Dataset
73
+
74
+ This project utilizes the **Home Credit Default Risk** from Kaggle, a public dataset containing details on over 246,000 of individuals who have made payments on their loans.
75
+
76
+ - **Source**: [Kaggle Dataset](https://www.kaggle.com/competitions/home-credit-default-risk/overview)
public/app-demo.png ADDED

Git LFS Details

  • SHA256: 1b781c368261edccb9a58a19b9e0ee8071b0c2240ea49fc7d2a4b63ddc455e7f
  • Pointer size: 130 Bytes
  • Size of remote file: 69.2 kB