Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -10,4 +10,29 @@ pinned: false
|
|
10 |
license: mit
|
11 |
---
|
12 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
10 |
license: mit
|
11 |
---
|
12 |
|
13 |
+
# DistilBERT-based Sentiment Analysis Project for Predicting Shoe Review Ratings
|
14 |
+
|
15 |
+
This project implements a sentiment analysis model to predict star ratings for Amazon shoe reviews. It leverages DistilBERT-base-uncased, a pre-trained transformer model from Hugging Face, fine-tuned on a dataset of Amazon shoe reviews.
|
16 |
+
|
17 |
+
## Project Structure
|
18 |
+
|
19 |
+
- `01. Data Preparation.ipynb`: This notebook handles the entire data pipeline:
|
20 |
+
* __Data Collection:__ An amazon-shoe-review dataset has been collected from [here](https://www.kaggle.com/datasets/cynthiarempel/amazon-us-customer-reviews-dataset?select=amazon_reviews_us_Shoes_v1_00.tsv).
|
21 |
+
* __Data Cleaning & Preprocessing:__ Data cleaning and preprocessing has been done to prepare it for model training.
|
22 |
+
* __Data Sharing:__ After preprocessing the dataset has been pushed to HuggingFace Hub. [Dataset Link](https://huggingface.co/datasets/mazed/amazon_shoe_review)
|
23 |
+
|
24 |
+
- `02. Model Training.ipynb`: This notebook covers:
|
25 |
+
* Fine-tuning the pre-trained DistilBERT-base-uncased model from Hugging Face on the preprocessed data for predicting shoe review star ratings.
|
26 |
+
|
27 |
+
- `03. Save Model to Hub.ipynb`: This notebook handles:
|
28 |
+
* __Model Evaluation:__ Predicitons are made on few examples to evaluate the fine-tuned model.
|
29 |
+
* __Model Sharing:__ The fine-tuned model is then pushed to HuggingFace model hub. [Model Link](https://huggingface.co/mazed/distilbert-amazon-shoe-review)
|
30 |
+
|
31 |
+
- `requirements.txt`: Lists the dependencies needed for the project:
|
32 |
+
- `transformers`
|
33 |
+
- `gradio`
|
34 |
+
- `torch`
|
35 |
+
|
36 |
+
- `app.py`: A script to deploy the model using Gradio for a web-based interface.
|
37 |
+
|
38 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|