Update README.md
Browse files
README.md
CHANGED
@@ -1,6 +1,8 @@
|
|
1 |
# prodigy-ecfr-textcat
|
2 |
|
3 |
-
|
|
|
|
|
4 |
|
5 |
## Table of Contents
|
6 |
|
@@ -10,29 +12,49 @@ Brief description of your project.
|
|
10 |
- [Installation](#installation)
|
11 |
- [Usage](#usage)
|
12 |
- [File Structure](#file-structure)
|
13 |
-
- [Contributing](#contributing)
|
14 |
- [License](#license)
|
15 |
- [Acknowledgements](#acknowledgements)
|
16 |
|
17 |
-
## About the Project
|
18 |
-
|
19 |
-
Our goal is to organize these financial institution rules and regulations so financial institutions can go through newly created rules and regulations to know which departments to send the information to and to allow easy retrieval of these regulations when necessary. Text mining and information retrieval will allow a large step of the process to be automated. Automating these steps will allow less time and effort to be contributed for financial institutions employees. This allows more time and work to be used to accomplish other projects.
|
20 |
-
|
21 |
## Getting Started
|
22 |
|
23 |
Instructions on setting up the project on a local machine.
|
24 |
|
25 |
### Prerequisites
|
26 |
|
27 |
-
|
|
|
|
|
|
|
28 |
|
29 |
### Installation
|
30 |
|
31 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
|
33 |
## Usage
|
34 |
|
35 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
36 |
|
37 |
## File Structure
|
38 |
|
@@ -129,19 +151,17 @@ Describe the organization of files and directories within the project.
|
|
129 |
- `thirdStep-label.py`
|
130 |
- `train_eval_split.ipynb`
|
131 |
- `TerminalCode.txt`
|
|
|
|
|
|
|
132 |
- `README.md`
|
133 |
- `prodigy.json`
|
134 |
|
135 |
-
|
136 |
-
|
137 |
-
## Contributing
|
138 |
-
|
139 |
-
Guidelines for contributing to the project, including how to report bugs or suggest improvements.
|
140 |
-
|
141 |
## License
|
142 |
|
143 |
-
|
|
|
144 |
|
145 |
## Acknowledgements
|
146 |
|
147 |
-
|
|
|
1 |
# prodigy-ecfr-textcat
|
2 |
|
3 |
+
## About the Project
|
4 |
+
|
5 |
+
Our goal is to organize these financial institution rules and regulations so financial institutions can go through newly created rules and regulations to know which departments to send the information to and to allow easy retrieval of these regulations when necessary. Text mining and information retrieval will allow a large step of the process to be automated. Automating these steps will allow less time and effort to be contributed for financial institutions employees. This allows more time and work to be used to accomplish other projects.
|
6 |
|
7 |
## Table of Contents
|
8 |
|
|
|
12 |
- [Installation](#installation)
|
13 |
- [Usage](#usage)
|
14 |
- [File Structure](#file-structure)
|
|
|
15 |
- [License](#license)
|
16 |
- [Acknowledgements](#acknowledgements)
|
17 |
|
|
|
|
|
|
|
|
|
18 |
## Getting Started
|
19 |
|
20 |
Instructions on setting up the project on a local machine.
|
21 |
|
22 |
### Prerequisites
|
23 |
|
24 |
+
Before running the project, ensure you have the following software dependencies installed:
|
25 |
+
- [Python 3.x](https://www.python.org/downloads/)
|
26 |
+
- [spaCy](https://spacy.io/usage)
|
27 |
+
- [Prodigy](https://prodi.gy/docs/) (optional)
|
28 |
|
29 |
### Installation
|
30 |
|
31 |
+
Follow these step-by-step instructions to install and configure the project:
|
32 |
+
|
33 |
+
1. **Clone this repository to your local machine.**
|
34 |
+
```bash
|
35 |
+
git clone <https://github.com/ManjinderUNCC/prodigy-ecfr-textcat.git>
|
36 |
+
2. Install the required dependencies by running:
|
37 |
+
```bash
|
38 |
+
pip install -r requirements.txt
|
39 |
+
```
|
40 |
|
41 |
## Usage
|
42 |
|
43 |
+
To use the project, follow these steps:
|
44 |
+
|
45 |
+
1. **Prepare your data:**
|
46 |
+
- Place your dataset files in the `/data` directory.
|
47 |
+
- Optionally, annotate your data using Prodigy and save the annotations in the `/data` directory.
|
48 |
+
|
49 |
+
2. **Train the text classification model:**
|
50 |
+
- Run the training script located in the `/python_Code` directory.
|
51 |
+
|
52 |
+
3. **Evaluate the model:**
|
53 |
+
- Use the evaluation script to assess the model's performance on labeled data.
|
54 |
+
|
55 |
+
4. **Make predictions:**
|
56 |
+
- Apply the trained model to new, unlabeled data to classify it into relevant categories.
|
57 |
+
|
58 |
|
59 |
## File Structure
|
60 |
|
|
|
151 |
- `thirdStep-label.py`
|
152 |
- `train_eval_split.ipynb`
|
153 |
- `TerminalCode.txt`
|
154 |
+
- `requirements.txt`
|
155 |
+
- `Terminal Commands vs Project.yml`
|
156 |
+
- `Project.yml`
|
157 |
- `README.md`
|
158 |
- `prodigy.json`
|
159 |
|
|
|
|
|
|
|
|
|
|
|
|
|
160 |
## License
|
161 |
|
162 |
+
- Package A: MIT License
|
163 |
+
- Package B: Apache License 2.0
|
164 |
|
165 |
## Acknowledgements
|
166 |
|
167 |
+
Manjinder Sandhu, Dagim Bantikassegn, Alex Brooks, Tyler Dabbs
|