DagimB commited on
Commit
30fc9b7
1 Parent(s): 49f0c5b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -17
README.md CHANGED
@@ -1,6 +1,8 @@
1
  # prodigy-ecfr-textcat
2
 
3
- Brief description of your project.
 
 
4
 
5
  ## Table of Contents
6
 
@@ -10,29 +12,49 @@ Brief description of your project.
10
  - [Installation](#installation)
11
  - [Usage](#usage)
12
  - [File Structure](#file-structure)
13
- - [Contributing](#contributing)
14
  - [License](#license)
15
  - [Acknowledgements](#acknowledgements)
16
 
17
- ## About the Project
18
-
19
- Our goal is to organize these financial institution rules and regulations so financial institutions can go through newly created rules and regulations to know which departments to send the information to and to allow easy retrieval of these regulations when necessary. Text mining and information retrieval will allow a large step of the process to be automated. Automating these steps will allow less time and effort to be contributed for financial institutions employees. This allows more time and work to be used to accomplish other projects.
20
-
21
  ## Getting Started
22
 
23
  Instructions on setting up the project on a local machine.
24
 
25
  ### Prerequisites
26
 
27
- List any software dependencies or libraries required to run the project.
 
 
 
28
 
29
  ### Installation
30
 
31
- Step-by-step instructions on how to install and configure the project.
 
 
 
 
 
 
 
 
32
 
33
  ## Usage
34
 
35
- Explain how to use the project, including any command-line arguments, configurations, or examples.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
36
 
37
  ## File Structure
38
 
@@ -129,19 +151,17 @@ Describe the organization of files and directories within the project.
129
  - `thirdStep-label.py`
130
  - `train_eval_split.ipynb`
131
  - `TerminalCode.txt`
 
 
 
132
  - `README.md`
133
  - `prodigy.json`
134
 
135
-
136
-
137
- ## Contributing
138
-
139
- Guidelines for contributing to the project, including how to report bugs or suggest improvements.
140
-
141
  ## License
142
 
143
- Specify the project's license (e.g., MIT License, Apache License 2.0).
 
144
 
145
  ## Acknowledgements
146
 
147
- Credits to any individuals, organizations, or resources that contributed to the project.
 
1
  # prodigy-ecfr-textcat
2
 
3
+ ## About the Project
4
+
5
+ Our goal is to organize these financial institution rules and regulations so financial institutions can go through newly created rules and regulations to know which departments to send the information to and to allow easy retrieval of these regulations when necessary. Text mining and information retrieval will allow a large step of the process to be automated. Automating these steps will allow less time and effort to be contributed for financial institutions employees. This allows more time and work to be used to accomplish other projects.
6
 
7
  ## Table of Contents
8
 
 
12
  - [Installation](#installation)
13
  - [Usage](#usage)
14
  - [File Structure](#file-structure)
 
15
  - [License](#license)
16
  - [Acknowledgements](#acknowledgements)
17
 
 
 
 
 
18
  ## Getting Started
19
 
20
  Instructions on setting up the project on a local machine.
21
 
22
  ### Prerequisites
23
 
24
+ Before running the project, ensure you have the following software dependencies installed:
25
+ - [Python 3.x](https://www.python.org/downloads/)
26
+ - [spaCy](https://spacy.io/usage)
27
+ - [Prodigy](https://prodi.gy/docs/) (optional)
28
 
29
  ### Installation
30
 
31
+ Follow these step-by-step instructions to install and configure the project:
32
+
33
+ 1. **Clone this repository to your local machine.**
34
+ ```bash
35
+ git clone <https://github.com/ManjinderUNCC/prodigy-ecfr-textcat.git>
36
+ 2. Install the required dependencies by running:
37
+ ```bash
38
+ pip install -r requirements.txt
39
+ ```
40
 
41
  ## Usage
42
 
43
+ To use the project, follow these steps:
44
+
45
+ 1. **Prepare your data:**
46
+ - Place your dataset files in the `/data` directory.
47
+ - Optionally, annotate your data using Prodigy and save the annotations in the `/data` directory.
48
+
49
+ 2. **Train the text classification model:**
50
+ - Run the training script located in the `/python_Code` directory.
51
+
52
+ 3. **Evaluate the model:**
53
+ - Use the evaluation script to assess the model's performance on labeled data.
54
+
55
+ 4. **Make predictions:**
56
+ - Apply the trained model to new, unlabeled data to classify it into relevant categories.
57
+
58
 
59
  ## File Structure
60
 
 
151
  - `thirdStep-label.py`
152
  - `train_eval_split.ipynb`
153
  - `TerminalCode.txt`
154
+ - `requirements.txt`
155
+ - `Terminal Commands vs Project.yml`
156
+ - `Project.yml`
157
  - `README.md`
158
  - `prodigy.json`
159
 
 
 
 
 
 
 
160
  ## License
161
 
162
+ - Package A: MIT License
163
+ - Package B: Apache License 2.0
164
 
165
  ## Acknowledgements
166
 
167
+ Manjinder Sandhu, Dagim Bantikassegn, Alex Brooks, Tyler Dabbs