Update README.md
Browse files
README.md
CHANGED
@@ -1,23 +1,49 @@
|
|
1 |
---
|
2 |
-
tags:
|
3 |
-
- structured-data-classification
|
4 |
library_name: generic
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
---
|
6 |
-
|
7 |
-
|
8 |
-
|
9 |
-
|
10 |
-
|
11 |
-
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
-
|
18 |
-
|
19 |
-
|
20 |
-
cd structured-data-classification
|
21 |
-
git remote set-url origin https://huggingface.co/$YOUR_USER/$YOUR_REPO_NAME
|
22 |
-
git push --force
|
23 |
-
```
|
|
|
1 |
---
|
|
|
|
|
2 |
library_name: generic
|
3 |
+
|
4 |
+
language:
|
5 |
+
- en
|
6 |
+
|
7 |
+
thumbnail:
|
8 |
+
|
9 |
+
tags:
|
10 |
+
- classification
|
11 |
+
- gradient boosted trees
|
12 |
+
- keras
|
13 |
+
- TensorFlow
|
14 |
+
|
15 |
+
license: apache-2.0
|
16 |
+
libraries: TensorBoard
|
17 |
+
|
18 |
+
metrics:
|
19 |
+
- accuracy
|
20 |
+
|
21 |
+
model-index:
|
22 |
+
- name: TF_Decision_Trees
|
23 |
+
results:
|
24 |
+
- task:
|
25 |
+
type: structured-data-classification
|
26 |
+
dataset:
|
27 |
+
type: census
|
28 |
+
name: Census-Income Data Set
|
29 |
+
metrics:
|
30 |
+
- type: accuracy
|
31 |
+
value: 96.57
|
32 |
+
|
33 |
+
pipeline_tag: "structured-data-classification"
|
34 |
+
|
35 |
---
|
36 |
+
|
37 |
+
# Classification with TensorFlow Decision Forests
|
38 |
+
#### Using TensorFlow Decision Forests for structured data classification
|
39 |
+
|
40 |
+
<br />
|
41 |
+
##### This example uses Gradient Boosted Trees model in binary classification of structured data, and covers the following scenarios:
|
42 |
+
|
43 |
+
1. Build a decision forests model by specifying the input feature usage.
|
44 |
+
2. Implement a custom Binary Target encoder as a Keras Preprocessing layer to encode the categorical features with respect to their target value co-occurrences, and then use the encoded features to build a decision forests model.
|
45 |
+
|
46 |
+
The example uses Tensorflow 7.0 or higher. It uses the US Census Income Dataset containing approximately 300k instances with 41 numerical and categorical variables. This is a binary classification problem to determine whether a person makes over 50k a year.
|
47 |
+
|
48 |
+
Author: Khalid Salama <br />
|
49 |
+
Adapted implementation: Tannia Dubon
|
|
|
|
|
|
|
|