FredZhang7
commited on
Commit
•
25e2b57
1
Parent(s):
86d1e89
mistake during training
Browse files
README.md
CHANGED
@@ -51,10 +51,10 @@ language:
|
|
51 |
|
52 |
The classification task is split into two stages:
|
53 |
1. URL features model
|
54 |
-
- **96.5%+
|
55 |
- 2,436,727 rows of labelled URLs
|
56 |
2. Website features model
|
57 |
-
- **
|
58 |
- 911,180 rows of 42 features
|
59 |
|
60 |
## Training Features
|
@@ -72,7 +72,7 @@ params = {
|
|
72 |
'num_boost_round': [500, 750, 800, 900, 1000, 1250, 2000]
|
73 |
}
|
74 |
```
|
75 |
-
To reproduce the
|
76 |
Then train a LightGBM model using the most suited hyperparamters for this task:
|
77 |
```python
|
78 |
params = {
|
|
|
51 |
|
52 |
The classification task is split into two stages:
|
53 |
1. URL features model
|
54 |
+
- **96.5%+ accurate** on training and validation data
|
55 |
- 2,436,727 rows of labelled URLs
|
56 |
2. Website features model
|
57 |
+
- **98.4% accurate** on training data, and **98.9% accurate** on validation data
|
58 |
- 911,180 rows of 42 features
|
59 |
|
60 |
## Training Features
|
|
|
72 |
'num_boost_round': [500, 750, 800, 900, 1000, 1250, 2000]
|
73 |
}
|
74 |
```
|
75 |
+
To reproduce the 98.4% accurate model, you can follow the data analysis in the dataset page to filter out the unimportant features.
|
76 |
Then train a LightGBM model using the most suited hyperparamters for this task:
|
77 |
```python
|
78 |
params = {
|