LiYuan commited on
Commit
3496770
1 Parent(s): fdb3b5f
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -51,8 +51,10 @@ model = AutoModelForSequenceClassification.from_pretrained("LiYuan/amazon-query-
51
 
52
  Download all the raw [dataset](https://www.aicrowd.com/challenges/esci-challenge-for-improving-product-search/dataset_files) from the Amazon KDD Cup website.
53
 
54
- The three dataset we used like below:
55
-
 
 
56
 
57
 
58
  ## Training procedure
51
 
52
  Download all the raw [dataset](https://www.aicrowd.com/challenges/esci-challenge-for-improving-product-search/dataset_files) from the Amazon KDD Cup website.
53
 
54
+ 1. Concatenate the all product attributes from the product dataset
55
+ 2. Join it with a training query dataset
56
+ 3. Stratified Split the merged data into 571,223-row training, 142,806-row validation, 79,337-row test set
57
+ 4. Train on the full training set
58
 
59
 
60
  ## Training procedure