Afreen commited on
Commit
fb42d1c
1 Parent(s): 94bbb4c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -1
README.md CHANGED
@@ -1,3 +1,40 @@
1
  ---
2
- license: other
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - Text Classification
6
+ co2_eq_emissions: 0.1069
7
+ widget:
8
+ - text: "At the every month post-injection monitoring event, TCE, carbon tetrachloride, and chloroform concentrations were above CBSGs in three of the wells"
9
+ example_title: "Remediation Standards"
10
+ - text: "TRPH exceedances were observed in the subsurface soils immediately above the water table and there are no TRPH exceedances in surface soils."
11
+ example_title: "Extent of Contamination"
12
+ - text: "weathered shale was encountered below the surface area with fluvial deposits. Sediments in the coastal plain region are found above and below the bedrock with sandstones and shales that form the basement rock"
13
+ example_title: "Geology"
14
+
15
  ---
16
+
17
+ ## About the Model
18
+ An Environmental due diligence classification model, trained on customized environmental Dataset to detect contamination and remediation activities (both prevailing as well as planned) as a part of site assessment process. This model can identify the source of contamination, the extent of contamination, the types of contaminants present at the site, the flow of contaminants and their interaction with ground water, surface water and other surrounding water bodies .
19
+
20
+ This model was built on top of distilbert-base-uncased model and trained for 10 epochs with a batch size of 16, a learning rate of 5e-5, and a maximum sequence length of 512.
21
+
22
+ - Dataset : Open Source News data + Custom data
23
+ - Carbon emission 0.1069 Kg
24
+
25
+ ## Usage
26
+ The easiest way is to load through the pipeline object offered by transformers library.
27
+ ```python
28
+ from transformers import AutoTokenizer, TFAutoModelForSequenceClassification
29
+ from transformers import pipeline
30
+ tokenizer = AutoTokenizer.from_pretrained("d4data/environmental-due-diligence-model")
31
+ model = TFAutoModelForSequenceClassification.from_pretrained("d4data/environmental-due-diligence-model")
32
+
33
+ classifier = pipeline('text-classification', model=model, tokenizer=tokenizer) # cuda = 0,1 based on gpu availability
34
+ classifier("At the every month post-injection monitoring event, TCE, carbon tetrachloride, and chloroform concentrations were above CBSGs in three of the wells")
35
+ ```
36
+
37
+ ## Author
38
+ This model is part of the Research topic "Environmental Due Diligence" conducted by Deepak John Reji, Afreen Aman, Shaina Raza. If you use this work (code, model or dataset), please cite as:
39
+ > Environmental Due Diligence, (2020), GitHub repository https://github.com/dreji18/environmental-due-diligence
40
+