naughtondale commited on
Commit
9514984
β€’
1 Parent(s): 875392c

Update Project Description

Browse files
Files changed (1) hide show
  1. Project Description +32 -5
Project Description CHANGED
@@ -1,12 +1,39 @@
1
- Oil Flow Predictor (Barrels Per Day) Using Machine Learning Algorithms – Random Forest and Gradient Boosting Machine
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
- Executives in the Oil & Gas industry are always seeking ways to optimize the production of oil and gas wells and make informed decisions about drilling and exploration. Machine learning can be used to effectively resolve this optimization problem, but would usually involve extensive or complex coding that the typical executive or non-technical staff would not be able to undertake. As an example, the code in the Appendix of this paper, replicating a Google Colaboratory, notebook loads the data from a CSV file and performs data scaling, PCA dimensionality reduction, DBSCAN clustering, and random forest regression and gradient boosting regression for predicting output variables related to oil and gas production rates – not something the typical executive or non-technical Oil & Gas staff can manage. The analysis was used to gain insights into the factors that affect oil and gas production rates and to predict production rates based on the input variables. Executives are then able to use this knowledge to improve the efficiency of their operations.
4
- This Gradio chatbot reads in a CSV file containing well rate data, performs some preprocessing on the data, and then trains and evaluates two regression models, a Random Forest (RF) and a Gradient Boosting Machine (GBM), on the preprocessed data. The input data is preprocessed using Principal Component Analysis (PCA) to reduce the dimensionality of the input features, and then the DBSCAN clustering algorithm is applied to the PCA-transformed data to filter out outlier data points. The remaining data points are used for training and testing the regression models. The RF and GBM models are both trained using the preprocessed input features and corresponding well rates, and their performance is evaluated using the R-squared metric. Finally, the predicted well rates from the RF and GBM models are plotted against the actual well rates for visualization.
5
- For example, the following inputs are required for predicting the oil flow rate (Qoil) based on input features (BHP, WHP, WHT, Tsep, Psep, and Choke_in):
6
  β€’ BHP: Bottom Hole Pressure
7
  β€’ WHP: Well Head Pressure
8
  β€’ WHT: Well Head Temperature
9
  β€’ Tsep: Separator Temperature
10
  β€’ Psep: Separator Pressure
11
  β€’ Choke_in: Choke Size
12
- These features (BHP, WHP, WHT, Tsep, Psep, and Choke_in) are used as inputs to the machine learning models that were trained in the notebook to predict the oil flow rate (Qoil). We suspect and propose that the same input features (BHP, WHP, WHT, Tsep, Psep, and Choke_in) are provided to the trained model which would then be enabled to predict Qoil or the oil flow rate (number of barrels per day) via the Gradio input interface. The oil executive is able to learn this without having to know machine learning or the complexities of training the model himself.
 
 
 
 
 
 
 
 
1
+ Oil Flow Predictor (Barrels Per Day) Using Machine Learning Algorithms – Random Forest and Gradient Boosting
2
+ Machine. Executives in the Oil & Gas industry are always seeking ways to optimize the production of oil and gas
3
+ wells and make informed decisions about drilling and exploration. Machine learning can be used to effectively
4
+ resolve this optimization problem, but would usually involve extensive or complex coding that the typical
5
+ executive or non-technical staff would not be able to undertake. For example, the code behind this simple Gradio
6
+ bot does thinngs the typical oil executive knows nothing about in that it loads the data from a CSV file and
7
+ performs data scaling, PCA dimensionality reduction, DBSCAN clustering, and random forest regression and
8
+ gradient boosting regression for predicting output variables related to oil and gas production rates – not
9
+ something the typical executive or non-technical Oil & Gas staff can manage. The analysis was used to gain
10
+ insights into the factors that affect oil and gas production rates and to predict production rates based on
11
+ the input variables. Executives are then able to use this knowledge to improve the efficiency of their
12
+ operations.
13
+
14
+ This Gradio chatbot reads in a CSV file containing well rate data, performs some preprocessing on the data, and
15
+ then trains and evaluates two regression models, a Random Forest (RF) and a Gradient Boosting Machine (GBM), on
16
+ the preprocessed data. The input data is preprocessed using Principal Component Analysis (PCA) to reduce the
17
+ dimensionality of the input features, and then the DBSCAN clustering algorithm is applied to the PCA-transformed
18
+ data to filter out outlier data points. The remaining data points are used for training and testing the
19
+ regression models. The RF and GBM models are both trained using the preprocessed input features and
20
+ corresponding well rates, and their performance is evaluated using the R-squared metric. Finally, the predicted
21
+ well rates from the RF and GBM models are plotted against the actual well rates for visualization.
22
+
23
+ For example, the following inputs are required for predicting the oil flow rate (Qoil) based on input features
24
+ (BHP, WHP, WHT, Tsep, Psep, and Choke_in):
25
 
 
 
 
26
  β€’ BHP: Bottom Hole Pressure
27
  β€’ WHP: Well Head Pressure
28
  β€’ WHT: Well Head Temperature
29
  β€’ Tsep: Separator Temperature
30
  β€’ Psep: Separator Pressure
31
  β€’ Choke_in: Choke Size
32
+
33
+ These features (BHP, WHP, WHT, Tsep, Psep, and Choke_in) are used as inputs to the machine learning models that
34
+ were trained in the notebook to predict the oil flow rate (Qoil). We suspect and propose that the same input
35
+ features (BHP, WHP, WHT, Tsep, Psep, and Choke_in) are provided to the trained model which would then be
36
+ enabled to predict Qoil or the oil flow rate (number of barrels per day) via the Gradio input interface.
37
+ The oil executive is able to learn this without having to know machine learning or the complexities of
38
+ training the model himself.
39
+