naughtondale commited on
Commit
875392c
β€’
1 Parent(s): 5a00ac2

Create Project Description

Browse files
Files changed (1) hide show
  1. Project Description +12 -0
Project Description ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Oil Flow Predictor (Barrels Per Day) Using Machine Learning Algorithms – Random Forest and Gradient Boosting Machine
2
+
3
+ Executives in the Oil & Gas industry are always seeking ways to optimize the production of oil and gas wells and make informed decisions about drilling and exploration. Machine learning can be used to effectively resolve this optimization problem, but would usually involve extensive or complex coding that the typical executive or non-technical staff would not be able to undertake. As an example, the code in the Appendix of this paper, replicating a Google Colaboratory, notebook loads the data from a CSV file and performs data scaling, PCA dimensionality reduction, DBSCAN clustering, and random forest regression and gradient boosting regression for predicting output variables related to oil and gas production rates – not something the typical executive or non-technical Oil & Gas staff can manage. The analysis was used to gain insights into the factors that affect oil and gas production rates and to predict production rates based on the input variables. Executives are then able to use this knowledge to improve the efficiency of their operations.
4
+ This Gradio chatbot reads in a CSV file containing well rate data, performs some preprocessing on the data, and then trains and evaluates two regression models, a Random Forest (RF) and a Gradient Boosting Machine (GBM), on the preprocessed data. The input data is preprocessed using Principal Component Analysis (PCA) to reduce the dimensionality of the input features, and then the DBSCAN clustering algorithm is applied to the PCA-transformed data to filter out outlier data points. The remaining data points are used for training and testing the regression models. The RF and GBM models are both trained using the preprocessed input features and corresponding well rates, and their performance is evaluated using the R-squared metric. Finally, the predicted well rates from the RF and GBM models are plotted against the actual well rates for visualization.
5
+ For example, the following inputs are required for predicting the oil flow rate (Qoil) based on input features (BHP, WHP, WHT, Tsep, Psep, and Choke_in):
6
+ β€’ BHP: Bottom Hole Pressure
7
+ β€’ WHP: Well Head Pressure
8
+ β€’ WHT: Well Head Temperature
9
+ β€’ Tsep: Separator Temperature
10
+ β€’ Psep: Separator Pressure
11
+ β€’ Choke_in: Choke Size
12
+ These features (BHP, WHP, WHT, Tsep, Psep, and Choke_in) are used as inputs to the machine learning models that were trained in the notebook to predict the oil flow rate (Qoil). We suspect and propose that the same input features (BHP, WHP, WHT, Tsep, Psep, and Choke_in) are provided to the trained model which would then be enabled to predict Qoil or the oil flow rate (number of barrels per day) via the Gradio input interface. The oil executive is able to learn this without having to know machine learning or the complexities of training the model himself.