Resources

An inventory of my online resources that I have made to help people learn about spatial data analytics, geostatistics and machine learning. I have produced these resources to support my students and I thought they would be useful to my students after completion of the class (an evergreen resource), to other students and working professionals interested in this topic.

The Author:

Michael Pyrcz, Associate Professor, University of Texas at Austin

Novel Data Analytics, Geostatistics and Machine Learning Subsurface Solutions

With over 17 years of experience in subsurface consulting, research and development, Michael has returned to academia driven by his passion for teaching and enthusiasm for enhancing engineers' and geoscientists' impact in subsurface resource development.

One of my vairous roles is as principal investigator at the Texas Center for Geostatistics. For more about Michael check out these links:

About Michael

  1. My story of how I got started in engineering and ended up as a professor at The University of Texas at Austin My Story

  2. More about my experiences and my musings My Musings

  3. Nothing is possible without awesome graduate students My Students

  4. I've written a bit, here's the books My Books

  5. My peer-reviewed publications My Papers and My Other Publications

  6. I do quite a bit on social media My Social Media Efforts

  7. I post a lot of code and course material to support anyone that wants to learn My GitHub

  8. I also blog a little, check it out here My Blog and My Other Blog

  9. I partnered with Prof. John Foster (UT Austin) and Bazean, a technology-enabled energy investment firm, to start the energy-focussed data sceince education company, daytum. We are currently offering short courses in Energy Data Science.

Recorded Lectures

  1. Introduction - Howdy, I'm Michael

  2. YouTube Channel GeostatsGuy Lectures

  3. Introduction to Data Analytics, Geostatistics and Machine Learning Undergraduate Lectures (Lec00-Lec21)

  4. Subsurface Modeling Graduate Course (Lec00 - Lec22)

  5. Geostatistical Workflows for Unconventional Reservoirs)

  6. Geostatistical Workflows for Unconventional Reservoirs at BEG

  7. What Does a Geoscientist Need to Know About Geostatistics? And Why It Would Be Helpful?

  8. Center for Petroleum and Geosystems Engineering Webinar - Big Data Analytics for Petroleum Engineering: Hype or Panacea?

GeostatsPy Python Package Workflows

  1. GeostatsPy: Reimplementation of GSLIB in Python
  2. Confidence Intervals and Hypothesis Testing with GeostatsPy
  3. Monte Carlo Simulation with GeostatsPy
  4. Bootstrap with GeostatsPy
  5. Data Distributions
  6. Declustering with GeostatsPy
  7. Indicator Kriging with GeostatsPy
  8. Kriging with GeostatsPy
  9. Multivariate Analysis with GeostatsPy
  10. Overfitting Models with GeostatsPy
  11. Plotting Spatial Data with GeostatsPy
  12. Directional Spatial Continuity with GeostatsPy
  13. Spatial Updating with GeostatsPy
  14. Data Transformation with GeostatsPy
  15. Spatial Trend Modeling with GeostatsPy
  16. Multivariate Feature Ranking with GeostatsPy
  17. Variogram Calculation with GeostatsPy
  18. Variogram Modeling with GeostatsPy
  19. Spatial Bootstrap

Resources on Statistics and Probability

  1. Probability Theory – my undergraduate lecture
  2. Statistics – undergraduate lecture
  3. Marginal, Joint & Conditional Probability – slides

Parametric Distributions

  1. How to make them in Excel
  2. Poisson distribution in Excel
  3. Gaussian transform in Excel and Python
  4. Log normal distribution in Excel

Hypothesis Testing

  1. Difference in means in Excel and in Python
  2. Difference in variances in Excel and in Python
  3. Difference in distributions in Excel

Demos of Bayesian Statistics

  1. The Coin Problem from Sivia (1996) in Excel
  2. Bayesian updating with Gaussian in Excel
  3. Probability given a positive test in Excel

Other

  1. Bootstrap in Excel, in Python and in R
  2. Spatial Bootstrap in Python
  3. Linear regression in Excel and in R
  4. Loss functions in Excel
  5. Multivariate Analysis

Heterogeneity

  1. Making an example well in Excel
  2. Lorenz coefficient in Excel
  3. Hurst coefficient in R
  4. Ripley Cross K in R

Machine Learning

  1. Dimensional reduction in Python and in R
  2. Decision tree in Python and in R
  3. Support vector machine in Python
  4. Feature Engineering
  5. Linear Regression
  6. Naive Bayes Regression and Classification
  7. Principal Components Analysis
  8. Ridge Regression
  9. Support Vector Machines
  10. Time Series Analysis
  11. Clustering
  12. k Nearest Neighbour
  13. Neural Networks
  14. scikit learn Overview

Geostatistics

  1. GeostatsPy: Reimplementation of GSLIB in Python
  2. Introduction to Data Analytics, Geostatistics and Machine Learning Undergraduate Lectures (Lec00-Lec21)
  3. What Does a Geoscientist Need to Know About Geostatistics? And Why It Would Be Helpful? and PPT
  4. Exercises, hands-on and demonstrations PPT Inventory
  5. Functions that reimplement or call GSLIB exes in Python
  6. Demo of the functions in Python
  7. Declustering in Python and with PyGSLIB Package
  8. Declustering and Debiasing in Excel
  9. Variogram calculation in Excel and in R
  10. Full variogram Calculation and Modeling in Excel and in PyGSLIB Package

Supplemental Slides

  1. Facies criteria in PPT
  2. Value of quantification in PPT
  3. Stationarity in PPT
  4. Uncertainty in PPT
  5. Suggested books in PPT
  6. Simple kriging in Excel and in R
  7. Uncertainty Away from Data in Excel
  8. Convolution methods in Python
  9. LU Simulation in Pyton
  10. Sequential Gaussian simulation in Excel and in R
  11. Truncated Gaussian simulation in Excel
  12. Spatial uncertainty in Excel
  13. Volume-variance relations in Excel
  14. Working with realizations in R
  15. Lecture on value in industry in PPT

I hope these resources are useful.

Want to Work Together?

I hope that this is helpful to those that want to learn more about subsurface modeling, data analytics and machine learning. Students and working professionals are welcome to participate.

  • Want to invite me to visit your company for training, mentoring, project review, workflow design and consulting, I'd be happy to drop by and work with you!

  • Interested in partnering, supporting my graduate student research or my Subsurface Data Analytics and Machine Learning consortium (co-PIs including Profs. Foster, Torres-Verdin and van Oort)? My research combines data analytics, stochastic modeling and machine learning theory with practice to develop novel methods and workflows to add value. We are solving challenging subsurface problems!

  • I can be reached at mpyrcz@austin.utexas.edu.

I'm always happy to discuss,

Michael

Michael Pyrcz, Ph.D., P.Eng. Associate Professor The Hildebrand Department of Petroleum and Geosystems Engineering, Bureau of Economic Geology, The Jackson School of Geosciences, The University of Texas at Austin

More Resources Available at: Twitter | GitHub | Website | GoogleScholar | Book | YouTube | LinkedIn