Skip to main navigation menu Skip to main content Skip to site footer

Geography and Geology

March 31, 2023; Zurich, Switzerland: IV International Scientific and Practical Conference «GRUNDLAGEN DER MODERNEN WISSENSCHAFTLICHEN FORSCHUNG»


APPLIED STATISTICS AND MACHINE LEARNING IN EARTH SCIENCES: CHOOSING THE RIGHT APPROACH FOR MODERN SCIENTIFIC RESEARCH


DOI
https://doi.org/10.36074/logos-31.03.2023.70
Published
14.04.2023

Abstract

In the rapidly evolving domain of Earth Sciences, the application of statistics and machine learning (ML) has become indispensable for advancing research and understanding complex phenomena. With the increasing availability of large, high-quality datasets, researchers are harnessing the power of these methodologies to analyze, interpret, and model diverse Earth Science problems. However, the choice of which technique to apply depends on the specific problem, data type, and desired outcome. This article aims to provide a comprehensive guide to the foundations of applied statistics and ML in Earth Sciences, illustrating the most effective techniques for various research problems and helping researchers make informed decisions on where to use what.

References

  1. Camps-Valls, G., Gómez-Chova, L., Muñoz-Marí, J., Rojo-Álvarez, J. L., & Martín, G. (2019). Machine learning in Earth observation data processing: A review of techniques and applications. ISPRS Journal of Photogrammetry and Remote Sensing, 145, 267-282.
  2. Carranza, E. J. M., & Hale, M. (2001). Logistic regression for geologically constrained mapping of gold potential, Baguio district, Philippines. Economic Geology, 96(4), 685-701.
  3. Cavazos, T. (2000). Using self-organizing maps to investigate extreme climate events: An application to wintertime precipitation in the Balkans. Journal of Climate, 13(10), 1718-1732.
  4. Dawson, C. W., Abrahart, R. J., & See, L. M. (2007). HydroTest: A web-based toolbox of evaluation metrics for the standardised assessment of hydrological forecasts. Environmental Modelling & Software, 22(7), 1034-1052.
  5. Davis, J. C. (2002). Statistics and data analysis in geology. John Wiley & Sons.
  6. Geller, R. J. (1997). Earthquake prediction: a critical review. Geophysical Journal International, 131(3), 425-450.
  7. Goovaerts, P. (1997). Geostatistics for natural resources evaluation. Oxford University Press.
  8. Hansen, J., Sato, M., Ruedy, R., Lacis, A., & Oinas, V. (2000). Global warming in the twenty-first century: An alternative scenario. Proceedings of the National Academy of Sciences, 97(18), 9875-9880.
  9. Hannachi, A., Jolliffe, I. T., & Stephenson, D. B. (2007). Empirical orthogonal functions and related techniques in atmospheric science: A review. International Journal of Climatology, 27(9), 1119-1152.
  10. Karpatne, A., Atluri, G., Faghmous, J. H., Steinbach, M., Banerjee, A., Ganguly, S., ... & Kumar, V. (2017). Theory-guided data science: A new paradigm for scientific discovery from data. IEEE Transactions on Knowledge and Data Engineering, 29(10), 2318-2331.
  11. Krishnamurthy, G., K., Choubey, A., & Mujumdar, P. P. (2020). Deep learning for hydrological streamflow forecasting: A state-of-the-art review. Journal of Hydroinformatics, 22(3), 365-387.
  12. Mann, M. E. (2011). The signal and the noise: The art and science of climate change predictions. Climatic Change, 108(3), 585-606.
  13. Michener, W. K., & Jones, M. B. (2012). Ecoinformatics: supporting ecology as a data-intensive science. Trends in ecology & evolution, 27(2), 85-93.
  14. Pal, M., & Mather, P. M. (2005). Support vector machines for classification in remote sensing. International Journal of Remote Sensing, 26(5), 1007-1011.
  15. Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., Carvalhais, N., & Prab hat. (2019). Deep learning and process understanding for data-driven Earth system science. Nature, 566(7743), 195-204.
  16. Rudin, C. (2019). Stop explaining black-box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5), 206-215.
  17. Stein, S., & Wysession, M. (2003). An introduction to seismology, earthquakes, and Earth structure. John Wiley & Sons.
  18. Zhang, F., Du, B., & Zhang, L. (2016). Scene classification via a gradient boosting random convolutional network framework. IEEE Transactions on Geoscience and Remote Sensing, 54(3), 1793-1802.