The AMS Machine Learning in Python for Environmental Science Problems Short Course is an introductory course for researchers interested in learning about how methods from machine learning and data science can be applied to environmental research questions. This course will give participants an opportunity to interact with real world data and develop ML pipelines in python using Jupyter notebooks.
This course will have both a beginner track and an intermediate track. The beginner track will provide an introduction to machine learning, and will cover topics such as supervised and unsupervised machine learning, with an introduction to deep learning. Participants will be guided through example code and notebooks to try out machine learning methods for themselves. Participants will become familiar with the ML pipeline, starting from an investigation of the dataset and its features. Then, participants will learn how to configure and train models for tabular and image based datasets. Finally, participants will learn techniques for evaluating and comparing models to select the one that fits their needs.
The intermediate track will assume that the participant is familiar with the basic ML pipeline and has some experience with model development. Topics will include automated hyperparameter tuning and statistical testing of model performance. Environmental science modelers are increasingly using explainable AI techniques to investigate their models for debugging and to see if they can extract scientific insight from what the model has learned. However, there are many pitfalls in doing so, especially with complex models. In this course, participants will learn about some of the pitfalls that can cause explainable XAI methods to give misleading explanations, and some techniques to mitigate these issues.
Course participants will learn:
If you have questions regarding the course, please contact Kara Lamb or Evan Krell.
Columbia University
Texas A&M University - Corpus Christi
UC Davis
University Maryland