Jupyter Notebook Tutorials for Machine Learning in Chemistry (MLChem)ยถ
This website hosts the Jupyter Notebook gallery for the undergraduate course Machine Learning in Chemistry (MLChem), first offered at the University of WisconsinโMadison in Spring 2025 as CHEM 361, taught by Prof. Xuhui Huang. In this course, results from applying ML methods to chemical datasets were generated directly from executable Jupyter notebooks used in the lecture slides. These notebooks have been compiled into an online gallery with Sphinx. Each notebook is a self-contained tutorial that can be viewed as a rendered webpage or run interactively in Google Colab. The corresponding lecture slides are also embedded within each notebook.
Course Syllabusยถ
Chapter |
Contents |
||
---|---|---|---|
Chapter 1 |
Introduction and Setup of the Coding Environment |
๐ EnvSetup |
๐ผ HW0 |
Chapter 2 |
From Molecules to Features โ Preparing Training Data |
๐ป Descriptors, ๐ป Fingerprints |
๐ผ HW1 |
Chapter 3 |
Uncovering Patterns in Chemistry โ Unsupervised Learning and Dimensionality Reduction |
๐ป KMeans, ๐ป KCenter, ๐ป DBSCAN, ๐ป AgglomerativeClustering, ๐ป PCA |
|
Chapter 4 |
Predicting Chemical Outcomes โ Supervised Learning for Classification and Regression |
๐ป LinReg, ๐ป OverfitCV, ๐ป Regularization, ๐ป LogReg, ๐ป kNN, ๐ป DecisionTree |
๐ผ HW2 |
Chapter 5 |
Neural Networks โ Fundamentals and Applications in Chemical Modeling |
||
Chapter 6 |
Deep Neural Networks โ Advanced Architectures for Chemical Applications |
๐ผ HW3 |
|
Chapter 7 |
Generating Chemical Data โ AI Generative Models |
๐ป VAE |
|
Chapter 8 |
Transforming Chemistry with Large Language Models โ From Chemical to Protein Language Models |
๐ป ESM2 |
๐ผ HW4 |
Chapter 9 |
Machine Learning Force Fields โ New Energy Functions in Chemical Simulations |
||
Chapter 10 |
Reinforcement Learning for Chemical Discovery โ Enabling Automated Synthesis |
||
Chapter 11 |
Understanding Dynamics โ ML Models for Molecular Simulations |
||
Chapter 12 |
Predicting Protein Structure โ AlphaFold and Its Applications |
How to contributeยถ
To maximize reproducibility and accessibility, we maintain the source code and notebooks at this repository xuhuihuang/mlchem. If you would like to contribute, please feel free to open an issue or a pull request to this repository.
How to citeยถ
If you use materials in this repository, please cite:
โMachine Learning in Chemistry: A Data Centred, Hands-on Introductory Machine Learning Course for Undergraduate Studentsโ, Mingyi Xue, Bojun Liu, and Xuhui Huang, ChemRxiv, DOI: 10.26434/chemrxiv-2025-9zldf.