Data Sources
|
Essential libaries for data online data sources
Data retrieval from the CDS Toolbox
netCDF and GRIB data formats
|
Data Preprocessing and Visualization
|
|
Machine Learning Fundamentals
|
Data representations are crucial for ML in science, including spatial data (vector, raster), point clouds, time series, graphs, and more
ML algorithms like linear regression, k-nearest neighbors,support vector Machine, xgboost and random forests are vital algorithms
Supervised learning is a popular ML approach, with decision trees, random forests, and neural networks being widely used
Fundamentals of data engineering are crucial for building robust ML pipelines, including data storage, processing, and serving
|
Deep Learing Fundamentals
|
Deep Learning algorithms are often represented as graph computation
We have different non-linear activation functions that help in learning different relationships to solve handle non-linearity in nn problems.
-
|
Recurrent Neural Networks (RNNs) Models
|
LSTMs and GRUs are advanced RNN architectures designed to handle long-term dependencies in sequential data.
The application of deep learning, particularly through RNNs and their variants like LSTM, GRU, and TFT, holds significant promise for time series forecasting in the physical sciences
LSTM, GRU and TFT models leverage advanced mechanisms for superior predictive performance in physical sciences applications.
|
Machine Learning For Material Science
|
Artificial intelligence and machine learning are being increasingly used in scientific domains such as computational science
RDKit can be used to calculate various molecular properties for machine learning in materials science
PyMatGen help to query structures from MP repository
Descriptor-based ML approaches focus on using numerical features (descriptors) that summarize key aspects of the material
Graph-based ML techniques are particularly suitable when dealing with materials that can be represented as networks or graphs
In practice, hybrid approaches that combine aspects of both graph-based and descriptor-based methods are also gaining traction, leveraging the strengths of each depending on the task at hand.
Crystal Hamiltonian Graph neural Network is pretrained on the GGA/GGA+U static and relaxation trajectories from Materials Project
Charge-informed molecular dynamics can be simulated with pretrained CHGNet through ASE python interface
CHGNet can perform fast structure optimization and provide site-wise magnetic moments. This makes it ideal for pre-relaxation and MAGMOM initialization in spin-polarized DFT
|