ideas

Exploring emerging concepts and their potential applications in atmospheric science.

Kolmogorov-Arnold Networks (KAN) in Atmospheric Science

I’ve been exploring the potential applications of Kolmogorov-Arnold Networks (KAN) in atmospheric science. KAN offers a promising alternative to traditional Multi-Layer Perceptrons (MLPs) by using learnable activation functions on edges rather than fixed activation functions on nodes.

KAN vs MLP Architecture

Comparison of MLP (left) and KAN (right) architectures. MLPs use fixed activation functions (σ) on nodes with linear weights on edges. KANs use learnable spline functions (φ) on edges with simple summation on nodes.

Why KAN for Atmospheric Science?

Traditional neural networks have been increasingly used in atmospheric modeling, but they often suffer from:

  • Poor interpretability
  • Difficulty capturing complex nonlinear relationships
  • Limited ability to discover underlying physical laws

KAN addresses these challenges by:

  • Improved interpretability: The network structure can reveal underlying mathematical relationships
  • Higher accuracy with smaller networks: KAN can achieve comparable or better accuracy with fewer parameters
  • Scientific discovery potential: KAN can help identify governing equations from data

Potential Applications

  1. Aerosol-cloud interactions: Modeling complex relationships between aerosol properties and cloud formation
  2. Chemical mechanism discovery: Identifying reaction pathways from observational data
  3. Parameterization development: Creating more accurate and interpretable parameterizations for Earth system models
  4. Emission source attribution: Discovering relationships between emissions and atmospheric composition

Future Directions

I plan to explore how KAN can be integrated into atmospheric chemistry and climate models, like CESM and GEOS-Chem, to improve both accuracy and interpretability of aerosol and chemistry parameterizations.


This is an evolving research interest. More updates coming soon.