Impact of Training Dataset Size for ML Load Flow Surrogates
Timon Conrad, Changhun Kim, Johann Jäger, Andreas Maier, Siming Bayer
- Year
- 2026
- Access
- Open access
Abstract
Efficient and accurate load flow calculations are a bedrock of modern power system operation. Classical numerical methods such as the Newton-Raphson algorithm provide highly precise results but are computationally demanding, which limits their applicability in large-scale scenario studies and optimization in time-critical contexts. Research has shown that machine learning approaches can approximate load flow results with high accuracy while substantially reducing computation time. Sample efficiency, i.e., the ability to achieve high accuracy with limited training dataset size, is still insufficiently researched, especially in grids with a fixed topology. This paper presents a systematic investigation of the sample efficiency of a Multilayer Perceptron and two Graph Neural Network variants on a dataset based on a modified IEEE 5-bus system. The results for this grid size show that Graph Neural Networks achieve the lowest losses. However, the availability of large training datasets remains the dominant factor influencing performance compared to architecture choice.
Keywords
Related papers
The Organization of Behavior
D. O. Hebb
2005
Fractional Brownian Motions, Fractional Noises and Applications
Benoît B. Mandelbrot, John W. Van Ness
1968
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
Laith Alzubaidi, Jinglan Zhang, Amjad J. Humaidi +7 more
2021
A guide to deep learning in healthcare
Andre Esteva, Alexandre Robicquet, Bharath Ramsundar +7 more
2018