ODEMI: One Dimensional Electromagnetic Inversion Dataset to Study Machine Learning and Lessons Learned

Author/Creator

Date

Department

Program

Citation of Original Publication

Rights

This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.

Subjects

Abstract

In the past a few years, electromagnetic inversion with machine learning has been one of the most popular subjects in the computational electromagnetics (CEM) society. Unlike many other more mature fields, e.g., computer vision, there is a lack of large datasets allowing both junior and senior members of the CEM society to experiment their machine learning algorithms with. In this talk, first, I would like to introduce a moderately large dataset: One Dimensional Electromagnetic Inversion (ODEMI), which includes hundreds of thousands of cases assuming different measurement configurations and multilayered medium scenarios [1]. In the second part, I will focus on some fundamental yet important lessons learned from our study conducted on this dataset using four machine learning methods: linear regression, k-nearest neighbors, random forests, and neural-networks [2]. In terms of accuracy, deep-learning outperforms the other aforementioned machine learning techniques at the expense of computational cost. Our numerical studies show that bringing antennas closer to the domain of interest and increasing the number of antennas help with reducing the error but this reduction is not linear. Similar to suggested mesh sampling density in classical CEM applications, use of 20-30 antennas per wavelength is the ideal case for the highest accuracy and further increase of the number of receiver antennas is not helpful. We also confirm an outcome of very recent studies carried by other research groups [3]: training and testing a learning system with broadband excitations outperforms the learning systems deploying single-frequency datasets. Last but not least, we show that training datasets created with random grids rather than uniform grids can significantly increase the inversion accuracy, especially for the single-frequency applications.