Iterative Random Training Sampling Approaches to Hyperspectral Image Classification with and without Background
Links to Files
Permanent Link
Author/Creator
Author/Creator ORCID
Date
Type of Work
Department
Computer Science and Electrical Engineering
Program
Engineering, Electrical
Citation of Original Publication
Rights
Distribution Rights granted to UMBC by the author.
This item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
This item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
Subjects
Abstract
As one of fundamental tasks in remote sensing, hyperspectral image classification (HSIC) has attracted considerable interest. Among existing techniques, 3D Convolutional Neural Networks (3D-CNNs) are considered as powerful spectral-spatial classification (SSC) approaches due to their ability in automatically learning hierarchical spectral-spatial features. However, their performance depends heavily on large labeled datasets and requires significant computational resources. In addition, the background (BKG) issue in HSIC has been largely overlooked, despite being a key challenge in real-world applications. The first contribution of this dissertation is the development of a novel deep learning framework, named Iterative Random Training Sampling Convolutional Neural Network (IRTS-CNN). This framework integrates iterative random training sampling spectral-spatial classification (IRTS-SSC) with CNNs, allowing CNN models to iteratively update spatial information through a feedback-based spatial filtering mechanism. IRTS-CNN serves as a generalizable structure that can enhance any baseline CNN classifier through an iterative refinement process. Experimental results show that IRTS-CNN significantly improves CNN classification performance, especially when the training sample size is small.
To overcome the computational inefficiencies of deep learning methods, the second contribution made in this dissertation introduces a concept of the Iterative Gaussian-Laplacian Pyramid Network (IGLPN). A traditional CNN consists of a series of feedforward layers which is composed of convolutional (CL) and pooling (PL) sublayers. This architecture can be interpreted through a Gaussian Pyramid (GP), where each layer uses low-pass filtering and downsampling. Additionally, a Laplacian Pyramid (LP) can be constructed to capture differential information between consecutive layers. IGLPN leverages these structures to realize CNN functionality more efficiently. This structure reduces computational demands while improving classification accuracy.
In the past, most of HSIC methods assume BKG is removed by using ground truth (GT),. Unfortunately, this is impractical in real-world scenarios. HSIC that perform well without background (HSIC-NB) typically do not work well when BKG is included. To address this, a third contribution made in this dissertation develops a new approach to HSIC with background included (HSIC-B), called Hierarchical One-Class Detection (HOCD) which extends One-Class Detection (OCD) to a hierarchical framework guided by Class Classification Priority (CCP). Experiments demonstrate that HOCD achieves robust accuracy with minimal degradation in the presence of background.
These three contributions advance HSIC by enhancing adaptability to limited labeled data samples, reducing resource demands, and addressing background challenges, as validated through extensive experiments.
