Endmember Extraction on the Grassmannian
Endmember extraction plays a prominent role in a variety of data analysis problems as endmembers often correspond to data representing the purest or best representative of some feature. Identifying endmembers then can be useful for further identification and classification tasks. In settings with high-dimensional data, such as hyperspectral imagery, it can be useful to consider endmembers that are subspaces as they are capable of capturing a wider range of variations of a signature. The endmember extraction problem in this setting thus translates to finding the vertices of the convex hull of a set of points on a Grassmannian. In the presence of noise, it can be less clear whether a point should be considered a vertex. In this paper, we propose an algorithm to extract endmembers on a Grassmannian, identify subspaces of interest that lie near the boundary of a convex hull, and demonstrate the use of the algorithm on a synthetic example and on the 220 spectral band AVIRIS Indian Pines hyperspectral image.
Endmember Extraction on the Grassmannian
|Elin Farnell, Henry Kvinge, Michael Kirby, Chris Peterson ††thanks: This paper is based on research partially supported by the National Science Foundation under Grants No. DMS-1513633, and DMS-1322508.|
|Colorado State University|
|Department of Mathematics|
|Fort Collins, CO 80523-1874|
Index Terms— Endmember, Grassmannian, convex hull, hyperspectral imagery
There is increasing evidence that for certain types of high-dimensional data, particularly when the data contains samples of similar objects in a variety of state, it is advantageous to work with subspaces of the ambient data space rather than individual points. Examples of this include improved classification of signals in hyperspectral images [1, 2] and identification of ultra-low resolution grayscale face images via illumination spaces . In these examples, the underlying geometric framework is that of Grassmann manifolds. The real Grassmann manifold is a topological manifold whose points parameterize all possible -dimensional subspaces of . There are many distinct ways to impose a geometric structure on . The most commonly used include geometries imposed by the geodesic, chordal, and Fubini-Study metrics. Each of these geometries has its own distinct characteristics and its own distinct advantages and disadvantages. Although the geometry of the Grassmannian, with respect to each of these various metrics, is well understood from a theoretical perspective, many of the standard methods for studying data in Euclidean space do not yet have analogues on the Grassmannian. In this paper we will describe a method for computing endmembers and other points of interest within a collection of points on the Grassmann manifold .
Given a set of data points in , one can consider the convex hull of which is defined as the smallest convex set containing . If is finite then will be a convex polytope and one can ask for the vertices of this polytope. The vertices are sometimes known as endmembers. Interest in endmembers stems from their relationship to extrema of linear functionals on X and thus endmember identification exposes extremal elements in . Note that for any datapoint lying in the convex hull of we can express as a convex combination of its endmembers, i.e. there exists an expression
where and . Thus, in some sense, the endmembers of represent the most novel signals in and every other point in can be recognized as a convex combination of these fundamental elements. The coefficients are known as the fractional abundances of in . The determination of endmembers of a hyperspectral image is a particularly well-studied problem which is a key step of the more general problem of decomposing each pixel into its pure signals, also known as the spectral unmixing problem . There are a significant number of algorithms for endmember extraction. These include the pixel purity index (PPI) , N-FINDR , optical real-time adaptive spectral identification system (ORASIS) , iterative error analysis (IEA) , convex cone analysis (CCA) , vertex component analysis , orthogonal subspace projection (OSP) technique , automated morphological endmember extraction (AMEE) , and simulated annealing algorithm (SAA) . We note in particular that the algorithm presented in this paper can be seen to address some of the same challenges that the SAA algorithm recognizes; namely those of endmember variability.
While the algorithm described in this paper might be accurately described as non-linear endmember extraction, the framework is different from other established methods falling under this designation . Motivating the latter is the observation that under certain circumstances, the spectral interactions between endmembers in hyperspectral images can be non-linear. In other words, the data is sometimes better modeled as living on a non-linear submanifold (unknown before-hand) which is embedded in . In our case, the non-linearity arises from the fact that we take subspaces of rather than single points. We consider these as points on a Grassmannian, thus giving rise to an immediate non-linear framework (irrespective of the particularities of the data).
2 Endmember Extraction Algorithm
We design our endmember extraction algorithm around the exploitation of an isometric embedding of a Grassmannian manifold into Euclidean space followed by a projection into a lower dimensional Euclidean space. Once in this setting, we can utilize known algorithms for discovering vertices of a convex hull. Consider a set of points on the Grassmannian We define vertices of the convex hull of on to be those points in with indices obtained in the following way.
Construct a distance matrix for using chordal distance on (the projection Frobenius norm).
Use Multidimensional Scaling (MDS) to find an embedding for that is an isometry.
Apply the Convex Hull Stratification Algorithm (CHSA) to identify the indices of the vertices of the convex hull in Euclidean space.
Via the bijection between and the indices are precisely the indices for points in that are vertices of the convex hull of on
To be precise, we elaborate on these steps here. Chordal distance is defined as follows. If then correspond to two -dimensional subspaces of . The chordal distance is defined as
where are the principal angles between the subspaces and For convenience, one may compute chordal distance in the following way. If are orthonormal bases for two -dimensional subspaces and in then
where is the Frobenius matrix norm. For more context on the chordal distance metric, see, e.g., [15, 16]. The choice of chordal distance as a metric on the Grassmannian is notable; as shown in , the Grassmannian can be isometrically embedded when this metric is used.
Multidimensional scaling provides a means of defining a collection of points in Euclidean space whose pairwise distances are as faithful as possible to their original distances on some manifold. See [17, 18, 19, 20, 21, 22] for full details. The procedure is described briefly below. Given a distance matrix compute matrices and where and is the result of double-centering That is, if then The MDS embedding is provided via the eigenvectors and eigenvalues of Specifically, if is the eigendecomposition of then the rows of the matrix give a collection of points in whose pairwise distances come as close as possible to provided each
The Convex Hull Stratification Algorithm proposed in  and  utilizes an optimization problem that attempts to represent each point in a data set as a convex combination of its nearest neighbors. Note that vertices of the convex hull cannot be represented as a convex combination of its nearest neighbors for any . Those points whose coefficient vectors require at least one negative component are deemed to be points of interest and include the vertices of the convex hull. Further, the norm of the coefficient vector can be used to stratify the points by their proximity to the boundary of the convex hull. Specifically, one solves the following optimization problem for each in the data set:
subject to where the set is the set of nearest neighbors to in the data set. Those vectors with an associated vector for which at least one component of is less than zero necessarily contain the vertices of the convex hull. Additionally, a measure of boundary-proximity is provided by the -norm of the associated vectors
To summarize, our algorithm exploits an isometric embedding of the Grassmannian into Euclidean space, where well-known convex hull algorithms may be applied. Thus it is computationally manageable and serves as a useful technique in applied settings where a “pure representative” for a feature may be better represented as a span of several realizations of the “pure representative.”
The computational complexity of the Grassmann Endmember Extraction Algorithm is dominated by the complexity of MDS and CHSA as well as that of constructing a distance matrix for points on Classical MDS is known to be but faster versions exist (see, e.g. ). For details on the complexity of CHSA, see . A chordal distance matrix can be computed in polynomial time. Combining these three complexities, we conclude that the Grassmann Endmember Extraction Algorithm is a polynomial time algorithm.
3 Application: Simplex Embedding
Given a set of subspaces of , a Singular Value Decomposition (SVD) based algorithm can be used to compute a weighted flag mean of the set of . The underlying theory for the construction of the flag mean is based on the chordal metric as this allows for the very direct (and fast) SVD-based algorithm. The input for the algorithm consists of orthonormal bases for each of together with real-valued weights . The output of the flag mean algorithm consists of an ordered sequence of orthonormal vectors (defined only up to sign where is the dimension of the span of all of the considered together). From the one can construct a full flag of nested vector spaces by defining .
If we restrict to all have the same dimension then for each choice of weights we can consider the -dimensional component of the flag . In this manner, we can think of , as determined by the weighted flag mean, as representing a kind of convex combination of the as points on the Grassmann manifold .
In the following example, we consider random points then we determine 4997 random weighted flag means of these points, for a total of 5000 points. We use Grassmann endmember extraction to recover the three -dimensional subspaces of that were used to construct the data set.
In Figure 1, we show the results of the MDS mapping of the set of points on described above to its best -dimensional space. Note that the mapping appears to faithfully represent the geometry and distance relationships present in the original points on the Grassmannian.
After applying our algorithm to this data set, we threshold by the norm of the weight vector and display those points that are thus identified as endmembers on the Grassmannian (Figure 1). The algorithm successfully identified the vertices that generated the data set, providing a proof of concept of the algorithm.
4 Application: Indian Pines Hyperspectral Imagery
We apply our algorithm to the Indian Pines data set and present a visualization of the extracted endmembers in that context.
The Indian Pines data set consists of 220 spectral bands at a resolution of pixels. We use a corrected data set that has bands from the region of water absorption removed, resulting in 200 remaining spectral bands.111The data is available at http://www.ehu.eus/ccwintco/index.php/ Hyperspectral_Remote_Sensing_Scenes. The data was collected with an Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) at the Indian Pines test site in Indiana . The scene contains various classes, such as woods, grass, corn, alfalfa, and buildings.
We begin by obtaining points on the Grassmannian by collecting local patches of spectral data. We choose to take regions of size pixels. The span of the 9 vectors in a fixed region determine a point in Define this set to be We compute pairwise chordal distances between elements of on and define a distance matrix The corresponding MDS embedding into is shown in Figure 3.
For comparison, consider the embedding shown in Figure 2. Here, we define points on by taking spectral information from random collections of pixel locations, where each draw is made without replacement from a set of pixels that has been manually identified to consist of a particular class (e.g. corn, alfalfa). Call this set In Figure 2, an effort has been made to color embedded points by class in such a way that the classes are visually distinguishable. Notably, points within the same class tend to cluster together. Thus, it is reasonable to expect that endmembers extracted from the embedding of spectral information from local patches are likely to correspond to pure forms of various classes.
In Figure 3, we show the embedding of into We note that by varying parameters in the application of CHSA, one may obtain varying numbers of identified vertices. Thus, depending on the application, one may choose to identify all points that are ‘near’ the boundary or to identify only those vertices that are minimally necessary to define a convex hull. In our setting, we choose to define the number of nearest neighbors to be 7, the -norm parameter to be and the -norm parameter to be
We can gain some insight into the meaning of the extracted endmembers by visualizing the locations of the local patches that gave rise to those points on the Grassmannian. In Figure 4, we show the local patches whose corresponding subspaces are vertices. We conjecture that these locations carry valuable information about various substance classes.
The convex hull of a set of data points in the manifold setting has significant meaning for a variety of applications. We propose a method that incorporates manifold geometry appropriately in the setting where data naturally lives on a Grassmann manifold. Our method employs an approximately isometric embedding in Euclidean space to identify vertices of the convex hull of a set of points on the Grassmannian. We provide a proof-of-concept of the algorithm in a synthetic example, where we successfully extract the vertices used to generate a convex collection of points on a Grassmannian. We further demonstrate the potential usefulness of the algorithm via an example with hyperspectral imagery, where the identified points on the Grassmannian correspond to meaningful locations in the Indian Pines scene.
-  S. Chepushtanova and M. Kirby, “Classification of hyperspectral imagery on embedded Grassmannians,” in 2014 6th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS), June 2014, pp. 1–4.
-  Sofya Chepushtanova and Michael Kirby, “Sparse Grassmannian embeddings for hyperspectral data representation and classification,” IEEE Geoscience and Remote Sensing Letters, vol. 14, no. 3, pp. 434–438, 2017.
-  Jen-Mei Chang, Michael Kirby, Holger Kley, Chris Peterson, Bruce Draper, and J. Ross Beveridge, “Recognition of digital images of the human face at ultra low resolution via illumination spaces,” in Computer Vision – ACCV 2007, Yasushi Yagi, Sing Bing Kang, In So Kweon, and Hongbin Zha, Eds., Berlin, Heidelberg, 2007, pp. 733–743, Springer Berlin Heidelberg.
-  J. M. Bioucas-Dias, A. Plaza, N. Dobigeon, M. Parente, Q. Du, P. Gader, and J. Chanussot, “Hyperspectral unmixing overview: Geometrical, statistical, and sparse regression-based approaches,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 5, no. 2, pp. 354–379, April 2012.
-  J. W. Boardman, F. A. Kruse, and R.O. Green, “Mapping target signatures via partial unmixing of AVIRIS data,” Fifth JPL Airborne Geoscience Workshop, 1995, pp. 23–26, 1995.
-  Michael E. Winter, “N-findr: an algorithm for fast autonomous spectral end-member determination in hyperspectral data,” 1999.
-  Jeffrey H Bowles, Peter J Palmadesso, John A Antoniades, Mark M Baumback, and Lee J Rickard, “Use of filter vectors in hyperspectral data analysis,” in Infrared Spaceborne Remote Sensing III. International Society for Optics and Photonics, 1995, vol. 2553, pp. 148–158.
-  R Neville, “Automatic endmember extraction from hyperspectral data for mineral exploration,” in International Airborne Remote Sensing Conference and Exhibition, 4 th/21 st Canadian Symposium on Remote Sensing, Ottawa, Canada, 1999.
-  A. Ifarraguerri and C. I. Chang, “Multispectral and hyperspectral image analysis with convex cones,” IEEE Transactions on Geoscience and Remote Sensing, vol. 37, no. 2, pp. 756–770, Mar 1999.
-  J. M. P. Nascimento and J. M. B. Dias, “Vertex component analysis: a fast algorithm to unmix hyperspectral data,” IEEE Transactions on Geoscience and Remote Sensing, vol. 43, no. 4, pp. 898–910, April 2005.
-  J. C. Harsanyi and C. I. Chang, “Hyperspectral image classification and dimensionality reduction: an orthogonal subspace projection approach,” IEEE Transactions on Geoscience and Remote Sensing, vol. 32, no. 4, pp. 779–785, Jul 1994.
-  A. Plaza, P. Martinez, R. Perez, and J. Plaza, “Spatial/spectral endmember extraction by multidimensional morphological operations,” IEEE Transactions on Geoscience and Remote Sensing, vol. 40, no. 9, pp. 2025–2041, Sep 2002.
-  C. A. Bateson, G. P. Asner, and C. A. Wessman, “Endmember bundles: a new approach to incorporating endmember variability into spectral mixture analysis,” IEEE Transactions on Geoscience and Remote Sensing, vol. 38, no. 2, pp. 1083–1094, Mar 2000.
-  N. Dobigeon, J. Y. Tourneret, C. Richard, J. C. M. Bermudez, S. McLaughlin, and A. O. Hero, “Nonlinear unmixing of hyperspectral images: Models and algorithms,” IEEE Signal Processing Magazine, vol. 31, no. 1, pp. 82–94, Jan 2014.
-  John H. Conway, Ronald H. Hardin, and Neil J. A. Sloane, “Packing lines, planes, etc.: Packings in Grassmannian spaces,” Experimental Mathematics, vol. 5, no. 2, pp. 139–159, 1996.
-  Inderjit S Dhillon, Jr RW Heath, Thomas Strohmer, and Joel A Tropp, “Constructing packings in Grassmannian manifolds via alternating projection,” Experimental mathematics, vol. 17, no. 1, pp. 9–35, 2008.
-  Kanti V Mardia, “Some properties of clasical multi-dimesional scaling,” Communications in Statistics-Theory and Methods, vol. 7, no. 13, pp. 1233–1241, 1978.
-  Warren S Torgerson, “Multidimensional scaling: I. theory and method,” Psychometrika, vol. 17, no. 4, pp. 401–419, 1952.
-  Warren S. Torgerson, Theory and methods of scaling, Robert E. Krieger Publishing Co., Inc., Melbourne, FL, 1985, With a foreword by Harold Gulliksen, Reprint of the 1958 original.
-  John C Gower, “Some distance properties of latent root and vector methods used in multivariate analysis,” Biometrika, vol. 53, no. 3-4, pp. 325–338, 1966.
-  Isaac J Schoenberg, “Remarks to Maurice Frechet’s article “sur la definition axiomatique d’une classe d’espace distances vectoriellement applicable sur l’espace de Hilbert,” Annals of Mathematics, pp. 724–732, 1935.
-  Gale Young and Aiston S Householder, “Discussion of a set of points in terms of their mutual distances,” Psychometrika, vol. 3, no. 1, pp. 19–22, 1938.
-  Lori Ziegelmeier, Michael Kirby, and Chris Peterson, “Stratifying high-dimensional data based on proximity to the convex hull boundary,” SIAM Review, vol. 59, no. 2, pp. 346–365, 2017.
-  Lori Beth Ziegelmeier, Exploiting geometry, topology, and optimization for knowledge discovery in big data, Ph.D. thesis, Colorado State University, 2013.
-  Matthew Chalmers, “A linear iteration time layout algorithm for visualising high-dimensional data,” Proceedings of Seventh Annual IEEE Visualization ’96, pp. 127–131, 1996.
-  Bruce Draper, Michael Kirby, Justin Marks, Tim Marrinan, and Chris Peterson, “A flag representation for finite collections of subspaces of mixed dimensions,” Linear Algebra and its Applications, vol. 451, pp. 15–32, 2014.
-  Marion F. Baumgardner, Larry L. Biehl, and David A. Landgrebe, “220 band AVIRIS hyperspectral image data set: June 12, 1992 Indian Pine test site 3,” Sep 2015.