AVS 51st International Symposium
    Applied Surface Science Wednesday Sessions
       Session AS-WeM

Paper AS-WeM7
Maximum Likelihood Principal Component Analysis of ToF-SIMS Spectral Images

Wednesday, November 17, 2004, 10:20 am, Room 210A

Session: Chemometric Analysis of Spectral or Image Data; XPS/TOF-SIMS Applications
Presenter: M.R. Keenan, Sandia National Laboratories
Correspondent: Click to Email

Many modern surface analytical instruments are able to acquire huge amounts of data in the form of spectral images. ToF-SIMS, for instance, can easily generate a complete mass spectrum at each point in a 2D or 3D spatial array. The challenge for the data analyst, then, is to garner the analytically useful information from the overwhelming quantity of raw spectral data. Factor analysis techniques such as Principal Component Analysis (PCA) have proven quite useful in this endeavor. Standard PCA, however, assumes that noise in the data is uniform, that is, that it does not depend on the magnitude of signal. This is clearly not correct for methods that rely on particle counting where the noise is governed by Poisson statistics. In this case, properly accounting for heteroscedasticity is essential to extracting the chemical information into a minimum number of factors while maximally excluding noise. Maximum Likelihood PCA (MLPCA) is one approach to addressing this issue. MLPCA can, in principle, incorporate a separate uncertainty estimate for each individual observation in a data set. This paper will present a MLPCA analysis of a simple and intuitive ToF-SIMS spectral image. The results show that there is a tradeoff between the number of uncertainty parameters included in the model and the quality of each and, in fact, using poor estimates may be worse than doing nothing at all. The best results were obtained by using a low-rank approximation to the noise rather than individual estimates. MLPCA will also be compared with an optimal scaling approach. For the particular example given, the added benefits of MLPCA probably do not outweigh the greatly increased computational demands of the technique. This work was completed at Sandia National Laboratories, a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy's National Nuclear Security Administration under contract DE-AC04-94AL85000.