Äijälä , M , Heikkinen , L , Fröhlich , R , Canonaco , F , Prevot , A S H , Junninen , H , Petäjä , T , Kulmala , M , Worsnop , D & Ehn , M 2017 , ' Resolving anthropogenic aerosol pollution types - deconvolution and exploratory classification of pollution events ' , Atmospheric Chemistry and Physics , vol. 17 , no. 4 , pp. 3165-3197 . https://doi.org/10.5194/acp-17-3165-2017
Title: | Resolving anthropogenic aerosol pollution types - deconvolution and exploratory classification of pollution events |
Author: | Äijälä, Mikko; Heikkinen, Liine; Fröhlich, Roman; Canonaco, Francesco; Prevot, Andre S. H.; Junninen, Heikki; Petäjä, Tuukka; Kulmala, Markku; Worsnop, Douglas; Ehn, Mikael |
Contributor organization: | Department of Physics Polar and arctic atmospheric research (PANDA) |
Date: | 2017-03-01 |
Language: | eng |
Number of pages: | 33 |
Belongs to series: | Atmospheric Chemistry and Physics |
ISSN: | 1680-7316 |
DOI: | https://doi.org/10.5194/acp-17-3165-2017 |
URI: | http://hdl.handle.net/10138/178117 |
Abstract: | Mass spectrometric measurements commonly yield data on hundreds of variables over thousands of points in time. Refining and synthesizing this raw data into chemical information necessitates the use of advanced, statisticsbased data analytical techniques. In the field of analytical aerosol chemistry, statistical, dimensionality reductive methods have become widespread in the last decade, yet comparable advanced chemometric techniques for data classification and identification remain marginal. Here we present an example of combining data dimensionality reduction (factorization) with exploratory classification (clustering), and show that the results cannot only reproduce and corroborate earlier findings, but also complement and broaden our current perspectives on aerosol chemical classification. We find that applying positive matrix factorization to extract spectral characteristics of the organic component of air pollution plumes, together with an unsupervised clustering algorithm, k -means C C, for classification, reproduces classical organic aerosol speciation schemes. Applying appropriately chosen metrics for spectral dissimilarity along with optimized data weighting, the source-specific pollution characteristics can be statistically resolved even for spectrally very similar aerosol types, such as different combustion-related anthropogenic aerosol species and atmospheric aerosols with similar degree of oxidation. In addition to the typical oxidation level and source-driven aerosol classification, we were also able to classify and characterize outlier groups that would likely be disregarded in a more conventional analysis. Evaluating solution quality for the classification also provides means to assess the performance of mass spectral simi-larity metrics and optimize weighting for mass spectral variables. This facilitates algorithm-based evaluation of aerosol spectra, which may prove invaluable for future development of automatic methods for spectra identification and classification. Robust, statistics-based results and data visualizations also provide important clues to a human analyst on the existence and chemical interpretation of data structures. Applying these methods to a test set of data, aerosol mass spectrometric data of organic aerosol from a boreal forest site, yielded five to seven different recurring pollution types from various sources, including traffic, cooking, biomass burning and nearby sawmills. Additionally, three distinct, minor pollution types were discovered and identified as amine-dominated aerosols. |
Subject: |
POSITIVE MATRIX FACTORIZATION
QUALITY INTERACTIONS EUCAARI EUROPEAN INTEGRATED PROJECT FINE-PARTICLE COMPOSITION MASS-SPECTROMETER DATA VOLATILITY BASIS-SET NEW-YORK-CITY ORGANIC-AEROSOL SOURCE APPORTIONMENT HIGH-RESOLUTION 114 Physical sciences |
Peer reviewed: | Yes |
Rights: | cc_by |
Usage restriction: | openAccess |
Self-archived version: | publishedVersion |
Total number of downloads: Loading...
Files | Size | Format | View |
---|---|---|---|
acp_17_3165_2017.pdf | 3.021Mb |
View/ |