What is Indirect Hard Modeling?Indirect Hard Modeling (IHM) is a method for predicting unknown concentrations from spectra of mixtures, in particular mid-infrared (MIR), Raman, and NMR spectra, but any peak-shaped signal may work. Also, literally taken, the name describes a procedure; the procedure of building a Hard Model. Why indirectly? Well, this is explained later.
The Hard Model of IHMA mathematical function is called physical model or Hard Model if it is derived from equations representing the physics behind an underlying process. In contrast, a function is a polynomial model or Soft Model if it is derived empirically or statistically. As for IHM, the Hard Model is derived from the physics of molecular spectroscopy; the function is a mathematical representation of a mixture spectrum.
|Mixture spectrum (black) is composed of superimposed peaks (cyan)|
|Hard Model (red) is a weighted sum of Component Models (blue)|
|Flexibility of |
The Calibration Model of IHMThe goal of calibration is to quantitatively convert measurements made on one measurement scale to another measurement scale. The functional relationship between both scales is established by regression. While a direct conversion from spectra to concentrations would require complex multivariate regression techniques (see e.g. Partial Least Squares regression), the relationship between IHM component weights (resulting from fitting the Hard Model to a measured spectrum) and concentrations can be determined by simple univariate regression. The benefit of univariate regression is a well-established methodology with a sound statistical underpinning, especially regarding validation of the regression function.
|Calibration model of IHM: |
Regression of known concentrations on component weights.
“Indirect” Model BuildingModel building typically involves two steps: determination of the model’s structure and estimation of initial model parameters. The structure of the Hard Model of IHM is already known to be a weighted sum of component models which again are sums of peak-curves, e.g. Gaussian or Lorentzian profiles. Therefore, the task reduces to building the model for a very specific mixture, i.e. choosing the number of peak-curves and their initial parameters (position, intensity, width, shape) in order for the model to represent the mixture's spectrum.
Due to overlapping peaks in the mixture spectrum, one cannot simply choose peak-curves for all measured peaks directly. Instead one needs an indirect approach with a little help of the pure component spectra. For each pure component spectrum a separate Hard Model is built in a first step by automatic peak fitting. In order to set up all individual Component Models one needs to know all pure component spectra. (See related article about HMFA if not all pure component spectra are known). In the final step, these Component Models are combined to a Mixture Model.
A question may arise so far: Why bothering with building a pure component model and not simply use those measured pure component spectra instead? First of all, this idea is already picked up and implemented in a method called Classical Least Squares. But more importantly, the measured pure component spectra are stiff data vectors while the model has flexible parameters that can be adjusted to the many spectral variations that may occur in a mixture spectrum.
Properties of IHM
- All pure components of the mixture must be known (or at least all components contributing to the spectral signal within the selected spectral range)
- The Hard Model must fit the mixture spectrum well. A lack-of-fit would induce systematic errors, especially in case of small concentrations.
- IHM uses a physically motivated Hard Model which is easily interpretable because it resembles a mixture spectrum. This kind of modeling allows for better prediction and is less subject to variation than Soft Models (as long as the underlying physical process doesn't change).
- The component weight parameter of the Hard Model is a very suitable parameter for calibration because it is highly selective for a certain component of interest.
- The Hard Model contains flexible parameters that are automatically adjusted during model fitting to correct spectral effects like peak shifts and shape changes. Error propagation to the important component weight parameter is reduced. Less training samples are necessary for calibration.
- Using surrogate component weights instead of full spectra for calibration allows for simple univariate regression instead of complex multivariate regression. This enables thorough model validation, e.g. uncertainty estimation, prediction intervals, and other figures of merit.
- Instead of classical calibration, IHM uses inverse calibration which has been proven to be better [Tellinghuisen2000, Krutchkoff1967].
- E. Kriesten, F. Alsmeyer, A. Bardow, and W. Marquardt (2008). “Fully automated indirect hard modeling of mixture spectra”, Chemometrics and Intelligent Laboratory Systems, Vol. 91, pp. 181–193.
- F. Alsmeyer, H.-J. Koß, and W. Marquardt (2004). “Indirect Spectral Hard Modeling for the Analysis of Reactive and Interacting Mixtures”, Journal of Applied Spectroscopy, Vol. 58 (8), pp. 975-986.
- J. Tellinghuisen (2000). “Inverse vs. classical calibration for small data sets”, Fresenius Journal of Analytical Chemistry, Vol. 368, pp. 585-588
- R. G. Krutchkoff (1967). “Classical and Inverse Methods of Calibration”, Technometrics, Vol. 9, pp. 425-439.