Overview

The Python package called parmest facilitates model-based parameter estimation along with characterization of uncertainty associated with the estimates. For example, parmest can provide confidence regions around the parameter estimates. Additionally, parameter vectors, each with an attached probability estimate, can be used to build scenarios for design optimization.

Functionality in parmest includes:

Model-based parameter estimation using experimental data
Covariance matrix estimation
Bootstrap resampling for uncertainty quantification
Confidence regions based on single or multi-variate distributions
Likelihood ratio test
Leave-N-out cross validation
Regularization for objective function improvement
Multi-start initialization optimization
Parallel processing

Background

The goal of parameter estimation is to estimate values for a vector, \(\boldsymbol{\theta}\), to use in the functional form

\[\boldsymbol{y}_i = \boldsymbol{f}\left(\boldsymbol{x}_{i}, \boldsymbol{\theta}\right) + \boldsymbol{\varepsilon}_i \quad \forall \; i \in \{1, \ldots, n\}\]

where \(\boldsymbol{y}_{i} \in \mathbb{R}^m\) are observations of the measured or output variables, \(\boldsymbol{f(\cdot)}\) is the model function, \(\boldsymbol{x}_{i} \in \mathbb{R}^{q}\) are the decision or input variables, \(\boldsymbol{\theta} \in \mathbb{R}^p\) are the model parameters, \(\boldsymbol{\varepsilon}_{i} \in \mathbb{R}^m\) are measurement errors, and \(n\) is the number of experiments.

The following least squares objective can be used to estimate model parameters from data assuming that the measurement errors follow a Gaussian distribution:

\[\min_{\boldsymbol{\theta}} \, g(\boldsymbol{x}, \boldsymbol{y};\boldsymbol{\theta}) \;\;\]

where \(g(\boldsymbol{x}, \boldsymbol{y};\boldsymbol{\theta})\) can be:

Sum of squared errors

If the measurement errors (which are assumed to follow a Gaussian distribution) are independent and identically distributed, the objective function can be defined as the sum of squared errors

\[g(\boldsymbol{x}, \boldsymbol{y};\boldsymbol{\theta}) = \sum_{i = 1}^{n} \left(\boldsymbol{y}_{i} - \boldsymbol{f}(\boldsymbol{x}_{i};\boldsymbol{\theta}) \right)^\text{T} \left(\boldsymbol{y}_{i} - \boldsymbol{f}(\boldsymbol{x}_{i};\boldsymbol{\theta})\right)\]
Weighted sum of squared errors

When the measurement errors are correlated and their covariance matrix, \(\boldsymbol{\Sigma}_{\boldsymbol{y}}\), is known a priori, the objective function is defined as the weighted sum of squared errors

\[g(\boldsymbol{x}, \boldsymbol{y};\boldsymbol{\theta}) = \frac{1}{2} \sum_{i = 1}^{n} \left(\boldsymbol{y}_{i} - \boldsymbol{f}(\boldsymbol{x}_{i};\boldsymbol{\theta}) \right)^\text{T} \boldsymbol{\Sigma}_{\boldsymbol{y}}^{-1} \left(\boldsymbol{y}_{i} - \boldsymbol{f}(\boldsymbol{x}_{i};\boldsymbol{\theta})\right)\]

Custom objectives can also be defined for parameter estimation.

In the applications of interest to us, the function \(g(\cdot)\) is usually defined as an optimization problem with a large number of (perhaps constrained) optimization variables, a subset of which are fixed at values \(\boldsymbol{x}\) when the optimization is performed. In other applications, the values of \(\boldsymbol{\theta}\) are fixed parameter values, but for the problem formulation above, the values of \(\boldsymbol{\theta}\) are the primary optimization variables. Note that in general, the function \(g(\cdot)\) will have a large set of parameters that are not included in \(\boldsymbol{\theta}\).