System Identification using Orthonormal Basis Filters

Lemma, D.T.; Ramasamy, M.; Shuhaimi, M.

ABSTRACT

The widely used dynamic models for identification of linear time invariant systems in process industries are Auto Regressive with Exogenous Input (ARX) and Finite Impulse Response (FIR) models. Their popularity is due to their simplicity in developing the model. However, they need very large amount of data to reduce variance error, in addition ordinary ARX model structures lead to inconsistent model parameters. Orthonormal Basis Filter (OBF) model structures permit incorporation of prior knowledge of the system in the form of one or more poles, which renders it the capacity to capture the system dynamics with a few number of parameters (parsimonious in parameters). In addition, the resulting OBF models are consistent in parameters. The model parameters can be easily developed using linear least square method. In this study, OBF model development for simulation and real case studies is presented.

PDF Abstract XML References Citation

INTRODUCTION

Models of real systems are used, practically, in all fields of science and engineering. In engineering, models are required for the design and development of new processes and for analyzing and improving existing processes. In process industries, models are used in controller design, optimization and fault detection and diagnosis. Models are extensively used in advanced process control design and implementations. Nearly all optimal control design techniques rely on the use of the model of the system to be controlled. In Model Predictive Controllers (MPC), models are used to predict the future values of the output which is used in calculating the optimum input values. The process of developing system models from experimental data is known as system identification.

A general linear dynamic model consists of deterministic and stochastic parts. According to this general model, the output is the sum of the input u (k) and noise e (k) filtered by their respective filters (Ljung, 1999; Nelles, 2001). Equation 1 represents the general linear model shown in Fig. 1.

(1)

This general model leads to a much complicated model where parameter estimation is usually difficult; therefore it is most commonly simplified by making assumptions on the polynomials A, B, C, D and F.


Fig. 1:	Block diagram for the general linear model

Some of the most commonly used linear models derived from this general model.

Auto regressive with exogenous input (ARX): Autoregressive with exogenous input (ARX) model is derived from the general linear model by assuming C(q) = D(q) = F(q) = 1. ARX models are very popular in industrial application because of the simplicity in estimating the model parameters (Nelles, 2001).

(2)

Auto regressive moving average with exogenous input (ARMAX): The ARMAX structure is derived from the general linear model by assuming D(q) = F(q) = 1. The parameters of the ARMAX model are calculated by nonlinear optimization or by extended least square method.

(3)

Output Error (OE): The output error structure does not include a noise model where A(q) = C(q) = D(q) = 1. Estimation of the model parameters involves nonlinear optimization.

(4)

Box Jenkins (BJ): The Box Jenkins structure is the most flexible among the linear model structures. It is derived from the general structure by assuming A(q) = 1 (Nelles, 2001).

(5)

Finite Impulse Response (FIR): The finite impulse response model is the simplest of the linear models. It is a linear combination of delay filters, q^-1, q^-2, ….

(6)

The FIR and ARX models are the most popular linear models in process industries. It is because the model parameters can be easily estimated using linear least square method. However, both models have major drawbacks. FIR model requires large number of parameters (non-parsimonious) to accurately capture system dynamics and ARX model, for most practical systems, results in inconsistent parameters (Nelles, 2001). When model parameters are non-parsimonious, large input-output data set is required to minimize variance errors in model parameters. When a model is inconsistent in parameters, there will be a systematic error (bias) in the estimated model parameters that cannot be removed by increasing the number of data points.

The ARMAX is the next commonly used model structure. Its model parameters can be estimated using nonlinear optimization or extended least square method. However, the common denominator dynamics A(q) may not describe many practical problems, where the noise is not correlated with the input. BJ models are the most flexible of all the linear models. However, their application is very limited due to the difficulty in estimating the model parameters (Nelles, 2001). Estimation of BJ model parameters involves non-linear optimization and because of the large number of parameters, it is rarely applied in Multiple-Input Multiple-Output (MIMO) systems. One common problem in all the linear models is that prior knowledge of time delay is required to accurately estimate the model parameters.

Recently, there has been a significant progress in system identification based on Orthonormal Basis Filters (OBF) and their implementation in MPC and fault tolerant control (Patwardhan and Shah, 2005; Patwardhan et al., 2006). The OBF models allow incorporation of a priori knowledge of system dynamics into the model and due to this, they can accurately capture the dynamics with a fewer number of parameters. Unlike ARX models, OBF models do not have the parameter inconsistency problem. OBF models are parsimonious in parameters compared to FIR and step response models (Nelles, 2001; Patwardhan and Shah, 2005; Van den Hof et al., 2005). The parameters of OBF models can be easily determined using linear least square method and time delays can also be easily estimated and incorporated into the models (Patwardhan and Shah, 2005).

The present study compares the accuracy of FIR, ARX and OBF models in two case studies, viz., (1) a simulated example, a SISO system and (2) a pilot-scale distillation column, a MIMO system.

A brief introduction to various basis filters used in OBF based system identification is provided in the next section together with techniques for the estimation of time delay and model parameters.

Orthonormal basis filters: The OBF models can be considered as a generalization of FIR models in which the filters q^-1, q^-2, … are replaced with more complex orthonormal basis filters which allow incorporation of a prior knowledge of the system (Patwardhan and Shah, 2005; Van den Hof et al., 2000; Wahlberg, 1991). Two filters, f_mand f_n, are said to be orthonormal if they satisfy the property.

(7)

where <,> represents the inner product defined on the set of all stable transfer functions. Thus, a stable system, G(q), can be approximately represented by a finite-length generalized Fourier series expansion as:

(8)

where, q is forward shift operator, {L_i}_{i =1, 2, …}is model parameters and f(q) is orthonormal basis filters for the system G(q).

One of the important steps in OBF model development is the selection of an appropriate type of orthonormal basis filter. The various types of orthonormal basis filters are discussed below.

Laguerre filter: The Laguerre filters are first-order lag filters with one real pole. They are, therefore, more appropriate for well damped processes (Nelles, 2001; Patwardhan and Shah, 2005; Van den Hof et al., 2005). The Laguerre filters are given by:

(9)

where, p is pole (estimated).

Kautz filter: Kautz filters allow the incorporation of a pair of conjugate complex poles; they are, therefore, effective for modeling weakly damped processes (Nelles, 2001; Patwardhan and Shah, 2005; Van den Hof et al., 2005). The Kautz filters are defined by

(10)

(11)

where

(12)

Generalized orthonormal basis filter: Heuberger et al. (1995) introduced the generalized orthonormal basis filters and showed the existence of orthogonal functions that, in a natural way, are generated by stable linear dynamic systems and form an orthonormal basis for the linear signal space . They showed that pulse, Laguerre and Kautz filters are generated from inner functions and their minimal balanced realization. Ninness and Gustafsson (1997) unified the construction of orthonormal basis filters. The GOBF filters are formulated as:

(13)

where is an arbitrary sequence of poles inside the unit circle appearing in complex conjugate pairs.

Markov-OBF: When a system involves time delay and an estimate of the time delay is available, Markov-OBF can be used. The time delay in Markov-OBF is included by placing some of the poles at the origin (Patwardhan and Shah, 2005). For a SISO system with dead time equal to d samples, the basis function can be selected as:

(14)

(15)

Estimation of time delay: Patwardhan and Shah (2005) presented a two-step method for estimating time delays from step response of GOBF models. In the first step, the time delays in all input-output channels are assumed zero and the model is identified with GOBF. In GOBF models, the time delay is approximated by a non-minimum phase zero and the corresponding step response is an inverse response. The time delay is then estimated from a tangent line drawn at the point of inflection.

A similar approach to determine the time delay is presented by Tufa et al. (2008). In this method, the time delay estimated by the previous method is divided into apparent and contributed time delays. The apparent time delay represents the true time delay and the contributed time delay represents the time delay due to the tail of the sigmoidal response curve which is significant for higher order systems. The latter method gives more accurate estimation of time delay when the order of the system is high.

Estimation of GOBF poles: Finding an appropriate estimate of the poles for the filters is an important step in estimating the parameters of the OBF models. Arbitrary choice of poles may lead to a non-parsimonious model unless an iterative technique is used. Van den Hof et al. (2000) showed that for a SISO system with poles the rate of convergence of the model parameters is determined by the lowest magnitude of eigen value:

(16)

Therefore, a good approximation by a small number of parameters can be obtained by choosing a basis for which ρ is small. It is shown that the poles determined by Van den Hof et al. (2005) method closely match the dominant poles of the system (Patwardhan and Shah, 2005; Wahlberg, 1991).

Model parameter estimation: Once the dominant poles of the system and the types of filters are determined, the model parameters can be estimated using linear least square method. The parameter vector, θ, of the model are then calculated by the linear least square (Eq. 17).

(17)

where, θ is model parameters, X is the regressor matrix and y is output sequence.

The regressor matrix, X, is formed by filtering the input sequence u(k) with the corresponding filters f_i(q, p) and arranging them in a matrix form as shown in Eq. 18.

Image for - System Identification using Orthonormal Basis Filters

(18)

where, u_fi (k) = f_i (q, p) u (k)

If an estimate of the dominant pole is not available, an iterative technique can be employed where an arbitrary sequence of poles can be used as a starting point and better estimates of the dominant poles are obtained from the noise-free step response of the GOBF model. The iterative technique for estimating the poles and the deterministic part of the OBF model is explained in Tufa et al. (2008).

PRESENT STUDY

In this study, the advantages of OBF models over FIR and ARX models are illustrated through a case study by simulation and OBF and OBF plus ARMA noise models are then developed for a real plant case study of a MIMO system.

Case study 1: In this case study, OBF, ARX and FIR models are developed from the same data set generated by simulation from a system represented by Fig. 1 using SIMULINK. The input is a ‘PRBS’ data set generated using the idinput function in MATLAB. The model of the system to be identified is given by Eq. 19.

(19)

The prediction capability of the various models are compared using the Percentage Prediction Error (PPE) defined by Eq. 20.

(20)

where represents the mean value of measurements the predicted value of y_i (k).


Fig. 2:	The input-output data used for model development for the case of 1000 data points


Fig. 3:	The prediction using 500-validation datapoints of the GOBF, FI and ARX models compared to the actual output using 1000 data points for model development

Two thousand data points are generated and the 1000 data points shown in Fig. 2 are used for model development while the remaining 1000 data points are used for validation. Five hundred of the validation data are depicted in Fig. 3.

The prediction of the GOBF, FIR and ARX models with number of model parameters 8, 40 and 8, respectively, compared to the response of the original system without noise (simulation in all cases), for the validation data, is given in Fig. 3. It is seen from Fig. 3, that the GOBF model is much closer to the response of the original system than the other two models.

The percentage prediction error, PPE, for each model type developed with 500, 1000, 2000, 3000 and 4000 data points for GOBF, FIR and ARX models with various number of model parameters is given in Table 1. For the comparison, the number of parameters for the GOBF model is fixed at 8.

Table 1:	Percentage prediction error of the various models with different number of parameters and data points

It is observed from the table that, FIR model requires between 40 and 60 parameters to describe the model as accurate as GOBF model with 8 parameters. In addition, as the number of data points increases the percentage prediction error decreases for the GOBF model. In the case of ARX model, increasing the number of model parameters improves the accuracy. However, the accuracy does not improve with increasing number of data points. This shows the inconsistency problem of ARX models, in that, the bias of the parameters cannot be eliminated by increasing the number of model parameters.

Case study 2: In this case study, a GOBF model is developed for a binary distillation column. The distillation column is a part of a reaction-separation system where the product stream from the reactor becomes the feed stream for the distillation column. Isopropyl Alcohol (IPA) is dehydrogenated in the catalytic packed bed tubular reactor. The products from the reactor, acetone and hydrogen, together with un-reacted IPA are cooled in a plate heat exchanger and sent to a vapor-liquid separator where hydrogen is separated from condensed acetone and IPA. This acetone-IPA mixture is stored in an intermediate storage vessel and fed to the distillation column for separation. The bottom product of the column consisting mainly of IPA is recycled back to the reactor. In the present study, the distillation column is operated alone with acetone-IPA mixture as the feed and the product streams are recombined. A snapshot of the 5.5 m high distillation column is shown in Fig. 4. The major dimensions and nominal operating conditions of the distillation column are given in Table 2.

The input sequences are designed as a low frequency Pseudo Random Binary Signal (PRBS) generated using the idinput function in MATLAB with band [0 0.04] and levels 18 22 kg h^-1 and 0.4 0.8 L min^-1 for steam and reflux flow rates, respectively. Four thousand data points are collected with a sampling interval of 5 sec. The first three thousand data points are used for model identification and the rest are used for validation. The input-output data used for identification of the distillation column is depicted in Fig. 5.

Table 2:	Major dimensions and nominal operating conditions of the distillation column


Fig. 4:	The pilot-scale distillation column

GOBF model: The transfer function of the distillation column is given in the following form:

(21)

where, G_ij is GOBF models, H_i is Stochastic part of the model and e₁, e₂ is innovation sequences.


Fig. 5:	The input-output sequence used for identification for changes in (a) steam flow rate and (b) reflux flow rate in the pilot scale distillation column


Fig. 6:	The prediction of the GOBF model with (Õ) and without noise model (-----) compared to the original (·@@) measured data for the distillation column

Individual GOBF models with eight terms each are developed with alternating poles 0.7788 and 0.8187. The estimated GOBF model parameters corresponding to the transfer functions are:

The GOBF and the GOBF plus noise model, are compared to the actual measured output in Fig. 6.

The result shows that a GOBF model can capture the dynamics of the distillation column with good accuracy and MIMO systems can be easily developed using GOBF model.

CONCLUSION

The OBF models capture the dynamics of linear systems with much smaller number of parameters than FIR models. GOBF model structures enable consistent parameter estimation, while ARX leads to inconsistent parameter estimation. GOBF model parameters are estimated using linear least square method. It is shown that if the system involves time delay, then an iterative procedure can be employed to simultaneously estimate the delay time and the model parameters. In addition, it is demonstrated that GOBF model can be extended to develop MIMO model for a real pilot scale distillation column. It is also illustrated that the stochastic part of the model can be developed using the residual sequence obtained from the noise-free GOBF model.

REFERENCES

Heuberger, P.S.C., P.M.J. Van de Hof and O.H. Bosgra, 1995. A generalized orthonormal basis for linear dynamical systems. IEEE Tran. Automatic Control, 40: 451-465.
CrossRef Direct Link
Tufa, L.D., M. Ramasamy, S.C. Patwardhan and M. Shuhaimi, 2008. Development of second order plus time delay (SOPTD) model from orthonormal basis filter (OBF) model. Proceedings of the UKACC International Conference on Control, Sept. 2-4, University of Manchester, pp: 1-6.
Direct Link
Ljung, L., 1999. System Identification: Theory for the User. 2nd Edn., Prentice Hall PTR, London, UK., ISBN-10: 9780136566953, Pages: 609.
Nelles, O., 2001. Nonlinear System Identification. Springer, New York.
Ninness, B.M. and F. Gustafsson, 1997. A unifying construction of orthonormal basis for system identification. IEEE Tran. Automatic Control, 42: 515-521.
Patwardhan, S.C. and S.L. Shah, 2005. From data to diagnosis and control using generalized orthonormal basis filters, Part I: Development of state observers. J. Process Control, 15: 819-835.
CrossRef
Patwardhan, S.C. S. Manuja, S. Narasimhan and S.L. Shah, 2006. From data to diagnosis and control using generalized orthonormal basis filters, Part II: Model predictive and fault tolerant control. J. Process Control, 16: 157-175.
CrossRef
Van den Hof, P., B. Walhberg, P. Heurberger, B. Ninness, J. Bokor and Oliver T. Silva, 2000. Modeling and identification with rational orthonormal basis functions. Proceedings of IFAC SYSID, (IFACSYSID'02), Santa Barbara, California, pp: 445-456.
Van den Hof, P.M.J., P.S.C. Heuberger and B. Wahlberg, 2005. Modeling and Identification with Rational Orthogonal Basis Functions. 1st Edn., Springer, London, ISBN-10: 185233956X, PP: 397.
Direct Link
Wahlberg, B., 1991. System identification using laguerre filters. IEEE Trans. Automatic Control, 36: 551-562.
CrossRef Direct Link

Journal of Applied Sciences

Research Article

System Identification using Orthonormal Basis Filters

ABSTRACT

How to cite this article

Search

INTRODUCTION

CONCLUSION

ACKNOWLEDGMENTS

REFERENCES

Search

Related Articles

Leave a Comment