ABSTRACT
This study addresses new face recognition method based on Principal Component Analysis (PCA) and Directional Filter Bank (DFB) responses. Our method consists of two parts. One is the creation of directional images using DFB from the original face image. The other is transforming the directional images into eigenspace by PCA, which is able to optimally classify individual facial representations. PCA analysis is primarily used as a dimensionality reduction technique with least consideration to the recognition aspect. The basic idea of combining PCA and DFB is to provide PCA with some recognition ability. In our system recognition ability of the PCA is enhanced by providing directional images as inputs. The experiment results showed the remarkable improvement of recognition rate of 21.25% in Olivetti data set.
PDF Abstract XML References Citation
How to cite this article
DOI: 10.3923/itj.2004.290.295
URL: https://scialert.net/abstract/?doi=itj.2004.290.295
INTRODUCTION
Face recognition is largely motivated by the need for surveillance and security, telecommunication and digital library, human-computer intelligent interaction and smart environments[1-4]. The face recognition primarily based on the understanding how people process and recognize each others face and the development of corresponding computational modal for automated face recognition.
Developing a computational modal for the recognition of natural objects such as human faces is quite difficult, because they are complex and multidimensional. The general approach is to start with a given set of features and then attempts to derive an optimal subset (under some criteria) of features leading to high recognition performance. Principal Component Analysis (PCA) is a popular technique used to derive a starting set of features for face recognition. Turk and Pentland[5] develop a well-known PCA-based face recognition method, referred to as Eigenfaces method. More recently, a principal component analysis of imagery has also been applied for robust target detection[6,7], nonlinear image interpolation[8], visual learning for object recognition[9,10] and visual servoing for robotics[11].
However, PCA analysis was used primarily as a dimensionality reduction technique and did not the recognition aspect. This is due to the fact that PCA is based on the optimal representation of the data in the sense of mean-square error. One way to improve the PCA stand-alone recognition performance, one needs to combine further this optimal representation criterion with some discrimination criterion.
One widely used discrimination criterion in the face recognition community is the Fisher Linear Discriminant (FLD, a.k.a. Linear Discriminant Analysis, or LDA)[12], which defines a projection that makes the within-class scatter small and the between-class scatter large. As a result, FLD derives compact and well-separated clusters. FLD is behind several face recognition methods[13,14]. As the original image space is high dimensional, most of these methods apply PCA first for dimensionality reduction, as it is the case with the Fisherfaces method due to Belhumeur et al.[14]. Subsequent FLD transformation is used then to build the Most Discriminating Features (MDF) space for classification[13]. The drawback of FLD is that it requires large training sample size for good generalization.
In contrast, one can provide a preprocessing step that outputs discriminating feature classes.
![]() | |
Fig. 1: | Regions used by (a) the DFB (b) the Gabor filter bank-based method for feature extraction |
Then PCA can be used with each class to reduce the dimensionality. This view is described by Ki-chung et al.[15], where the Gabor filter responses are used as input vectors to the PCA. It is mentioned that Gabor filter responses works well with PCA and results in a system that is less sensitive to the rotation and illumination with improved classification. However, as described by Chul-Hyun et al.[16], Gabor filter bank has overlapping and missing subband regions the DFB, on the other hand, is a contiguous subband representation as shown in Fig. 1. Accordingly, a DFB can represent linear patterns, as found around eyes, nose and mouth area, more effectively, than a Gabor filter bank. The positive effect, in case of Gabor filter bank, of extracting special frequency and directional feature and suppressing noise component is much offset by the negligence of useful information existing outside the specified frequency range. This study represents Directional Filter Bank (DFB) as a preprocessing step to provide directional discriminating feature spaces. DFB based directional analysis has played a major role in wavelet image denoising[17], fingerprint image enhancement[18], fingerprint image enhancement in a binary domain[19]. In this study, DFB effectively decomposes the face image into eight directional images and each directional image contains directional features associated with a particular direction. This preprocessing can be regarded as a discrimination process. Then PCA is used with each directional image in isolation. Finally, the PCA outputs based on each directional image is analyzed for recognition purpose.
Creation of directional images: It has two categories as follows:
Directional filter design: The directional analysis employed in this paper decomposes the spectral region of a given image into wedge-shaped passband regions. It is easily shown that these wedge-shaped regions correspond to directional components of an image. The filters related to wedge-shaped regions are commonly referred to as fan filters.
The schematic diagram of DFB structure is in the form of a tree with two-band splits at the end of each stage (Fig. 2) where each split increases the angular resolution by the factor of two. The first stage employs the complementary hour-glass filters. The filters for next two stages are obtained by linear transformation of the first stage hour-glass filters. For implementing linear transformation, the uni-modular matrices M and R are utilized. The rules for the selection of these matrices are presented by Park et al.[20]. Once the filters for each stage are implemented, they can be combined on branch by branch basis to get the required fan filters as shown at the end of third stage in Fig. 2.
![]() |
One important difference between present proposed structure and DFB structure presented[20], is the absence of decimator. It was pointed out that if sub-bands need to be processed for directional energy estimates, the decimation present in the conventional filter bank structure poses problem[21]. This means that two samples located at the same spatial index (n1, n2) in two different sub-bands I and j will not necessarily correspond to same spatial region in the original image. This problem was circumvented by employing nearest neighborhood or bilinear interpolation to make all sub-bands of the same size[21]. However, in the present structure, decimators at each stage are taken out and filters are designed by linear transformation in the frequency domain to get fan filters. Furthermore, to avoid ringing artifact in the output, ideal fan filters are avoided by employing non-ideal hour-glass filters using an FIR lowpass filter.
Directional images: Directional images are obtained by applying all directional filters constructed as above. Eight directional images were obtained (Fig. 3). These directional images can be regarded as decomposition of the original image in eight pieces based on direction. Directional images contain features associated with global directions rather than local directions.
![]() | |
Fig. 2: | Directional filter bank schematic diagram |
![]() | |
Fig. 3: | Creation of directional image: a) Directional image 1, b) Directional image 2, c) Directional image 3, d) Directional image 4, e) Directional image 5, f) Directional image 6, g) Directional image 7, h) Directional image 8 |
By creating directional images, noise of the original image was divided into eight different directions, thus reducing noise energy eight times.
Principal Component Analysis (PCA): PCA generates a set of orthonormal basis vectors, known as principal components (PCS) that maximize the scatter of all projected samples. Let X= [X1, X2, , Xn] be the sample set of the original images. After normalizing the images to unity norm and subtracting grand mean a new image set Y= [Y1, Y2, , Yn] is derived. Where, each
![]() |
The covariance matrix of normalized image is defined as:
![]() | (1) |
and the eigenvector and eigenvalue matrices Φ, Λ are computed as:
![]() | (2) |
Note that YYt is an NxN matrix while YtY is an nxn matrix. If the sample size n is much smaller than the dimensionality N, then the following method saves some computation[5]:
![]() | (3) |
![]() | (4) |
where, Λ1 = diag (λ1, λ2, , λn) and δ = [Y1, Y2, , Yn]. If one assumes that eigenvalues are sorted in decreasing order, λ1≥λ2≥ ≥λn, then the first m leading eigenvectors define matrix P
![]() | (5) |
The new feature set Z with lower dimensionality m (m << N) is then computed as
![]() | (6) |
Proposed system: In proposed system the directional features will be used as the training set images of PCA and recognition process is performed on these images. Block diagram of proposed system is shown in Fig. 4.
The steps involved in the proposed system are as follows:
1. | Obtain face images I1, I2, , IM of Olivetti face dataset shown in Fig. 5. This data set is composed of 400 images from 40 galleries which is constructed under various depth and plane rotations. Dataset is composed of 10 face images per a person so we have defined one as a training set and the other nine sets as testing sets in turn. We have passed whole dataset through DFB and got eight directional images for each dataset image. Then we created eight new datasets by combining same directions of all the images. After that PCA is performed on each directional dataset. |
2. | Constructed 10304x40 matrix where the column vector Vi was the data of training image. |
3. | Compute the average face vector AVG, |
![]() | |
4. | Subtract the mean face, |
di = Vi-AVG | |
5. | Compute the covariance matrix: |
![]() |
![]() | |
Fig. 4: | Block diagram of proposed system |
![]() | |
Fig. 5: | Face images of Olivetti dataset |
where, A = [d1, d2, , dM]. AAt is a very large matrix so Turk and Pentland suggested[5] a way to reduce it. They proposed that instead of AAt, AtA can be used.
6. | Compute the M eigenvectors ui of AtA. |
7. | Keep only K eigenvectors (corresponding to the K largest eigenvalues). |
8. | Calculate weights of the training images by projecting mean subtracted images on the eigenvectors. |
9. | For recognition of any image first calculate weights for this image and then compare it with the weights of the training image. |
![]() | |
Fig. 6: | Comparison of PCA, DFB-PCA and Gabor filter bank-PCA |
![]() | |
Fig. 7: | Bars showing contribution of various directional images for PCA |
Simulation results: Figure 6 shows three curves comparing PCA, DFB-PCA, Gabor Filter Bank-PCA (GFB-PCA). We see that Gabor filter bank-PCA performs well when the number of eigenfaces were less than and equal to five but it is overshadowed by DFB-PCA as the number of eigenfaces are increased. We can confirm the remarkable improvement of recognition rate of 21.25% whereas Gabor filter bank-PCA gives 11% recognition rate against PCA when tested on Olivetti dataset. Further, this study investigated the question of which directional image is most effective input for PCA. The Fig. 7 shows a bar diagram of various directional images. It can be seen that direction 8 is providing 66% recognition rate. This confirms the observation that directional image 8 is providing the optimal discriminating features.
CONCLUSION
This study, presented the face recognition system that combines Directional Filter Bank (DFB) and PCA. Because face images are very sensitive to illumination and pose variation, we anticipated that the drawback could be overcome by using DFB responses as an input of PCA. The experiment results were reasonably acceptable i.e. the directional images of DFB based on principal component vectors were successful in classification and discrimination.
REFERENCES
- Chellappa, R., C.L. Wilson and S. Sirohey, 1995. Human and machine recognition of faces: A survey. Proc. IEEE, 83: 705-741.
CrossRefDirect Link - Daugman, J., 1997. Face and gesture recognition: Overview. IEEE Trans. Pattern Anal. Mach. Intel., 19: 675-676.
Direct Link - Pentland, A., 2000. Looking at people: Sensing for ubiquitous and wearable computing. IEEE Trans. Pattern Anal. Mach. Intel., 22: 107-119.
Direct Link - Turk, M. and A. Pentland, 1991. Eigenfaces for recognition. J. Cognit. Neurosci., 3: 71-86.
CrossRefDirect Link - Burl, M.C., U.M. Fayyad, P. Perona, P. Smyth and M.P. Burl, 1994. Automating the hunt for volcanos on venus. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Jun. 21-23, Seattle, WA., USA., pp: 302-309.
CrossRef - Pentland, A., B. Moghaddam and T. Starner, 1994. Viewbased and modular eigenspaces for face recognition. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 21-23, 1994, Seattle, WA., USA., pp: 84-91.
CrossRef - Murase, H. and S.K. Nayar, 1995. Visual learning and recognition of 3-d objects from appearance. Int. J. Comput. Vision, 14: 5-24.
CrossRefDirect Link - Murase, H., S.K. Nayar and S.A. Nene, 1994. General learning algorithm for robot vision. Proc. SPIE, 2304: 10-17.
Direct Link - Belhumeur, P.N., J.P. Hespanha and D.J. Kriegman, 1997. Eigenfaces vs. fisherfaces: Recognition using class specific linear projection. IEEE Trans. Pattern Anal. Mach. Intell., 19: 711-720.
CrossRefDirect Link - Chul-Hyun, P., L. Joon-Jae, M.J.T. Smith, P. Sang-il and P. Kil-Houm, 2004. Directional filter bank-based fingerprint feature extraction and matching. IEEE Trans. Circuits Syst. Video Technol., 14: 74-85.
Direct Link - Rosiles, J.G. and M.J.T. Smith, 2000. Image denoising using directional filter banks. Proceedings of the International Conference on Image Processing, Sept. 10-13, Vancouver, BC., Canada, pp: 292-295.
CrossRef - Randolph, J.R. and M.J.T. Smith, 2001. Fingerprint image enhancement using a binary angular representation. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, May 7-11, Salt Lake City, UT., USA., pp: 1561-1564.
CrossRef - Park, S., M.J.T. Smith and R.M. Mersereau, 1999. A new directional filter bank for image analysis and classification. IEEE Int. Conf. Acoustics Speech Signal Process., 3: 1417-1420.
CrossRefDirect Link