HOME JOURNALS CONTACT

Trends in Applied Sciences Research

Year: 2015 | Volume: 10 | Issue: 1 | Page No.: 68-76
DOI: 10.17311/tasr.2015.68.76
A Novel Approach for Arbitrary-Shape ROI Compression of Medical Images Using Principal Component Analysis (PCA)
Lim Sin Ting, David Yap Fook Weng and Nurulfajar Bin Abdul Manap

Abstract: PCA-based image compression is inherently limited by its matrix form and thus being restricted to work on block information only. In this study a novel approach is proposed to apply PCA technique on arbitrary shape ROI instead of rectangular or square ROI only. Based on factorization, this method successfully compresses arbitrary-shape ROI of an MRI brain image with different compression ratios using PCA. Simulation results show that no visible distortion were apparent on test image for total compression ratio as high as 80%.

Fulltext PDF Fulltext HTML

How to cite this article
Lim Sin Ting, David Yap Fook Weng and Nurulfajar Bin Abdul Manap, 2015. A Novel Approach for Arbitrary-Shape ROI Compression of Medical Images Using Principal Component Analysis (PCA). Trends in Applied Sciences Research, 10: 68-76.

Keywords: medical image compression, Principal component analysis and medical image compression

INTRODUCTION

For over the past 30 years, medical imaging technologies offer the invaluable means for physicians in disease detection, diagnosis and treatment planning. To ensure highest accuracy, medical images produced by the imaging modalities are in high resolution, typically in the size of Mbytes. Hence, the large storage space occupied by the images raises the need to compress the images with high compression ratio, at the same time, preserving the diagnostic value of the images especially in some diagnostically important areas, called Regions of Interest (ROIs).

There are few compression schemes for instance the MAXSHIFT, the EZW-based and the ROI-VQ methods that have incorporated ROI in the algorithms, allowing ROI to be compressed at different quality levels (Doukas and Maglogiannis, 2007). However, only a few of the algorithms support arbitrary ROI coding. Principal Component Analysis (PCA) is a data reduction techniques used in image recognition and compression (Costa and Fiori, 2001). Its purpose is to reduce the high dimensionality of the data space to a smaller dimensionality, resulting in a smaller set of uncorrelated variables. Taur and Tao (1996) was the first to propose a PCA coding scheme that incorporated ROI. In their study, the images were partitioned into blocks with means and variances being computed within the blocks. ROI were distinguished and compressed according to the class belonged by the blocks. In this study, ROI segmented based on mean thresholding yield oversegmentation or undersegmentation problems and may not reflect the real boundary of the ROI. Similarly, abdomen CT images were divided into blocks and ROI were segmented based on mathematical morphology (Gokturk et al., 2001). In the mentioned studies, PCA were applied only on block information because standard PCA can only define for matrices.

In this study, the first attempt to apply PCA technique on arbitrary shape ROI instead of block information only was proposed. The new algorithm will serve to faithfully extract the ROI desired by the user and compress the selected ROI with different degree of compression ratio.

MATERIALS AND METHODS

Simulation of the proposed method was done using MATLAB R2013a Image Processing Toolbox on a Dual Core 2.6 GHz PC. The test image, as shown in Fig. 1, is an MRI brain image obtained from the Osirix public dataset, with a size of 512x512. The test image will then undergo series of operation as summarized in Fig. 2.

Manual segmentation: To test the robustness of the proposed system, the ROI of the image was interactively selected by the user using imfreehand. By placing the mouse over the image, the user is free to outline the contour of the desired arbitrary ROI, regardless of any shape. A mask for the ROI is created to locate and extract the pixel values contained within the ROI.

ROI reshaping: Prior to applying PCA, the segmented ROI will be reshaped to form a matrix. Initially, the arbitrary ROI will first be converted into a row, resulting in a vector of 1xk, as shown in Eq. 1:

(1)

where, k represents the total number of pixel values contained within the ROI. With the purpose of constructing a matrix from a row vector, the divisors of k were found using factorization.

Fig. 1:A slice of brain image (IM-0001-0011) obtained from Osirix brain dataset, BRAINIX

Fig. 2:Flowchart of the proposed method in compressing arbitrary ROI using PCA

However, to have a standardize algorithm in this study, there is only one divisor to be used. Assume that the total number of divisors of k is found to be t, the divisor will be selected based on the following statements given d(0)≤d(t/2)≤d(t):

•  If t is even, then d is selected from position (t/2)
If t is odd, then d is selected from position ((t+1)/2)

The formulated matrix that carries the ROI information by now has the size of (dxk/d) that is ready to be compressed using PCA algorithm.

PCA algorithm: PCA is a sophisticated statistical tool that linearly transforms a large set of correlated variables into a lower dimension of uncorrelated variables. In pattern recognition, PCA is used to reveal hidden trends and patterns from complex data sets. Image compression can then be achieved by removing variables with smaller eigenvalues or lower data significance (Pandey et al., 2011). PCA has been developed based on Eigen Decomposition (ED), Singular Value Decomposition (SVD) and neural networks. In this study, PCA based on ED is used because of its simplicity and its comparable compression performance with SVD and neural network for single image computation (Kumar et al., 2008). Assume now that the ROI data, g(x,y) has a size of mxn with its pixels values represent the gray level at each coordinates, its expression can be defined as:

(2)

To apply PCA by ED, the first step required is to compute the mean from each of the data dimension (n):

(3)

To prepare a dataset with zero mean, ROI data is subtracted with the mean computed above, resulting in mean ROI data :

(4)

Next, covariance matrix of s determined so that the eigenvectors and eigenvalues for the ROI data can be found. The value of covariance matrix intuitively conveys the relationships between the dimensions. A positive non-diagonal element in the covariance matrix indicates a positive correlation between the variables and vice versa. The resulting covariance matrix is a square matrix. Given a matrix with size mxn, the computed covariance matrix size will be nxn. The covariance matrix can be expressed as:

(5)

The interesting part of PCA lies in its ability to linear transform the data to a new coordinate system by the manipulation of the eigenvectors and eigenvalues. An eigenvector is a transformation matrix when it is multiplied with a matrix, the resulting matrix is the multiple of the eigenvector itself. It should be noted that eigenvectors can only be found from square matrix but not every square matrix has eigenvectors. There will be n eigenvectors for nxn matrix that does have eigenvectors and all eigenvectors for a matrix are orthogonal to each other. Eigenvalues indicate by how much the matrix is being scaled by their corresponding eigenvectors such that the higher the eigenvalue, the more significant of the eigenvector in representing the data dimensions. The eigenvalues with their corresponding eigenvectors are then sorted based on their significance as shown in Table 1.

Once obtaining the eigenvalue-eigenvector pairs, data compression is achieved by only selecting the first p eigenvectors and excluding the least important eigenvectors. The resulting orthogonal matrix V contains only the chosen p eigenvectors:

(6)


Table 1:Eigenvalue-eigenvector pairs storage in a specific way as λ1≥λ2≥,..., λn

In this case, the compression rate and image quality is dependent on the number of eigenvectors that are being selected. The compressed data is finally achieved by taking the transpose of V and multiply it with the transposed mean ROI data.

(7)

(8)

At this stage, the dimensionality of the ROI data has been reduced from n to p columns. The compression rate of the ROI is therefore defined as:

(9)

The ROI data is now being compressed but it will not serve meaningful purpose without transforming it back to the original axes. The reconstructed ROI can be obtained by using Eq. 10:

(10)

(11)

Reform PCA-data: Subsequently, the reconstructed ROI is reformed back into its previous arbitrary shape by replacing the original pixels with the ‘new’ pixels as shown in Fig. 3. The performance of the proposed method will be evaluated based on compression ratio and Peak Signal Noise Ratio (PSNR).

Fig. 3(a-c):
(a) Arbitrary ROI selected, (b) Mask created to mark the position of the ROI and (c) Masked pixels replaced by reconstructed data (shaded area)

The total compression ratio for an image is calculated based on the total pixels encompassed by the arbitrary ROI:

(12)

PSNR is an objective indicator for image quality and it is based on the Mean Square Error (MSE):

(13)

(14)

Where:
f = Pixel value of the original image
g = Pixel value of the compressed image
m = Number of rows of the images and i represents the index of that row
n = Number of columns of the image and j represents the index of that column

RESULTS AND DISCUSSION

The ROI drawn in this study is shown in Fig. 4. The ROI comprises a total of 4270 pixels and all the divisors of 4270 were found to be 1, 2, 5, 7, 10, 14, 35, 61, 70, 122, 305, 427, 610, 854, 2135 and 4270. According to the selection criterion that have defined earlier, 61, being situated mid-way of the divisors, is chosen as the divisor and the reshaped ROI thus has the matrix size of 61x70. The ROI was then compressed at CR of 50, 60, 70, 80 and 90%. As seen from Fig. 5, no distortion is visible on the reconstructed image at CR up to 80% while ROI was slightly blurred out on the image at CR of 90%. This finding is consistent with the objective evaluation of the image quality as shown in Table 2 in which PSNR value decreases when higher CR is applied to the ROI. However, the algorithm failed when the total number of pixels in a ROI is a prime number because the divisors of a prime number are only 1 and the prime number itself. This limitation of the algorithm will be addressed an overcome in future study. Based on the results presented, the proposed method supersedes the existing PCA algorithm (Taur and Tao, 1996; Gokturk et al., 2001) by incorporating an arbitrary ROI that suit the need of each users. This study provides insights into the possibility of performing partial compression for image compression efficiency. Future study will also take in the factor of ROI size and the evaluation of the expert panels.

Fig. 4:Arbitrary ROI drawn by holding the mouse and the contour is shown in blue line


Fig. 5(a-e): Comparison of output images obtained with compression rate of (a) 50, (b) 60, (c) 70, (d) 80 and (e) 90%

Table 2:Simulation results achieved by the proposed method

CONCLUSION

This study took the initiative to present a method that applies PCA compression on arbitrary ROI. The initiative of developing the algorithm comes from having a compression scheme that compresses the ROI in various shapes, without restricting it to block information as what is being adopted by previous research. Normal or abnormal features exhibited on medical images are in arbitrary shapes and inclusion of only the exact contour of the ROI will not only improve the compression ratio but also increase the efficiency of the algorithm in preserving the diagnostically important area.

ACKNOWLEDGMENT

This study was supported in part by grant number FRGS/2012/FKEKK/TK02/03/1/F00127 and PRGS/2012/TK02/FKEKK/03/1/T00002 from the Ministry of Higher Education of Malaysia.

http://www.osirix-viewer.com/datasets

REFERENCES

  • Doukas, C. and I. Maglogiannis, 2007. Region of interest coding techniques for medical image compression. IEEE Eng. Med. Biol. Mag., 26: 29-35.
    CrossRef    


  • Costa, S. and S. Fiori, 2001. Image compression using principal component neural networks. Image Vision Comput., 19: 649-668.
    CrossRef    Direct Link    


  • Taur, J.S. and C.W. Tao, 1996. Medical image compression using principal component analysis. Proceedings of the International Conference on Image Processing, Volume 1, September 16-19, 1996, Lausanne, Switzerland, pp: 903-906.


  • Gokturk, S.B., C. Tomasi, B. Girod and C. Beaulieu, 2001. Medical image compression based on region of interest, with application to colon CT images. Proceedings of the 23rd Annual International Conference of the Engineering in Medicine and Biology Society, Volume 3, October 25-28, 2001, Istanbul, Turkey, pp: 2453-2456.


  • Pandey, P.K., Y. Singh and S. Tripathi, 2011. Image processing using principal component analysis. Int. J. Comput. Applic., 15: 37-40.
    CrossRef    


  • Kumar, D., C.S. Rai and S. Kumar, 2008. Principal component analysis for data compression and face recognition. J. Comput. Sci., 7: 48-59.

  • © Science Alert. All Rights Reserved