HOME JOURNALS CONTACT

Research Journal of Information Technology

Year: 2009 | Volume: 1 | Issue: 1 | Page No.: 51-56
DOI: 10.17311/rjit.2009.51.56
Effective Video Analysis Preprocessing Algorithm Based on Rough Sets in Compressed Domain
L. Xiang -wei, Z. Ming-xin, Z. Geng-lie, Z. Ya-lin and Z. Shuang-ping

Abstract: This study is aimed to overcome the characteristic of redundant data for video analysis, the study propose a video analysis preprocessing algorithm based on Rough Sets (RS). Firstly, the representative data of video sequences is extracted. Secondly, the Information Table is constructed by extracted representative data. Finally, the Core of Information Table is achieved by making use of the attributes reduction theory of RS. As present experimental results indicate, the algorithm can get effective and scientific data to complete video analysis such as key frame extraction and shot segment. Compared to existing techniques, thr proposed algorithm enjoys following advantages. Only a subset of frames need to be considered during video analysis. The limitations of requirements for a huge amounts of memory and CPU resource is overcome.

Fulltext PDF Fulltext HTML

How to cite this article
L. Xiang -wei, Z. Ming-xin, Z. Geng-lie, Z. Ya-lin and Z. Shuang-ping, 2009. Effective Video Analysis Preprocessing Algorithm Based on Rough Sets in Compressed Domain. Research Journal of Information Technology, 1: 51-56.

Keywords: information system, Attributes reduction and core

INTRODUCTION

Advances in computer, telecommunications and internet technologies have brought huge amounts of multimedia information to a rapidly growing. More and more digital videos are made available over the internet or other storage device. Since of the characteristics of redundancy information in video data, many video analysis technologies become less available, especially in real-time system. So, video data preprocessing become an important and imperative work before video analysis.

There have been many considerable work reported on video preprocessing. All in all, the existing typical techniques can be categorised into following types: Video shot segmentation based technologies (Cooper et al., 2007; Lee et al., 2006), key frame extraction based technologies (Shuping and Xinggang, 2005; Kin-Wai Sze et al., 2005) and video summarization based technologies (Wen-Nung and Chun-Ming, 2004; Yu-Fei and Hong-Jiang, 2002). A challenging and difficult problem of above technologies is there is no efficient and scientific algorithm to processing redundant data either in uncompressed domain or compressed domain.

Rough Set (RS) is a novel and powerful tool for data analysis. It has successfully been used in many application domains, such as machine learning, expert system and pattern classification (Wojciecn et al., 2008; Neil and Qiang, 2008). The main advantage of rough sets is that it does not need any preliminary or additional information about data, like probability in statistics, or basic probability assignment in Dempster-Shafer theory and grade of membership or the value of possibility in fuzzy sets. Especially, the attributes reduction theory has widely used in many application domain (Changzhong et al., 2008; Yiyu and Yan, 2008). In this study, by using the advantage of RS theory , a RS based video analysis preprocessing algorithm is developed.

ROUGH SETS THEORY

Basic Concepts of Rough Sets
An Information System is a 4-tuple S = (U, A, V, F), where U non-empty finite set of objects, A is a non-empty finite set of attributes, V is the union of attributes domains.

Let U ≠φ be a universe of discourse and X be a subset of U. An equivalence relation, R, classifies U into a set of subsets U/R = {X1, X2,...Xn} in which the following conditions are satisfied:

Xi→U, Xi ≠φ for any i
Xi∩ Xj ≠φ for any i, j
Ui=1, 2,....,n, Xi = U

Any subset Xi, which called a category, class or granule, represents an equivalence class of R. A category in R containing an object x∈U is denoted by [x]R. For a family of equivalence relations P⊆R, an indiscernibility relation over P is denoted by IND (P) and is defined by Eq. 1:

(1)

The set X can be divided according to the basic sets of R, namely a lower approximation set and upper approximation set. Approximation is used to represent the roughness of the knowledge. Suppose a set X⊆U represents a vague concept, then the R-lower and R-upper approximations of X are defined by Eq. 2 and 3.

(2)

Equation 3 is the subset of X, such that X belongs to X in R, is the lower approximation of X.

(3)

Equation 3 is the subsets of all X that possibly belong to X in R, thereby meaning that X may or may not belong to X in R and the upper approximation contains sets that are possibly included in X. R-positive, R-negative and R-boundary regions of X are defined respectively by Eq. 4, 5 and 6.

(4)

(5)

(6)

The positive region includes all objects in U which can be classified into the equivalence classes determined by indiscerniblility relation.

Attributes Reduction and Core

In RS theory, an Information Table is used for describing the object of universe, it consists of two dimension, each row is an object, and each column is an attribute. RS classifies the attributes into two types according to their roles for Information Table: Core attributes and redundant attributes. Here, the minimum condition attribute set can be received, which is called reduction. One Information Table might have several different reductions simultaneously. The intersection of the reductions is the Core of the Information Table and the Core attribute are the important attribute that influences attribute classification.

A subset B of a set of attributes C is a reduction of C with respect to R if and only if:

POSB (R) = POSC (R)
POSB-{a} (R) ≠ POSC (R), for any aεb

And the Core can be defined by Eq. 7:

(7)

THE PROCEDURE OF DATA PREPROCESSING

Acquisition of Raw Data
Raw video data come from two aspects, the first is come from uncompressed domain, i.e., the frame in original video sequences. In this case, each video frame can be represented by the three primary colors of RGB. The second is come from compressed domain. In this case, the data is come from I, P and B frame. To simplify processing model, we only consider the I frame, i.e., DCT coefficients and DC coefficients. The DCT coefficients and DC coefficients can be acquisited easily by Eq. 8 and 9.

(8)

(9)

Equation 8 shows the DCT coefficients can be extracted from I frame of each video sequences. And Eq. 9 shows that by eliminating the AC coefficients of each block, the DC coefficients can be extracted.

The Construction of Preprocessing Model

According to MPEG video compressed standard, DC coefficients represent the major visual information of video sequences. As such, DC coefficients can satisfy the requirements of video analysis sufficiently. We have got the DC coefficients of each frame, so we can construct an Information System with it. Each row is a DC coefficient, and each column is the frame. This process can be described as following Eq. 10.

(10)

where, U is sets, denotes all the object of Information System, A is also a sets, denotes all attributes in Information System, V is the sets of attributes value, f is a function denotes the relations between objects and attributes.

By using above process, we can get Information System table as Table 1. In Table 1, each row represent the DC coefficients of block in various video frame, consequently, this table contained the first and most important information of video sequences.

Table 1: Information system table constructed using DC coefficients

Table 2: The reduced information system

Reduced Sets and Core
The attributes in the Information Table can be divided into two types by their roles: Core attributes and redundant attributes. From Table 1, we can show that the frame 2 and frame 8 can be reduced, the reduced Information System is showed as Table 2. If we introduce a threshold, more attributes can be reduced.

By attributes reduction, we can get a succinct and informative Information Table, this Table 2 contain major visual information of video sequences, so it is sufficient to utilize the table to video analysis.

The Description of Algorithm

EXPERIMENTAL RESULTS

Various MPEG video sequences are selected to examine the performance of proposed algorithm. Figure 1 illustrates some key frames extracted from shots using proposed algorithm. Figure 2 show some shots extracted from experimental video sequences using our proposed algorithm. We can see obviously that the data can complete the video analysis efficiently. Table 3 and 4 show the comparison results.

Fig. 1: Some key frames extraxted from shots using proposed algorithm

Fig. 2: Shots extracted from experimental video sequences using our proposed algorithm

Table 3: Comparison results of shot boundary detection by various video sequences

Table 4: Comparison result of key frame extraction by various video sequences

With above comparison and evaluation, we can see obviously that preprocessed data can satisfy the video analysis for key frame extraction and shot segmentation, while the amounts of the frames and the data of each frame are reduced dramatically. As such, the efficiency is improved.

CONCLUSION

As the characteristics of video stream data, the first and most difficult task in video analysis is how to process redundant data. In this study, we propose a effective and efficient video analysis preprocessing algorithm based on rough sets in compressed domain. The algorithm introduce concepts of rough sets theory, the amount of data is reduced dramatically and scientifically. The experimental results prove that proposed algorithm can process video such as key frame extraction or shots segment without considering full data.

ACKNOWLEDGMENT

The study is supported by Gansu Natural Science Foundation of China under Grant (No. 3ZS051-A25-047).

REFERENCES

  • Wang, C., C. Wu and D. Chen, 2008. A systematic studey on attribute reduction with rough sets based general binary relations. Inform. Sci., 178: 2237-2261.
    CrossRef    Direct Link    


  • Sze, K.W., K.M. Lam and G. Qiu, 2005. A new key frame representation for video segment retrieval. IEEE Trans. Circuits Syst. Video Technol., 15: 1148-1155.
    CrossRef    


  • Lee, M.H., H.W. Yoo and D.S. Jang, 2006. Video scene change detection using neural network: Improved ART2. Exp. Syst. Appl., 31: 13-25.
    CrossRef    Direct Link    


  • Cooper, M., T. Liu and E. Rieffel, 2007. Video segmentation via temporal pattern classification. IEEE. Trans. Multimedia, 9: 610-619.
    CrossRef    Direct Link    


  • Mac-Parthalain, N. and S. Qiang, 2008. Exploring the boundary region of tolerance rough sets for feature selection. Pattern Recognit. (In Press).
    CrossRef    


  • Lie, W.N. and C.M. Lai, 2004. News video summarization based on spatial and motion feature alalysis. Proceedings of the 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, LNCS 3332, November 30-December 3, 2004, Springer, Berlin/Heidelberg, pp: 246-255.


  • Kotłowski, W., K. Dembczyński, S. Greco and R. Słowiński, 2008. Stochastic dominance-based rough set model for ordinal classification. Inform. Sci., 178: 4019-4037.
    CrossRef    Direct Link    


  • Shuping, Y. and L. Xinggang, 2005. Key frame extraction using unsupervised clustering based on a statistical model. Tsinghua Sci. Technol., 10: 169-172.
    CrossRef    Direct Link    


  • Ma, Y.F. and H.J. Zhang, 2002. A model of motion attention for video skimming. Proceeding of the International Conference on Image Process, Volume 1, December 10, 2002, USA., pp: I-129-I-132.


  • Yao, Y. and Y. Zhao, 2008. Attribute reduction in decision-theoretic rough set models. Inform. Sci., 178: 3356-3373.
    CrossRef    Direct Link    

  • © Science Alert. All Rights Reserved