Joint sparse representation based classification of rubimpact in rotating machinery with multiple acoustic emission sensors
Wei Peng^{1} , Jing Li^{2} , Weidong Liu^{3} , Liping Shi^{4} , Han Li^{5}
^{1, 3, 4, 5}School of Electrical and Power Engineering of China University of Mining and Technology, Xuzhou, China
^{2}National Engineering Research Center of Turbogenerator Vibration, Southeast University, Sipailou Campus, Nanjing, China
^{4}Corresponding author
Journal of Vibroengineering, Vol. 20, Issue 1, 2018, p. 762773.
https://doi.org/10.21595/jve.2017.18292
Received 28 February 2017; received in revised form 28 July 2017; accepted 11 August 2017; published 15 February 2018
JVE Conferences
Acoustic emission (AE) technique has been widely used for the classification of rubimpact in rotating machinery due to its high sensitivity, wide frequency response range and dynamic detection property. However, it is still unsatisfied to effectively classify the rubimpact in rotating machinery under complicated environment using traditional classification method tailored to a single AE sensor. Recently, motivated by the theory of compressed sensing, a sparse representation based classification (SRC) method has been successfully used in many classification applications. Moreover, when dealing with multiple measurements the joint sparse representation based classification (JSRC) method could improve the classification accuracy with the aid of employing structural complementary information from each measurement. This paper investigates the use of multiple AE sensors for the classification of rubimpact in rotating machinery based on the JSRC method. First, the cepstral coefficients of each AE sensor are extracted as the features for the rubimpact classification. Then, the extracted cepstral features of all AE sensors are concatenated as the input matrix for the JSRC based classifier. Last, the backtracking simultaneous orthogonal matching pursuit (BSOMP) algorithm is proposed to solve the JSRC problem aiming to get the rubimpact classification results. The BSOMP has the advantages of not requiring the sparsity to be known as well as deleting unreliable atoms. Experiments are carried out on realworld data sets collected from in our laboratory. The results indicate that the JSRC method with multiple AE sensors has higher rubimpact classification accuracies when compared to the SRC method with a single AE sensor and the proposed BSOMP algorithm is more flexible and it performs better than the traditional SOMP algorithm for solving the JSRC method.
Keywords: acoustic emission, backtracking simultaneous orthogonal matching pursuit, joint sparse representation based classification, multiple sensors, rotating machinery, rubimpact.
1. Introduction
Rubimpact classification is one of the most important issues in the research filed of large rotating machinery. Traditional rubimpact classification techniques use vibration signals, which have some technical defects especially in the early rubbing stage [1]. Acoustic emission (AE) technique provides a new approach for the rubimpact classification because of its unique advantages, such as high sensitivity, wide frequency response range and dynamic detection property [2]. During the past few years, plenty of methods have been proposed in order to extract robust features of the AE signal employed in the rubimpact classification in rotating machinery. Modal acoustic emission (MAE) derived from the traditional propagation theory is an effective way to express the AE signal [3]. Following the MAE theory, an analytic expression of the AE signal was given and then used as feature representation for the rubimpact classification [4]. A rubimpact classification method based on Gaussian mixture model (GMM) [5] was proposed in [6] using the cepstral coefficients of the AE signal as the input features. In [7], a new fractal dimension of the AE signal was used as the feature for the rubimpact classification, and simulation results demonstrated the effectiveness of the proposed wavelength based fractal dimension. However, it is still unsatisfied to effectively classify the rubimpact in rotating machinery under complicated environment using traditional classification method tailored to a single AE sensor.
Recently, a sophisticated classification approach based on the theory of sparse representation has been proposed in the fields of compressed sensing [8]. This sparse representation based classification (SRC) scheme represents the test sample as a linear sparse combination of the training samples and then classifies the test sample to the class which yields the minimum representative error [8]. The SRC method has been successfully used in many applications, such as face recognition [8], power system transient recognition [9] and hyperspectral image classification [10]. As an extension of the SRC method, the joint sparse representation based classification (JSRC) method has been proposed for the multiple measurements classification which uses not only the sparse property of each measurement but also the structural sparse information across the multiple measurements [11]. The JSRC method has demonstrated advantageous over the SRC method in the classification problems of multiple measurements [11], multimodality [12] and multiple features [13].
Inspired by the amazing performance of the JSCR method for the multiple measurements classification problem, in this paper we investigate the use of multiple AE sensors for the classification of rubimpact in rotating machinery with the aid of the JSRC method. We build a rubimpact test bed with multiple AE sensors in our laboratory. As far as we known, this is the first attempt to use the JSCR method for the classification of rubimpact in rotating machinery. Previous works rely on simultaneous orthogonal matching pursuit (SOMP) [1416] for solving the JSRC method. However, the SOMP algorithm should have the knowledge of the sparsity in advance which makes the algorithm less flexible. Moreover, once an atom has been wrongly selected, it will not have the chance to be deleted. Accordingly, in this paper we propose a novel algorithm called backtracking SOMP (BSOMP), by employing the backtracking strategy [17] to compensate for these shortcomings.
The rest of this paper is organized as follows. Section 2 introduces cepstral coefficients which are extracted as features of the AE signal for the rubimpact classification, and then in Section 3 we give a brief review of the SRC method. In Section 4 we first present the JSRC method for the rubimpact classification in rotating machinery with multiple AE sensors and then propose a BSOMP algorithm for solving it. Experiments on realworld data sets collected from our laboratory are carried out in Section 5, and final conclusions are given in Section 6.
2. Feature extraction
Feature extraction plays an important role for rubimpact faults classification in rotating machinery. Previous works have demonstrated the effectiveness of using the cepstral coefficients of the AE signal as the features for the rubimpact classification [6]. So, in this paper, we use the cepstral coefficients of each AE measurement to be the features for the rubimpact classification. And in this section, we will describe the method for extracting the cepstral coefficients in details.
The analytic expression of the AE signal based on the modal acoustic emission (MAE) theory was derived in [4]. However, some AE mode waves would be separated or even disappeared in time domain due to different propagation speeds and different distances from source to sensors. So, it is reasonable to classify the rubimpact in rotating machinery using the frequency domain information with the filter banks. The frequency spectrum of the AE signals mainly concentrated form 100 kHz to 300 kHz, and different frequency bins provide different contributions for the rubimpact classification. This assumption is same as the speech recognition problem, so in this paper we employ the cepstral coefficients of the AE signal as the features for the rubimpact classification which have been proven to be effective as the frequency domain features for speech recognition. The procedure for extracting the cepstral coefficients is shown in Fig. 1 and explained in steps as the following [4]:
Step 1: Transform the input AE signal $x\left(n\right)$ from the time domain to the frequency domain $X\left(\omega \right)$ using the shorttime Fourier transform (STFT):
where $w\left[n\right]$ is the window function. In this paper, we use the Hanning window.
Fig. 1. The procedure for extracting the cepstral coefficients
Step 2: Pass $X\left(\omega \right)$ through the triangular filter banks and then calculate the output spectrum energy $E\left(k\right)$ of each sub filter:
where $M$ is the number of the sub filters, ${V}_{k}\left(\omega \right)$ is the frequency response of the $k$th sub filter, ${L}_{k}$ and ${U}_{k}$ are the low frequency limit and the high frequency limit of the $k$th sub filter respectively, ${A}_{k}$ is the energy normalization factor defined as:
The center frequencies of all the sub filters are equal distributed at the logarithm scale, meanwhile the low frequency limit ${L}_{k}$ and the high frequency limit ${U}_{k}$ of the $k$th sub filter equal to the center frequency ${C}_{k1}$ and ${C}_{k+1}$ of its two adjacent sub filters at the logarithm scale, respectively. Accounting for higher attenuation of the AE signal happening in the lower frequency band, the center frequency ${C}_{k}$ of $k$th sub filter, the low frequency limit ${L}_{1}$ and the high frequency limit ${U}_{M}$ of the triangular filter banks should satisfy:
For AE signals we set ${L}_{1}=$100 kHz and ${U}_{M}=$ 300 kHz. From Eq. (4) we can get all the design parameters of the triangular filter banks.
Step 3: Following the logarithm operation and the discrete cosine transform (DCT), we can finally get the cepstral coefficients of the AE signal:
where $L$ is the desired order of the cepstral coefficients which usually ranges from 12 to 16. In this paper, we set $L=$12 as suggested in [4].
3. The sparse representation based classification (SRC) method
Recently the sparse representation based classification (SRC) method motivated by the theory of compressed sensing [1415] has been successfully used in many classification applications, such as face recognition [8], power system transient recognition [9] and hyperspectral image classification [10]. The basic idea of the SRC method is to correctly determine which class the test sample belongs to using the training samples collected from different classes with the sparse representation method [20]. This SRC scheme first constructs the dictionary using training samples, and then represents the test sample as a linear sparse combination of the dictionary atoms and finally classifies the test sample to the class which yields the minimum representative error [21]. The SRC approach is employed as a benchmark for the classification of rubimpact in rotating machinery with a single AE sensor in this paper.
In the SRC approach, all the ${n}_{i}$ training samples from the $i$th class are arranged as the columns of a subdictionary matrix ${\mathrm{{\rm A}}}_{i}=[{\mathbf{a}}_{i1},{\mathbf{a}}_{i2},\cdots ,{\mathbf{a}}_{i{n}_{i}}]\in {\mathfrak{R}}^{L\times {n}_{i}}$. We can define a dictionary $\mathbf{A}$ which includes the entire set of the training samples from all the $K$ classes given as follows:
where $n=\sum _{i=1}^{K}{n}_{i}$ is the total number of the training samples from all the $K$ classes, $L$ is the dimensionality of the cepstral coefficients extracted from its corresponding AE measurement.
The basic assumption of the SRC method is that the test sample $\mathbf{y}$ lies in the linear span of the training samples from the same class. Suppose the test sample $\mathbf{y}$ belongs to the $i$th class linearly represented by all the atoms of the dictionary:
where $\mathbf{x}=[{\mathbf{x}}_{1}^{T},{\mathbf{x}}_{2}^{T},\cdots ,{\mathbf{x}}_{K}^{T}{]}^{T}$ is the representation vector, in which ${\mathbf{x}}_{i}$ is the subrepresentation vector associated with the subdictionary ${\mathbf{A}}_{i}$. In ideal situation when $\mathbf{y}$ belongs to the $i$th class $\mathbf{x}=[0,\cdots ,{\mathbf{x}}_{i}^{T},\cdots ,0{]}^{T}$, so $\mathbf{y}$ can represent as $\mathbf{y}={\mathbf{A}}_{i}{\mathbf{x}}_{i}$, i.e., directly classified to the $i$th class. However, in general case most of the representation coefficients are quite small while only the coefficients of the subrepresentation vector ${\mathbf{x}}_{i}$ have large values. This means that the test sample $\mathbf{y}$ can be accurately classified to the $i$th class by forcing the representation vector $\mathbf{x}$ to be sparse, which leads to the l0norm minimization problem:
For practical classification of the rubimpact in rotating machinery, we should account for the noises and rewrite the l0norm minimization problem Eq. (8) as follows [22]:
However, the l0norm minimization problem Eq. (9) is NPhard which is hard to solve. In practice, problem Eq. (9) can be solved using the greedy algorithm [2326] or relaxing to its convex l1norm minimization form [27, 28]:
In this paper, we use the orthogonal matching pursuit (OMP) algorithm to solve the l0norm problem Eq. (9), the general procedure of the OMP algorithm is described in Algorithm 1 [23]:
Algorithm 1 OMP.
Input: dictionary $\mathbf{A}$, test sample $\mathbf{y}$, maximum number of iteration ${K}_{max}$, error threshold $\epsilon $.
Initialization: the residual ${\mathbf{r}}_{0}=\mathbf{y}$, the index set ${\mathrm{\Lambda}}_{0}=\mathrm{\varnothing}$, the iteration counter $i=$1.
while $\Vert {\mathbf{r}}_{i1}\Vert >\epsilon $ and $i<{K}_{max}$
1. Find the index ${\lambda}_{i}=\mathrm{a}\mathrm{r}\mathrm{g}\mathrm{m}\mathrm{a}{\mathrm{x}}_{j=1,\cdots ,n}\left\u27e8{\mathbf{r}}_{i1},\mathbf{A}(:,j)\u27e9\right$
2. Set ${\mathrm{\Lambda}}_{i}={\mathrm{\Lambda}}_{i1}\cup {\lambda}_{i}$
3. Solve the least squares problem ${\mathbf{s}}_{i}=\mathrm{a}\mathrm{r}\mathrm{g}\mathrm{m}\mathrm{i}{\mathrm{n}}_{\mathbf{s}}{\Vert \mathbf{y}\mathbf{A}(:,{\mathrm{\Lambda}}_{i})\mathbf{s}\Vert}_{2}$
4. Renew the residual ${\mathbf{r}}_{i}=\mathbf{y}\mathbf{A}(:,{\mathrm{\Lambda}}_{i}){\mathbf{s}}_{i}$
5. $i=i+1$
end while
Output: the sparse representation vector $\mathbf{x}$ equals to ${\mathbf{s}}_{i1}$ with the index in ${\mathrm{\Lambda}}_{i1}$ and 0 otherwise.
Having obtained the sparse representation vector $\mathbf{x}$, we can identify the test sample $\mathbf{y}$ to the class with the minimum representative error. The procedure of the SRC method for the classification of rubimpact in rotating machinery with a single AE sensor is described as follows:
1) Use the feature extraction method described in Section 2 to get the training samples of all the $K$ classes to form dictionary $\mathbf{A}=[{\mathbf{A}}_{1},{\mathbf{A}}_{2},\cdots ,{\mathbf{A}}_{K}]$ and then normalize the columns of $\mathbf{A}$.
2) Use the feature extraction method described in Section 2 to get the test sample $\mathbf{y}$.
3) Solve the problem Eq. (9) using the OMP algorithm and get the sparse representation vector $\mathbf{x}$.
4) Calculate the representation error of each class:
5) The class associated with the smallest representation error is classified to be the right one:
4. The joint sparse representation based classification (JSRC) method
From the classification results of previous works [4, 6, 7], we can see that it is still unsatisfied to effectively classify the rubimpact in rotating machinery under complicated environment using traditional classification method tailored to a single AE sensor. Making a classification with multiple measurements using the joint sparse representation based classification (JSRC) method has shown its advantages to improve the classification accuracy in transient acoustic signal classification [11], so in this paper we investigate using the JSRC method for the classification of rubimpact in rotating machinery with multiple AE sensors.
In this section, we first present the JSRC problem formulation for the classification of rubimpact with multiple AE sensors and introduce the SOMP algorithm [15, 16] which is used in the previous work [14] to solve the JSRC problem. Then we propose an algorithm called BSOMP by employing the backtracking strategy [17] and provide the procedure of the JSRC method for rubimpact classification.
4.1. Problem formulation
As an extension of the SRC method, the general idea of the JSRC method is the same with the SRC method with the exception that not only using the sparse property of each measurement but also the joint sparse information across the multiple measurements [11]. Suppose the number of AE sensors is $p$, then for each training sample there are $p$ measurements. So, in the JSRC method, the dictionary $\mathbf{A}$ including the entire set of the training samples from all the $K$ classes can be defined as:
where $n$ is the number of the training samples of all the $K$ classes for each measurement, each subdictionary ${\mathbf{{\rm A}}}_{i}=[{\mathbf{A}}_{i,1},{\mathbf{A}}_{i,2},\cdots ,{\mathbf{A}}_{i,K}]\in {\mathfrak{R}}^{L\times n}$ contains all training samples of all the $K$ classes from the $i$th measurement.
In the JSRC method, the test sample is represented by a matrix $\mathbf{Y}\in {\mathfrak{R}}^{L\times p}$ which could be linearly represented by the dictionary:
where $\mathbf{X}=[{\alpha}_{1},\dots ,{\alpha}_{p}]\in {\mathfrak{R}}^{pn\times p}$ is the representation coefficient matrix.
There are two assumptions for the JSRC method [14]:
First the $i$th measurement of the test sample should lie in the span of the training samples corresponding to the $i$th measurement, i.e. the representation coefficient matrix $\mathbf{X}$ should have a blockdiagonal structure with the columns have the following structure:
where $0$ denotes a zero vector in ${\mathfrak{R}}^{n}$, each subvector ${\left\{{\alpha}_{i,j}\right\}}_{j=1}^{K}$, $i=1,\cdots ,p$ lies in ${\mathfrak{R}}^{{n}_{j}}$ and ${n}_{j}$ denotes the number of the training samples for the $j$th class.
Second the coefficients of the representation coefficient matrix $\mathbf{X}$ corresponding to the $K$ measurements from the same training sample should be activated simultaneously to jointly and sparsely represent the test sample. For this assumption, we should transform the matrix $\mathbf{X}$ to the sparse representation matrix ${\mathbf{X}}^{\mathit{\text{'}}}$^{}by removing the zero coefficients of $\mathbf{X}$:
where $\circ $ denotes the matrix Hadamard product, the matrix $\mathbf{H}$ and $\mathbf{J}$ are defined as:
where $1\in {\mathfrak{R}}^{n}$ is the vector of all ones and ${\mathbf{I}}_{n}$ is the $n$dimensional identity matrix.
For the JSRC method, the matrix ${\mathbf{X}}^{\mathbf{\text{'}}}$ defined in Eq. (16) is constraint to be row wise sparse. Taking into the practical noises into consideration, this problem can be formulated as [14]:
where ${\Vert \cdot \Vert}_{l0\backslash l2}$ the l0\l2 norm is equals to the number of nonzero rows in the matrix.
However, the l0\l2 norm minimization problem Eq. (18) is also NPhard. In practice, it can be solved using the greedy algorithm [16] or relaxing to the l1\l2 norm minimization problem [29, 30]:
4.2. SOMP algorithm
Previous work [14] uses the simultaneous orthogonal matching pursuit (SOMP) algorithm [15, 16] to solve the l0\l2 norm problem Eq. (18), and the general procedure of the SOMP algorithm is described in Algorithm 2.
Algorithm 2 SOMP.
Input: dictionary $\mathbf{A}$, test sample $\mathbf{Y}$, maximum number of iteration ${S}_{max}$, error threshold$\epsilon $.
Initialization: the residual ${\mathbf{R}}_{0}=\mathbf{Y}$, the index set ${\mathrm{\Lambda}}_{0}=\mathrm{\varnothing}$, the iteration counter $k=$1.
while ${\Vert {\mathbf{R}}_{k1}\Vert}_{F}>\epsilon $ and $k<{S}_{max}$
1. Find the index ${\lambda}_{k}=\mathrm{a}\mathrm{r}\mathrm{g}\mathrm{m}\mathrm{a}{\mathrm{x}}_{i=1,\cdots ,n}\sum _{j=1}^{p}{\Vert \u27e8{\mathbf{R}}_{k1}(:,j),{\mathbf{A}}_{j,i}\u27e9\Vert}_{F}$
2. Set ${\mathrm{\Lambda}}_{k}={\mathrm{\Lambda}}_{k1}\cup {\lambda}_{k}$
3. Compute the orthogonal projector ${\mathbf{s}}_{i}^{k}=\mathrm{a}\mathrm{r}\mathrm{g}\mathrm{m}\mathrm{i}{\mathrm{n}}_{\mathbf{s}}{\Vert \mathbf{Y}(:,i){\mathbf{A}}_{i}(:,{\mathrm{\Lambda}}_{k})\mathbf{s}\Vert}_{2}$ for $i=$1,…, $p$
4. Renew the residual ${\mathbf{R}}_{k}=\mathbf{Y}\left[{\mathbf{A}}_{1}\right(:,{\mathrm{\Lambda}}_{k}){\mathbf{s}}_{1}^{k},\cdots ,{\mathbf{A}}_{p}(:,{\mathrm{\Lambda}}_{k}\left){\mathbf{s}}_{p}^{k}\right]$
5. $k=k+1$
end while
Output: the nonzero rows of the sparse representation matrix ${\mathbf{X}}^{\text{'}}$ indexed by ${\mathrm{\Lambda}}_{k1}$equal to $[{\mathbf{s}}_{1}^{k1},\cdots ,{\mathbf{s}}_{p}^{k1}]$.
4.3. BSOMP algorithm
The main drawback of the SOMP is that the sparsity should be known in advance. So crossvalidation should be used in order to obtain a better classification result which makes the algorithm less flexible. Additionally, once the atom has been selected, it will not have the chance to be deleted even if it is a wrong one. In [17] backtracking strategy is employed to improve the OMP algorithm by detecting the previous chosen atoms’ reliability and then deleting the unreliable atoms. With the aid of backtracking strategy, no prior knowledge of the sparsity is needed, so in this paper we propose a BSOMP algorithm by incorporating the backtracking strategy with SOMP. The general procedure of the BSOMP algorithm is described in Algorithm 3.
Algorithm 3 BSOMP.
Input: dictionary $\mathbf{A}$, test sample $\mathbf{Y}$, atomadding threshold ${\mu}_{1}$, atomdeleting threshold ${\mu}_{2}$, error threshold $\epsilon $.
Initialization: the residual ${\mathbf{R}}_{0}=\mathbf{Y}$, the index set ${\mathrm{\Lambda}}_{0}=\mathrm{\varnothing}$, the iteration counter $k=$1.
while ${\Vert {\mathbf{R}}_{k1}\Vert}_{F}>\epsilon $
1. Find the candidate set $\mathbf{C}$ by choosing all the indexes of atoms that satisfying $\sum _{j=1}^{p}{\Vert \u27e8{\mathbf{R}}_{k1}(:,j),{\mathbf{A}}_{j,i}\u27e9\Vert}_{F}\ge {\mu}_{1}\mathrm{m}\mathrm{a}{\mathrm{x}}_{i=1,\cdots ,n}\sum _{j=1}^{p}{\Vert \u27e8{\mathbf{R}}_{k1}(:,j),{\mathbf{A}}_{j,i}\u27e9\Vert}_{F}$
2. Compute the orthogonal projector ${\mathbf{s}}_{i}=\mathrm{a}\mathrm{r}\mathrm{g}\mathrm{m}\mathrm{i}{\mathrm{n}}_{\mathbf{s}}{\Vert \mathbf{Y}(:,i){\mathbf{A}}_{i}(:,{\mathrm{\Lambda}}_{k1}\cup C)\mathbf{s}\Vert}_{2}$ for $i=$1,…, $p$
3. Find the candidate deleting set D by choosing all the indexed of atoms that satisfying $\sqrt{{\mathbf{s}}_{1}(i{)}^{2}+\cdots {\mathbf{s}}_{p}(i{)}^{2}}\le {\mu}_{2}\mathrm{m}\mathrm{a}{\mathrm{x}}_{i\in {\mathrm{\Lambda}}_{k1}\cup C}\sqrt{{\mathbf{s}}_{1}(i{)}^{2}+\cdots {\mathbf{s}}_{p}(i{)}^{2}}$
4. Set ${\mathrm{\Lambda}}_{k}=({\mathrm{\Lambda}}_{k1}\cup C)\backslash D$
5. Compute the orthogonal projector ${\mathbf{s}}_{i}^{k}=\mathrm{a}\mathrm{r}\mathrm{g}\mathrm{m}\mathrm{i}{\mathrm{n}}_{\mathbf{s}}{\Vert \mathbf{Y}(:,i){\mathbf{A}}_{i}(:,{\mathrm{\Lambda}}_{k})\mathbf{s}\Vert}_{2}$ for $i=$1,…, $p$
6. Renew the residual ${\mathbf{R}}_{k}=\mathbf{Y}\left[{\mathbf{A}}_{1}\right(:,{\mathrm{\Lambda}}_{k}){\mathbf{s}}_{1}^{k},\cdots ,{\mathbf{A}}_{p}(:,{\mathrm{\Lambda}}_{k}\left){\mathbf{s}}_{p}^{k}\right]$
7. $k=k+1$
end while
Output: the nonzero rows of the sparse representation matrix ${\mathbf{X}}^{\mathit{\text{'}}}$ indexed by ${\mathrm{\Lambda}}_{k1}$ equal to $[{\mathbf{s}}_{1}^{k1},\cdots ,{\mathbf{s}}_{p}^{k1}]$.
4.4. The procedure of JSRC
The procedure of the JSRC method for the classification of rubimpact in rotating machinery with multiple AE sensors is described as follows:
1. Use the feature extraction method described in Section 2 to get the training samples of all the $K$ classes from $p$ measurements to construct dictionary $\mathbf{A}=[{\mathbf{A}}_{1},{\mathbf{A}}_{2},\cdots ,{\mathbf{A}}_{p}]$ and then normalize the columns of $\mathbf{A}$.
2. Use the feature extraction method described in Section 2 to get the test sample $\mathbf{Y}$.
3. Solve the problem Eq. (18) using the SOMP/BSOMP algorithm and get the sparse representation matrix ${\mathbf{X}}^{\mathbf{\text{'}}}$.
4. Calculate the representation error of each class:
5. The class associated with the smallest representation error is classified to be the right one:
5. Experiment
All the data for the experiment are collected in our laboratory. Data obtained from a rubimpact test bed for fault diagnosis of rotor are used in the paper as shown in Fig. 2(a). The test rig containing three bearings and two rubbing components can simulate two rubimpact faults at the same time through the steel arch case and the rubimpact screws in Fig. 2(a). The case is tightly fixed on the base support of the test rig by four fixed screws and drilled two rubimpact holes on the one side according to the size of the rubimpact screws. Also, the fault degree generated by rubimpact can be adjusted by screws. AE sensors with an operating range of 20 kHz to 1 MHz are marked as A, B, C and D form the left to the right respectively as shown in Fig. 2(b). The output signals from AE sensors are amplified to 40 dB. The AE signals are then passed through a bandpass filter 1 kHz200 kHz to record AE signature rising from runimpact faults. The sampling rate for acquisition of AE signal waveforms is 2MSPS.
Fig. 2. The data collection system
a) 3bearing 2span rotor system
b) AE signal collection system with 4 sensors
Three classes rubimpact events are simulated in the experiments: the nonrubimpact event sampled at common condition, the medium rubimpact event and the heavy rubimpact event. The number of the AE signals collected for each is 300, the maximum number of iteration ${K}_{max}$ and ${S}_{max}$ are all set to 15 and the error threshold $\epsilon $ is set to 1×10^{3}. The atomadding threshold and the atomdeleting threshold have wide choices to get similar good results, and here we set ${\mu}_{1}=$ 0.4 and ${\mu}_{2}=$ 0.6 as suggested in [30]. In order to demonstrate the effectiveness of the JSRC method using multiple AE sensors for the rubimpact classification, we also compare it with the SRC method using AE signal from a single AE senor. Moreover, a modified SVM classifier named concatenated SVM (CSVM) [11], concatenating the cepstral features of all AE measurements into a single vector as the input for the SVM, is also used for comparison. For each class 10 rounds 3foldcrossvalidation are used as the evaluation type and the average performances are reported in Table 1.
Table 1. The classification accuracy of the rubimpact in rotating machinery (noted as ‘%’)
Method

Non

Medium

Heavy

JSRCBSOMP

98.87

91.28

94.58

JSRCSOMP

98.76

90.06

94.16

CSVM

93.87

84.78

90.46

SRCA

95.42

86.63

92.58

SRCB

93.79

85.32

91.14

SRCC

93.56

85.28

90.79

SRCD

93.23

83.87

88.64

It is drawn from Table 1 that, the JSRC method obtains better classification results than the SRC method which demonstrates the advantage of jointly using the information from multiple AE sensors over only using the information from a single sensor or directly concatenating the information from multiple AE sensors. Moreover, the classification accuracy of the proposed JSRCBSOMP method is better than the JSRCSOMP method. For the nonrubimpact event all the methods can get good classification accuracies, while the JSRC method performances best. The classification accuracy for the medium rubimpact is lower than the heavy rubimpact, due to the fact that the medium rubimpact may be misclassified into nonrubimpact or heavy rubimpact. For the SRC method with a single AE sensor, the classification accuracy becomes lower and lower form senor A to senor D because of the distance to the AE source getting farther and farther, which is coincident with the early analysis in paper [4].
Fig. 3. The classification accuracies with various values of sparsity
a) Nonrubimpact
b) Medium rubimpact
c) Heavy rubimpact
Further, in order to show the advantages of the BSOMP algorithm, we also investigate the effects of the parameter ${K}_{max}$ and ${S}_{max}$ namely the sparsity on the classification accuracies of the JSRCSOMP and the SRC methods. The performances of two methods with the sparsity within the range {5, 10, 15, 20, 25, 30, 35, 40} are showed in Fig. 3 for all the three rubimpact events. It is drawn from Fig. 3 that, the classification performances of the JSRCSOMP and the SRC vary heavily with the sparsity which demonstrates the advantages of the proposed BSOMP algorithm. The accuracy performances of the JSRCSOMP and the SRC methods first increase with increased sparsity, and then get the leading classification accuracy around the sparsity of 15. This is the reason for which we set ${K}_{max}$ and ${S}_{max}$ to 15 for fair comparison. The classification performances decease when the sparsity goes beyond 25 which mainly because more atoms of the training dictionary from the incorrect classes are selected with the sparsity increasing thus deteriorating the classification performance. However, with the backtracking strategy the BSOMP algorithm could delete the incorrectly selected atoms, thus obtain the leading performance without the prior knowledge of the sparsity.
6. Conclusions
Inspired by the success of the joint sparse representation based classification (JSCR) method for the multiple measurements classification problem, in this paper we investigate the use of multiple acoustic emission (AE) sensors for the classification of rubimpact in rotating machinery. With the extracted cepstral coefficients of each AE sensor concatenated as the input matrix, the BSOMP algorithm is used to solve the JSRC problem to get the classification result. Experimental results demonstrate that the JSRC method with multiple AE sensors has higher rubimpact classification rate when compared to the SRC method with a single AE sensor. The proposed BSOMP algorithm is more flexible and better than the traditional SOMP algorithm for solving the JSRC method. In our future work, it is expected to employ a learned discriminative dictionary of the training data, in order to further improve the classification accuracy.
References
 Ehrich F. F. Some observations of chaotic vibration phenomena in highspeed rotor dynamics. Journal of Vibration and Acoustics, Vol. 113, Issue 1, 1991, p. 5057. [Publisher]
 Aggelis D. G. Classification of cracking mode in concrete by acoustic emission parameters. Mechanics Research Communications, Vol. 38, Issue 3, 2011, p. 153157. [Publisher]
 Dunegan H. L. Modal analysis of acoustic emission signals. Journal of Acoustic Emission, Vol. 15, 1997, p. 5361. [Search CrossRef]
 Deng A., Zhao L., Zhao Y. Recognition of acoustic emission signal based on mae and propagation theory. IEEE International Conference on Management and Service Science, 2009, p. 14. [Search CrossRef]
 Reynolds D., Rose R. C. Robust textindependent speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing, Vol. 3, Issue 1, 1995, p. 7283. [Publisher]
 Deng A., Bao Y., Zhao L. Rubimpact acoustic emission signal recognition of rotating machinery based on Gaussian mixture model. Journal of Mechanical Engineering, Vol. 46, Issue 15, 2010, p. 5258. [Publisher]
 Deng A., Gao W., Bao Y., et al. Study on recognition characteristics of acoustic emission based on fractal dimension. IEEE International Conference on Embedded Software and Systems Symposia, 2008, p. 475478. [Search CrossRef]
 Wright J., Yang A. Y., Ganesh A., et al. Robust face recognition via sparse representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 31, Issue 2, 2009, p. 210227. [Publisher]
 Chakraborty S., Chatterjee A., Goswami S. K. A sparse representation based approach for recognition of power system transients. Engineering Applications of Artificial Intelligence, Vol. 30, 2014, p. 137144. [Publisher]
 Chen Y, Nasrabadi N M, Tran T D. Hyperspectral image classification via kernel sparse representation. IEEE Transactions on Geoscience and Remote sensing, Vol. 51, Issue 1, 2013, p. 217231. [Publisher]
 Zhang H., Zhang Y., Nasrabadi N. M., et al. JointStructuredSparsityBased Classification for MultipleMeasurement Transient Acoustic Signals. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 42, Issue 6, 2012, p. 15861598. [Publisher]
 Shekhar S., Patel V. M., Nasrabadi N. M., et al. Joint sparse representation for robust multimodal biometrics recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 36, Issue 1, 2014, p. 113126. [Publisher]
 Zhang E., Zhang X., Liu H., et al. Fast multifeature joint sparse representation for hyperspectral image classification. IEEE Geoscience and Remote Sensing Letters, Vol. 12, Issue 7, 2015, p. 13971401. [Publisher]
 Mo X., Monga V., Bala R., et al. Adaptive sparse representations for video anomaly detection. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 24, Issue 4, 2014, p. 631645. [Publisher]
 Jeon C., Monga V., Srinivas U. A Greedy Pursuit Approach to Classification Using Multitask Multivariate Sparse Representations. Technical Report PSUEETR0112, Pennsylvania State University, PA, USA, 2012. [Search CrossRef]
 Tropp J. A., Gilbert A. C., Strauss M. J. Algorithms for simultaneous sparse approximation. Part I: Greedy pursuit. Signal Processing, Vol. 86, Issue 3, 2006, p. 572588. [Publisher]
 Huang H., Makur A. Backtrackingbased matching pursuit method for sparse signal reconstruction. IEEE Signal Processing Letters, Vol. 18, Issue 7, 2011, p. 391394. [Publisher]
 Donoho D. L. Compressed sensing. IEEE Transactions on Information Theory, Vol. 52, Issue 4, 2006, p. 12891306. [Publisher]
 Candès E. J., Romberg J., Tao T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Transactions on Information Theory, Vol. 52, Issue 2, 2006, p. 489509. [Publisher]
 Donoho D. L., Elad M. Optimally sparse representation in general (nonorthogonal) dictionaries via l1 minimization. Proceedings of the National Academy of Sciences, Vol. 100, Issue 5, 2003, p. 21972202. [Publisher]
 Wright J., Ma Y., Mairal J., et al. Sparse representation for computer vision and pattern recognition. Proceedings of the IEEE, Vol. 98, Issue 6, 2010, p. 10311044. [Publisher]
 Donoho D. L. For most large underdetermined systems of linear equations the minimal l1norm solution is also the sparsest solution. Communications on Pure and Applied Mathematics, Vol. 59, Issue 6, 2006, p. 797829. [Publisher]
 Tropp J. A., Gilbert A. C. Signal recovery from random measurements via orthogonal matching pursuit. IEEE Transactions on Information Theory, Vol. 53, Issue 12, 2007, p. 46554666. [Publisher]
 Donoho D. L., Tsaig Y., Drori I., et al. Sparse solution of underdetermined systems of linear equations by stagewise orthogonal matching pursuit. IEEE Transactions on Information Theory, Vol. 58, Issue 2, 2012, p. 10941121. [Publisher]
 Needell D., Vershynin R. Uniform uncertainty principle and signal recovery via regularized orthogonal matching pursuit. Foundations of Computational Mathematics, Vol. 9, Issue 3, 2009, p. 317334. [Publisher]
 Needell D., Tropp J. A. CoSaMP: Iterative signal recovery from incomplete and inaccurate samples. Applied and Computational Harmonic Analysis, Vol. 26, Issue 3, 2009, p. 301321. [Publisher]
 Chen S. S., Donoho D. L., Saunders M. A. Atomic decomposition by basis pursuit. SIAM Review, Vol. 43, Issue 1, 2001, p. 129159. [Publisher]
 Lu W., Vaswani N. Regularized modified BPDN for noisy sparse reconstruction with partial erroneous support and signal value knowledge. IEEE Transactions on Signal Processing, Vol. 60, Issue 1, 2012, p. 182196. [Publisher]
 Tropp J. A. Algorithms for simultaneous sparse approximation. Part II: Convex relaxation. Signal Processing, Vol. 86, Issue 3, 2006, p. 589602. [Publisher]
 Van Den Berg E., Friedlander M. P. Theoretical and empirical results for recovery from multiple measurements. IEEE Transactions on Information Theory, Vol. 56, Issue 5, 2010, p. 25162527. [Publisher]