Ensembled mechanical fault recognition system based on deep learning algorithm

Primary detection and removal of mechanical fault is vital for the recovery of mechanical and electrical equipment. The conventional mechanical fault recognition modules are not able obtain highly sensitive feature attributes for mechanical fault classification in the absence of prior knowledge. The fault diagnosis via data-driven methods have become a point of expansion with recent development in smart manufacturing and fault recognition techniques using the concept of deep learning. In this work, a combination of feature selection with Artificial Intelligence (AI) algorithm is presented for the mechanical fault recognition to deal with smart machine tools. This article proposes a CNN based fault recognition and classification framework that uses the combination of feature extraction, feature vector decomposition using Empirical Mode Decomposition (EMD) and deep neural network (DNN) for recognising the different fault states of the rotating machinery. The experimental outcomes obtained by the combination of EMD, feature selection module and Convolutional Neural Network (CNN) provides the detailed fault information by selecting the sensitive features from large number of faulty feature attributes. The proposed fault recognition and classification method performs better in terms of all the parameters yielding 99.01 % accuracy with respective cross-entropy loss of 0.325 and time complexity of 18 mins and 31 seconds. The comparative analysis is also done with other mainstream models and other state of the art methods, which reveals that the maximum improvement of 12.29 % is attained in terms of accuracy for the proposed fault recognition method. The presented method is robust in terms of reduction of network size, improvement of mechanical fault recognition, providing classification accuracy along with high fault diagnostic solution.


Introduction
The recovery of mechanical and electrical equipment relies on the early detection and removal of faults using effectively accurate diagnosis. With the development and recent advents in smart manufacturing, the fault diagnosis via data-driven methods have become a point of development. In the current scenario, a significant improvement in the technology has been witnessed in the production of mechanical products with the recent advent of artificial intelligence (AI). The AI paradigms include machine learning (ML) and neural networks based deep learning (DL) algorithms [1,2]. DL networks generally require a huge dataset for the computation of reliable outcomes. These methods re successful in the field of fault diagnosis in different rotating equipment yielding high classification accuracy comparative to the other machine health condition monitoring systems [3,4].
The detection of mechanical faults has become a serious topic of concern for various industries as the mechanical faults are likely to affect the operation, thereby leading to an increased operational cost. Also, mechanical faults when happen at one component of the machinery, can trigger the chain of reactions which may damage the other components also [5][6][7]. The vibration signal analysis is significant for monitoring the normal and abnormal operations of the mechanical system. Multiple component failure is diagnosed using the vibration analysis which is done in time as well as frequency domain. For health state monitoring and recognition, the exert system based diagnosis is being used nowadays to solve the problems of machine diagnosis. Expert system based diagnosis have five different modules which are depicted in Figure 1. The five different modules of expert system are a dynamic dataset, knowledge base, expert monitoring the fault condition and machine health monitoring, an explanation system and a user interface [6].
Over the past few decades, for the implementation of the expert system based approaches, deep learning based Convolutional Neural Networks (CNN) have been used [7][8][9]. The suitability of CNN is well established for image classification and recognition, thereby, being explored for fault recognition and analysis [10,11]. The vibration signals are periodic in nature and therefore, the data collected from such signals is not only correlated with the neighbouring data but also collected in the cycle intervals. Thus, in order to capture the fault features, these 1D signals are to be represented in 2D domain. Such analysis initially uses the 2D transformation methods, so that the signal is converted to 2D image for easy and intuitive interpretation [12]. Using the signal image, periodic features can easily be extracted using the CNN architecture, further leading to fault classification.
This work contributes in fault recognition and classification using the deep learning concept based on convolutional neural networks. The key contributions of this research are highlighted below.
-A CNN based fault recognition and classification framework is proposed in this article which uses the features attained from the frequency spectrum of the original signal to obtain the statistical feature derivatives.
-Feature vector decomposition is accomplished by exploiting EMD followed by the selection of highly sensitive features, which are combined to recognise the different fault states of the rotating machinery using the deep neural network (DNN).
-The combination of EMD and DNN extracts the robust features from the noisy vibration samples and effectively reduces the random factor effect on the machine health information extraction.
-EMD decomposes the vibration signals of the rotating machinery and extract the statistical features of the time as well as frequency domain.
The experimental outcomes obtained by the combination of EMD, feature selection module and deep learning methods provides the detailed fault information by selecting the sensitive features from large number of faulty feature attributes. The selected feature attributes are taken to the further stage for fault classification using the CNN architecture. The method presented in this article is robust in terms of reduction of network size, improvement of mechanical fault recognition, providing classification accuracy along with high fault diagnostic solution.
The rest of the article is structured as: Section 2 presents the literature review of state-of-theart methods in the field of deep learning based mechanical fault diagnosis. Section 3 includes the material and methods followed by results and analysis in Section 4. Section 5 presents the article conclusion including the future research directions for fault analysis.

Literature review
In the recent years, DL has emerged as an advent in several fields like healthcare, image processing, etc. The major benefits of DL have been seen in image recognition domain [13][14][15]. DL can accommodate large number of feature attributes which cannot be easily processed by machine learning. Various researchers have utilized DL for fault recognition.
Qi, et al. [16] presented an ensemble empirical mode decomposition method utilizing auto regressive representation. Authors in [17] used the stacked auto encoders for fault detection and classification is done using the Softmax activation function. The fault detection accuracy computed using this method is comparatively advanced than the other current methods. Authors in [18] used the convolutional sparse encoders for the fault determination by utilizing the auto encoder signals. This approach uses the support hypothesis for flexibility improvement of the system. Some of the researchers investigated the combination of correlation with the convolutional neural networks and augmented convolutional sparse autoencoders (ACSAE) and revealed the superiorities of ACSAE for the precise detection of gear fitting faults [19]. Authors considers gears as the basic building block for damage monitoring in industrial processes. There are numerous researchers who examined the faulty conditions of gears using the data analysis method [20]. Elyassami, et al. [21] proposed a scheme for gear fault classification un order to classify wheel and pinion defects from the good gears. The temporal and spectral analysis is used for the diagnosis of gear condition. A hybrid deep belief network (DBN) system was utilized by Gai et al. [22] for the diagnosis of faulty condition using the optimization based on grasshopper algorithm. Wang et al. [23] presented a novel DBN system for the extraction of liabilities in axial piston pump. This method provides a high precision rate approving the viability and productivity of DBN for fault diagnosis.
A comparative analysis on the mechanical fault diagnosis methods utilizing DL based approaches have been detailed in Table 1 along with the technique specifications and future perspective in this field.
From this literature review, it is suggested that certain drawbacks exist in the prior research on fault diagnosis of mechanical systems using vibration signals. One of the issues lies in raw data acquisition from the sensor nodes as it difficult in the current scenario due to the existence of highly complicated systems. Therefore, there is a need for more precise and accurate sensing elements and placement of sensor nodes is also important for proper investigation of data. Besides this, most of the mechanical equipment contain the signal information that is not essential for diagnosis and leads to misinterpretation of results. Feature extraction from raw signal is a tedious task. The major limitation with the manual computation lies in its dependency on the experts having specific domain knowledge about mechanical engineering.
Therefore, in order to overcome these problems, this article proposes a combination of image processing and convolutional neural networks for enhancing the discrimination between normal and faulty condition in a mechanical system. Further, in this work, the performance comparison will be done with the several other machine learning algorithms as well as deep learning methods to rationalize the efficiency and viability of the proposed approach.

Material and methods
This section presents the explanation of dataset used in this work, fault feature extraction and selection, introduction of the CNN architecture and aspect of performance parameters. Fault diagnosis is a pattern recognition problem that contain various stages like signal acquisition, feature extraction and fault classification for faulty state recognition. The block diagram of the proposed fault recognition and classification system is shown in Fig. 2.
The subsequent stages of various steps involved in the proposed fault recognition and classification system are designated in the following sub-sections.

Signal acquisition and dataset description
For the acquisition of vibration signal of rotating machinery, a massive number of training samples are considered for training the CNN model sufficiently. The experimental data was attained from accelerometers situated on a mechanical system which were sampled at 12 kHz of sampling frequency. Basically, the four types of faults are considered for experimentation; ball faults, inner race faults, outer race faults and normal machinery without fault. This experiment uses 2500 1D vibration signals which are transformed into the 2D images of size 60×40. The entire dataset contains 30000 2D images for training and 7500 testing images which are used to classify the 4 faulty conditions. The details of the dataset are provided in Table 2.

Feature extraction and selection
In this work the features are extracted through artificial neural network that is designed to combine the large number of training data to study the features of that particular data. This deep learning based concept does not only extracts the features but is also able to recognise the different states of fault for classification. In this work, EMD is also combined with the Neural Network (NN) architecture to determine the local characteristics of the non-stationary and non-linear signal. This combination of EMD and NN acts as a denoising encoder which extracts the relevant features from the non-linear and non-stationary signals of the rotating machinery for fault diagnosis. Fig. 3 provides the structure of the denoising encoder that is divided into two modules: encoding and decoding module.
This denoising encoder (EMD+NN) reduces the high dimensional input into a lower dimensional space and decoding module is responsible for restoring the signal in original form without noise interference. This combination of EMD and NN extracts the robust features from the noisy vibration samples and effectively reduces the influence of random factors on the machine health information extraction. Also, at the same time, EMD decomposition takes place to decompose the vibration signals of the rotating machinery and extracts the statistical characteristics of the time and frequency domain. The time domain features extracted using this technique are maximum intensity, absolute mean, peak to peak value, root mean square, kurtosis, skewness, impulse, shape and crest factors. Some of the frequency domain features extracted are centre of gravity and variance in frequency domain, etc. Feature selection is the dimensionality reduction step which should be carried out to obtain the optimal features out of the total features. Feature redundancy is reduces using this step and therefore it is the foremost important step which aids to the accurate faulty state classification. The algorithm used for feature selection in this work is presented in Algorithm 1.
Algorithm 1: Feature selection algorithm for faulty state recognition.
Step 1: Compute the inner distance of the features of class = using the formula expressed in Eq. (1): where indicates the number of samples, is the number of features, number of classes are indicated by , , and , are the feature of samples of th class and features of the n samples of th class respectively.
Step 2: Calculate the average of the inner distance of class of feature, expressed in Eq. (2): Step 3: Calculate the average of feature of the samples in the class , expressed in Eq. (3): Step 4: Calculate the average of inter-distance of class of the feature using Eq. (4): where , and , are the averages of features of samples in and class respectively.
Step 5: Calculate the evaluation factor for feature using Eq. (5): Now, the features are sorted according to the value of ∝ and increase the number of features subsequently. As the number of features exhibit an increase of 5, classification accuracy does not improve and then the feature selection process is stopped. The selected features till this point are the optimal features which will be taken to the further stage for fault classification.

CNN architecture for fault classification
In this article, fault classification is accomplished using the CNN architecture. Convolutional deep neural networks are better suited for image recognition tasks and are thus implemented as part of fault diagnostics. The basic CNN model depicted in Fig. 4 consists of an input layer followed by a stack of convolutional layers, several merging layers, a fully connected layer, followed by an output layer. The initial network layers are responsible for the perception of edge information from the faulty images; and the further layers of the convolutional filter are accountable for finding the features which can recognise and distinguish the faulty images with different mechanical faults. The output layers provides the classification outcomes obtained by the feature information acquired at the fully connected layers.
Convolution Layer: Convolution layer is designed to accept the input and then filter kernels are used trailed by the activation function to obtain the convolution filter outputs. This layer consists of a large set of learnable convolutional filters. For an input image is the image and is the output of convolutional operation. The convolutional layer function is expressed by Eq. (6): * , where is weight vector, is total number of nodes till the maximum number of node and * denotes the convolutional operator. After the convolution operation, Rectified Linear Unit (ReLU) is used to compute the maximum of the output obtained using the function 0, . Pooling Layer: The convolutional layers are appended with the pooling layer after the convolution operation. This is done to obtain the down sampled output. The most used pooling operator is max pooling that computes the local maximum of input features for intermediate dimensionality reduction.
Fully connected Layer: Subsequent fully connected layers are added after the series of subsequent convolution and pooling layers to facilitate the output flattening into a single vector. This layer uses the softmax activation function for computing the output and it is expressed by Eq. (7): where is exponential function of the output vector. The CNN architecture used in this work for fault recognition and analysis consist of 2 convolutional layers trailed by 2 subsequent pooling layers and a fully connected layer. The parameter considerations for convolutional as well as pooling layers are tabulated in Table 3.

Performance evaluation parameters
The Specificity (SEP.): Specificity provides the higher probability of truly indicated non-faulty class without giving the false positive outcomes. It is expressed by Eq. (10): Precision value: This value indicated the correctly or precisely obtained accurate results from the total number of positive predictions. Precision is expressed by Eq. (11): Cross entropy: The cross-entropy loss is able to provide the correct classification capabilities of a neural network in the range of 0 to 1. This value should be inclined towards (0) and the expression is given in Eq. (12): where is the number of classes, true classification is indicated by for observation , prediction probability and output is the binary indicator for precise class label classification.

Results and discussion
The experimentation comprises a MATLAB 2019a simulation environment equipped on a computer system with 8GB RAM, Intel processor, NVIDIA GB graphics card. The proposed work is carried out by initially using the time domain vibration signal generation and then the combination of EMD and NN is used to extract the 2D image features followed by the feature selection module. The 1D vibration signal is measured using acceleration sensor at various operating states and the time domain vibration signal as well as frequency domain signal spectrum is depicted in Fig. 5. This represents the time domain vibration signals as Fig. 5(a), (c) and (e) and the corresponding frequency spectrum is indicated in Fig. 5(b), (d) and (f). The original vibration signal spectrum is decomposed utilizing the EMD method and the statistical features are extracted from the input signal. This feature extraction process using NN extracts the 96 features in total which are further reduced using the feature selection procedure. This procedure further reduces the feature dimensionality and a total of 30 relevant and optimal features are selection which can provide better average accuracy performance.

Classification results of proposed fault recognition and classification system
The classification is done using the CNN architecture and the experimentation is performed utilizing the MATLAB software. The CNN classification results are obtained in terms of various performance matrices like accuracy, sensitivity, specificity, precision, cross entropy and time complexity. The performance outcomes for fault recognition are depicted in Table 4 and are graphically presented in Fig. 6. Both tabular and visual representation depicts that the favourable outcomes are achieved for the proposed fault recognition and classification system. The method achieves the performance parameters as; 99.01 % accuracy, 99.01 % sensitivity, 99.00 % specificity and 99.50 % precision value. The cross-entropy loss and time complexity are used as cost analysis parameters for the proposed framework, which should be minimized for the reliable network. The cross-entropy value of 0.325 and time complexity of 18 mins and 31 seconds is seen for the proposed fault recognition method.

Comparison with different main-stream CNN models
There are various main-stream pre-trained CNN architectures which are used by various researchers for fault and image recognition purpose. A comparative analysis of the proposed framework is done with the existing main-stream CNN models like AlexNet [36], GoogleNet [37], ResNet [38] and VggNet [39] in terms of network accuracy performance, and time complexity. The comparison is drawn in Fig. 7. This graphical analysis reveals that the proposed fault recognition system provides the most significant outcomes among all the pre-trained mainstream CNN models. The performance through network parameters is maximum for the proposed fault recognition framework. However, while comparing the time complexity, the proposed framework maintains a trade-off in this case and is a bit complex comparative to the other mainstream models.

Comparison with the state-of-the-art methods
In this work, a comparison is also done with the other state-of-the-art methods reported in the literature for fault recognition and classification employing the CNN based methodology. This comparison is drawn in Table 5. The comparison reveals that the proposed model outperforms the other state of the art methods in terms of all the performance parameters thereby, shows its competence for faulty state classification. A reliable fault recognition and classification solution is achieved using the proposed framework.

Conclusions
This article proposes a CNN based fault recognition and classification framework that uses the feature extraction process exploiting EMD followed by the selection of highly sensitive features which are combined to recognise the different fault states of the rotating machinery using the DNN. The experimental outcomes obtained by combining the EMD, feature selection and CNN provides the detailed fault information for further fault classification. The method achieves a good quality performance in terms of all the parameters yielding 99.01 % accuracy with respective cross-entropy loss of 0.325 and time complexity of 18 mins and 31 seconds. The comparative analysis is also done with other mainstream models by considering the network performance accuracy and time complexity, which reveals the trade-off among the two parameters. The performance validation with the other existing methods reveals that the maximum accuracy improvement of 12.29 % is achieved for the proposed fault recognition framework. The proposed fault recognition method is robust in terms of reduction of network size, improvement of mechanical fault recognition and provides high classification accuracy along with a viable fault diagnostic solution. Using the proposed approach for real time fault recognition classification scenario will be the key emphasis of research in the future perspective of this work.

Normal University in 2021 (2021B29).
Yubin Liu received his B.Eng. degree in Hebei Normal University in China. He is currently a Associate Professor in Tangshan Normal University. His research interest is mainly in the virtual reality technique and internet of things technology. He has published several research papers in scholarly journals in the above research areas and has participated in several conferences.
Weiying Ding received his M.D. degree in Hebei University of Technology in China, she is currently a Associate Professor in Tangshan Normal University. The main research areas are database, big data, software testing and so on, she has published more than 10 papers published in various journals.