Hongying Meng | Brunel University (original) (raw)
Papers by Hongying Meng
Journal of Biomedical Engineering and Biosciences
Neurocomputing, Jun 1, 2022
IEEE Transactions on Biomedical Engineering, Feb 1, 2006
Clinical electromyography (EMG) interference pattern (IP) signals can reveal more diagnostic info... more Clinical electromyography (EMG) interference pattern (IP) signals can reveal more diagnostic information than their constituents, the motor unit action potentials (MUAPs). Singularities and irregular structures typically characterize the mathematically defined content of information in signals. In this paper, a wavelet transform method is used to detect and quantify the singularity characteristics of EMG IP signals using the Lipschitz Exponent (LE) and measures derived from it. The performance of the method is assessed in terms of its ability to discriminate healthy, myopathic and neuropathic subjects and how it compares with traditionally used Turns Analysis (TA) methods and a method recently developed by the authors, Inter-Scale Wavelet Maximum (ISWM). Highly significant intergroup differences were found using the LE method. Most of the singularity measures have a performance similar to that of ISWM and considerably better than that of TA. Some measures such as the ratio of the mean LE value to the number of singular points in the signal have considerably superior performance to both TA methods. These findings add weight to the view that wavelet analysis methods offer an effective way forward in the quantitative analysis of EMG IP signal to assist the clinician in the diagnosis of neuromuscular disorders.
Lecture Notes in Computer Science, 2018
Social touch is an important form of social interaction. In Human Robot Interaction (HRI), touch ... more Social touch is an important form of social interaction. In Human Robot Interaction (HRI), touch can provide additional information to other modalities, such as audio, visual. One of the application is the robot therapy that has great social significance. In this paper, an ensemble classifier based on threeway decisions is proposed to recognize touch gestures. Firstly, features are extracted from on six perspectives and four classifiers are constructed on different scales with different pre-processing methods. . Then an ensemble classifier is used to combine the four classifiers to classify the gestures. The proposed method is tested on the public Corpus of Social Touch (Cost) dataset. The experiments results not only verify the validity of our method but also show the better accuracy of our ensemble classifier.
Since signal-to-noise ratio and axial resolution are effective decision parameters in ultrasound ... more Since signal-to-noise ratio and axial resolution are effective decision parameters in ultrasound imaging and ultrasonic testing system, we combined nonlinear frequency modulation and Barker code sequence to improve these parameters, thereby improving the effect and performance in the ultrasound imaging and ultrasonic testing system. The theoretical study on the signal generation and decoding has been presented and the cyst phantom simulation has been carried out. The theoretical analysis shows that the new code method can improve 8.46 dB in Contrast ratio compared to simple pulse signal. The simulation produced the similar results.
Applied sciences, Apr 8, 2024
3D facial expression recognition has gained more and more interests from affective computing soci... more 3D facial expression recognition has gained more and more interests from affective computing society due to issues such as pose variations and illumination changes caused by 2D imaging having been eliminated. There are many applications that can benefit from this research, such as medical applications involving the detection of pain and psychological effects in patients, in human-computer interaction tasks that intelligent systems use in today's world. In this paper, we look into 3D Facial Expression Recognition, by investigating many feature extraction methods used on the 2D textured images and 3D geometric data, fusing the 2 domains to increase the overall performance. A One Vs All Multi-class SVM Classifier has been adopted to recognize the expressions Angry, Disgust, Fear, Happy, Neutral, Sad and Surprise from the BU-3DFE and Bosphorus databases. The proposed approach displays an increase in performance when the features are fused together.
arXiv (Cornell University), Mar 4, 2018
Meaningful facial parts can convey key cues for both facial action unit detection and expression ... more Meaningful facial parts can convey key cues for both facial action unit detection and expression prediction. Textured 3D face scan can provide both detailed 3D geometric shape and 2D texture appearance cues of the face which are beneficial for Facial Expression Recognition (FER). However, accurate facial parts extraction as well as their fusion are challenging tasks. In this paper, a novel system for 3D FER is designed based on accurate facial parts extraction and deep feature fusion of facial parts. In particular, each textured 3D face scan is firstly represented as a 2D texture map and a depth map with one-to-one dense correspondence. Then, the facial parts of both texture map and depth map are extracted using a novel 4-stage process consists of facial landmark localization, facial rotation correction, facial resizing, facial parts bounding box extraction and post-processing procedures. Finally, deep fusion Convolutional Neural Networks (CNNs) features of all facial parts are learned from both texture maps and depth maps, respectively and nonlinear SVMs are used for expression prediction. Experiments are conducted on the BU-3DFE database, demonstrating the effectiveness of combing different facial parts, texture and depth cues and reporting the state-of-the-art results in comparison with all existing methods under the same setting.
IEEE Access, 2020
Although automatic fuzzy clustering framework (AFCF) based on improved density peak clustering is... more Although automatic fuzzy clustering framework (AFCF) based on improved density peak clustering is able to achieve automatic and efficient image segmentation, the framework suffers from two problems. The first one is that the adaptive morphological reconstruction (AMR) employed by the AFCF is easily influenced by the initial structuring element. The second one is that the improved density peak clustering using a density balance strategy is complex for finding potential clustering centers. To address these two problems, we propose a fast and automatic image segmentation algorithm using superpixel-based graph clustering (FAS-SGC). The proposed algorithm has two major contributions. First, the AMR based on regional minimum removal (AMR-RMR) is presented to improve the superpixel result generated by the AMR. The binary morphological reconstruction is performed on a regional minimum image, which overcomes the problem that the initial structuring element of the AMR is chosen empirically, since the geometrical information of images is effectively explored and utilized. Second, we use an eigenvalue gradient clustering (EGC) instead of improved density peak (DP) algorithms to obtain potential clustering centers, since the EGC is faster and requires fewer parameters than the DP algorithm. Experiments show that the proposed algorithm is able to achieve automatic image segmentation, providing better segmentation results while requiring less execution time than other state-of-the-art algorithms. INDEX TERMS Image segmentation, fuzzy clustering, graph clustering, density peak (DP) algorithm.
In this paper, we propose a novel approach based on a symmetric fully convolutional network withi... more In this paper, we propose a novel approach based on a symmetric fully convolutional network within pyramid pooling (FCN-PP) for landslide mapping (LM). The proposed approach has three advantages. Firstly, this approach is automatic and insensitive to noise because multivariate morphological reconstruction (MMR) is used for image preprocessing. Secondly, it is able to take into account features from multiple convolutional layers and explore efficiently the context of images, which leads to a good tradeoff between wider receptive field and the use of context. Finally, the selected pyramid pooling module addresses the drawback of global pooling employed by convolutional neural network (CNN), fully convolutional network (FCN), U-Net, etc. Experimental results show that the proposed FCN-PP is effective for LM, and it outperforms state-of-the-art approaches in terms of four metrics, P recision, Recall, F -score, and Accuracy.
IEEE Transactions on Fuzzy Systems, Oct 1, 2018
Frontiers in Psychology, May 31, 2022
With the rapid development of the high-speed railways, the speed of trains is getting faster and ... more With the rapid development of the high-speed railways, the speed of trains is getting faster and faster, and the dynamic load between the wheels and rails of the vehicle increases accordingly. The rolling bearing is a key part of the highspeed train transmission system. The train is subjected to highfrequency vibration for a long time during operation, and the bearing is prone to fatigue damage, which affects the safe operation of the train. Nowadays, many methods have been applied in fault diagnosis like reinforcement learning, convolutional neural networks and autoencoders. One of the typical methods is the reinforcement neural architecture research method. It makes neural network design automatic and eliminates the bottleneck associated with choosing network architectural parameters. However, this method focuses on the time domain signal, and a time domain signal cannot capture the particular properties of a frequency domain signal. In order to solve these problems, we propose a new method containing two Steps: Use FFT to convert the time domain signal to the frequency domain and use Bi-LSTM neural network model to recognize different faults. For each fault, the time series signal has some correlation with some specific frequencies. The frequency domain is more intuitive than the time domain and describes different states of faulty types. For recognition, LSTM is better at classifying sequence data than other methods, and Bi-LSTM can predict the sequence from both directions, achieving higher accuracy. Experiments on public data sets demonstrate the efficiency of the proposed method.
arXiv (Cornell University), Sep 28, 2020
Journal of Biomedical Engineering and Biosciences
Neurocomputing, Jun 1, 2022
IEEE Transactions on Biomedical Engineering, Feb 1, 2006
Clinical electromyography (EMG) interference pattern (IP) signals can reveal more diagnostic info... more Clinical electromyography (EMG) interference pattern (IP) signals can reveal more diagnostic information than their constituents, the motor unit action potentials (MUAPs). Singularities and irregular structures typically characterize the mathematically defined content of information in signals. In this paper, a wavelet transform method is used to detect and quantify the singularity characteristics of EMG IP signals using the Lipschitz Exponent (LE) and measures derived from it. The performance of the method is assessed in terms of its ability to discriminate healthy, myopathic and neuropathic subjects and how it compares with traditionally used Turns Analysis (TA) methods and a method recently developed by the authors, Inter-Scale Wavelet Maximum (ISWM). Highly significant intergroup differences were found using the LE method. Most of the singularity measures have a performance similar to that of ISWM and considerably better than that of TA. Some measures such as the ratio of the mean LE value to the number of singular points in the signal have considerably superior performance to both TA methods. These findings add weight to the view that wavelet analysis methods offer an effective way forward in the quantitative analysis of EMG IP signal to assist the clinician in the diagnosis of neuromuscular disorders.
Lecture Notes in Computer Science, 2018
Social touch is an important form of social interaction. In Human Robot Interaction (HRI), touch ... more Social touch is an important form of social interaction. In Human Robot Interaction (HRI), touch can provide additional information to other modalities, such as audio, visual. One of the application is the robot therapy that has great social significance. In this paper, an ensemble classifier based on threeway decisions is proposed to recognize touch gestures. Firstly, features are extracted from on six perspectives and four classifiers are constructed on different scales with different pre-processing methods. . Then an ensemble classifier is used to combine the four classifiers to classify the gestures. The proposed method is tested on the public Corpus of Social Touch (Cost) dataset. The experiments results not only verify the validity of our method but also show the better accuracy of our ensemble classifier.
Since signal-to-noise ratio and axial resolution are effective decision parameters in ultrasound ... more Since signal-to-noise ratio and axial resolution are effective decision parameters in ultrasound imaging and ultrasonic testing system, we combined nonlinear frequency modulation and Barker code sequence to improve these parameters, thereby improving the effect and performance in the ultrasound imaging and ultrasonic testing system. The theoretical study on the signal generation and decoding has been presented and the cyst phantom simulation has been carried out. The theoretical analysis shows that the new code method can improve 8.46 dB in Contrast ratio compared to simple pulse signal. The simulation produced the similar results.
Applied sciences, Apr 8, 2024
3D facial expression recognition has gained more and more interests from affective computing soci... more 3D facial expression recognition has gained more and more interests from affective computing society due to issues such as pose variations and illumination changes caused by 2D imaging having been eliminated. There are many applications that can benefit from this research, such as medical applications involving the detection of pain and psychological effects in patients, in human-computer interaction tasks that intelligent systems use in today's world. In this paper, we look into 3D Facial Expression Recognition, by investigating many feature extraction methods used on the 2D textured images and 3D geometric data, fusing the 2 domains to increase the overall performance. A One Vs All Multi-class SVM Classifier has been adopted to recognize the expressions Angry, Disgust, Fear, Happy, Neutral, Sad and Surprise from the BU-3DFE and Bosphorus databases. The proposed approach displays an increase in performance when the features are fused together.
arXiv (Cornell University), Mar 4, 2018
Meaningful facial parts can convey key cues for both facial action unit detection and expression ... more Meaningful facial parts can convey key cues for both facial action unit detection and expression prediction. Textured 3D face scan can provide both detailed 3D geometric shape and 2D texture appearance cues of the face which are beneficial for Facial Expression Recognition (FER). However, accurate facial parts extraction as well as their fusion are challenging tasks. In this paper, a novel system for 3D FER is designed based on accurate facial parts extraction and deep feature fusion of facial parts. In particular, each textured 3D face scan is firstly represented as a 2D texture map and a depth map with one-to-one dense correspondence. Then, the facial parts of both texture map and depth map are extracted using a novel 4-stage process consists of facial landmark localization, facial rotation correction, facial resizing, facial parts bounding box extraction and post-processing procedures. Finally, deep fusion Convolutional Neural Networks (CNNs) features of all facial parts are learned from both texture maps and depth maps, respectively and nonlinear SVMs are used for expression prediction. Experiments are conducted on the BU-3DFE database, demonstrating the effectiveness of combing different facial parts, texture and depth cues and reporting the state-of-the-art results in comparison with all existing methods under the same setting.
IEEE Access, 2020
Although automatic fuzzy clustering framework (AFCF) based on improved density peak clustering is... more Although automatic fuzzy clustering framework (AFCF) based on improved density peak clustering is able to achieve automatic and efficient image segmentation, the framework suffers from two problems. The first one is that the adaptive morphological reconstruction (AMR) employed by the AFCF is easily influenced by the initial structuring element. The second one is that the improved density peak clustering using a density balance strategy is complex for finding potential clustering centers. To address these two problems, we propose a fast and automatic image segmentation algorithm using superpixel-based graph clustering (FAS-SGC). The proposed algorithm has two major contributions. First, the AMR based on regional minimum removal (AMR-RMR) is presented to improve the superpixel result generated by the AMR. The binary morphological reconstruction is performed on a regional minimum image, which overcomes the problem that the initial structuring element of the AMR is chosen empirically, since the geometrical information of images is effectively explored and utilized. Second, we use an eigenvalue gradient clustering (EGC) instead of improved density peak (DP) algorithms to obtain potential clustering centers, since the EGC is faster and requires fewer parameters than the DP algorithm. Experiments show that the proposed algorithm is able to achieve automatic image segmentation, providing better segmentation results while requiring less execution time than other state-of-the-art algorithms. INDEX TERMS Image segmentation, fuzzy clustering, graph clustering, density peak (DP) algorithm.
In this paper, we propose a novel approach based on a symmetric fully convolutional network withi... more In this paper, we propose a novel approach based on a symmetric fully convolutional network within pyramid pooling (FCN-PP) for landslide mapping (LM). The proposed approach has three advantages. Firstly, this approach is automatic and insensitive to noise because multivariate morphological reconstruction (MMR) is used for image preprocessing. Secondly, it is able to take into account features from multiple convolutional layers and explore efficiently the context of images, which leads to a good tradeoff between wider receptive field and the use of context. Finally, the selected pyramid pooling module addresses the drawback of global pooling employed by convolutional neural network (CNN), fully convolutional network (FCN), U-Net, etc. Experimental results show that the proposed FCN-PP is effective for LM, and it outperforms state-of-the-art approaches in terms of four metrics, P recision, Recall, F -score, and Accuracy.
IEEE Transactions on Fuzzy Systems, Oct 1, 2018
Frontiers in Psychology, May 31, 2022
With the rapid development of the high-speed railways, the speed of trains is getting faster and ... more With the rapid development of the high-speed railways, the speed of trains is getting faster and faster, and the dynamic load between the wheels and rails of the vehicle increases accordingly. The rolling bearing is a key part of the highspeed train transmission system. The train is subjected to highfrequency vibration for a long time during operation, and the bearing is prone to fatigue damage, which affects the safe operation of the train. Nowadays, many methods have been applied in fault diagnosis like reinforcement learning, convolutional neural networks and autoencoders. One of the typical methods is the reinforcement neural architecture research method. It makes neural network design automatic and eliminates the bottleneck associated with choosing network architectural parameters. However, this method focuses on the time domain signal, and a time domain signal cannot capture the particular properties of a frequency domain signal. In order to solve these problems, we propose a new method containing two Steps: Use FFT to convert the time domain signal to the frequency domain and use Bi-LSTM neural network model to recognize different faults. For each fault, the time series signal has some correlation with some specific frequencies. The frequency domain is more intuitive than the time domain and describes different states of faulty types. For recognition, LSTM is better at classifying sequence data than other methods, and Bi-LSTM can predict the sequence from both directions, achieving higher accuracy. Experiments on public data sets demonstrate the efficiency of the proposed method.
arXiv (Cornell University), Sep 28, 2020