Title
Audio signals encoding for cough classification using convolutional neural networks: A comparative study
Abstract
Cough detection has considerable clinical value, which can provide an objective basis for assessment and diagnosis of respiratory diseases. Motivated by the great achievements of convolutional neural networks (CNNs) in recent years, we adopted 5 different ways to encode audio signals as images and treated them as the input of CNNs, so that image processing technology could be applied to analyze audio signals. In order to explore the optimal audio signals encoding method, we performed comparative experiments on medical dataset containing 70000 audio segments from 26 patients. Experimental results show that RASTA-PLP spectrum is the best method to encode audio signals as images with respect to cough classification task, which gives an average accuracy of 0.9965 in 200 iterations on test batches and a F1-score of 0.9768 on samples re-sampled from the test set. Therefore, the image processing based method is shown to be a promising choice for the process of audio signals.
Year
DOI
Venue
2015
10.1109/BIBM.2015.7359724
IEEE International Conference on Bioinformatics and Biomedicine
Keywords
Field
DocType
cough detection, audio features, convolutional neural networks, image processing, cough classification
ENCODE,Audio signal,Convolutional neural network,Computer science,Image processing,Speech recognition,Artificial intelligence,Hidden Markov model,Machine learning,Encoding (memory),Test set
Conference
ISSN
Citations 
PageRank 
2156-1125
3
0.42
References 
Authors
8
4
Name
Order
Citations
PageRank
Hui-Hui Wang130.42
Jia-Ming Liu2153.14
Mingyu You3102.31
Guo-Zheng Li436842.62