Audio Classification of Bit-Representation Waveform. - Citegraph

Paper Info

Title
Audio Classification of Bit-Representation Waveform.

Abstract
This paper investigates waveform representation for audio signal classification. Recently, many studies on audio waveform classification such as acoustic event detection and music genre classification have been increasing. Most studies on audio waveform classification proposed to use a deep learning (neural network) framework. Generally, a frequency analysis method like the Fourier transform is applied to extract frequency or spectral information of the input audio waveform before inputting the raw audio waveform into a neural network. As against to these previous studies, in this paper, we propose a novel waveform representation method, in which audio waveforms are represented as bit-sequence, for audio classification. In our experiment, we compare the proposed bit-representation waveform, which is directly given to a neural network, to other representation of audio waveforms such as raw audio waveform and power spectrum on two classification tasks: one is an acoustic event classification task, the other is a sound/music classification task. The experimental results showed that the bit-representation waveform got the best classification performances on both the tasks.

Year	DOI	Venue
2019	10.21437/interspeech.2019-1855	Conference of the International Speech Communication Association
DocType	Volume	Citations
Journal	abs/1904.04364	0
PageRank	References	Authors
0.34	0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Masaki Okawa	1	0	0.68
Takuya Saito	2	0	2.70
Naoki Sawada	3	16	5.06
Hiromitsu Nishizaki	4	163	29.49

1