Title
Inferring Speech Activity from Encrypted Skype Traffic
Abstract
Normally, voice activity detection (VAD) refers to speech processing algorithms for detecting the presence or absence of human speech in segments of audio signals. In this paper, however, we focus on speech detection algorithms that take VoIP traffic instead of audio signals as input. We call this category of algorithms network-level VAD. Traditional VAD usually plays a fundamental role in speech processing systems because of its ability to delimit speech segments. Network-level VAD, on the other hand, can be quite helpful in network management, which is the motivation for our study. We propose the first real-time network-level VAD algorithm that can extract voice activity from encrypted and non-silence-suppressed Skype traffic. We evaluate the speech detection accuracy of the proposed algorithm with extensive real-life traces. The results show that our scheme achieve reasonably good performance even high degree of randomness has been injected into the network traffic.
Year
DOI
Venue
2008
10.1109/GLOCOM.2008.ECP.413
IEEE Global Telecommunications Conference (Globecom)
Keywords
Field
DocType
QoS Provisioning,Traffic Classification,User Satisfaction,VoIP,Voice Activity Detection
Speech processing,Audio signal,Speech coding,Detection theory,Voice activity detection,Cryptography,Computer science,Computer network,Speech recognition,Encryption,Voice over IP
Conference
ISSN
Citations 
PageRank 
1930-529X
7
0.74
References 
Authors
4
4
Name
Order
Citations
PageRank
Yu-Chun Chang144825.55
Kuan-Ta Chen21896136.86
Chen-Chi Wu323817.12
Chin-Laung Lei41686201.07