Title
Character usage in Chinese short message service SMS: a real-world study in Mainland China
Abstract
Short message service SMS is an important component of modern mobile services. Given unique characteristics of Chinese language, it is imperative to conduct study to understand characteristic of language usage patterns in Chinese SMS so that important facts like why and how people in China use SMS can be discovered. In this paper, we report an analysis of Chinese SMS logs from three different provinces in China. A computational approach was applied to extract n-grams from logs of SMS. The language usage patterns reported in this paper consist of two aspects: 1 most popular n-grams that represent what types of information were transmitted via SMS; 2 distribution of n-grams in comparison with Zipf laws. We discovered that, compared with other forms of free text in Chinese, SMS contains more conversational elements, which are expressed mostly in bigrams. Trigrams, 4-and 5-grams are less frequent but are closely connected to commercial activities, which may indicate the commercial needs of SMS users.
Year
DOI
Venue
2013
10.1504/IJMC.2013.056954
IJMC
Keywords
DocType
Volume
chinese sms,chinese language,sms user,popular n-grams,character usage,important component,mainland china,chinese sms log,china use,real-world study,commercial need,commercial activity,language usage pattern,chinese short message service,china,short message service,mobile communications,text mining
Journal
11
Issue
ISSN
Citations 
5
1470-949X
2
PageRank 
References 
Authors
0.45
16
4
Name
Order
Citations
PageRank
Xi Chen1283.52
Chenhui Guo2622.89
Michael Chau3147197.79
Weihua Zhou4243.80