Title
Semantic Understanding of General Linguistic Items by Means of Fuzzy Set Theory
Abstract
Modern statistical techniques used in the field of natural language processing are limited in their applications by the fact they suffer from the loss of most of the semantic information contained in text documents. Fuzzy techniques have been proposed as a way to correct this problem through the modelling of the relationships between words while accommodating the ambiguities of natural languages. However, these techniques are currently either restricted to modelling the effects of simple words or are specialized in a single domain. In this paper, we propose a novel statistical-fuzzy methodology to represent the actions described in a variety of text documents by modelling the relationships between subject-verb-object triplets. The research will focus in the first place on the technique used to accurately extract the triplets from the text, on the necessary equations to compute the statistics of the subject-verb and verb-object pairs, and on the formulas needed to interpolate the fuzzy membership functions from these statistics and on those needed to de fuzzify the membership value of unseen triplets. Taken together, these sets of equations constitute a comprehensive system that allows the quantification and evaluation of the meaning of text documents, while being general enough to be applied to any domain. In the second phase, this paper will proceed to experimentally demonstrate the validity of our new methodology by applying it to the implementation of a fuzzy classifier conceived especially for this research. This classifier is trained using a section of the Brown Corpus, and its efficiency is tested with a corpus of 20 unseen documents drawn from three different domains. The positive results obtained from these experimental tests confirm the soundness of our new approach and show that it is a promising avenue of research.
Year
DOI
Venue
2007
10.1109/TFUZZ.2006.889817
IEEE T. Fuzzy Systems
Keywords
DocType
Volume
text document,fuzzy membership function,natural language processing,fuzzy technique,semantic understanding,natural language,new approach,general linguistic items,new methodology,fuzzy classifier,membership value,fuzzy set theory,different domain,fuzzy system,text analysis,fuzzy systems,single domain,natural languages,statistical analysis
Journal
15
Issue
ISSN
Citations 
5
1063-6706
4
PageRank 
References 
Authors
0.45
31
5
Name
Order
Citations
PageRank
R. Khoury140.79
Fakhri Karray21733130.97
Yu Sun35510.37
M. Kamel440.45
O. Basir5454.51