Title
Performance Evaluation of Word Representation Techniques using Deep Learning Methods
Abstract
Word vectors are the real-valued numbers which allow machine learning algorithms to extract the semantic information concern with the words when trained on natural language corpora. The paper explores word representation techniques with evaluation criteria to measure the quality of representation through deep learning models like BERT. The performance of these words vectors can be evaluated using certain measures. Broadly, the two classes of evaluation are intrinsic and extrinsic evaluation. Intrinsic evaluators directly extract syntactic or semantic relationships between the words independent of any language processing task. These evaluators focus on subtasks while extrinsic evaluators consider complete natural language processing task as a measure of performance like chunking, sentiment analysis etc. The experiments have been performed using BOW model, Word2Vec and BERT language model. In this research work word-similarity task is considered for intrinsic evaluation and part-of-speech (POS) tagging task is used as a measure for extrinsic evaluation. The experiments have been performed using python, sklearn machine learning toolkit and keras deep learning framework. BERT language model is used which has recently emerged as the prominent tool for natural language processing. The result obtained from the experiment in this research for word embedding representation techniques are efficient and better compared to other existing traditional models. However, considering large datasets this can be enhanced for better accuracy
Year
DOI
Venue
2020
10.1109/ICCCS49678.2020.9277190
2020 5th International Conference on Computing, Communication and Security (ICCCS)
Keywords
DocType
ISBN
Word Vector,Word Embedding,Distributed Representation,Intrinsic Evaluators,Extrinsic Evaluators,BOW,Word2Vec,BERT,Pre-trained Embedding
Conference
978-1-7281-9181-2
Citations 
PageRank 
References 
0
0.34
4
Authors
2
Name
Order
Citations
PageRank
Anjali Bohra100.34
N. C. Barwar200.34