Title
Towards Building Arabic Corpus For Drug Information
Abstract
Corpora have opened up many new areas of research in the linguistic domain, which would never been possible without them. Moreover, corpora have proved their usefulness not only in the linguistic domain but also in other domains, such as medical, economic, legal, pharmacological, etc. English is considered to have the richest language resources in most of these domains, while Arabic reveals a gap in most of them. This paper tries to fill the gap in the pharmacological domain, especially for drugs, by constructing the first Arabic drug corpus, which is composed of 202 drugs, each drug is saved in a text file with UTF-8 character encoding. The corpus was manually annotated with four-entity types: generic (for drug's generic name), brand for (trade names), chemical formula and class (for drug classes).
Year
DOI
Venue
2014
10.1145/2668260.2668275
MEDES
Keywords
Field
DocType
design,pharmacological domain,experimentation,annotation,content analysis and indexing,drug-drug interaction,corpus building,arabic language,data collection,drug,information search and retrieval
Drug-drug interaction,Annotation,Arabic,Computer science,Artificial intelligence,Natural language processing,Character encoding,Drug
Conference
Citations 
PageRank 
References 
0
0.34
2
Authors
3
Name
Order
Citations
PageRank
Haifa Al-Ibrahim100.34
Hend Suliman Al-Khalifa2193.98
AbdulMalik S. Al-Salman314118.35