Towards Building Arabic Corpus For Drug Information - Citegraph

Paper Info

Title
Towards Building Arabic Corpus For Drug Information

Abstract
Corpora have opened up many new areas of research in the linguistic domain, which would never been possible without them. Moreover, corpora have proved their usefulness not only in the linguistic domain but also in other domains, such as medical, economic, legal, pharmacological, etc. English is considered to have the richest language resources in most of these domains, while Arabic reveals a gap in most of them. This paper tries to fill the gap in the pharmacological domain, especially for drugs, by constructing the first Arabic drug corpus, which is composed of 202 drugs, each drug is saved in a text file with UTF-8 character encoding. The corpus was manually annotated with four-entity types: generic (for drug's generic name), brand for (trade names), chemical formula and class (for drug classes).

Year	DOI	Venue
2014	10.1145/2668260.2668275	MEDES
Keywords	Field	DocType
design,pharmacological domain,experimentation,annotation,content analysis and indexing,drug-drug interaction,corpus building,arabic language,data collection,drug,information search and retrieval	Drug-drug interaction,Annotation,Arabic,Computer science,Artificial intelligence,Natural language processing,Character encoding,Drug	Conference
Citations	PageRank	References
0	0.34	2
Authors
3

Authors (3 rows)

Cited by (0 rows)

References (2 rows)

Name	Order	Citations	PageRank
Haifa Al-Ibrahim	1	0	0.34
Hend Suliman Al-Khalifa	2	19	3.98
AbdulMalik S. Al-Salman	3	141	18.35

1