Title
P-Biblio-MetReS, a parallel data mining tool for the reconstruction of molecular networks
Abstract
Biblio-MetReS is a single-thread data mining application that facilitates the reconstruction of molecular networks based on automated text mining analysis of published scientific literature. This application is very CPU-intensive, requiring High Performace Computing (HPC). Due to the amount of execution tasks, it can be quite slow. Those tasks are repetitive and consist in mining the information from large sets of scientific documents, a process where the time-cost of the application could be improved through paralellization. This paper presents a parallel version of Biblio-MetReS. The multithreading application P(arallel)-Biblio-MetReS distributes the work among copies of the same Java class, each mining a collection of documents obtained in a previous search phase from different literature sources of Internet. In this article, we compare performances between the parallel and non-parallel versions of the application and discuss scalability issues on multi-threading systems in the context of this application. Furthermore, we also optimize memory management and reutilization of document parsing results. Our experimental results corroborate the good performance of P-Biblio-MetReS, pinpointing specific aspects that still need to be improved.
Year
DOI
Venue
2013
10.1145/2488551.2488586
EuroMPI
Keywords
Field
DocType
molecular network,high performace computing,different literature source,scientific document,execution task,single-thread data mining application,scientific literature,java class,multithreading application,parallel data mining tool,automated text mining analysis,parallel version,multithreading,scalability,parallel programming,parallelization
Data mining,Scientific literature,Multithreading,Text mining,Computer science,Memory management,Parsing,Java,Database,The Internet,Scalability
Conference
Citations 
PageRank 
References 
0
0.34
8
Authors
8
Name
Order
Citations
PageRank
Ivan Teixido1705.71
Anabel Usié200.34
Josep Ll. Lérida300.68
Francesc Solsona422527.39
Jorge Comas530.76
Nestor Torres600.34
Hiren Karathia718416.08
Rui Alves819632.99