Title
Self-Configured Framework for scalable link prediction in twitter: Towards autonomous spark framework
Abstract
Scalable link prediction in social networks allow dynamic social interaction gathering, potential friend suggestions, and community detection. Distributed open-source frameworks such as Hadoop and Spark facilitate efficient link prediction especially in large-scale social networks. The frameworks provide different kinds of tunable properties for users to manually configure the parameters for the applications. However, manual configurations are open to performance issues when the applications start scaling tremendously, which are hard to set up and are exposed to human errors. This paper proposes a novel Self-Configured Framework (SCF) to provide an autonomous feature in Spark that predicts and sets the best configuration instantly before the application execution using the XGBoost classifier. The framework with a self-configuration setting demonstrates a 40% reduction in prediction time as well as a balanced resource consumption that makes full use of resources, especially for limited number and size of clusters. The presented framework establishes its efficiency for link prediction in large-scale social networks by automatically configuring the best configuration suitable for a specific application given the varying dataset size of the Twitter social network, workload, and cluster specification.
Year
DOI
Venue
2022
10.1016/j.knosys.2022.109713
Knowledge-Based Systems
Keywords
DocType
Volume
Self-Configured Framework,Link prediction,Social network,Large-scale
Journal
255
ISSN
Citations 
PageRank 
0950-7051
0
0.34
References 
Authors
0
6