Title
Extracting Knowledge From Web Data
Abstract
The user behavior on a website triggers a sequence of queries that have a result which is the display of certain pages. The Information about these queries (including the names of the resources requested and responses from the Web server) are stored in a text file called a log file. Analysis of server log file can provide significant and useful information. Web Mining is the extraction of interesting and potentially useful patterns and implicit information from artifacts or activity related to the World Wide Web. Web usage mining is a main research area in Web mining focused on learning about Web users and their interactions with Web sites. The motive of mining is to find users' access models automatically and quickly from the vast Web log file, such as frequent access paths, frequent access page groups and user clustering. Through Web Usage Mining, several information left by user access can be mined which will provide foundation for decision making of organizations, Also the process of Web mining was defined as the set of techniques designed to explore, process and analyze large masses of consecutive information activities on the Internet, has three main steps: data preprocessing, extraction of reasons of the use and the interpretation of results. This paper will start with the presentation of different formats of web log files, then it will present the different preprocessing method that have been used, and finally it presents a system for "Web content and Usage Mining'' for web data extraction and web site analysis using Data Mining Algorithms Apriori, FPGrowth, K-Means, KNN, and ID3.
Year
DOI
Venue
2014
10.4018/jitr.2014100103
JOURNAL OF INFORMATION TECHNOLOGY RESEARCH
Keywords
Field
DocType
Data Cleaning, Preprocessing, Web LogFile, Web Mining, Web Usage Mining
Static web page,Data mining,Web API,World Wide Web,Web mining,Web page,Computer science,Data Web,Web log analysis software,Web modeling,Web navigation
Journal
Volume
Issue
ISSN
7
4
1938-7857
Citations 
PageRank 
References 
0
0.34
1
Authors
4
Name
Order
Citations
PageRank
Hanane Ezzikouri111.72
Mohamed Fakir200.34
Cherki Daoui301.01
Mohamed Erritali401.35