Title
A New Framework for Publishing and Sharing Network and Security Datasets
Abstract
Datasets are very important for network and security research and development. Despite the continuous growth in the number of available datasets, there is no effective publishing and sharing mechanisms so that realistic and representative datasets are not only hard to construct but it is difficult to select from tens of thousands of datasets scattered in online repositories. This work aims to alleviate the difficulties inherent in searching, selecting and comparing datasets as well as to decrease the ambiguity associated with dataset publication and share. In this paper we present the basis and the implementation of a new framework to describe and share network datasets with a special focus on network and security-related datasets. Hereafter, we present the underlying idea of the proposed framework and the key component of this approach: a Dataset Description Language (DDL) to express dataset metadata. Besides that, we explain how we implemented a proof-of-concept prototype to demonstrate its feasibility and usefulness, only from OSOTS (Open Source Off The Shelf). It allows us to overcome the problem of backward dealing with a huge number of already existing datasets where it generates Dataset Description Sheets (DDS) automatically for traffic datasets. The proposed approach provides several benefits where it facilitates searching in dataset repositories according to various criteria. Moreover, its output in XML format can be integrated easily with Security Content Automation Protocol (SCAP) tools. It also, enhances communicating dataset properties in a clear and succinct manner.
Year
DOI
Venue
2012
10.1109/SC.Companion.2012.77
High Performance Computing, Networking, Storage and Analysis
Keywords
Field
DocType
dataset description language,new framework,dataset publication,security datasets,dataset repository,traffic datasets,dataset metadata,available datasets,representative datasets,security-related datasets,dataset property,share network datasets,xml,meta data,publishing,dataset,security,computer network security,network,public domain software
Data mining,Metadata,Off the shelf,XML,Computer science,Network security,Automation,Cataloging,Publishing,Ambiguity
Conference
ISBN
Citations 
PageRank 
978-1-4673-6218-4
0
0.34
References 
Authors
8
2
Name
Order
Citations
PageRank
Mohammed S. GADELRAB111.08
Ali Ghorbani2101.95