Title
Framework for Efficient Indexing and Searching of Scientific Metadata
Abstract
A seamless and intuitive data reduction capability for the vast amount of scientific metadata generated by experiments is critical to ensure effective use of the data by domain specific scientists. The portal environments and scientific gateways currently used by scientists provide search capability that is limited to the pre-defined pull-down menus and conditions set in the portal interface. Currently, data reduction can only be effectively achieved by scientists who have developed expertise in dealing with complex and disparate query languages. A common theme in our discussions with scientists is that data reduction capability, similar to web search in terms of ease-of-use, scalability, and freshness/accuracy of results, is a critical need that can greatly enhance the productivity and quality of scientific research. Most existing search tools are designed for exact string matching, but such matches are highly unlikely given the nature of metadata produced by instruments and a user’s inability to recall exact numbers to search in very large datasets. This paper presents research to locate metadata of interest within a range of values. To meet this goal, we leverage the use of XML in metadata description for scientific datasets, specifically the NeXus datasets generated by the SNS scientists. We have designed a scalable indexing structure for processing data reduction queries. Web semantics and ontology based methodologies are also employed to provide an elegant, intuitive, and powerful free-form query based data reduction interface to end users.
Year
DOI
Venue
2010
10.1109/CCGRID.2010.120
Cluster, Cloud and Grid Computing
Keywords
Field
DocType
data reduction query,efficient indexing,scientific datasets,scientific gateway,data reduction capability,scientific metadata,existing search tool,metadata description,intuitive data reduction capability,data reduction interface,data reduction,productivity,scalability,web pages,indexation,ease of use,query language,xml,database languages,indexing,scientific research,string matching,search engines
Metadata,Metadata repository,Query language,XML,Information retrieval,Web page,Computer science,Data element,Search engine indexing,Scalability
Conference
ISBN
Citations 
PageRank 
978-1-4244-6987-1
1
0.38
References 
Authors
7
2
Name
Order
Citations
PageRank
Chaitali Gupta1203.84
Madhusudhan Govindaraju285496.53