Abstract | ||
---|---|---|
The U.S. Federal Government developed HealthData.gov to disseminate healthcare datasets to the public. Metadata is provided for each datasets and is the sole source of information to find and retrieve data. This study employed automated quality assessments of the HealthData.gov metadata published from 2012 to 2014 to measure completeness, accuracy, and consistency of applying standards. The results demonstrated that metadata published in earlier years had lower completeness, accuracy, and consistency. Also, metadata that underwent modifications following their original creation were of higher quality. HealthData.gov did not uniformly apply Dublin Core Metadata Initiative to the metadata, which is a widely accepted metadata standard. These findings suggested that the HealthData.gov metadata suffered from quality issues, particularly related to information that wasn't frequently updated. The results supported the need for policies to standardize metadata and contributed to the development of automated measures of metadata quality. |
Year | Venue | Field |
---|---|---|
2016 | AMIA | Metadata quality,World Wide Web,Computer science,Information repository |
DocType | Volume | Citations |
Conference | 2016 | 0 |
PageRank | References | Authors |
0.34 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
David T. Marc | 1 | 0 | 2.70 |
James Beattie | 2 | 0 | 0.68 |
Vitaly Herasevich | 3 | 29 | 17.49 |
L. C. Gatewood | 4 | 4 | 2.61 |
Rui Zhang | 5 | 6 | 5.25 |