Title
Development of a Scalable Method for Creating Food Groups Using the NHANES Dataset and MapReduce.
Abstract
In this paper we tackle the need for meaningful food group classifications in dietary datasets such as the National Health and Nutrition Examination Survey (NAHNES) that are less subjective in nature by defining a new objective method of identifying food groups exclusively based on the food's micro- and macro-nutrient content. We first perform extensive preprocessing of the NHANES raw data to mitigate impacts of missing nutrient values, redundancies, and different food intake quantities and scales. We then utilize an unsupervised learning clustering algorithm to create food groups within the preprocessed NHANES data and identify food groups with similar nutrient content. Finally we parallelize our method to benefit from the scalable MapReduce paradigm. Our results show that our method identifies food groups with smaller diameter and larger cluster separation distances than the standard, expert-informed, method of grouping food items.
Year
DOI
Venue
2016
10.1145/2975167.2975179
BCB
Keywords
Field
DocType
Clustering methods, data processing, Apache Spark, dietary data, micro- and macro-nutrients
Data mining,Data processing,Computer science,Raw data,Unsupervised learning,Artificial intelligence,Cluster analysis,National Health and Nutrition Examination Survey,Preprocessor,Bioinformatics,Food group,Machine learning,Scalability
Conference
Citations 
PageRank 
References 
0
0.34
2
Authors
4
Name
Order
Citations
PageRank
Michael R. Wyatt II130.72
Travis Johnston2111.97
Mia Papas300.34
michela taufer435253.04