Title | ||
---|---|---|
Development of a Scalable Method for Creating Food Groups Using the NHANES Dataset and MapReduce. |
Abstract | ||
---|---|---|
In this paper we tackle the need for meaningful food group classifications in dietary datasets such as the National Health and Nutrition Examination Survey (NAHNES) that are less subjective in nature by defining a new objective method of identifying food groups exclusively based on the food's micro- and macro-nutrient content. We first perform extensive preprocessing of the NHANES raw data to mitigate impacts of missing nutrient values, redundancies, and different food intake quantities and scales. We then utilize an unsupervised learning clustering algorithm to create food groups within the preprocessed NHANES data and identify food groups with similar nutrient content. Finally we parallelize our method to benefit from the scalable MapReduce paradigm. Our results show that our method identifies food groups with smaller diameter and larger cluster separation distances than the standard, expert-informed, method of grouping food items. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1145/2975167.2975179 | BCB |
Keywords | Field | DocType |
Clustering methods, data processing, Apache Spark, dietary data, micro- and macro-nutrients | Data mining,Data processing,Computer science,Raw data,Unsupervised learning,Artificial intelligence,Cluster analysis,National Health and Nutrition Examination Survey,Preprocessor,Bioinformatics,Food group,Machine learning,Scalability | Conference |
Citations | PageRank | References |
0 | 0.34 | 2 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Michael R. Wyatt II | 1 | 3 | 0.72 |
Travis Johnston | 2 | 11 | 1.97 |
Mia Papas | 3 | 0 | 0.34 |
michela taufer | 4 | 352 | 53.04 |