Development of a Scalable Method for Creating Food Groups Using the NHANES Dataset and MapReduce. - Citegraph

Paper Info

Title
Development of a Scalable Method for Creating Food Groups Using the NHANES Dataset and MapReduce.

Abstract
In this paper we tackle the need for meaningful food group classifications in dietary datasets such as the National Health and Nutrition Examination Survey (NAHNES) that are less subjective in nature by defining a new objective method of identifying food groups exclusively based on the food's micro- and macro-nutrient content. We first perform extensive preprocessing of the NHANES raw data to mitigate impacts of missing nutrient values, redundancies, and different food intake quantities and scales. We then utilize an unsupervised learning clustering algorithm to create food groups within the preprocessed NHANES data and identify food groups with similar nutrient content. Finally we parallelize our method to benefit from the scalable MapReduce paradigm. Our results show that our method identifies food groups with smaller diameter and larger cluster separation distances than the standard, expert-informed, method of grouping food items.

Year	DOI	Venue
2016	10.1145/2975167.2975179	BCB
Keywords	Field	DocType
Clustering methods, data processing, Apache Spark, dietary data, micro- and macro-nutrients	Data mining,Data processing,Computer science,Raw data,Unsupervised learning,Artificial intelligence,Cluster analysis,National Health and Nutrition Examination Survey,Preprocessor,Bioinformatics,Food group,Machine learning,Scalability	Conference
Citations	PageRank	References
0	0.34	2
Authors
4

Authors (4 rows)

Cited by (0 rows)

References (2 rows)

Name	Order	Citations	PageRank
Michael R. Wyatt II	1	3	0.72
Travis Johnston	2	11	1.97
Mia Papas	3	0	0.34
michela taufer	4	352	53.04

1