Putting Humans In A Scene: Learning Affordance In 3d Indoor Environments - Citegraph

Paper Info

Title
Putting Humans In A Scene: Learning Affordance In 3d Indoor Environments

Abstract
Affordance(1) modeling plays an important role in visual understanding. In this paper, we aim to predict affordances of 3D indoor scenes, specifically what human poses are afforded by a given indoor environment, such as sitting on a chair or standing on the floor. In order to predict valid affordances and learn possible 3D human poses in indoor scenes, we need to understand the semantic and geometric structure of a scene as well as its potential interactions with a human. To learn such a model, a large-scale dataset of 3D indoor affordances is required. In this work, we build a fully automatic 3D pose synthesizer that fuses semantic knowledge from a large number of 2D poses extracted from TV shows as well as 3D geometric knowledge from voxel representations of indoor scenes. With the data created by the synthesizer, we introduce a 3D pose generative model to predict semantically plausible and physically feasible human poses within a given scene (provided as a single RGB, RGB-D, or depth image). We demonstrate that our human affordance prediction method consistently outperforms existing state-of-the-art methods.

Year	DOI	Venue
2019	10.1109/CVPR.2019.01265	2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019)
Field	DocType	Volume
Semantic memory,Voxel,Computer science,Human–computer interaction,Artificial intelligence,RGB color model,Affordance,Machine learning,Generative model	Journal	abs/1903.05690
ISSN	Citations	PageRank
1063-6919	5	0.40
References	Authors
17	6

Authors (6 rows)

Cited by (5 rows)

References (17 rows)

Name	Order	Citations	PageRank
Xueting Li	1	14	3.22
Sifei Liu	2	227	17.54
Kihwan Kim	3	409	28.22
Xiaolong Wang	4	713	39.04
Yang Ming-Hsuan	5	15303	620.69
Jan Kautz	6	3615	198.77

1