Context-Enhanced Stereo Transformer. - Citegraph

Paper Info

Title
Context-Enhanced Stereo Transformer.

Abstract
Stereo depth estimation is of great interest for computer vision research. However, existing methods struggles to generalize and predict reliably in hazardous regions, such as large uniform regions. To overcome these limitations, we propose Context Enhanced Path (CEP). CEP improves the generalization and robustness against common failure cases in existing solutions by capturing the long-range global information. We construct our stereo depth estimation model, Context Enhanced Stereo Transformer (CSTR), by plugging CEP into the state-of-the-art stereo depth estimation method Stereo Transformer. CSTR is examined on distinct public datasets, such as Scene Flow, Middlebury-2014, KITTI-2015, and MPI-Sintel. We find CSTR outperforms prior approaches by a large margin. For example, in the zero-shot synthetic-to-real setting, CSTR outperforms the best competing approaches on Middlebury-2014 dataset by 11\(\%\). Our extensive experiments demonstrate that the long-range information is critical for stereo matching task and CEP successfully captures such information(\(^1\)Code available at: github.com/guoweiyu/Context-Enhanced-Stereo-Transformer).

Year	DOI	Venue
2022	10.1007/978-3-031-19824-3_16	European Conference on Computer Vision
Keywords	DocType	Citations
Stereo depth estimation,Transformer,Context extraction	Conference	0
PageRank	References	Authors
0.34	0	8

Authors (8 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Weiyu Guo	1	0	0.34
Zhaoshuo Li	2	4	2.46
Yongkui Yang	3	0	0.34
Zheng Wang	4	72	47.08
Russell H. Taylor	5	1970	438.00
mathias unberath	6	56	24.46
Alan L. Yuille	7	10339	1902.01
Yingwei Li	8	7	6.35

1