Transformer for Single Image Super-Resolution | 0 | 0.34 | 2022 |
Integrating Point and Line Features for Visual-Inertial Initialization | 0 | 0.34 | 2022 |
Attentive Decoupling Network for Cloth-Changing Re-Identification | 0 | 0.34 | 2022 |
Image-to-video person re-identification using three-dimensional semantic appearance alignment and cross-modal interactive learning | 0 | 0.34 | 2022 |
SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization. | 0 | 0.34 | 2022 |
A Robust Pixel-Aware Gyro-Aided KLT Feature Tracker for Large Camera Motions | 0 | 0.34 | 2022 |
When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition with Limited Data | 1 | 0.35 | 2021 |
Binaural Sound Source Localization Based On Weighted Template Matching | 0 | 0.34 | 2021 |
Optimization-Based Online Initialization And Calibration Of Monocular Visual-Inertial Odometry Considering Spatial-Temporal Constraints | 0 | 0.34 | 2021 |
A Base-Derivative Framework For Cross-Modality Rgb-Infrared Person Re-Identification | 0 | 0.34 | 2020 |
An Online Initialization and Self-Calibration Method for Stereo Visual-Inertial Odometry | 1 | 0.35 | 2020 |
3d Audio-Visual Speaker Tracking With A Novel Particle Filter | 0 | 0.34 | 2020 |
Unsupervised Monocular Visual-inertial Odometry Network | 1 | 0.36 | 2020 |
Mutual Alignment Between Audiovisual Features For End-To-End Audiovisual Speech Recognition | 0 | 0.34 | 2020 |
Efficient High-Resolution High-Level-Semantic Representation Learning For Human Pose Estimation | 0 | 0.34 | 2020 |
Audio-Visual Speech Recognition Using A Two-Step Feature Fusion Strategy | 0 | 0.34 | 2020 |
An Adaptive Method Based On Multiscale Dilated Convolutional Network For Binaural Speech Source Localization | 0 | 0.34 | 2020 |
A Weight-Shared Dual-Branch Convolutional Neural Network For Unsupervised Dense Depth Prediction And Camera Motion Estimation | 0 | 0.34 | 2019 |
Regrasp Planning Using Stable Object Poses Supported by Complex Structures | 0 | 0.34 | 2019 |
Combining Adaptive Hierarchical Depth Motion Maps with Skeletal Joints for Human Action Recognition | 0 | 0.34 | 2019 |
Sample Fusion Network: An End-to-End Data Augmentation Network for Skeleton-based Human Action Recognition. | 1 | 0.41 | 2019 |
Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering | 3 | 0.41 | 2019 |
Multitask learning of time-frequency CNN for sound source localization | 0 | 0.34 | 2019 |
Robust 3D Action Recognition through Sampling Local Appearances and Global Distributions | 2 | 0.36 | 2018 |
Sensor-based complete coverage path planning in dynamic environment for cleaning robot | 3 | 0.39 | 2018 |
An attention-aware model for human action recognition on tree-based skeleton sequences | 0 | 0.34 | 2018 |
Instance Enhancing Loss: Deep Identity-Sensitive Feature Embedding For Person Search | 0 | 0.34 | 2018 |
Multiple Sound Source Counting And Localization Based On Spatial Principal Eigenvector | 1 | 0.36 | 2017 |
Building Endgame Data set to Improve Opponent Modeling Approach | 0 | 0.34 | 2017 |
How do you smile? Towards a comprehensive smile analysis system | 0 | 0.34 | 2017 |
Learning informative pairwise joints with energy-based temporal pyramid for 3D action recognition | 1 | 0.34 | 2017 |
Spontaneous versus posed smile recognition via region-specific texture descriptor and geometric facial dynamics | 2 | 0.37 | 2017 |
3D action recognition using multi-temporal skeleton visualization | 0 | 0.34 | 2017 |
Enhanced skeleton visualization for view invariant human action recognition | 88 | 2.07 | 2017 |
Online growing neural gas for anomaly detection in changing surveillance scenes | 17 | 0.59 | 2017 |
A new descriptor of gradients Self-Similarity for smile detection in unconstrained scenarios. | 7 | 0.47 | 2016 |
Sequential Bag-of-Words model for human action classification | 2 | 0.39 | 2016 |
Salient pairwise spatio-temporal interest points for real-time activity recognition | 2 | 0.36 | 2016 |
Linear canonical correlation analysis based ranking approach for facial age estimation | 0 | 0.34 | 2016 |
Energy-Based Global Ternary Image for Action Recognition Using Sole Depth Sequences | 0 | 0.34 | 2016 |
A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion | 7 | 0.48 | 2016 |
3D action recognition using multi-temporal depth motion maps and fisher vector | 16 | 0.57 | 2016 |
Scene-adaptive hierarchical data association and depth-invariant part-based appearance model for indoor multiple objects tracking | 1 | 0.35 | 2016 |
Saliency detection via global-object-seed-guided cellular automata | 0 | 0.34 | 2016 |
Binaural cues estimates based on Interaural Matching Filter for sound source localization | 0 | 0.34 | 2015 |
Exploring spatial correlation for visual object retrieval | 3 | 0.38 | 2015 |
An image forensic technique based on 2D lighting estimation using spherical harmonic frames | 0 | 0.34 | 2015 |
Binaural sound source localization based on generalized parametric model and two-layer matching strategy in complex environments | 1 | 0.35 | 2015 |
Body-Structure Based Feature Representation For Person Re-Identification | 0 | 0.34 | 2015 |
Two-level multi-task metric learning with application to multi-classification | 0 | 0.34 | 2015 |