Towards Building A Group-based Unsupervised Representation Disentanglement Framework | 0 | 0.34 | 2022 |
Sparse MLP for Image Recognition: Is Self-Attention Really Necessary? | 0 | 0.34 | 2022 |
Robust Multi-object Tracking by Marginal Inference. | 0 | 0.34 | 2022 |
FPCR-Net: Feature pyramidal correlation and residual reconstruction for optical flow estimation | 0 | 0.34 | 2022 |
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph. | 0 | 0.34 | 2022 |
Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation | 0 | 0.34 | 2022 |
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration. | 0 | 0.34 | 2021 |
Very Important Person Localization In Unconstrained Conditions: A New Benchmark | 0 | 0.34 | 2021 |
Uncertainty-Aware Multi-Shot Knowledge Distillation For Image-Based Object Re-Identification | 1 | 0.35 | 2020 |
Joint Time-Frequency and Time Domain Learning for Speech Enhancement | 0 | 0.34 | 2020 |
Style Normalization And Restitution For Generalizable Person Re-Identification | 8 | 0.45 | 2020 |
Multi-Scale Group Transformer for Long Sequence Modeling in Speech Separation | 0 | 0.34 | 2020 |
Fusing Wearable Imus With Multi-View Images For Human Pose Estimation: A Geometric Approach | 0 | 0.34 | 2020 |
Spatiotemporal Fusion in 3D CNNs: A Probabilistic View | 0 | 0.34 | 2020 |
EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks. | 4 | 0.39 | 2020 |
Temporal-Spatial Mapping for Action Recognition | 4 | 0.43 | 2020 |
Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition. | 0 | 0.34 | 2019 |
Multi-Modality Multi-Task Recurrent Neural Network for Online Action Detection | 3 | 0.37 | 2019 |
Predicting Future Instance Segmentation with Contextual Pyramid ConvLSTMs | 3 | 0.36 | 2019 |
High-Speed Hyperspectral Video Acquisition By Combining Nyquist and Compressive Sampling. | 4 | 0.39 | 2019 |
Quality-Gated Convolutional Lstm for Enhancing Compressed Video | 5 | 0.42 | 2019 |
Relation-Aware Global Attention. | 0 | 0.34 | 2019 |
Skeleton-Based Action Recognition with Gated Convolutional Neural Networks | 4 | 0.39 | 2019 |
Superimposed Modulation for Soft Video Delivery With Hidden Resources. | 3 | 0.42 | 2018 |
Hybrid Digital-Analog Video Delivery With Shannon-Kotel'nikov Mapping. | 0 | 0.34 | 2018 |
MiCT: Mixed 3D/2D Convolutional Tube for Human Action Recognition | 12 | 0.54 | 2018 |
Skeleton-Indexed Deep Multi-Modal Feature Learning for High Performance Human Action Recognition | 2 | 0.35 | 2018 |
Photo Stylistic Brush: Robust Style Transfer via Superpixel-Based Bipartite Graph. | 2 | 0.36 | 2018 |
Variable Block-Sized Signal-Dependent Transform for Video Coding. | 5 | 0.41 | 2018 |
An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data. | 20 | 0.57 | 2017 |
RESIDE: A Benchmark for Single Image Dehazing. | 4 | 0.38 | 2017 |
Dropout Prediction in Home Care Training. | 0 | 0.34 | 2017 |
Outlier-Robust Greedy Pursuit Algorithms in ℓp-Space for Sparse Approximation. | 0 | 0.34 | 2016 |
Deeply-Fused Nets. | 9 | 0.49 | 2016 |
On the Connection of Deep Fusion to Ensembling. | 0 | 0.34 | 2016 |
Lossless ROI Privacy Protection of H.264/AVC Compressed Surveillance Videos. | 2 | 0.39 | 2016 |
Progressive Pseudo-Analog Transmission For Mobile Video Live Streaming | 2 | 0.38 | 2015 |
Graph-based video fingerprinting using double optimal projection | 1 | 0.34 | 2015 |
Structural similarity-based video fingerprinting for video copy detection | 2 | 0.37 | 2014 |
Secure and robust image hashing via compressive sensing | 12 | 0.58 | 2014 |
Forging a Close Relationship with Multimedia Communities. | 0 | 0.34 | 2014 |
Context-Adaptive Modeling for Wavelet-Domain Distributed Video Coding | 4 | 0.38 | 2014 |
Towards Cross-Domain Learning for Social Video Popularity Prediction | 51 | 1.21 | 2013 |
Robust sparse channel estimation and equalization in impulsive noise using linear programming | 6 | 0.47 | 2013 |
The Hidden Potential of Movie Genome Communities: Analyzing Fine-Grained Semantic Information in Motion Pictures | 1 | 0.36 | 2013 |
Cognitive canonicalization of natural language queries using semantic strata | 0 | 0.34 | 2013 |
Direction-of-arrival estimation based on spatial-temporal statistics without knowing the source number. | 2 | 0.41 | 2013 |
ℓp-MUSIC: Robust Direction-of-Arrival Estimator for Impulsive Noise Environments. | 0 | 0.34 | 2013 |
Exploring psychophysical factors influencing visibility of virtual image display | 0 | 0.34 | 2013 |
Empowering Cross-Domain Internet Media with Real-Time Topic Learning from Social Streams | 7 | 0.61 | 2012 |