Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning. | 0 | 0.34 | 2022 |
Pretext Tasks Selection for Multitask Self-Supervised Audio Representation Learning | 0 | 0.34 | 2022 |
Neuro-Steered Music Source Separation With EEG-Based Auditory Attention Decoding And Contrastive-NMF | 0 | 0.34 | 2021 |
Early Detection of User Engagement Breakdown in Spontaneous Human-Humanoid Interaction | 1 | 0.37 | 2021 |
DISTRIBUTED SPEECH SEPARATION IN SPATIALLY UNCONSTRAINED MICROPHONE ARRAYS | 0 | 0.34 | 2021 |
Conditional Independence for Pretext Task Selection in Self-Supervised Speech Representation Learning. | 2 | 0.36 | 2021 |
Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes | 0 | 0.34 | 2021 |
Weakly Supervised Representation Learning for Audio-Visual Scene Analysis | 1 | 0.40 | 2020 |
On-the-Fly Detection of User Engagement Decrease in Spontaneous Human–Robot Interaction Using Recurrent and Deep Neural Networks | 1 | 0.36 | 2019 |
A Music Structure Informed Downbeat Tracking System Using Skip-chain Conditional Random Fields and Deep Learning | 1 | 0.39 | 2019 |
SAMBASET - A Dataset of Historical Samba de Enredo Recordings for Computational Music Analysis. | 0 | 0.34 | 2019 |
Identify, Locate and Separate: Audio-Visual Object Extraction in Large Video Collections Using Weak Supervision | 1 | 0.35 | 2019 |
EEG-Based Decoding of Auditory Attention to a Target Instrument in Polyphonic Music | 0 | 0.34 | 2019 |
A multimodal movie review corpus for fine-grained opinion mining. | 0 | 0.34 | 2019 |
Audiovisual Analysis of Music Performances: Overview of an Emerging Field. | 1 | 0.39 | 2019 |
Tracking Beats and Microtiming in Afro-Latin American Music Using Conditional Random Fields and Deep Learning. | 0 | 0.34 | 2019 |
From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining | 0 | 0.34 | 2019 |
A robust audio classification system for detecting pulmonary edema. | 0 | 0.34 | 2018 |
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events. | 0 | 0.34 | 2018 |
Analysis of Common Design Choices in Deep Learning Systems for Downbeat Tracking. | 0 | 0.34 | 2018 |
Main Melody Estimation with Source-Filter NMF and CRNN. | 2 | 0.38 | 2018 |
Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization. | 0 | 0.34 | 2018 |
Structured Output Learning with Abstention: Application to Accurate Opinion Prediction. | 1 | 0.36 | 2018 |
Opinion Dynamics Modeling For Movie Review Transcripts Classification With Hidden Conditional Random Fields | 0 | 0.34 | 2018 |
Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification. | 11 | 0.72 | 2017 |
UE-HRI: a new dataset for the study of user engagement in spontaneous human-robot interactions. | 0 | 0.34 | 2017 |
Nonnegative Feature Learning Methods for Acoustic Scene Classification. | 0 | 0.34 | 2017 |
EMOEEG: A new multimodal dataset for dynamic EEG-based emotion recognition with audiovisual elicitation. | 0 | 0.34 | 2017 |
Downbeat Detection with Conditional Random Fields and Deep Learned Features. | 0 | 0.34 | 2016 |
Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence | 1 | 0.35 | 2016 |
HOG and subband power distribution image features for acoustic scene classification | 8 | 0.48 | 2015 |
Melody Extraction by Contour Classification. | 6 | 0.59 | 2015 |
TPT-Dance&Actions : un corpus multimodal d'activités humaines. | 0 | 0.34 | 2015 |
A Conditional Random Field system for beat tracking | 0 | 0.34 | 2015 |
Assessment of new spectral features for eeg-based emotion recognition | 2 | 0.40 | 2014 |
Soft Nonnegative Matrix Co-Factorization | 1 | 0.40 | 2014 |
Piecewise constant nonnegative matrix factorization | 1 | 0.37 | 2014 |
Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes | 0 | 0.34 | 2014 |
Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization | 9 | 0.52 | 2013 |
A Multimodal Approach to Speaker Diarization on TV Talk-Shows | 8 | 0.52 | 2013 |
Multimodal classification of dance movements using body joint trajectories and step sounds | 6 | 0.49 | 2013 |
Non-Negative Matrix Factorization For Single-Channel Eeg Artifact Rejection | 5 | 0.58 | 2013 |
Learning Optimal Features for Polyphonic Audio-to-Score Alignment | 11 | 0.57 | 2013 |
Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring | 4 | 0.41 | 2013 |
Non-negative Tensor Factorization for single-channel EEG artifact rejection | 1 | 0.35 | 2013 |
Probabilistic dance performance alignment by fusion of multimodal features | 1 | 0.35 | 2013 |
A multi-modal dance corpus for research into interaction between humans in virtual environments | 4 | 0.52 | 2013 |
Exploring new features for music classification | 1 | 0.36 | 2013 |
Fusion of Multimodal Information in Music Content Analysis. | 6 | 0.45 | 2012 |
Decomposing the video editing structure of a talk-show using nonnegative matrix factorization | 1 | 0.36 | 2012 |