SNRi Target Training for Joint Speech Enhancement and Recognition | 0 | 0.34 | 2022 |
Deep Griffin–Lim Iteration: Trainable Iterative Phase Reconstruction Using Neural Network | 2 | 0.40 | 2021 |
Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech | 3 | 0.39 | 2021 |
Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method | 0 | 0.34 | 2021 |
Description and Discussion on DCASE 2021 Challenge Task 2 - Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions. | 0 | 0.34 | 2021 |
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement | 1 | 0.36 | 2021 |
A Transformer-based Audio Captioning Model with Keyword Estimation | 1 | 0.37 | 2020 |
Effects of Word-Frequency Based Pre- and Post- Processings for Audio Captioning. | 0 | 0.34 | 2020 |
Crossmodal Sound Retrieval Based on Specific Target Co-Occurrence Denoted with Weak Labels. | 0 | 0.34 | 2020 |
Description and Discussion on DCASE2020 Challenge Task2 - Unsupervised Anomalous Sound Detection for Machine Condition Monitoring. | 0 | 0.34 | 2020 |
Speech Enhancement Using Self-Adaptation and Multi-Head Self-Attention | 4 | 0.42 | 2020 |
Listen to What You Want: Neural Network-based Universal Sound Selector | 1 | 0.37 | 2020 |
Invertible DNN-based nonlinear time-frequency transform for speech enhancement | 0 | 0.34 | 2020 |
SPIDERnet: Attention Network For One-Shot Anomaly Detection In Sounds | 0 | 0.34 | 2020 |
Batch Uniformization for Minimizing Maximum Anomaly Score of Dnn-Based Anomaly Detection in Sounds | 2 | 0.43 | 2019 |
First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation. | 2 | 0.42 | 2019 |
Context-Aware Neural Voice Activity Detection Using Auxiliary Networks For Phoneme Recognition, Speech Enhancement And Acoustic Scene Classification | 0 | 0.34 | 2019 |
Data-Driven Design Of Perfect Reconstruction Filterbank For Dnn-Based Sound Source Enhancement | 0 | 0.34 | 2019 |
A Two-Class Hyper-Spherical Autoencoder For Supervised Anomaly Detection | 1 | 0.35 | 2019 |
Finding Low-Dimensional Dynamical Structure Through Variational Auto-Encoding Dynamic Mode Decomposition | 0 | 0.34 | 2019 |
Deep Griffin–Lim Iteration | 1 | 0.35 | 2019 |
Sniper: Few-Shot Learning For Anomaly Detection To Minimize False-Negative Rate With Ensured True-Positive Rate | 0 | 0.34 | 2019 |
Unsupervised Detection of Anomalous Sound based on Deep Learning and the Neyman-Pearson Lemma. | 12 | 0.83 | 2019 |
ToyADMOS: A Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection | 2 | 0.43 | 2019 |
Distant Noise Reduction Based on Multi-delay Noise Model Using Distributed Microphone Array | 0 | 0.34 | 2018 |
Trainable Adaptive Window Switching For Speech Enhancement | 0 | 0.34 | 2018 |
Adaflow: Domain-Adaptive Density Estimator With Application To Anomaly Detection And Unpaired Cross-Domain Translation | 1 | 0.35 | 2018 |
DNN-Based Source Enhancement to Increase Objective Sound Quality Assessment Score. | 7 | 0.52 | 2018 |
Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma. | 1 | 0.36 | 2017 |
Informative Acoustic Feature Selection to Maximize Mutual Information for Collecting Target Sources. | 0 | 0.34 | 2017 |
Intra-note segmentation via sticky HMM with DP emission | 0 | 0.34 | 2014 |