Name
Playground
About
FAQ
GitHub
Playground
Shortest Path Finder
Community Detector
Connected Papers
Author Trending
Claudia Calabrese
U. Hübner
Maria Concetta Palumbo
Ronald M. Summers
Jhonathan Pinzon
Giovanni Venturelli
Chen Ma
Radu Timofte
Kuanrui Yin
Matthew W Segar
Home
/
Author
/
DAVID F. HARWATH
Author Info
Open Visualization
Name
Affiliation
Papers
DAVID F. HARWATH
MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02173 USA
21
Collaborators
Citations
PageRank
40
63
8.34
Referers
Referees
References
119
323
137
Search Limit
100
323
Publications (21 rows)
Collaborators (40 rows)
Referers (100 rows)
Referees (100 rows)
Title
Citations
PageRank
Year
Everything at Once – Multi-modal Fusion Transformer for Video Retrieval
0
0.34
2022
Cascaded Multilingual Audio-Visual Learning from Videos.
0
0.34
2021
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions
0
0.34
2021
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos.
0
0.34
2021
Pair Expansion for Learning Multilingual Semantic Embeddings Using Disjoint Visually-Grounded Speech Audio Datasets.
0
0.34
2020
Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms
0
0.34
2020
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
2
0.39
2020
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech
1
0.35
2020
Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio
1
0.34
2019
Towards Visually Grounded Sub-Word Speech Unit Discovery
1
0.34
2019
Grounding Spoken Words in Unlabeled Video.
1
0.35
2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
1
0.34
2019
Learning Words By Drawing Images
1
0.34
2019
Learning modality-invariant representations for speech and images
4
0.39
2017
Learning Word-Like Units From Joint Audio-Visual Analysis
14
0.61
2017
Unsupervised Learning of Spoken Language with Visual Context.
4
0.40
2016
Look, listen, and decode: Multimodal speech recognition with images
1
0.34
2016
On the Use of Acoustic Unit Discovery for Language Recognition.
3
0.37
2016
Deep multimodal semantic embeddings for speech and images
16
0.73
2015
Speech recognition without a lexicon - bridging the gap between graphemic and phonetic systems.
2
0.41
2014
Zero Resource Spoken Audio Corpus Analysis
11
0.61
2013
1