Achieving Reliable Human Assessment of Open-Domain Dialogue Systems | 0 | 0.34 | 2022 |
Memento: A Prototype Lifelog Search Engine for LSC'21 | 0 | 0.34 | 2021 |
Improving Unsupervised Question Answering via Summarization-Informed Question Generation. | 0 | 0.34 | 2021 |
Contrasting Human Opinion of Non-factoid Question Answering with Automatic Evaluation | 0 | 0.34 | 2020 |
Assessing Human-Parity in Machine Translation on the Segment Level. | 0 | 0.34 | 2020 |
Findings of the 2020 Conference on Machine Translation (WMT20). | 0 | 0.34 | 2020 |
Improving Document-Level Sentiment Analysis with User and Product Context | 0 | 0.34 | 2020 |
TRECVID 2020: A comprehensive campaign for evaluating video retrieval tasks across multiple application domains | 0 | 0.34 | 2020 |
Incorporating Context and Knowledge for Better Sentiment Analysis of Narrative Text. | 0 | 0.34 | 2020 |
Exploring the Impact of Training Data Bias on Automatic Generation of Video Captions. | 0 | 0.34 | 2019 |
Results of the WMT19 Metrics Shared Task: Segment-Level and Strong MT Systems Pose Big Challenges. | 0 | 0.34 | 2019 |
The Second Multilingual Surface Realisation Shared Task (SR'19) - Overview and Evaluation Results. | 1 | 0.37 | 2019 |
Proceedings of the 2nd Workshop on Multilingual Surface Realisation, MSR@EMNLP-IJCNLP 2019, Hong Kong, China, November 3, 2019. | 0 | 0.34 | 2019 |
Translating Pro-Drop Languages with Reconstruction Models. | 3 | 0.38 | 2018 |
Results of the WMT18 Metrics Shared Task: Both characters and embeddings achieve good performance. | 0 | 0.34 | 2018 |
TRECVID 2018 - Benchmarking Video Activity Detection, Video Captioning and Matching, Video Storytelling Linking and Video Search. | 0 | 0.34 | 2018 |
Evaluation Of Automatic Video Captioning Using Direct Assessment | 3 | 0.38 | 2017 |
Can machine translation systems be evaluated by the crowd alone | 17 | 0.90 | 2017 |
Results of the WMT16 Metrics Shared Task. | 13 | 0.81 | 2017 |
Blend: a Novel Combined MT Metric Based on Direct Assessment - CASICT-DCU submission to WMT17 Metrics Task. | 0 | 0.34 | 2017 |
Further Investigation into Reference Bias in Monolingual Evaluation of Machine Translation. | 0 | 0.34 | 2017 |
Improving Evaluation of Document-level Machine Translation Quality Estimation. | 0 | 0.34 | 2017 |
Achieving Accurate Conclusions in Evaluation of Automatic Machine Translation Metrics. | 5 | 0.44 | 2016 |
Is all that Glitters in Machine Translation Quality Estimation really Gold? | 0 | 0.34 | 2016 |
MaxSD: A Neural Machine Translation Evaluation Metric Optimized by Maximizing Similarity Distance | 0 | 0.34 | 2016 |
Findings of the 2016 Conference on Machine Translation. | 55 | 2.03 | 2016 |
Accurate Evaluation of Segment-level Machine Translation Metrics. | 24 | 1.22 | 2015 |
Re-evaluating Automatic Summarization with BLEU and 192 Shades of ROUGE | 1 | 0.36 | 2015 |
Improving Evaluation Of Machine Translation Quality Estimation | 10 | 0.69 | 2015 |
Testing for Significance of Increased Correlation with Human Judgment. | 7 | 0.65 | 2014 |
Is Machine Translation Getting Better over Time? | 12 | 0.75 | 2014 |
Randomized Significance Tests in Machine Translation | 5 | 0.64 | 2014 |
Crowd-Sourcing of Human Judgments of Machine Translation Fluency | 0 | 0.34 | 2013 |
Umelb: Cross-lingual Textual Entailment with Word Alignment and String Similarity Features | 1 | 0.36 | 2013 |
Continuous Measurement Scales in Human Evaluation of Machine Translation | 17 | 1.02 | 2013 |
A Dependency-Constrained Hierarchical Model with Moses | 0 | 0.34 | 2013 |
Continuous Measurement Scales in Human Evaluation of Machine Translation. | 0 | 0.34 | 2013 |
Measurement of Progress in Machine Translation | 4 | 0.50 | 2012 |
Deep syntax language models and statistical machine translation | 3 | 0.42 | 2010 |
Sulis: An Open Source Transfer Decoder for Deep Syntactic Statistical Machine Translation | 0 | 0.34 | 2010 |
Gap Between Theory and Practice: Noise Sensitive Word Alignment in Machine Translation | 8 | 0.76 | 2010 |
Factor templates for factored machine translation models. | 2 | 0.37 | 2010 |
An Open Source Rule Induction Tool for Transfer-Based SMT | 3 | 0.43 | 2009 |
Guessing the grammatical function of a non-root f-structure in LFG | 0 | 0.34 | 2009 |
Packed Rules for Automatic Transfer-Rule Induction | 3 | 0.39 | 2008 |