Crowd_Frame: A Simple and Complete Framework to Deploy Complex Crowdsourcing Tasks Off-the-shelf | 0 | 0.34 | 2022 |
Task design in complex crowdsourcing experiments: Item assignment optimization. | 0 | 0.34 | 2022 |
Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments | 1 | 0.36 | 2022 |
Automatic Assignment of ICD-10 Codes to Diagnostic Texts using Transformers Based Techniques | 0 | 0.34 | 2021 |
The many dimensions of truthfulness: Crowdsourcing misinformation assessments on a multidimensional scale | 1 | 0.40 | 2021 |
DiLBERT: Cheap Embeddings for Disease Related Medical NLP | 0 | 0.34 | 2021 |
E-BART - Jointly Predicting and Explaining Truthfulness. | 0 | 0.34 | 2021 |
The Impact of Task Abandonment in Crowdsourcing | 1 | 0.35 | 2021 |
Human-in-the-Loop Systems for Truthfulness - A Study of Human and Machine Confidence. | 0 | 0.34 | 2021 |
On the effect of relevance scales in crowdsourcing relevance assessments for Information Retrieval evaluation | 1 | 0.36 | 2021 |
Cheap IR evaluation: fewer topics, no relevance judgements, and crowdsourced assessments | 0 | 0.34 | 2020 |
Crowd Worker Strategies in Relevance Judgment Tasks. | 3 | 0.37 | 2020 |
Fewer topics? A million topics? Both?! On topics subsets in test collections. | 0 | 0.34 | 2020 |
Effectiveness evaluation without human relevance judgments: A systematic analysis of existing methods and of their combinations | 1 | 0.35 | 2020 |
Underlying Cause of Death Identification from Death Certificates via Categorical Embeddings and Convolutional Neural Networks | 0 | 0.34 | 2020 |
Leveraging Behavioral Heterogeneity Across Markets for Cross-Market Training of Recommender Systems | 3 | 0.38 | 2020 |
Can The Crowd Identify Misinformation Objectively? The Effects of Judgment Scale and Assessor's Background | 0 | 0.34 | 2020 |
Twitter goes to the Doctor - Detecting Medical Tweets using Machine Learning and BERT. | 0 | 0.34 | 2020 |
Detection of HER2 from Haematoxylin-Eosin Slides Through a Cascade of Deep Learning Classifiers via Multi-Instance Learning. | 1 | 0.36 | 2020 |
The COVID-19 Infodemic: Can the Crowd Judge Recent Misinformation Objectively? | 1 | 0.35 | 2020 |
On Transforming Relevance Scales | 1 | 0.35 | 2019 |
HITS Hits Readersourcing - Validating Peer Review Alternatives Using Network Analysis. | 0 | 0.34 | 2019 |
All Those Wasted Hours - On Task Abandonment in Crowdsourcing. | 1 | 0.35 | 2019 |
Bias and Fairness in Effectiveness Evaluation by Means of Network Analysis and Mixture Models. | 0 | 0.34 | 2019 |
On Topic Difficulty in IR Evaluation: The Effect of Systems, Corpora, and System Components | 2 | 0.37 | 2019 |
Towards Stochastic Simulations of Relevance Profiles | 0 | 0.34 | 2019 |
Query Performance Prediction and Effectiveness Evaluation Without Relevance Judgments: Two Sides of the Same Coin. | 2 | 0.38 | 2018 |
IRevalOO: An Object Oriented Framework for Retrieval Evaluation. | 0 | 0.34 | 2018 |
CHEERS: CHeap & Engineered Evaluation of Retrieval Systems. | 0 | 0.34 | 2018 |
How Many Truth Levels? Six? One Hundred? Even More? Validating Truthfulness of Statements via Crowdsourcing. | 0 | 0.34 | 2018 |
Effectiveness Evaluation with a Subset of Topics: A Practical Approach. | 0 | 0.34 | 2018 |
On Fine-Grained Relevance Scales. | 4 | 0.40 | 2018 |
Reproduce. Generalize. Extend. On Information Retrieval Evaluation without Relevance Judgments | 1 | 0.35 | 2018 |
Reproduce and Improve: An Evolutionary Approach to Select a Few Good Topics for Information Retrieval Evaluation | 0 | 0.34 | 2018 |
Considering Assessor Agreement In Ir Evaluation | 4 | 0.47 | 2017 |
Let's Agree to Disagree: Fixing Agreement Measures for Crowdsourcing. | 0 | 0.34 | 2017 |
Economic Evaluation of Recommender Systems: A Proposal. | 0 | 0.34 | 2017 |
Do Easy Topics Predict Effectiveness Better Than Difficult Topics? | 2 | 0.36 | 2017 |
Improving the Efficiency of Retrieval Effectiveness Evaluation: Finding a Few Good Topics with Clustering? | 0 | 0.34 | 2016 |