CROWDLAB: Supervised learning to infer consensus labels and quality scores for data with multiple annotators

Goh, Hui Wen; Tkachenko, Ulyana; Mueller, Jonas

Computer Science > Machine Learning

arXiv:2210.06812 (cs)

[Submitted on 13 Oct 2022 (v1), last revised 27 Jan 2023 (this version, v2)]

Title:CROWDLAB: Supervised learning to infer consensus labels and quality scores for data with multiple annotators

Authors:Hui Wen Goh, Ulyana Tkachenko, Jonas Mueller

View PDF

Abstract:Real-world data for classification is often labeled by multiple annotators. For analyzing such data, we introduce CROWDLAB, a straightforward approach to utilize any trained classifier to estimate: (1) A consensus label for each example that aggregates the available annotations; (2) A confidence score for how likely each consensus label is correct; (3) A rating for each annotator quantifying the overall correctness of their labels. Existing algorithms to estimate related quantities in crowdsourcing often rely on sophisticated generative models with iterative inference. CROWDLAB instead uses a straightforward weighted ensemble. Existing algorithms often rely solely on annotator statistics, ignoring the features of the examples from which the annotations derive. CROWDLAB utilizes any classifier model trained on these features, and can thus better generalize between examples with similar features. On real-world multi-annotator image data, our proposed method provides superior estimates for (1)-(3) than existing algorithms like Dawid-Skene/GLAD.

Subjects:	Machine Learning (cs.LG); Human-Computer Interaction (cs.HC); Machine Learning (stat.ML)
Cite as:	arXiv:2210.06812 [cs.LG]
	(or arXiv:2210.06812v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.06812
Journal reference:	NeurIPS 2022 Human in the Loop Learning Workshop

Submission history

From: Jonas Mueller [view email]
[v1] Thu, 13 Oct 2022 07:54:07 UTC (1,289 KB)
[v2] Fri, 27 Jan 2023 18:53:11 UTC (1,305 KB)

Computer Science > Machine Learning

Title:CROWDLAB: Supervised learning to infer consensus labels and quality scores for data with multiple annotators

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:CROWDLAB: Supervised learning to infer consensus labels and quality scores for data with multiple annotators

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators