Skip to main navigation Skip to search Skip to main content

Towards intelligent crowdsourcing for audio data annotation: Integrating active learning in the real world

  • Universität Passau
  • Technical University of Munich
  • Imperial College London

Research output: Contribution to journalConference articlepeer-review

24 Scopus citations

Abstract

In this contribution, we combine the advantages of traditional crowdsourcing with contemporary machine learning algorithms with the aim of ultimately obtaining reliable training data for audio processing in a faster, cheaper and therefore more efficient manner than has been previously possible. We propose a novel crowdsourcing approach, which brings a simulated active learning annotation scenario into a real world environment creating an intelligent and gamified crowdsourcing platform for manual audio annotation. Our platform combines two active learning query strategies with an internally calculated trustability score to efficiently reduce manual labelling efforts. This reduction is achieved in a twofold manner: first our system automatically decides if an instance requires annotation; second, it dynamically decides, depending on the quality of previously gathered annotations, on exactly how many annotations are needed to reliably label an instance. Results presented indicate that our approach drastically reduces the annotation load and is considerably more efficient than conventional methods.

Original languageEnglish
Pages (from-to)3951-3955
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2017-August
DOIs
StatePublished - 2017
Externally publishedYes
Event18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 - Stockholm, Sweden
Duration: 20 Aug 201724 Aug 2017

Keywords

  • Active Learning
  • Annotation Reduction
  • Audio Processing
  • Intelligent Crowdsourcing

Fingerprint

Dive into the research topics of 'Towards intelligent crowdsourcing for audio data annotation: Integrating active learning in the real world'. Together they form a unique fingerprint.

Cite this