Skip to main navigation Skip to search Skip to main content

Convolutional neural networks with data augmentation for classifying speakers' native language

  • Universität Passau
  • Imperial College London

Research output: Contribution to journalConference articlepeer-review

19 Scopus citations

Abstract

We use a feedforward Convolutional Neural Network to classify speakers' native language for the INTERSPEECH 2016 Computational Paralinguistic Challenge Native Language Sub-Challenge, using no specialized features for computational paralinguistics tasks, but only MFCCs with their first and second order deltas. In addition, we augment the training data by replacing the original examples with shorter overlapping samples extracted from them, thus multiplying the number of training examples by almost 40. With the augmented training dataset and enhancements to neural network models such as Batch Normalization, Dropout, and Maxout activation function, we managed to improve upon the challenge baseline by a large margin, both for the development and the test set.

Original languageEnglish
Pages (from-to)2393-2397
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume08-12-September-2016
DOIs
StatePublished - 2016
Externally publishedYes
Event17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016 - San Francisco, United States
Duration: 8 Sep 201616 Sep 2016

Keywords

  • Computational paralinguistics
  • Convolutional neural networks
  • Deep learning

Fingerprint

Dive into the research topics of 'Convolutional neural networks with data augmentation for classifying speakers' native language'. Together they form a unique fingerprint.

Cite this