F Burkhardt, J Wagner, H Wierstorf, F Eyben, BW Schuller, "Nkululeko: A Tool For Rapid Speaker Characteristics Detection," in Proceedings of the Thirteenth Language Resources and Evaluation Conference, (2022). [ paper ]

Bibtex

@inproceedings{Burkhardt2022b,
    title     = {Nkululeko: A Tool For Rapid Speaker Characteristics Detection},
    author    = {Burkhardt, Felix and Wagner, Johannes and Wierstorf, Hagen
                 and Eyben, Florian and Schuller, Bj\"{o}rn W.},
    booktitle = {Proceedings of the Thirteenth Language Resources
                 and Evaluation Conference},
    publisher = {European Language Resources Association},
    address   = {Marseille, France},
    month     = {June},
    pages     = {1925--1932},
    year      = {2022}
}

Abstract

We present advancements with a software tool called Nkululeko, that lets users perform (semi-) supervised machine learning experiments in the speaker characteristics domain. It is based on audformat, a format for speech database metadata description. Due to an interface based on configurable templates, it supports best practise and very fast setup of experiments without the need to be proficient in the underlying language: Python. The paper explains the handling of Nkululeko and presents two typical experiments: comparing the expert acoustic features with artificial neural net embeddings for emotion classification and speaker age regression.