The FiCa speech dataset is a private dataset consisting of 92 minutes of audio from a single female speaker. This dataset was originally created in order to train a TTS system capable of synthesizing short feedback responses such as "mhm", "oh", "wow". This work was published at SigDial 2024.
Access to the feedback imitations and conversational feedback responses can be requested. Please contact the first author Carol Figueroa
Feedback imitations | Conversational feedback responses | |
---|---|---|
@inproceedings{figueroa2024mhm, title={Mhm... Yeah? Okay! Evaluating the Naturalness and Communicative Function of Synthesized Feedback Responses in Spoken Dialogue}, author={Figueroa, Carol and de Korte, Marcel and Ochs, Magalie and Skantze, Gabriel}, booktitle={Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue}, pages={544--553}, year={2024} }