1. IEMOCAP 데이터셋

    3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition

  2. CREMA 데이터셋

    CREMA-D: Crowd-Sourced Emotional Multimodal Actors Dataset

    LEAF: A Learnable Frontend for Audio Classification