Passer au contenu principal
Publication

Improving speech embedding using crossmodal transfer learning with audio-visual data