Skip to main content
Publication

Improving speech embedding using crossmodal transfer learning with audio-visual data