Recherche

Afficher tous les résultats pour

Accueil

Personne

Petr Motlicek

À propos
Confidentialité
Mentions légales

Graph Chatbot

Publications associées (49)

Supervised domain adaptation for text-independent speaker verification using limited data

Sébastien Marcel, Petr Motlicek

ISCA2020

BertAA: BERT fine-tuning for Authorship Attribution

Petr Motlicek, Maël Fabien

ACL2020

ODIANLP's Participation in WAT2020

Petr Motlicek

This paper describes the team (“ODI-ANLP”)’s submission to WAT 2020. We have participated in the English→HindiMultimodal task and Indic task. We have used the state-of-the-art Transformer model for the translation task and Incep-tionResNetV2 for the Hindi ...

ACL2020

Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System

Hervé Bourlard, Petr Motlicek, Sibo Tong

ISCA2020

Idiap & UAM participation at GermEval 2020: Classification and Regression of Cognitive and Motivational Style from Text

Petr Motlicek

In this paper, we describe the participation of the Idiap Research Institute at GermEval 2020 shared task on the Classification and Regression of Cognitive and Motivational style from Text, specifically on subtask 2, Classification of the Operant Motive Te ...

Hamburg University2020

Automatic Speech Recognition Benchmark for Air-Traffic Communications

Petr Motlicek

Advances in Automatic Speech Recognition (ASR) over the last decade opened new areas of speech-based automation such as in Air-Traffic Control (ATC) environments. Currently, voice communication and Controller Pilot Data Link Communications are the only way ...

ISCA2020

The MuMMER Data Set for Robot Perception in Multi-party HRI Scenarios

Jean-Marc Odobez, Petr Motlicek, Olivier Canévet

This paper presents the MuMMER data set, a data set for human-robot interaction scenarios that is available for research purposes% It comprises 1 h 29 min of multimodal recordings of people interacting with the social robot Pepper in entertainment scenario ...

IEEE2020

A Bayesian Approach To Inter-Task Fusion For Speaker Recognition

Petr Motlicek, Subhadeep Dey

In i-vector based speaker recognition systems, back-end classifiers are trained to factor out nuisance information and retain only the speaker identity. As a result, variabilities arising due to gender, language and accent ( among many others) are suppress ...

IEEE2019

End-to-end text-dependent speaker verification using novel distance measures

Petr Motlicek, Subhadeep Dey

This paper explores novel ideas in building end-to-end deep neural network (DNN) based text-dependent speaker verification (SV) system. The baseline approach consists of mapping a variable length speech segment to a fixed dimensional speaker vector by esti ...

2018

Deep Neural Networks for Multiple Speaker Detection and Localization

Jean-Marc Odobez, Petr Motlicek, Weipeng He

We propose to use neural networks for simultaneous detection and localization of multiple sound sources in human-robot interaction. In contrast to conventional signal processing techniques, neural network-based sound source localization methods require few ...

2018

Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network

Jean-Marc Odobez, Petr Motlicek, Weipeng He

We propose a novel multi-task neural network-based approach for joint sound source localization and speech/non-speech classification in noisy environments. The network takes raw short time Fourier transform as input and outputs the likelihood values for th ...

ISCA-INT SPEECH COMMUNICATION ASSOC2018

Analysis of Language Dependent Front-End for Speaker Recognition

Petr Motlicek, Subhadeep Dey

In Deep Neural Network (DNN) i-vector based speaker recognition systems, acoustic models trained for Automatic Speech Recognition are employed to estimate sufficient statistics for i-vector modeling. The DNN based acoustic model is typically trained on a w ...

ISCA-INT SPEECH COMMUNICATION ASSOC2018

Template-matching for text-dependent speaker verification

Petr Motlicek, Subhadeep Dey

In the last decade, i-vector and Joint Factor Analysis (JFA) approaches to speaker modeling have become ubiquitous in the area of automatic speaker recognition. Both of these techniques involve the computation of posterior probabilities, using either Gauss ...

2017

Exploiting sequence information for text-dependent Speaker Verification

Petr Motlicek, Subhadeep Dey

Model-based approaches to Speaker Verification (SV), such as Joint Factor Analysis (JFA), i-vector and relevance Maximum-a-Posteriori (MAP), have shown to provide state-of-the-art performance for text-dependent systems with fixed phrases. The performance o ...

Ieee2017

INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION

Petr Motlicek, Subhadeep Dey

Multi-session training conditions are becoming increasingly common in recent benchmark datasets for both text-independent and text-dependent speaker verification. In the state-of-the-art i-vector framework for speaker verification, such conditions are addr ...

Ieee2017

Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP

Petr Motlicek

This paper describes SIIP (Speaker Identification Integrated Project) a high performance innovative and sustainable Speaker Identification (SID) solution, running over large voice samples database. The solution is based on development, integration and fusi ...

2017

Feature mapping using far-field microphones for distant speech recognition

David Imseng, Petr Motlicek

Acoustic modeling based on deep architectures has recently gained remarkable success, with substantial improvement of speech recognition accuracy in several automatic speech recognition (ASR) tasks. For distant speech recognition, the multi-channel deep ne ...

2016

Information theoretic clustering for unsupervised domain-adaptation

Petr Motlicek, Subhadeep Dey

The aim of the domain-adaptation task for speaker verification is to exploit unlabelled target domain data by using the labelled source domain data effectively. The i-vector based Probabilistic Linear Dis- criminant Analysis (PLDA) framework approaches thi ...

IEEE2016

Deep neural network based posteriors for text-dependent speaker verification

Petr Motlicek, Subhadeep Dey

The i-vector and Joint Factor Analysis (JFA) systems for text- dependent speaker verification use sufficient statistics computed from a speech utterance to estimate speaker models. These statis- tics average the acoustic information over the utterance ther ...

IEEE2016

System fusion and speaker linking for longitudinal diarization of TV shows

Hervé Bourlard, Petr Motlicek

Performing speaker diarization while uniquely identifying the speakers in a collection of audio recordings is a challenging task. Based on our previous work on speaker diarization and linking, we developed a system for diarizing longitudinal TV show data s ...

IEEE2016

Page 2 sur 3