Speaker recognitionSpeaker recognition is the identification of a person from characteristics of voices. It is used to answer the question "Who is speaking?" The term voice recognition can refer to speaker recognition or speech recognition. Speaker verification (also called speaker authentication) contrasts with identification, and speaker recognition differs from speaker diarisation (recognizing when the same speaker is speaking).
Speech recognitionSpeech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.
Vocal pedagogyVocal pedagogy is the study of the art and science of voice instruction. It is used in the teaching of singing and assists in defining what singing is, how singing works, and how proper singing technique is accomplished. Vocal pedagogy covers a broad range of aspects of singing, ranging from the physiological process of vocal production to the artistic aspects of interpretation of songs from different genres or historical eras. Typical areas of study include: Human anatomy and physiology as it relates to the physical process of singing.
Euler methodIn mathematics and computational science, the Euler method (also called the forward Euler method) is a first-order numerical procedure for solving ordinary differential equations (ODEs) with a given initial value. It is the most basic explicit method for numerical integration of ordinary differential equations and is the simplest Runge–Kutta method. The Euler method is named after Leonhard Euler, who first proposed it in his book Institutionum calculi integralis (published 1768–1870).
Speech processingSpeech processing is the study of speech signals and the processing methods of signals. The signals are usually processed in a digital representation, so speech processing can be regarded as a special case of digital signal processing, applied to speech signals. Aspects of speech processing includes the acquisition, manipulation, storage, transfer and output of speech signals. Different speech processing tasks include speech recognition, speech synthesis, speaker diarization, speech enhancement, speaker recognition, etc.
Human voiceThe human voice consists of sound made by a human being using the vocal tract, including talking, singing, laughing, crying, screaming, shouting, humming or yelling. The human voice frequency is specifically a part of human sound production in which the vocal folds (vocal cords) are the primary sound source. (Other sound production mechanisms produced from the same general area of the body involve the production of unvoiced consonants, clicks, whistling and whispering.
Iterative methodIn computational mathematics, an iterative method is a mathematical procedure that uses an initial value to generate a sequence of improving approximate solutions for a class of problems, in which the n-th approximation is derived from the previous ones. A specific implementation with termination criteria for a given iterative method like gradient descent, hill climbing, Newton's method, or quasi-Newton methods like BFGS, is an algorithm of the iterative method.
Place of articulationIn articulatory phonetics, the place of articulation (also point of articulation) of a consonant is a location along the vocal tract where its production occurs. It is a point where a constriction is made between an active and a passive articulator. Active articulators are organs capable of voluntary movement which create the constriction, while passive articulators are so called because they are normally fixed and are the parts with which an active articulator makes contact.
Heun's methodIn mathematics and computational science, Heun's method may refer to the improved or modified Euler's method (that is, the explicit trapezoidal rule), or a similar two-stage Runge–Kutta method. It is named after Karl Heun and is a numerical procedure for solving ordinary differential equations (ODEs) with a given initial value. Both variants can be seen as extensions of the Euler method into two-stage second-order Runge–Kutta methods.
VocologyVocology is the science and practice of vocal habilitation, or vocal training and therapy. Its concerns include the nature of speech and language pathology, the defects of the vocal tract (laryngology), the remediation of speech therapy, and the voice training (voice therapy) and voice pedagogy of song and speech for actors and public speakers. In its broadest sense, vocology is the study of voice, but as a professional discipline it has a narrower focus: the science and practice of voice habilitation, which includes evaluation, diagnosis, and intervention.
PseudonymizationPseudonymization is a data management and de-identification procedure by which personally identifiable information fields within a data record are replaced by one or more artificial identifiers, or pseudonyms. A single pseudonym for each replaced field or collection of replaced fields makes the data record less identifiable while remaining suitable for data analysis and data processing. Pseudonymization (or pseudonymisation, the spelling under European guidelines) is one way to comply with the European Union's new General Data Protection Regulation (GDPR) demands for secure data storage of personal information.
ParalanguageParalanguage, also known as vocalics, is a component of meta-communication that may modify meaning, give nuanced meaning, or convey emotion, by using techniques such as prosody, pitch, volume, intonation, etc. It is sometimes defined as relating to nonphonemic properties only. Paralanguage may be expressed consciously or unconsciously. The study of paralanguage is known as paralinguistics and was invented by George L. Trager in the 1950s, while he was working at the Foreign Service Institute of the U.S.
Jacobi methodIn numerical linear algebra, the Jacobi method (a.k.a. the Jacobi iteration method) is an iterative algorithm for determining the solutions of a strictly diagonally dominant system of linear equations. Each diagonal element is solved for, and an approximate value is plugged in. The process is then iterated until it converges. This algorithm is a stripped-down version of the Jacobi transformation method of matrix diagonalization. The method is named after Carl Gustav Jacob Jacobi.
Respiratory tractThe respiratory tract is the subdivision of the respiratory system involved with the process of respiration in mammals. The respiratory tract is lined with respiratory epithelium as respiratory mucosa. Air is breathed in through the nose to the nasal cavity, where a layer of nasal mucosa acts as a filter and traps pollutants and other harmful substances found in the air. Next, air moves into the pharynx, a passage that contains the intersection between the oesophagus and the larynx.
De-identificationDe-identification is the process used to prevent someone's personal identity from being revealed. For example, data produced during human subject research might be de-identified to preserve the privacy of research participants. Biological data may be de-identified in order to comply with HIPAA regulations that define and stipulate patient privacy laws. When applied to metadata or general data about identification, the process is also known as data anonymization.
PseudonymA pseudonym (ˈsjuːdənɪm; ) or alias (ˈeɪli.əs) is a fictitious name that a person or group assumes for a particular purpose, which differs from their original or true name (orthonym). This also differs from a new name that entirely or legally replaces an individual's own. Many pseudonym holders use pseudonyms because they wish to remain anonymous, but anonymity is difficult to achieve and often fraught with legal issues.
Vocal cord dysfunctionVocal cord dysfunction (VCD) is a pathology affecting the vocal folds (commonly referred to as the vocal cords) characterized by full or partial vocal fold closure causing difficulty and distress during respiration, especially during inhalation. Due to the similarity in symptoms, VCD attack are often mistaken for asthma attacks or laryngospasms. Symptoms of VCD are not always present. Rather, they often occur episodically, often defined as "attacks," where the patient will be symptomatic for a short period of time.
Linguistic discriminationLinguistic discrimination (also called glottophobia, linguicism and languagism) is unfair treatment of people which is based on their use of language and the characteristics of their speech, including their first language, their accent, the perceived size of their vocabulary (whether or not the speaker uses complex and varied words), their modality, and their syntax. For example, an Occitan speaker in France will probably be treated differently from a French speaker.