Perceptual Error Identification of Human and Synthesized Voices

Englert, Marina [UNIFESP]; Madazio, Glaucya; Gielow, Ingrid; Lucero, Jorge; Behlau, Mara [UNIFESP]

Perceptual Error Identification of Human and Synthesized Voices

dc.contributor.author	Englert, Marina [UNIFESP]
dc.contributor.author	Madazio, Glaucya
dc.contributor.author	Gielow, Ingrid
dc.contributor.author	Lucero, Jorge
dc.contributor.author	Behlau, Mara [UNIFESP]
dc.date.accessioned	2019-07-22T15:46:47Z
dc.date.available	2019-07-22T15:46:47Z
dc.date.issued	2016
dc.description.abstract	Objectives/Hypothesis. To verify the discriminatory ability of human and synthesized voice samples. Study Design. This is a prospective study. Methods. A total of 70 subjects, 20 voice specialist speech-language pathologists (V-SLPs), 20 general SLPs (G-SLPs), and 30 naive listeners (NLs) participated of a listening task that was simply to classify the stimuli as human or synthesized. Samples of 36 voices, 18 human and 18 synthesized vowels, male and female (9 each), with different type and degree of deviation, were presented with 50% of repetition to verify intrarater consistency. Human voices were collected froma vocal clinic database. Voice disorders were simulated by perturbations of vocal frequency, jitter (roughness), additive noise (breathiness) and by increasing tension and decreasing separation of the vocal folds (strain). Results. The average amount of error considering all groups was 37.8%, 31.9% for V-SLP, 39.3% for G-SLP, and 40.8% for NL. V-SLP had smaller mean percentage error for synthesized (24.7%), breathy (36.7%), synthesized breathy (30.8%), and tense (25%) and female (27.5%) voices. G-SLP and NL presented equal mean percentage error for all voices classification. All groups together presented no difference on the mean percentage error between human and synthesized voices (P value = 0.452). Conclusions. The quality of synthesized samples was very high. V-SLP presented a lower amount of error, which allows us to infer that auditory training assists on vocal analysis tasks.	en
dc.description.affiliation	Univ Fed Sao Paulo, Dept Speech Language Pathol & Audiol, Sao Paulo, Brazil
dc.description.affiliation	Ctr Estudos Voz, Voice Dept, Sao Paulo, Brazil
dc.description.affiliation	[Lucero, Jorge] Univ Brasilia, Brasilia, DF, Brazil
dc.description.affiliationUnifesp	Univ Fed Sao Paulo, Dept Speech Language Pathol & Audiol, Sao Paulo, Brazil
dc.description.source	Web of Science
dc.format.extent	-
dc.identifier	http://dx.doi.org/10.1016/j.jvoice.2015.07.017
dc.identifier.citation	Journal Of Voice. New York, v. 30, n. 5, p. -, 2016.
dc.identifier.doi	10.1016/j.jvoice.2015.07.017
dc.identifier.issn	0892-1997
dc.identifier.uri	http://repositorio.unifesp.br/handle/11600/51092
dc.identifier.wos	WOS:000384010300023
dc.language.iso	eng
dc.publisher	Mosby-Elsevier
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.subject	Voice	en
dc.subject	Dysphonia	en
dc.subject	Auditory perception	en
dc.subject	Evaluation	en
dc.subject	Judgment	en
dc.title	Perceptual Error Identification of Human and Synthesized Voices	en
dc.type	info:eu-repo/semantics/article

Coleções

EPM - Artigos

Perceptual Error Identification of Human and Synthesized Voices

Arquivos

Coleções