Perceptual Error Analysis of Human and Synthesized Voices

Englert, Marina [UNIFESP]; Madazio, Glaucya; Gielow, Ingrid; Lucero, Jorge; Behlau, Mara [UNIFESP]

Perceptual Error Analysis of Human and Synthesized Voices

dc.citation.issue	4	]
dc.citation.volume	31	]
dc.contributor.author	Englert, Marina [UNIFESP]
dc.contributor.author	Madazio, Glaucya
dc.contributor.author	Gielow, Ingrid
dc.contributor.author	Lucero, Jorge
dc.contributor.author	Behlau, Mara [UNIFESP]
dc.coverage	New York
dc.date.accessioned	2020-06-26T16:30:29Z
dc.date.available	2020-06-26T16:30:29Z
dc.date.issued	2017
dc.description.abstract	Objective/ Hypothesis. To assess the quality of synthesized voices through listeners' skills in discriminating human and synthesized voices. Study Design. Prospective study. Methods. Eighteen human voices with different types and degrees of deviation (roughness, breathiness, and strain, with three degrees of deviation: mild, moderate, and severe) were selected by three voice specialists. Synthesized samples with the same deviations of human voices were produced by the VoiceSim system. The manipulated parameters were vocal frequency perturbation (roughness), additive noise (breathiness), increasing tension, subglottal pressure, and decreasing vocal folds separation (strain). Two hundred sixty-nine listeners were divided in three groups: voice specialist speech language pathologists (V-SLPs), general clinician SLPs (G-SLPs), and naive listeners (NLs). The SLP listeners also indicated the type and degree of deviation. Results. The listeners misclassified 39.3% of the voices, both synthesized (42.3%) and human (36.4%) samples (P = 0.001). V-SLPs presented the lowest error percentage considering the voice nature (34.6%)	en
dc.description.abstract	G-SLPs and NLs identified almost half of the synthesized samples as human (46.9%, 45.6%). The male voices were more susceptible for misidentification. The synthesized breathy samples generated a greater perceptual confusion. The samples with severe deviation seemed to be more susceptible for errors. The synthesized female deviations were correctly classified. The male breathiness and strain were identified as roughness. Conclusion. VoiceSim produced stimuli very similar to the voices of patients with dysphonia. V-SLPs had a better ability to classify human and synthesized voices. VoiceSim is better to simulate vocal breathiness and female deviations	en
dc.description.abstract	the male samples need adjustment.	en
dc.description.affiliation	Univ Fed Sao Paulo, Sao Paulo, Brazil
dc.description.affiliation	CEV, R Machado Bittencourt 361-1001, BR-04044001 Sao Paulo, SP, Brazil
dc.description.affiliation	Univ Brasilia, Brasilia, DF, Brazil
dc.description.affiliationUnifesp	Univ Fed Sao Paulo, Sao Paulo, Brazil
dc.description.source	Web of Science
dc.format.extent	-
dc.identifier	http://dx.doi.org/10.1016/j.jvoice.2016.12.015	]
dc.identifier.citation	Journal Of Voice. New York, v. 31, n. 4, p. -, 2017.
dc.identifier.doi	10.1016/j.jvoice.2016.12.015
dc.identifier.issn	0892-1997
dc.identifier.uri	https://repositorio.unifesp.br/handle/11600/53569
dc.identifier.wos	WOS:000406147000054
dc.language.iso	eng
dc.publisher	Mosby-Elsevier
dc.relation.ispartof	Journal Of Voice
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.subject	Voice	en
dc.subject	Voice disorders	en
dc.subject	Auditory perception	en
dc.subject	Speech acoustic	en
dc.subject	Judgment	en
dc.title	Perceptual Error Analysis of Human and Synthesized Voices	en
dc.type	info:eu-repo/semantics/article

Coleções

EPM - Artigos

Perceptual Error Analysis of Human and Synthesized Voices

Arquivos

Coleções