Perceptual Error Analysis of Human and Synthesized Voices

dc.citation.issue4]
dc.citation.volume31]
dc.contributor.authorEnglert, Marina [UNIFESP]
dc.contributor.authorMadazio, Glaucya
dc.contributor.authorGielow, Ingrid
dc.contributor.authorLucero, Jorge
dc.contributor.authorBehlau, Mara [UNIFESP]
dc.coverageNew York
dc.date.accessioned2020-06-26T16:30:29Z
dc.date.available2020-06-26T16:30:29Z
dc.date.issued2017
dc.description.abstractObjective/ Hypothesis. To assess the quality of synthesized voices through listeners' skills in discriminating human and synthesized voices. Study Design. Prospective study. Methods. Eighteen human voices with different types and degrees of deviation (roughness, breathiness, and strain, with three degrees of deviation: mild, moderate, and severe) were selected by three voice specialists. Synthesized samples with the same deviations of human voices were produced by the VoiceSim system. The manipulated parameters were vocal frequency perturbation (roughness), additive noise (breathiness), increasing tension, subglottal pressure, and decreasing vocal folds separation (strain). Two hundred sixty-nine listeners were divided in three groups: voice specialist speech language pathologists (V-SLPs), general clinician SLPs (G-SLPs), and naive listeners (NLs). The SLP listeners also indicated the type and degree of deviation. Results. The listeners misclassified 39.3% of the voices, both synthesized (42.3%) and human (36.4%) samples (P = 0.001). V-SLPs presented the lowest error percentage considering the voice nature (34.6%)en
dc.description.abstractG-SLPs and NLs identified almost half of the synthesized samples as human (46.9%, 45.6%). The male voices were more susceptible for misidentification. The synthesized breathy samples generated a greater perceptual confusion. The samples with severe deviation seemed to be more susceptible for errors. The synthesized female deviations were correctly classified. The male breathiness and strain were identified as roughness. Conclusion. VoiceSim produced stimuli very similar to the voices of patients with dysphonia. V-SLPs had a better ability to classify human and synthesized voices. VoiceSim is better to simulate vocal breathiness and female deviationsen
dc.description.abstractthe male samples need adjustment.en
dc.description.affiliationUniv Fed Sao Paulo, Sao Paulo, Brazil
dc.description.affiliationCEV, R Machado Bittencourt 361-1001, BR-04044001 Sao Paulo, SP, Brazil
dc.description.affiliationUniv Brasilia, Brasilia, DF, Brazil
dc.description.affiliationUnifespUniv Fed Sao Paulo, Sao Paulo, Brazil
dc.description.sourceWeb of Science
dc.format.extent-
dc.identifierhttp://dx.doi.org/10.1016/j.jvoice.2016.12.015]
dc.identifier.citationJournal Of Voice. New York, v. 31, n. 4, p. -, 2017.
dc.identifier.doi10.1016/j.jvoice.2016.12.015
dc.identifier.issn0892-1997
dc.identifier.urihttps://repositorio.unifesp.br/handle/11600/53569
dc.identifier.wosWOS:000406147000054
dc.language.isoeng
dc.publisherMosby-Elsevier
dc.relation.ispartofJournal Of Voice
dc.rightsinfo:eu-repo/semantics/restrictedAccess
dc.subjectVoiceen
dc.subjectVoice disordersen
dc.subjectAuditory perceptionen
dc.subjectSpeech acousticen
dc.subjectJudgmenten
dc.titlePerceptual Error Analysis of Human and Synthesized Voicesen
dc.typeinfo:eu-repo/semantics/article
Arquivos
Coleções