Perceptual Error Analysis of Human and Synthesized Voices
dc.citation.issue | 4 | ] |
dc.citation.volume | 31 | ] |
dc.contributor.author | Englert, Marina [UNIFESP] | |
dc.contributor.author | Madazio, Glaucya | |
dc.contributor.author | Gielow, Ingrid | |
dc.contributor.author | Lucero, Jorge | |
dc.contributor.author | Behlau, Mara [UNIFESP] | |
dc.coverage | New York | |
dc.date.accessioned | 2020-06-26T16:30:29Z | |
dc.date.available | 2020-06-26T16:30:29Z | |
dc.date.issued | 2017 | |
dc.description.abstract | Objective/ Hypothesis. To assess the quality of synthesized voices through listeners' skills in discriminating human and synthesized voices. Study Design. Prospective study. Methods. Eighteen human voices with different types and degrees of deviation (roughness, breathiness, and strain, with three degrees of deviation: mild, moderate, and severe) were selected by three voice specialists. Synthesized samples with the same deviations of human voices were produced by the VoiceSim system. The manipulated parameters were vocal frequency perturbation (roughness), additive noise (breathiness), increasing tension, subglottal pressure, and decreasing vocal folds separation (strain). Two hundred sixty-nine listeners were divided in three groups: voice specialist speech language pathologists (V-SLPs), general clinician SLPs (G-SLPs), and naive listeners (NLs). The SLP listeners also indicated the type and degree of deviation. Results. The listeners misclassified 39.3% of the voices, both synthesized (42.3%) and human (36.4%) samples (P = 0.001). V-SLPs presented the lowest error percentage considering the voice nature (34.6%) | en |
dc.description.abstract | G-SLPs and NLs identified almost half of the synthesized samples as human (46.9%, 45.6%). The male voices were more susceptible for misidentification. The synthesized breathy samples generated a greater perceptual confusion. The samples with severe deviation seemed to be more susceptible for errors. The synthesized female deviations were correctly classified. The male breathiness and strain were identified as roughness. Conclusion. VoiceSim produced stimuli very similar to the voices of patients with dysphonia. V-SLPs had a better ability to classify human and synthesized voices. VoiceSim is better to simulate vocal breathiness and female deviations | en |
dc.description.abstract | the male samples need adjustment. | en |
dc.description.affiliation | Univ Fed Sao Paulo, Sao Paulo, Brazil | |
dc.description.affiliation | CEV, R Machado Bittencourt 361-1001, BR-04044001 Sao Paulo, SP, Brazil | |
dc.description.affiliation | Univ Brasilia, Brasilia, DF, Brazil | |
dc.description.affiliationUnifesp | Univ Fed Sao Paulo, Sao Paulo, Brazil | |
dc.description.source | Web of Science | |
dc.format.extent | - | |
dc.identifier | http://dx.doi.org/10.1016/j.jvoice.2016.12.015 | ] |
dc.identifier.citation | Journal Of Voice. New York, v. 31, n. 4, p. -, 2017. | |
dc.identifier.doi | 10.1016/j.jvoice.2016.12.015 | |
dc.identifier.issn | 0892-1997 | |
dc.identifier.uri | https://repositorio.unifesp.br/handle/11600/53569 | |
dc.identifier.wos | WOS:000406147000054 | |
dc.language.iso | eng | |
dc.publisher | Mosby-Elsevier | |
dc.relation.ispartof | Journal Of Voice | |
dc.rights | info:eu-repo/semantics/restrictedAccess | |
dc.subject | Voice | en |
dc.subject | Voice disorders | en |
dc.subject | Auditory perception | en |
dc.subject | Speech acoustic | en |
dc.subject | Judgment | en |
dc.title | Perceptual Error Analysis of Human and Synthesized Voices | en |
dc.type | info:eu-repo/semantics/article |