acoustic voice analysis: Topics by WorldWideScience.org

Sample records for acoustic voice analysis

Diagnostic value of voice acoustic analysis in assessment of occupational voice pathologies in teachers.

Science.gov (United States)

Niebudek-Bogusz, Ewa; Fiszer, Marta; Kotylo, Piotr; Sliwinska-Kowalska, Mariola

2006-01-01

It has been shown that teachers are at risk of developing occupational dysphonia, which accounts for over 25% of all occupational diseases diagnosed in Poland. The most frequently used method of diagnosing voice diseases is videostroboscopy. However, to facilitate objective evaluation of voice efficiency as well as medical certification of occupational voice disorders, it is crucial to implement quantitative methods of voice assessment, particularly voice acoustic analysis. The aim of the study was to assess the results of acoustic analysis in 66 female teachers (aged 40-64 years), including 35 subjects with occupational voice pathologies (e.g., vocal nodules) and 31 subjects with functional dysphonia. The acoustic analysis was performed using the IRIS software, before and after a 30-minute vocal loading test. All participants were subjected also to laryngological and videostroboscopic examinations. After the vocal effort, the acoustic parameters displayed statistically significant abnormalities, mostly lowered fundamental frequency (Fo) and incorrect values of shimmer and noise to harmonic ratio. To conclude, quantitative voice acoustic analysis using the IRIS software seems to be an effective complement to voice examinations, which is particularly helpful in diagnosing occupational dysphonia.
Acoustic Analysis of Voice in Singers: A Systematic Review

Science.gov (United States)

Gunjawate, Dhanshree R.; Ravi, Rohit; Bellur, Rajashekhar

2018-01-01

Purpose: Singers are vocal athletes having specific demands from their voice and require special consideration during voice evaluation. Presently, there is a lack of standards for acoustic evaluation in them. The aim of the present study was to systematically review the available literature on the acoustic analysis of voice in singers. Method: A…
Acoustic analysis with vocal loading test in occupational voice disorders: outcomes before and after voice therapy.

Science.gov (United States)

Niebudek-Bogusz, Ewa; Kotyło, Piotr; Politański, Piotr; Sliwińska-Kowalska, Mariola

2008-01-01

To assess the usefulness of acoustic analysis with vocal loading test for evaluating the treatment outcomes in occupational voice disorders. Fifty-one female teachers with dysphonia were examined (Voice Handicap Index--VHI, laryngovideostroboscopy and acoustic analysis with vocal loading) before and after treatment. The outcomes of teachers receiving vocal training (group I) were referred to outcomes of group II receiving only voice hygiene instructions. The results of subjective assessment (VHI score) and objective evaluation (acoustic analysis) improved more significantly in group I than in group II. The post-treatment examination revealed a decreased percentage of subjects with deteriorated jitter parameters after vocal loading, particularly in group I. Acoustic analysis with vocal loading test can be a helpful tool in the diagnosis and evaluation of treatment efficacy in occupational dysphonia.
[Assessment of voice acoustic parameters in female teachers with diagnosed occupational voice disorders].

Science.gov (United States)

Niebudek-Bogusz, Ewa; Fiszer, Marta; Sliwińska-Kowalska, Mariola

2005-01-01

Laryngovideostroboscopy is the method most frequently used in the assessment of voice disorders. However, the employment of quantitative methods, such as voice acoustic analysis, is essential for evaluating the effectiveness of prophylactic and therapeutic activities as well as for objective medical certification of larynx pathologies. The aim of this study was to examine voice acoustic parameters in female teachers with occupational voice diseases. Acoustic analysis (IRIS software) was performed in 66 female teachers, including 35 teachers with occupational voice diseases and 31 with functional dysphonia. The teachers with occupational voice diseases presented the lower average fundamental frequency (193 Hz) compared to the group with functional dysphonia (209 Hz) and to the normative value (236 Hz), whereas other acoustic parameters did not differ significantly in both groups. Voice acoustic analysis, when applied separately from vocal loading, cannot be used as a testing method to verify the diagnosis of occupational voice disorders.
Acoustic analysis of voice in children with cleft palate and velopharyngeal insufficiency.

Science.gov (United States)

Villafuerte-Gonzalez, Rocio; Valadez-Jimenez, Victor M; Hernandez-Lopez, Xochiquetzal; Ysunza, Pablo Antonio

2015-07-01

Acoustic analysis of voice can provide instrumental data concerning vocal abnormalities. These findings can be used for monitoring clinical course in cases of voice disorders. Cleft palate severely affects the structure of the vocal tract. Hence, voice quality can also be also affected. To study whether the main acoustic parameters of voice, including fundamental frequency, shimmer and jitter are significantly different in patients with a repaired cleft palate, as compared with normal children without speech, language and voice disorders. Fourteen patients with repaired unilateral cleft lip and palate and persistent or residual velopharyngeal insufficiency (VPI) were studied. A control group was assembled with healthy volunteer subjects matched by age and gender. Hypernasality and nasal emission were perceptually assessed in patients with VPI. Size of the gap as assessed by videonasopharyngoscopy was classified in patients with VPI. Acoustic analysis of voice including Fundamental frequency (F0), shimmer and jitter were compared between patients with VPI and control subjects. F0 was significantly higher in male patients as compared with male controls. Shimmer was significantly higher in patients with VPI regardless of gender. Moreover, patients with moderate VPI showed a significantly higher shimmer perturbation, regardless of gender. Although future research regarding voice disorders in patients with VPI is needed, at the present time it seems reasonable to include strategies for voice therapy in the speech and language pathology intervention plan for patients with VPI. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Associations between the Transsexual Voice Questionnaire (TVQMtF ) and self-report of voice femininity and acoustic voice measures.

Science.gov (United States)

Dacakis, Georgia; Oates, Jennifer; Douglas, Jacinta

2017-11-01

The Transsexual Voice Questionnaire (TVQ MtF ) was designed to capture the voice-related perceptions of individuals whose gender identity as female is the opposite of their birth-assigned gender (MtF women). Evaluation of the psychometric properties of the TVQ MtF is ongoing. To investigate associations between TVQ MtF scores and (1) self-perceptions of voice femininity and (2) acoustic parameters of voice pitch and voice quality in order to evaluate further the validity of the TVQ MtF . A strong correlation between TVQ MtF scores and self-ratings of voice femininity was predicted, but no association between TVQ MtF scores and acoustic measures of voice pitch and quality was proposed. Participants were 148 MtF women (mean age 48.14 years) recruited from the La Trobe Communication Clinic and the clinics of three doctors specializing in transgender health. All participants completed the TVQ MtF and 34 of these participants also provided a voice sample for acoustic analysis. Pearson product-moment correlation analysis was conducted to examine the associations between TVQ MtF scores and (1) self-perceptions of voice femininity and (2) acoustic measures of F0, jitter (%), shimmer (dB) and harmonic-to-noise ratio (HNR). Strong negative correlations between the participants' perceptions of their voice femininity and the TVQ MtF scores demonstrated that for this group of MtF women a low self-rating of voice femininity was associated with more frequent negative voice-related experiences. This association was strongest with the vocal-functioning component of the TVQ MtF . These strong correlations and high levels of shared variance between the TVQ MtF and a measure of a related construct provides evidence for the convergent validity of the TVQ MtF . The absence of significant correlations between the TVQ MtF and the acoustic data is consistent with the equivocal findings of earlier research. This finding indicates that these two measures assess different aspects of the voice
Acoustic analysis after radiotherapy in T1 vocal cord carcinoma: a new approach to the analysis of voice quality

International Nuclear Information System (INIS)

Rovirosa, Angeles; Martinez-Celdran, Eugenio; Ortega, Alicia; Ascaso, Carlos; Abellana, Rosa; Velasco, Mercedes; Bonet, Montserrat; Herrera, Carmen; Casas, Francesc; Francisco, Rosa Maria; Arenas, Meritxell; Hernandez, Victor; Sanchez-Reyes, Alberto; Leon, Concha; Traserra, Jordi; Biete, Albert

2000-01-01

Purpose: The study of acoustic voice parameters (fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio) in extended vowel production, oral reading of a standard paragraph, spontaneous speech and a song in irradiated patients for Tis-T1 vocal cord carcinoma. Methods and Materials: Eighteen male patients irradiated for Tis-T1 vocal cord carcinoma and a control group of 31 nonirradiated subjects of the same age were included in a study of acoustic voice analysis. The control group had been rigorously selected for voice quality and the irradiated group had previous history of smoking in two-thirds of the cases and a vocal cord biopsy. Radiotherapy patients were treated with a 6MV Linac receiving a total dose of 66 Gy, 2 Gy/day, with median treatment areas of 28 cm 2 . Acoustic voice analysis was performed 1 year after radiotherapy, the voice of patients in extended vowel production, oral reading of a standard paragraph, spontaneous speech, and in a song was tape registered and analyzed by a Kay Elemetric's Computerized Speech Lab (model CSL no. 4300). Fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio were obtained in each case. Mann Whitney analysis was used for statistical tests. Results: The irradiated group presented higher values of fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio. Mann-Whitney analysis showed significant differences for fundamental frequency and jitter in vowel production, oral reading, spontaneous speech, and song. Shimmer only showed differences in vowel production and harmonics-to-noise ratio in oral reading and song. Conclusions: In our study only fundamental frequency and jitter showed significant increased values to the control group in all the acoustic situations. Sustained vowel production showed the worst values of the acoustic parameters in comparison with the other acoustic situations. This study seems to suggest that more work should be done in this field
Correlations between Sportsmen’s Morpho-Functional Measurements and Voice Acoustic Variables

Directory of Open Access Journals (Sweden)

Rexhepi Agron M.

2016-12-01

Full Text Available Purpose. Since human voice characteristics are specific to each individual, numerous anthropological studies have been oriented to find significant relationships between voice and morpho-functional features. The goal of this study was to identify the correlation between seven morpho-functional variables and six voice acoustic parameters in sportsmen. Methods. Following the protocols of the International Biological Program, seven morpho-functional variables and six voice acoustic parameters have been measured in 88 male professional athletes from Kosovo, aged 17-35 years, during the period of April-October 2013. The statistical analysis was accomplished through the SPSS program, version 20. The obtained data were analysed through descriptive parameters and with Spearman’s method of correlation analysis. Results. Spearman’s method of correlation showed significant negative correlations (R = -0.215 to -0.613; p = 0.05 between three voice acoustic variables of the fundamental frequency of the voice sample (Mean, Minimum, and Maximum Pitch and six morpho-functional measures (Body Height, Body Weight, Margaria-Kalamen Power Test, Sargent Jump Test, Pull-up Test, and VO2max.abs. Conclusions. The significant correlations imply that the people with higher stature have longer vocal cords and a lower voice. These results encourage investigations on predicting sportsmen’s functional abilities on the basis of their voice acoustic parameters.
Comparison of acoustic voice characteristics in smoking and nonsmoking teachers

Directory of Open Access Journals (Sweden)

Šehović Ivana

2012-01-01

Full Text Available Voice of vocal professionals is exposed to great temptations, i.e. there is a high probability of voice alterations. Smoking, allergies and respiratory infections greatly affect the voice, which can change its acoustic characteristics. In smokers, the vocal cords mass increases, resulting in changes in vocal fold vibratory cycle. Pathological changes of vocal folds deform the acoustic signal and affect voice production. As vocal professionals, teachers are much more affected by voice disorders than average speakers. The aim of this study was to examine the differences in acoustic parameters of voice between smoking and nonsmoking teachers, in a sample of vocal professionals. The sample consisted of 60 female subjects, aged from 25 to 59. For voice analysis we used Computer lab, model 4300, 'Kay Elemetrics Corporation'. The statistical significance of differences in the values of acoustic parameters between smokers and nonsmokers was tested by ANOVA. Results showed that in the sample of female teachers, professional use of voice combined with the smoking habit can be linked to the changes in voice parameters. Comparing smokers and nonsmokers, average values of the parameters in short-term and long-term disturbances of frequency and amplitude proved to be significantly different.
Correlation between acoustic parameters and Voice Handicap Index in dysphonic teachers.

Science.gov (United States)

Niebudek-Bogusz, E; Woznicka, E; Zamyslowska-Szmytke, E; Sliwinska-Kowalska, M

2010-01-01

The aim of this study was to investigate the relationship between acoustic analysis and biopsychosocial implications of voice problems, evaluated by the Voice Handicap Index (VHI). The study comprised 120 female teachers with voice disorders, evaluated by videolaryngostroboscopy. 60.8% of this group were diagnosed as having functional dysphonia and 39.2% had dysphonia with benign vocal fold masses (nodules and polyps). The controls consisted of 30 euphonic women. The correlations between VHI and acoustic analysis were assessed in both groups using the Pearson correlation coefficient and regression analysis. In teachers, the total VHI score was over 5 times as high as in controls (p teachers, significant positive correlations were found between the total VHI score and the frequency perturbation parameters and amplitude perturbation parameters when both statistical methods were used. These acoustic parameters also significantly correlated with the score on the functional and emotional subscales, but rarely with the physical subscale of the VHI. The study revealed a significant relationship between the objective voice measurements and the VHI. The results confirmed that VHI can be a valuable tool for assessing biopsychosocial implications of occupational dysphonia and should be incorporated in multidimensional voice evaluation. (c) 2010 S. Karger AG, Basel.
Perceptual-Auditory and Acoustical Analysis of the Voices of Transgender Women.

Science.gov (United States)

Schwarz, Karine; Fontanari, Anna Martha Vaitses; Costa, Angelo Brandelli; Soll, Bianca Machado Borba; da Silva, Dhiordan Cardoso; de Sá Villas-Bôas, Anna Paula; Cielo, Carla Aparecida; Bastilha, Gabriele Rodrigues; Ribeiro, Vanessa Veis; Dorfman, Maria Elza Kazumi Yamaguti; Lobato, Maria Inês Rodrigues

2017-09-28

Voice is an important gender marker in the transition process as a transgender individual accepts a new gender identity. The objectives of this study were to describe and relate aspects of a perceptual-auditory analysis and the fundamental frequency (F0) of male-to-female (MtF) transsexual individuals. A case-control study was carried out with individuals aged 19-52 years who attended the Gender Identity Program of the Hospital de Clínicas of Porto Alegre. Vocal recordings from the MtF transgender and cisgender individuals (vowel /a:/ and six phrases of Consensus Auditory Perceptual Evaluation Voice [CAPE-V]) were edited and randomly coded before storage in a Dropbox folder. The voices (vowel /a:/) were analyzed by consensus on the same day by two judge speech therapists who had more than 10 years of experience in the voice area using the GRBASI perceptual-auditory vocal evaluation scale. Acoustic analysis of the voices was performed using the advanced Multi-Dimensional Voice Program software. The resonance focus and the degrees of masculinity and femininity for each voice recording were determined by listening to the CAPE-V phrases, for the same judges. There were significant differences between the groups regarding a greater frequency of subjects with F0 between 80 and 150 Hz (P = 0.003), and a greater frequency of hypernasal resonant focus (P < 0.001) in the MtF cases and greater frequency of subjects with absence of roughness (P = 0.031) in the control group. The MtF group of individuals showed altered vertical resonant focus, more masculine voices, and lower fundamental frequencies. The control group showed a significant absence of roughness. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
[Acoustic and aerodynamic characteristics of the oesophageal voice].

Science.gov (United States)

Vázquez de la Iglesia, F; Fernández González, S

2005-12-01

The aim of the study is to determine the physiology and pathophisiology of esophageal voice according to objective aerodynamic and acoustic parameters (quantitative and qualitative parameters). Our subjects were comprised of 33 laryngectomized patients (all male) that underwent aerodynamic, acoustic and perceptual protocol. There is a statistical association between acoustic and aerodynamic qualitative parameters (phonation flow chart type, sound spectrum, perceptual analysis) among quantitative parameters (neoglotic pressure, phonation flow, phonation time, fundamental frequency, maximum intensity sound level, speech rate). Nevertheles, not always such observations bring practical resources to clinical practice. We consider that the facts studied may enable us to add, pragmatically, new resources to the more effective vocal rehabilitation to these patients. The physiology of esophageal voice is well understood by the method we have applied, also seeking for rehabilitation, improving oral communication skills in the laryngectomee population.
Immediate acoustic effects of straw phonation exercises in subjects with dysphonic voices.

Science.gov (United States)

Guzman, Marco; Higueras, Diego; Fincheira, Catherine; Muñoz, Daniel; Guajardo, Carlos; Dowdall, Jayme

2013-04-01

Abstract This study sought to measure any acoustic changes in the speaking voice immediately after phonation exercises involving plastic straws versus phonation exercises with the open vowel /a/. Forty-one primary school teachers with slightly dysphonic voices were asked to participate in four phonatory tasks. Phonetically balanced text at habitual intensity level and speaking fundamental frequency was recorded. Acoustical analysis with long-term average spectrum was performed. Significant changes after therapy for the experimental group include the alpha ratio, L1-L0 ratio and ratio between 1-5 kHz and 5-8 kHz. The results indicate that the use of phonatory tasks with straw exercises can have immediate therapeutic acoustic effects in dysphonic voices. Long-term effects were not assessed in this study.
Evaluation of voice acoustic parameters related to the vocal-loading test in professionally active teachers with dysphonia.

Science.gov (United States)

Niebudek-Bogusz, Ewa; Kotyło, Piotr; Sliwińska-Kowalska, Mariola

2007-01-01

Teachers are at risk of developing voice disorders. A clinical battery of vocal function tests should include non-invasive and accurate measurements. The quantitative methods (e.g., voice acoustic analysis) make it possible to objectively evaluate voice efficiency and outcomes of dysphonia treatment. To identify possible signs of vocal fatigue, acoustic waveform perturbations during sustained phonation were measured before and after the vocal-loading test in 51 professionally active female teachers with functional voice disorders, using IRIS software. All the participants were also subjected to laryngological/phoniatric examination involving videostroboscopy combined with self-estimation by voice handicap index (VHI)-based scale. The phoniatric examination revealed glottal insufficiency with bowed vocal folds in 35.2%, soft vocal nodules in 31.4%, and hyperfunctional dysphonia with a tendency towards vestibular phonation in 19.6% of the patients. In the VHI scale, 66% of the female teachers estimated their own voice problems as moderate disability. An acoustic analysis performed after the vocal-loading test showed an increased rate of abnormal frequency perturbation parameters (pitch perturbation quotient (Jitter), relative average perturbation (RAP), and pitch period perturbation quotient (PPQ)) compared to the pre-test outcomes. The same was true of pitch-intensity contour of vowel /a:/, an indication of voice instability during sustained phonation. The recorded impairments of voice acoustic parameters related to vocal loading provide further evidence of dysphonia. The voice acoustic analysis performed before and after the vocal-loading test can significantly contribute to objective voice examinations useful in diagnosis of dysphonia among teachers.
Effect of classic uvulopalatopharyngoplasty and laser-assisted uvulopalatopharyngoplasty on voice acoustics and speech nasalance

International Nuclear Information System (INIS)

Mahmoud Y Abu El-ella

2010-01-01

Uvulopalatopharyngoplasty (UPPP) is a commonly used surgical technique for oropharyngeal reconstruction in patients with obstructive sleep apnea (OSA). This procedure can be done either through the classic or the laser-assisted uvulopalatopharyngoplasty (LAUP) technique. The purpose of this study was to evaluate the effect of classic UPPP and LAUP on acoustics of voice and speech nasalance, and to compare the effect of each operation on these two domains. Patients and The study included 27 patients with a mean age of 46 years. All patients were diagnosed with OSA based on polysomnographic examination. Patients were divided into two groups according to the type of surgical procedure. Fifteen patients underwent classic UPPP, whereas 12 patients were subjected to LAUP. A full assessment was done for all patients preoperatively and postoperatively, including auditory perceptual assessment (APA) of voice and speech, objective assessment using acoustic voice analysis and nasometry. Auditory perceptual assessment of speech and voice, acoustic analysis of voice and nasometric analysis of speech did not show statistically significant differences between the preoperative and postoperative evaluations in either group (P>.05).The results of this study demonstrated that in patients with OSA, the surgical technique, whether classic UPPP or LAUP, does not have significant effects on the patients' voice quality or their speech outcomes (Author).
The sound of trustworthiness: Acoustic-based modulation of perceived voice personality.

Directory of Open Access Journals (Sweden)

Pascal Belin

Full Text Available When we hear a new voice we automatically form a "first impression" of the voice owner's personality; a single word is sufficient to yield ratings highly consistent across listeners. Past studies have shown correlations between personality ratings and acoustical parameters of voice, suggesting a potential acoustical basis for voice personality impressions, but its nature and extent remain unclear. Here we used data-driven voice computational modelling to investigate the link between acoustics and perceived trustworthiness in the single word "hello". Two prototypical voice stimuli were generated based on the acoustical features of voices rated low or high in perceived trustworthiness, respectively, as well as a continuum of stimuli inter- and extrapolated between these two prototypes. Five hundred listeners provided trustworthiness ratings on the stimuli via an online interface. We observed an extremely tight relationship between trustworthiness ratings and position along the trustworthiness continuum (r = 0.99. Not only were trustworthiness ratings higher for the high- than the low-prototypes, but the difference could be modulated quasi-linearly by reducing or exaggerating the acoustical difference between the prototypes, resulting in a strong caricaturing effect. The f0 trajectory, or intonation, appeared a parameter of particular relevance: hellos rated high in trustworthiness were characterized by a high starting f0 then a marked decrease at mid-utterance to finish on a strong rise. These results demonstrate a strong acoustical basis for voice personality impressions, opening the door to multiple potential applications.
Acoustic markers to differentiate gender in prepubescent children's speaking and singing voice.

Science.gov (United States)

Guzman, Marco; Muñoz, Daniel; Vivero, Martin; Marín, Natalia; Ramírez, Mirta; Rivera, María Trinidad; Vidal, Carla; Gerhard, Julia; González, Catalina

2014-10-01

Investigation sought to determine whether there is any acoustic variable to objectively differentiate gender in children with normal voices. A total of 30 children, 15 boys and 15 girls, with perceptually normal voices were examined. They were between 7 and 10 years old (mean: 8.1, SD: 0.7 years). Subjects were required to perform the following phonatory tasks: (1) to phonate sustained vowels [a:], [i:], [u:], (2) to read a phonetically balanced text, and (3) to sing a song. Acoustic analysis included long-term average spectrum (LTAS), fundamental frequency (F0), speaking fundamental frequency (SFF), equivalent continuous sound level (Leq), linear predictive code (LPC) to obtain formant frequencies, perturbation measures, harmonic to noise ratio (HNR), and Cepstral peak prominence (CPP). Auditory perceptual analysis was performed by four blinded judges to determine gender. No significant gender-related differences were found for most acoustic variables. Perceptual assessment showed good intra and inter rater reliability for gender. Cepstrum for [a:], alpha ratio in text, shimmer for [i:], F3 in [a:], and F3 in [i:], were the parameters that composed the multivariate logistic regression model to best differentiate male and female children's voices. Since perceptual assessment reliably detected gender, it is likely that other acoustic markers (not evaluated in the present study) are able to make clearer gender differences. For example, gender-specific patterns of intonation may be a more accurate feature for differentiating gender in children's voices. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Correlation Between Acoustic Measurements and Self-Reported Voice Disorders Among Female Teachers.

Science.gov (United States)

Lin, Feng-Chuan; Chen, Sheng Hwa; Chen, Su-Chiu; Wang, Chi-Te; Kuo, Yu-Ching

2016-07-01

Many studies focused on teachers' voice problems and most of them were conducted using questionnaires, whereas little research has investigated the relationship between self-reported voice disorders and objective quantification of voice. This study intends to explore the relationship of acoustic measurements according to self-reported symptoms and its predictive value of future dysphonia. This is a case-control study. Voice samples of 80 female teachers were analyzed, including 40 self-reported voice disorders (VD) and 40 self-reported normal voice (NVD) subjects. The acoustic measurements included jitter, shimmer, and noise-to-harmonics ratio (NHR). Levene's t test and logistic regression were used to analyze the differences between VD and NVD and the relationship between self-reported voice conditions and the acoustic measurements. To examine whether acoustic measurements can be used to predict further voice disorders, we applied a receiver operating characteristic (ROC) curve to determine the cutoff values and the associated sensitivity and specificity. The results showed that jitter, shimmer, and the NHR of VD were significantly higher than those of NVD. Among the parameters, the NHR and shimmer demonstrated the highest correlation with self-reported voice disorders. By using the NHR ≥0.138 and shimmer ≥0.470 dB as the cutoff values, the ROC curve displayed 72.5% of sensitivity and 75% of specificity, and the overall positive predictive value for subsequent dysphonia achieved 60%. This study demonstrated a significant correlation between acoustic measurements and self-reported dysphonic symptoms. NHR and ShdB are two acoustic parameters that are more able to reflect vocal abnormalities and, probably, to predict subsequent subjective voice disorder. Future research recruiting more subjects in other occupations and genders shall validate the preliminary results revealed in this study. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All
Mobile Communication Devices, Ambient Noise, and Acoustic Voice Measures.

Science.gov (United States)

Maryn, Youri; Ysenbaert, Femke; Zarowski, Andrzej; Vanspauwen, Robby

2017-03-01

The ability to move with mobile communication devices (MCDs; ie, smartphones and tablet computers) may induce differences in microphone-to-mouth positioning and use in noise-packed environments, and thus influence reliability of acoustic voice measurements. This study investigated differences in various acoustic voice measures between six recording equipments in backgrounds with low and increasing noise levels. One chain of continuous speech and sustained vowel from 50 subjects with voice disorders (all separated by silence intervals) was radiated and re-recorded in an anechoic chamber with five MCDs and one high-quality recording system. These recordings were acquired in one condition without ambient noise and in four conditions with increased ambient noise. A total of 10 acoustic voice markers were obtained in the program Praat. Differences between MCDs and noise condition were assessed with Friedman repeated-measures test and posthoc Wilcoxon signed-rank tests, both for related samples, after Bonferroni correction. (1) Except median fundamental frequency and seven nonsignificant differences, MCD samples have significantly higher acoustic markers than clinical reference samples in minimal environmental noise. (2) Except median fundamental frequency, jitter local, and jitter rap, all acoustic measures on samples recorded with the reference system experienced significant influence from room noise levels. Fundamental frequency is resistant to recording system, environmental noise, and their combination. All other measures, however, were impacted by both recording system and noise condition, and especially by their combination, often already in the reference/baseline condition without added ambient noise. Caution is therefore warranted regarding implementation of MCDs as clinical recording tools, particularly when applied for treatment outcomes assessments. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Influence of classroom acoustics on the voice levels of teachers with and without voice problems: a field study

DEFF Research Database (Denmark)

Pelegrin Garcia, David; Lyberg-Åhlander, Viveka; Rydell, Roland

2010-01-01

of the classroom. The results thus suggest that teachers with voice problems are more aware of classroom acoustic conditions than their healthy colleagues and make use of the more supportive rooms to lower their voice levels. This behavior may result from an adaptation process of the teachers with voice problems...... of the voice problems was made with a questionnaire and a laryngological examination. During teaching, the sound pressure level at the teacher’s position was monitored. The teacher’s voice level and the activity noise level were separated using mixed Gaussians. In addition, objective acoustic parameters...... of Reverberation Time and Voice Support were measured in the 30 empty classrooms of the study. An empirical model shows that the measured voice levels depended on the activity noise levels and the voice support. Teachers with and without voice problems were differently affected by the voice support...

Voice Quality After a Semi-Occluded Vocal Tract Exercise With a Ventilation Mask in Contemporary Commercial Singers: Acoustic Analysis and Self-Assessments.

Science.gov (United States)

Fantini, Marco; Succo, Giovanni; Crosetti, Erika; Borragán Torre, Alfonso; Demo, Roberto; Fussi, Franco

2017-05-01

The current study aimed at investigating the immediate effects of a semi-occluded vocal tract exercise with a ventilation mask in a group of contemporary commercial singers. A randomized controlled study was carried out. Thirty professional or semi-professional singers with no voice complaints were randomly divided into two groups on recruitment: an experimental group and a control group. The same warm-up exercise was performed by the experimental group with an occluded ventilation mask placed over the nose and the mouth and by the control group without the ventilation mask. Voice was recorded before and after the exercise. Acoustic and self-assessment analysis were accomplished. The acoustic parameters of the voice samples recorded before and after training were compared, as well as the parameters' variations between the experimental and the control group. Self-assessment results of the experimental and the control group were compared too. Significant changes after the warm-up exercise included jitter, shimmer, and singing power ratio (SPR) in the experimental group. No significant changes were recorded in the control group. Significant differences between the experimental and the control group were found for ΔShimmer and ΔSPR. Self-assessment analysis confirmed a significantly higher phonatory comfort and voice quality perception for the experimental group. The results of the present study support the immediate advantageous effects on singing voice of a semi-occluded vocal tract exercise with a ventilation mask in terms of acoustic quality, phonatory comfort, and voice quality perception in contemporary commercial singers. Long-term effects still remain to be studied. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Acoustic and aerodynamic measures of the voice during pregnancy.

Science.gov (United States)

Hancock, Adrienne B; Gross, Heather E

2015-01-01

Known influences of sex hormones on the voice would suggest pregnancy hormones could have an effect, yet studies using acoustic measures have not indicated changes. Additionally, no examination of the voice before the third trimester has been reported. Effect of pregnancy on the voice is relatively unexplored yet could be quite relevant to female speakers and singers. It is possible that spectral and aerodynamic measures would be more sensitive to tissue-level changes caused by pregnancy hormones. In this first longitudinal study of a 32-year-old woman's pregnancy, weekly voice samples were analyzed for acoustic (fundamental frequency, perturbation ratios of shimmer and jitter, Harmonic-to-Noise Ratio, spectral measures, and maximum phonation time) and aerodynamic (average airflow, peak flow, AC/DC ratio, open quotient, and speed quotient) parameters. All measures appeared generally stable during weeks 11-39 of pregnancy compared with 21 weeks postpartum. Slight decrease in minimum airflow and open speed quotient may reflect suspected vocal fold tissue changes. It is recommended that future studies monitor and test correlations among hormone levels, visual analyses of vocal fold mucosa, aerodynamic function, and glottal efficiency. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Comparison of Medical and Voice Therapy for reflux Laryngitis Based on Acoustic and Laryngeal Characteristics

Directory of Open Access Journals (Sweden)

Abbas Dehestani Ardakani

2011-12-01

Full Text Available Background and Aim: Reflux laryngitis is extremely common among patients with voice disorder. Medical therapy approaches are not efficient enough. The main goal of this study is to assess the acoustic and laryngeal characteristics of patients with dysphonia before and after medical or voice therapy, and to evaluate the effectiveness of each.Methods: In this retrospective study, 16 reflux laryngitis patients were assessed. Five received complete voice therapy, tow ceased voice therapy and nine received medication. Perceptual voice evaluation was performed by a speech-language pathologist, the severity of voice problem was calculated, based on the affected acoustic and laryngeal characteristics pre- and post-treatment.Results: Post-treatment evaluation in patients who received complete voice therapy indicates 80 percent improvement in the severity of disorder and 100 percent improvement in the perceptual voice evaluation. After medical therapy, we observed that voice disorder and perceptual voice evaluation are improved 44 and 66 percent respectively. The improvement was statistically significant in both treatment approaches: complete voice therapy (P=0.039 and medical therapy (p=0.017.Conclusion: In patients with reflux laryngitis, most acoustic and laryngeal characteristics were normal and satisfying after the treatment. It can be concluded that the proficiency of voice therapy in improving the acoustic and laryngeal characteristics is comparable to medical therapy
Acoustic cues for the recognition of self-voice and other-voice

Directory of Open Access Journals (Sweden)

Mingdi eXu

2013-10-01

Full Text Available Self-recognition, being indispensable for successful social communication, has become a major focus in current social neuroscience. The physical aspects of the self are most typically manifested in the face and voice. Compared with the wealth of studies on self-face recognition, self-voice recognition (SVR has not gained much attention. Converging evidence has suggested that the fundamental frequency (F0 and formant structures serve as the key acoustic cues for other-voice recognition (OVR. However, little is known about which, and how, acoustic cues are utilized for SVR as opposed to OVR. To address this question, we independently manipulated the F0 and formant information of recorded voices and investigated their contributions to SVR and OVR. Japanese participants were presented with recorded vocal stimuli and were asked to identify the speaker—either themselves or one of their peers. Six groups of 5 peers of the same sex participated in the study. Under conditions where the formant information was fully preserved and where only the frequencies lower than the third formant (F3 were retained, accuracies of SVR deteriorated significantly with the modulation of the F0, and the results were comparable for OVR. By contrast, under a condition where only the frequencies higher than F3 were retained, the accuracy of SVR was significantly higher than that of OVR throughout the range of F0 modulations, and the F0 scarcely affected the accuracies of SVR and OVR. Our results indicate that while both F0 and formant information are involved in SVR, as well as in OVR, the advantage of SVR is manifested only when major formant information for speech intelligibility is absent. These findings imply the robustness of self-voice representation, possibly by virtue of auditory familiarity and other factors such as its association with motor/articulatory representation.
VOICE QUALITY BEFORE AND AFTER THYROIDECTOMY

Directory of Open Access Journals (Sweden)

Dora CVELBAR

2016-04-01

Full Text Available Introduction: Voice disorders are a well-known complication which is often associated with thyroid gland diseases and because voice is still the basic mean of communication it is very important to maintain its quality healthy. Objectives: The aim of this study referred to questions whether there is a statistically significant difference between results of voice self-assessment, perceptual voice assessment and acoustic voice analysis before and after thyroidectomy and whether there are statistically significant correlations between variables of voice self-assessment, perceptual assessment and acoustic analysis before and after thyroidectomy. Methods: This scientific research included 12 participants aged between 41 and 76. Voice self-assessment was conducted with the help of Croatian version of Voice Handicap Index (VHI. Recorded reading samples were used for perceptual assessment and later evaluated by two clinical speech and language therapists. Recorded samples of phonation were used for acoustic analysis which was conducted with the help of acoustic program Praat. All of the data was processed through descriptive statistics and nonparametric statistical methods. Results: Results showed that there are statistically significant differences between results of voice self-assessments and results of acoustic analysis before and after thyroidectomy. Statistically significant correlations were found between variables of perceptual assessment and acoustic analysis. Conclusion: Obtained results indicate the importance of multidimensional, preoperative and postoperative assessment. This kind of assessment allows the clinician to describe all of the voice features and provides appropriate recommendation for further rehabilitation to the patient in order to optimize voice outcomes.
[THE APPLICATION OF SHORT-TERM EFFICIENCY ANALYSIS IN DIAGNOSING OCCUPATIONAL VOICE DISORDERS].

Science.gov (United States)

Niebudek-Bogusz, Ewa; Just, Marcin; Tyc, Michał; Wiktorowicz, Justyna; Morawska, Joanna; Śliwińska-Kowalska, Mariola

2015-01-01

An objective determination of the range of vocal efficiency is rather difficult. The aim of the study was to assess the possibility of application of short-term acoustic efficiency analysis in diagnosing occupational voice disorders. The study covered 98 people (87 women and 11 men) diagnosed with occupational dysphonia throuigh videostroboscopic examination. The control group comprised 100 people (81 women and 19 men) with normal voices. The short-term acoustic analysis was carried out by means of DiagnoScope software, including classical parameters (Jitter group, Shimmer group and the assessment of noise degree NHR), as well as new short-term efficiency parameters determined in a short time period during sustained phonation of the vowel "a." The results were then compared. Results: The values of all the examined classical parameters were considerably higher in the study group of pathological voices than in the control group of normal voices (p = 0.00). The aerodynamic parameter, maximum phonation time, was significantly shorter by over 0.5 s in the study group than in the control group. The majority of the acoustic efficiency parameters were also considerably worse in the study group of subjects with occupational dysphonia than in the control group (p = 0.00). Moreover, the correlation between the efficiency parameters and most of the classical acoustic parameters in the study group implies that for the voices with occupational pathology the decreased efficiency of the vocal apparatus is reflected in the acoustic voice structure. Effliciency parameters determined during short-term acoustic analysis can be an objective indicator of the decreased phonatory function of the larnx, useful in diagnosing occupational vocal pathology.
The Belt voice: Acoustical measurements and esthetic correlates

Science.gov (United States)

Bounous, Barry Urban

This dissertation explores the esthetic attributes of the Belt voice through spectral acoustical analysis. The process of understanding the nature and safe practice of Belt is just beginning, whereas the understanding of classical singing is well established. The unique nature of the Belt sound provides difficulties for voice teachers attempting to evaluate the quality and appropriateness of a particular sound or performance. This study attempts to provide answers to the question "does Belt conform to a set of measurable esthetic standards?" In answering this question, this paper expands on a previous study of the esthetic attributes of the classical baritone voice (see "Vocal Beauty", NATS Journal 51,1) which also drew some tentative conclusions about the Belt voice but which had an inadequate sample pool of subjects from which to draw. Further, this study demonstrates that it is possible to scientifically investigate the realm of musical esthetics in the singing voice. It is possible to go beyond the "a trained voice compared to an untrained voice" paradigm when evaluating quantitative vocal parameters and actually investigate what truly beautiful voices do. There are functions of sound energy (measured in dB) transference which may affect the nervous system in predictable ways and which can be measured and associated with esthetics. This study does not show consistency in measurements for absolute beauty (taste) even among belt teachers and researchers but does show some markers with varying degrees of importance which may point to a difference between our cognitive learned response to singing and our emotional, more visceral response to sounds. The markers which are significant in determining vocal beauty are: (1) Vibrancy-Characteristics of vibrato including speed, width, and consistency (low variability). (2) Spectral makeup-Ratio of partial strength above the fundamental to the fundamental. (3) Activity of the voice-The quantity of energy being produced. (4
The application of short-term efficiency analysis in diagnosing occupational voice disorders

Directory of Open Access Journals (Sweden)

Ewa Niebudek-Bogusz

2015-06-01

Full Text Available Background: An objective determination of the range of vocal efficiency is rather difficult. The aim of the study was to assess the possibility of application of short-term acoustic efficiency analysis in diagnosing occupational voice disorders. Material and Methods: The study covered 98 people (87 women and 11 men diagnosed with occupational dysphonia through videostroboscopic examination. The control group comprised 100 people (81 women and 19 men with normal voices. The short-term acoustic analysis was carried out by means of DiagnoScope software, including classical parameters (Jitter group, Shimmer group and the assessment of noise degree NHR, as well as new short-term efficiency parameters determined in a short time period during sustained phonation of the vowel “a.” The results were then compared. Results: The values of all the examined classical parameters were considerably higher in the study group of pathological voices than in the control group of normal voices (p = 0.00. The aerodynamic parameter, maximum phonation time, was significantly shorter by over 0.5 s in the study group than in the control group. The majority of the acoustic efficiency parameters were also considerably worse in the study group of subjects with occupational dysphonia than in the control group (p = 0.00. Moreover, the correlation between the efficiency parameters and most of the classical acoustic parameters in the study group implies that for the voices with occupational pathology the decreased efficiency of the vocal apparatus is reflected in the acoustic voice structure. Conclusions: Efficiency parameters determined during short-term acoustic analysis can be an objective indicator of the decreased phonatory function of the larynx, useful in diagnosing occupational vocal pathology. Med Pr 2015;66(2:225–234
Changes after voice therapy in objective and subjective voice measurements of pediatric patients with vocal nodules.

Science.gov (United States)

Tezcaner, Ciler Zahide; Karatayli Ozgursoy, Selmin; Ozgursoy, Selmin Karatayli; Sati, Isil; Dursun, Gursel

2009-12-01

The aim of this study was to analyze the efficiency of the voice therapy in children with vocal nodules by using the acoustic analysis and subjective assessment. Thirty-nine patients with vocal fold nodules, aged between 7 and 14, were included in the study. Each subject had voice therapy led by an experienced voice therapist once a week. All diagnostic and follow-up workouts were performed before the voice therapy and after the third or the sixth month. Transoral and/or transnasal videostroboscopic examination and acoustic analysis were achieved using multi-dimensional voice program (MDVP) and subjective analysis with GRBAS scale. As for the perceptual assessment, the difference was significant for four parameters out of five. A significant improvement was found in the acoustic analysis parameters of jitter, shimmer, and noise-to-harmonic ratio. The voice therapy which was planned according to patients' needs, age, compliance and response to therapy had positive effects on pediatric patients with vocal nodules. Acoustic analysis and GRBAS may be used successfully in the follow-up of pediatric vocal nodule treatment.
A study of VHI scores and acoustic features in street vendors as occupational voice users.

Science.gov (United States)

Natour, Yaser S; Darawsheh, Wesam B; Bashiti, Sara; Wari, Majd; Taha, Juhayna; Odeh, Thair

to investigate acoustic features of phonation and perception of voice handicap in street vendors. Eighty-eight participants (44 street vendors, 44 controls) were recruited. The mean age of the group was 38.9±16.0 years (range: 20-78 years). Scores of the Arabic version of the Voice Handicap Index (VHI-Arab) were used for analysis. Acoustic measures of fundamental frequency (F 0 ), jitter, shimmer, and signal-to-noise ratio (SNR) were also analyzed. Analysis showed a significant difference between street vendors and controls in the total score of the VHI-Arab (p<0.001) as well as scores of all three VHI-Arab subsections: functional (p<0.001), physical (p<0.001), and emotional (p=0.025). Weak correlations were found among all of the VHI scores and acoustic measures (-0.219≤ r≤0.355), except for SNR where a moderate negative correlations were found (r=-0.555; -0.4) between the VHI (physical and total) scores and SNR values. Significant differences also were found in F 0 , jitter, and SNR among specific subgroups of street vendors when stratified by weekly hours worked (p<0.05), and in jitter (p=0.39) when stratified by educational level. Perception of voice handicap and a possible effect on vocal quality in street vendors were noted. The effect of factors, namely work hours and educational level, on voice quality should be further studied. Copyright © 2017. Published by Elsevier Inc.
Validation of the Acoustic Voice Quality Index Version 03.01 and the Acoustic Breathiness Index in the Spanish language.

Science.gov (United States)

Delgado Hernández, Jonathan; León Gómez, Nieves M; Jiménez, Alejandra; Izquierdo, Laura M; Barsties V Latoszek, Ben

2018-05-01

The aim of this study was to validate the Acoustic Voice Quality Index 03.01 (AVQIv3) and the Acoustic Breathiness Index (ABI) in the Spanish language. Concatenated voice samples of continuous speech (cs) and sustained vowel (sv) from 136 subjects with dysphonia and 47 vocally healthy subjects were perceptually judged for overall voice quality and breathiness severity. First, to reach a higher level of ecological validity, the proportions of cs and sv were equalized regarding the time length of 3 seconds sv part and voiced cs part, respectively. Second, concurrent validity and diagnostic accuracy were verified. A moderate reliability of overall voice quality and breathiness severity from 5 experts was used. It was found that 33 syllables as standardization of the cs part, which represents 3 seconds of voiced cs, allows the equalization of both speech tasks. A strong correlation was revealed between AVQIv3 and overall voice quality and ABI and perceived breathiness severity. Additionally, the best diagnostic outcome was identified at a threshold of 2.28 and 3.40 for AVQIv3 and ABI, respectively. The AVQIv3 and ABI showed in the Spanish language valid and robust results to quantify abnormal voice qualities regarding overall voice quality and breathiness severity.
Dosimetric complication probability and acoustic analysis of vocal cord region in oropharyngeal carcinoma treated with voice-sparing intensity modulated radiotherapy

International Nuclear Information System (INIS)

Jain, S.; Gupta, T.; Agarwal, J.P.; Baccher, G.; Shrivastava, S.K.; Reenadevi; Master, J.

2008-01-01

Radiation to larynx has long been associated with speech and voice dysfunction. The objective is to study dosimetric parameters and complication probability of vocal cord region (VCR) and the effect of voice-sparing (VS) in the patients treated with intensity modulated radiotherapy (IMRT). The secondary objective is to describe the post-radiation acoustic voice characteristics and correlate them with the dosimetric parameters. (author)
What makes a voice masculine: physiological and acoustical correlates of women's ratings of men's vocal masculinity.

Science.gov (United States)

Cartei, Valentina; Bond, Rod; Reby, David

2014-09-01

Men's voices contain acoustic cues to body size and hormonal status, which have been found to affect women's ratings of speaker size, masculinity and attractiveness. However, the extent to which these voice parameters mediate the relationship between speakers' fitness-related features and listener's judgments of their masculinity has not yet been investigated. We audio-recorded 37 adult heterosexual males performing a range of speech tasks and asked 20 adult heterosexual female listeners to rate speakers' masculinity on the basis of their voices only. We then used a two-level (speaker within listener) path analysis to examine the relationships between the physiological (testosterone, height), acoustic (fundamental frequency or F0, and resonances or ΔF) and perceptual dimensions (listeners' ratings) of speakers' masculinity. Overall, results revealed that male speakers who were taller and had higher salivary testosterone levels also had lower F0 and ΔF, and were in turn rated as more masculine. The relationship between testosterone and perceived masculinity was essentially mediated by F0, while that of height and perceived masculinity was partially mediated by both F0 and ΔF. These observations confirm that women listeners attend to sexually dimorphic voice cues to assess the masculinity of unseen male speakers. In turn, variation in these voice features correlate with speakers' variation in stature and hormonal status, highlighting the interdependence of these physiological, acoustic and perceptual dimensions. Copyright © 2014. Published by Elsevier Inc.
The Relationship Between Acoustic Signal Typing and Perceptual Evaluation of Tracheoesophageal Voice Quality for Sustained Vowels.

Science.gov (United States)

Clapham, Renee P; van As-Brooks, Corina J; van Son, Rob J J H; Hilgers, Frans J M; van den Brekel, Michiel W M

2015-07-01

To investigate the relationship between acoustic signal typing and perceptual evaluation of sustained vowels produced by tracheoesophageal (TE) speakers and the use of signal typing in the clinical setting. Two evaluators independently categorized 1.75-second segments of narrow-band spectrograms according to acoustic signal typing and independently evaluated the recording of the same segments on a visual analog scale according to overall perceptual acoustic voice quality. The relationship between acoustic signal typing and overall voice quality (as a continuous scale and as a four-point ordinal scale) was investigated and the proportion of inter-rater agreement as well as the reliability between the two measures is reported. The agreement between signal type (I-IV) and ordinal voice quality (four-point scale) was low but significant, and there was a significant linear relationship between the variables. Signal type correctly predicted less than half of the voice quality data. There was a significant main effect of signal type on continuous voice quality scores with significant differences in median quality scores between signal types I-IV, I-III, and I-II. Signal typing can be used as an adjunct to perceptual and acoustic evaluation of the same stimuli for TE speech as part of a multidimensional evaluation protocol. Signal typing in its current form provides limited predictive information on voice quality, and there is significant overlap between signal types II and III and perceptual categories. Future work should consider whether the current four signal types could be refined. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Robotic vehicle uses acoustic sensors for voice detection and diagnostics

Science.gov (United States)

Young, Stuart H.; Scanlon, Michael V.

2000-07-01

An acoustic sensor array that cues an imaging system on a small tele- operated robotic vehicle was used to detect human voice and activity inside a building. The advantage of acoustic sensors is that it is a non-line of sight (NLOS) sensing technology that can augment traditional LOS sensors such as visible and IR cameras. Acoustic energy emitted from a target, such as from a person, weapon, or radio, will travel through walls and smoke, around corners, and down corridors, whereas these obstructions would cripple an imaging detection system. The hardware developed and tested used an array of eight microphones to detect the loudest direction and automatically setter a camera's pan/tilt toward the noise centroid. This type of system has applicability for counter sniper applications, building clearing, and search/rescue. Data presented will be time-frequency representations showing voice detected within rooms and down hallways at various ranges. Another benefit of acoustics is that it provides the tele-operator some situational awareness clues via low-bandwidth transmission of raw audio data for the operator to interpret with either headphones or through time-frequency analysis. This data can be useful to recognize familiar sounds that might indicate the presence of personnel, such as talking, equipment, movement noise, etc. The same array also detects the sounds of the robot it is mounted on, and can be useful for engine diagnostics and trouble shooting, or for self-noise emanations for stealthy travel. Data presented will characterize vehicle self noise over various surfaces such as tiles, carpets, pavement, sidewalk, and grass. Vehicle diagnostic sounds will indicate a slipping clutch and repeated unexpected application of emergency braking mechanism.
The Effects of Size and Type of Vocal Fold Polyp on Some Acoustic Voice Parameters

Directory of Open Access Journals (Sweden)

Elaheh Akbari

2018-03-01

Full Text Available Background: Vocal abuse and misuse would result in vocal fold polyp. Certain features define the extent of vocal folds polyp effects on voice acoustic parameters. The present study aimed to define the effects of polyp size on acoustic voice parameters, and compare these parameters in hemorrhagic and non-hemorrhagic polyps. Methods: In the present retrospective study, 28 individuals with hemorrhagic or non-hemorrhagic polyps of the true vocal folds were recruited to investigate acoustic voice parameters of vowel/ æ/ computed by the Praat software. The data were analyzed using the SPSS software, version 17.0. According to the type and size of polyps, mean acoustic differences and correlations were analyzed by the statistical t test and Pearson correlation test, respectively; with significance level below 0.05. Results: The results indicated that jitter and the harmonics-to-noise ratio had a significant positive and negative correlation with the polyp size (P=0.01, respectively. In addition, both mentioned parameters were significantly different between the two types of the investigated polyps. Conclusion: Both the type and size of polyps have effects on acoustic voice characteristics. In the present study, a novel method to measure polyp size was introduced. Further confirmation of this method as a tool to compare polyp sizes requires additional investigations.
Acoustic passaggio pedagogy for the male voice.

Science.gov (United States)

Bozeman, Kenneth Wood

2013-07-01

Awareness of interactions between the lower harmonics of the voice source and the first formant of the vocal tract, and of the passive vowel modifications that accompany them, can assist in working out a smooth transition through the passaggio of the male voice. A stable vocal tract length establishes the general location of all formants, including the higher formants that form the singer's formant cluster. Untrained males instinctively shorten the tube to preserve the strong F1/H2 acoustic coupling of voce aperta, resulting in 'yell' timbre. If tube length and shape are kept stable during pitch ascent, the yell can be avoided by allowing the second harmonic to rise above the first formant, creating the balanced timbre of voce chiusa.
Remote Capture of Human Voice Acoustical Data by Telephone: A Methods Study

Science.gov (United States)

Cannizzaro, Michael S.; Reilly, Nicole; Mundt, James C.; Snyder, Peter J.

2005-01-01

In this pilot study we sought to determine the reliability and validity of collecting speech and voice acoustical data via telephone transmission for possible future use in large clinical trials. Simultaneous recordings of each participant's speech and voice were made at the point of participation, the local recording (LR), and over a telephone…
Acoustic characteristics of voice after severe traumatic brain injury.

Science.gov (United States)

McHenry, M

2000-07-01

To describe the acoustic characteristics of voice in individuals with motor speech disorders after traumatic brain injury (TBI). Prospective study of 100 individuals with TBI based on consecutive referrals for motor speech evaluations. Subjects were audio tape-recorded while producing sustained vowels and single word and sentence intelligibility tests. Laryngeal airway resistance was estimated, and voice quality was rated perceptually. None of the subjects evidenced vocal parameters within normal limits. The most frequently occurring abnormal parameter across subjects was amplitude perturbation, followed by voice turbulence index. Twenty-three percent of subjects evidenced deviation in all five parameters measured. The perceptual ratings of breathiness were significantly correlated with both the amplitude perturbation quotient and the noise-to-harmonics ratio. Vocal quality deviation is common in motor speech disorders after TBI and may impact intelligibility.
Acoustic and capacity analysis of voice academic teachers with diagnosed hyperfunctional dysphonia by using DiagnoScope Specialist software.

Science.gov (United States)

Zielińska-Bliźniewska, Hanna; Pietkiewicz, Piotr; Miłoński, Jarosław; Urbaniak, Joanna; Olszewski, Jurek

2013-01-01

The aim of the study was to assess the acoustic and capacity analyses of voice in academic teachers with hyperfunctional dysphonia using DiagnoScope Specialist software. The study covered 46 female academic teachers aged 34-48 years. The women were diagnosed with hyperfunctional dysphonia (with absence of organic pathologies). Having obtained the informed consent, a primary medical history was taken, videolaryngoscopic and stroboscopic examinations were performed and diagnostic voice acoustic and capacity analyses were carried out using DiagnoScope Specialist software. The acoustic analysis carried out of academic teachers with diagnosed hyperfunctional dysphonia showed enhancement in the following parameters: fundamental frequency (FO) by 1.2%; relative average perturbation (Jitter by 100.0% and RAP by 81.8%); relative amplitude perturbation quotient (APQ) by 2.9%; non-harmonic to harmonic ratio (U2H) by 16.0%; and noise to harmonic ratio (NHR) by 13.4%. A decrease of 2.5% from normal values was noted in relative amplitude perturbation (Shimmer). Formant frequencies also showed reduction (F1 by 10.7%, F2 by 5.1%, F3 by 2.2%, and F4 by 3.5%). The harmonic perturbation quotient (HPQ) was 0.8% lower and the residual harmonic perturbation quotient (RHPQ) 16.8% lower, with the residual to harmonic (R2H) decreasing by 35.1 per cent; the sub-harmonic to harmonic (S2H) by 2.4%; and the Yanagihara coefficient by 20.2%. The capacity analysis with the DiagnoScope Specialist software showed figures significantly lower than normal values of the following parameters: phonation time, true phonation time, phonation break coefficients, vocal capacity coefficient and mean vocal capacity. Copyright © 2013 Polish Otorhinolaryngology - Head and Neck Surgery Society. Published by Elsevier Urban & Partner Sp. z.o.o. All rights reserved.

Analysis of the Auditory Feedback and Phonation in Normal Voices.

Science.gov (United States)

Arbeiter, Mareike; Petermann, Simon; Hoppe, Ulrich; Bohr, Christopher; Doellinger, Michael; Ziethe, Anke

2018-02-01

The aim of this study was to investigate the auditory feedback mechanisms and voice quality during phonation in response to a spontaneous pitch change in the auditory feedback. Does the pitch shift reflex (PSR) change voice pitch and voice quality? Quantitative and qualitative voice characteristics were analyzed during the PSR. Twenty-eight healthy subjects underwent transnasal high-speed video endoscopy (HSV) at 8000 fps during sustained phonation [a]. While phonating, the subjects heard their sound pitched up for 700 cents (interval of a fifth), lasting 300 milliseconds in their auditory feedback. The electroencephalography (EEG), acoustic voice signal, electroglottography (EGG), and high-speed-videoendoscopy (HSV) were analyzed to compare feedback mechanisms for the pitched and unpitched condition of the phonation paradigm statistically. Furthermore, quantitative and qualitative voice characteristics were analyzed. The PSR was successfully detected within all signals of the experimental tools (EEG, EGG, acoustic voice signal, HSV). A significant increase of the perturbation measures and an increase of the values of the acoustic parameters during the PSR were observed, especially for the audio signal. The auditory feedback mechanism seems not only to control for voice pitch but also for voice quality aspects.
Voice acoustic patterns of patients diagnosed with vibroacoustic disease

Directory of Open Access Journals (Sweden)

Ana Mendes

2006-07-01

Full Text Available Background: Long-term low frequency noise exposure (LFN (â¤Â 500Â Hz, including infrasound may lead to the development of vibroacoustic disease (VAD, a systemic pathology characterized by the abnormal growth of extra-cellular matrices. The respiratory system is a target for LFN. Fibrosis of the respiratory tract epithelia was observed in VAD patients through biopsy, and confirmed in animal models exposed to LFN. Voice acoustic analysis can detect vocal fold variations of mass, tension, muscular and neural activity. Frequency perturbation (jitter, amplitude perturbation (shimmer and harmonicto- noise ratio (HNR are used in the evaluation of the vocal function, and can be indicators of the presence and degree of severity of vocal pathology. Since the respiratory system is the energy source of the phonation process, this raises questions about the effects of VAD on voice production. The purpose of this study was to determine if voice acoustic parameters of VAD patients are different from normative data. Methods: Nine individuals (5 males and 4 females diagnosed with VAD were recorded performing spoken and sung tasks. The spoken tasks included sustaining vowels and fricatives. The sung tasks consisted of maximum phonational frequency range (MPFR. Voice acoustic parameters analysed were: fundamental frequency (F0, jitter, shimmer, HNR and temporal measures. Results: Compared with normative data, both males and females diagnosed with VAD exhibited increased F0, shimmer and HNR. Jitter, MPFR and one temporal measure were reduced. Conclusions: VAD individuals presented voice acoustic parameter differences in spectral, temporal and perturbation measures, which may be indicative of small morphological changes in the phonatory system. Resumo: Enquadramento: A exposiÃ§Ã£o crÃ³nica ao ruÃdo de baixa frequÃªncia (RBF (â¤Â 500Â Hz, incluindo infra-sons pode conduzir ao desenvolvimento da doenÃ§a vibroacÃºstica (VAD �
Objective and subjective analysis of women's voice with idiopathic Parkinson's disease

Directory of Open Access Journals (Sweden)

Riviana Rodrigues das Graças

2012-07-01

Full Text Available OBJECTIVE: To compare the voice quality of women with idiopathic Parkinson's disease and those without it. METHODS: An evaluation was performed including 19 female patients diagnosed with idiopathic Parkinson's disease, with an average age of 66 years, and 27 women with an average of 67 years-old in the Control Group. The assessment was performed by computed acoustic analysis and perceptual evaluation. RESULTS: Parkinson's disease patients presented moderate rough and unstable voice quality. The parameters of grade, roughness, and instability had higher scores in Parkinson's disease patients with statistically significant differences. Acoustic measures of Jitter and period perturbation quotient (PPQ significantly differ between groups. CONCLUSIONS: Parkinson's disease female individuals showed more vocal alterations compared to the Control Group, when both perceptual and acoustic evaluations were analyzed.
Multidimensional assessment of strongly irregular voices such as in substitution voicing and spasmodic dysphonia: a compilation of own research.

Science.gov (United States)

Moerman, Mieke; Martens, Jean-Pierre; Dejonckere, Philippe

2015-04-01

This article is a compilation of own research performed during the European COoperation in Science and Technology (COST) action 2103: 'Advance Voice Function Assessment', an initiative of voice and speech processing teams consisting of physicists, engineers, and clinicians. This manuscript concerns analyzing largely irregular voicing types, namely substitution voicing (SV) and adductor spasmodic dysphonia (AdSD). A specific perceptual rating scale (IINFVo) was developed, and the Auditory Model Based Pitch Extractor (AMPEX), a piece of software that automatically analyses running speech and generates pitch values in background noise, was applied. The IINFVo perceptual rating scale has been shown to be useful in evaluating SV. The analysis of strongly irregular voices stimulated a modification of the European Laryngological Society's assessment protocol which was originally designed for the common types of (less severe) dysphonia. Acoustic analysis with AMPEX demonstrates that the most informative features are, for SV, the voicing-related acoustic features and, for AdSD, the perturbation measures. Poor correlations between self-assessment and acoustic and perceptual dimensions in the assessment of highly irregular voices argue for a multidimensional approach.
Outcomes Measurement in Voice Disorders: Application of an Acoustic Index of Dysphonia Severity

Science.gov (United States)

Awan, Shaheen N.; Roy, Nelson

2009-01-01

Purpose: The purpose of this experiment was to assess the ability of an acoustic model composed of both time-based and spectral-based measures to track change following voice disorder treatment and to serve as a possible treatment outcomes measure. Method: A weighted, four-factor acoustic algorithm consisting of shimmer, pitch sigma, the ratio of…
The acoustic and perceptual differences to the non-singer's singing voice before and after a singing vocal warm-up

Science.gov (United States)

DeRosa, Angela

The present study analyzed the acoustic and perceptual differences in non-singer's singing voice before and after a vocal warm-up. Experiments were conducted with 12 females who had no singing experience and considered themselves to be non-singers. Participants were recorded performing 3 tasks: a musical scale stretching to their most comfortable high and low pitches, sustained productions of the vowels /a/ and /i/, and singing performance of the "Star Spangled Banner." Participants were recorded performing these three tasks before a vocal warm-up, after a vocal warm-up, and then again 2-3 weeks later after 2-3 weeks of practice. Acoustical analysis consisted of formant frequency analysis, singer's formant/singing power ratio analysis, maximum phonation frequency range analysis, and an analysis of jitter, noise to harmonic ratio (NHR), relative average perturbation (RAP), and voice turbulence index (VTI). A perceptual analysis was also conducted with 12 listeners rating comparison performances of before vs. after the vocal warm-up, before vs. after the second vocal warm-up, and after both vocal warm-ups. There were no significant findings for the formant frequency analysis of the vowel /a/, but there was significance for the 1st formant frequency analysis of the vowel /i/. Singer's formant analyzed via Singing Power Ratio analysis showed significance only for the vowel /i/. Maximum phonation frequency range analysis showed a significant increase after the vocal warm-ups. There were no significant findings for the acoustic measures of jitter, NHR, RAP, and VTI. Perceptual analysis showed a significant difference after a vocal warm-up. The results indicate that a singing vocal warm-up can have a significant positive influence on the singing voice of non-singers.
Perception of recorded singing voice quality and expertise: cognitive linguistics and acoustic approaches.

Science.gov (United States)

Morange, Séverine; Dubois, Danièle; Fontaine, Jean-Marc

2010-07-01

The objective of the present pluridisciplinary study was to contribute to determine how a diversity of audience differently appreciates several versions resulting from different "restoration" treatments of one single original lyrical recording. We present here a joint analysis coupling psychological and linguistic analyses with acoustic descriptions on a unique research object: a Caruso's piece of song diversely remastered on commercial CDs. Thirty-two subjects were selected contrasted on age ("younger than 30 years" and "older than 60 years") related with their different experience of earlier technical recording devices (rendering through devices such as radio, 78rpm records, CD...) and on expertise concerning musical acoustics (acousticians and/or musicians vs ordinary music lovers). Eleven excerpts of reediting of an opera record interpreted by Caruso were selected from what could found on the market. The listening protocol involved a free categorization task and the selection of excerpts on preference judgments. Each task involved subjects' free commentaries about their choices as a joint output from psychological processing. A cluster analysis scaffold by a psycholinguistic processing of the verbal comments of the categories allowed to identify both commonalities and differences in groupings excerpts by the different groups of the subjects, along a diversity of criteria, varying according to age and expertise. Each excerpt can therefore be characterized both according to psychological and to acoustic criteria. This study has enabled us to develop the idea that a lyric voice is a multifaced object (cultural, esthetic, technical, physical), acoustic parameters being linked to the various sensory experiences and expertises of appraisers. Such pluridisciplinary research and the coupling of the correlated multiplicity of methodologies we developed acknowledge for a better understanding of listening practices and music-lover assessments here concerned with a
Voice parameters and videonasolaryngoscopy in children with vocal nodules: a longitudinal study, before and after voice therapy.

Science.gov (United States)

Valadez, Victor; Ysunza, Antonio; Ocharan-Hernandez, Esther; Garrido-Bustamante, Norma; Sanchez-Valerio, Araceli; Pamplona, Ma C

2012-09-01

Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. Before the onset of voice therapy, there was a significant difference (ptherapy period, a significant improvement (pvocal nodules were no longer discernible on the vocal folds in any of the cases. SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Acoustic Analysis of Soccer Fans in Acute Phonotrauma After the Match.

Science.gov (United States)

Pinarbasli, Mehmet Ozgur; Kaya, Ercan; Ozudogru, Erkan; Gurbuz, Melek Kezban; Colak, Ertugrul; Aksoy, Mehmet Akif; Birdane, Leman; Guney, Fatma Ozgur

2017-11-13

Acute phonotrauma is the result of sound production by shouting or straining one's voice. In this study, we aimed to investigate the acute changes in the vocal folds and voices of soccer fans who voluntarily applied to our clinic after the soccer match where they engaged in acute phonotrauma. There are no other studies in the literature conducted on a similar sample group. This is a case-control study. Videolaryngostroboscopic (VLS) examination, acoustic voice analysis, and Voice Handicap Index (VHI) questionnaire were performed on 29 voluntary soccer fans included to the study before the match and at the first hour after the match. The values obtained were compared statistically with each other and with 29 control groups without voice pathology. The jitter, shimmer, and normalized noise energy values measured after the match increased significantly statistically compared with the pre-match level, but harmonic noise ratio value decreased significantly (P < 0.05). VHI scores increased significantly after the match according to the pre-match scores (P < 0.05). In the VLS examinations, there was no difference in the images before and after the match. It has been concluded that people who are using their voices loudly and intensely by shouting during the match are exposed to sound changes after the match, and if this situation becomes persistent, it may cause permanent voice pathologies. It is thought that VHI and acoustic voice analysis should be done together with VLS for diagnosis and follow-up of voice changes for which the VLS examination alone is not sufficient. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effects of Bel Canto Training on Acoustic and Aerodynamic Characteristics of the Singing Voice.

Science.gov (United States)

McHenry, Monica A; Evans, Joseph; Powitzky, Eric

2016-03-01

This study was designed to assess the impact of 2 years of operatic training on acoustic and aerodynamic characteristics of the singing voice. This is a longitudinal study. Participants were 21 graduate students and 16 undergraduate students. They completed a variety of tasks, including laryngeal videostroboscopy, audio recording of pitch range, and singing of syllable trains at full voice in chest, passaggio, and head registers. Inspiration, intraoral pressure, airflow, and sound pressure level (SPL) were captured during the syllable productions. Both graduate and undergraduate students significantly increased semitone range and SPL. The contributions to increased SPL were typically increased inspiration, increased airflow, and reduced laryngeal resistance, although there were individual differences. Two graduate students increased SPL without increased airflow and likely used supraglottal strategies to do so. Students demonstrated improvements in both acoustic and aerodynamic components of singing. Increasing SPL primarily through respiratory drive is a healthy strategy and results from intensive training. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Acoustic Correlates of Fatigue in Laryngeal Muscles: Findings for a Criterion-Based Prevention of Acquired Voice Pathologies

Science.gov (United States)

Boucher, Victor J.

2008-01-01

Purpose: The objective was to identify acoustic correlates of laryngeal muscle fatigue in conditions of vocal effort. Method: In a previous study, a technique of electromyography (EMG) served to define physiological signs of "voice fatigue" in laryngeal muscles involved in voicing. These signs correspond to spectral changes in contraction…
Characterization of the voice of children with mouth breathing caused by four different etiologies using perceptual and acoustic analyses

Directory of Open Access Journals (Sweden)

Rosana Tiepo Arévalo

2005-09-01

Full Text Available Objective: To describe vocal characteristics in children aged fiveto twelve years with mouth breathing caused by four etiologies:chronic rhinitis, hypertrophy, hypertrophy + chronic rhinitis andfunctional condition, using perceptual evaluation and acousticanalysis. Methods: Voice recordings of 120 mouth breathers judgedby four speech pathologists using the software Multi-Speech.Results: The perceptual evaluation of the voice revealed highincidence of breathy and hoarse voices, especially in the rhinitisgroup. Most cases were moderate, with low pitch and normalloudness. Hyponasality was found in over 50% of sample, asexpected, but we also found high occurrence of laryngealresonance, especially in the rhinitis group. Mean fundamentalfrequency was 24.81Hz, SD = 15.02; jitter = 2.17; shimmer =0.44, and HNR = 2.11. Values did not show statistically significantdifference among the groups. Conclusion: Perceptual evaluation ofthe voice revealed that most mouth breathers presented hoarseand breathy voice, low pitch, normal loudness and hyponasal andlaryngeal resonance. However, the acoustic analysis did not resultin any significant condition.
Estimating RASATI scores using acoustical parameters

International Nuclear Information System (INIS)

Agüero, P D; Tulli, J C; Moscardi, G; Gonzalez, E L; Uriz, A J

2011-01-01

Acoustical analysis of speech using computers has reached an important development in the latest years. The subjective evaluation of a clinician is complemented with an objective measure of relevant parameters of voice. Praat, MDVP (Multi Dimensional Voice Program) and SAV (Software for Voice Analysis) are some examples of software for speech analysis. This paper describes an approach to estimate the subjective characteristics of RASATI scale given objective acoustical parameters. Two approaches were used: linear regression with non-negativity constraints, and neural networks. The experiments show that such approach gives correct evaluations with ±1 error in 80% of the cases.
A Phonemic and Acoustic Analysis of Hindko Oral Stops

Directory of Open Access Journals (Sweden)

Haroon Ur RASHID

2014-12-01

Full Text Available Hindko is an Indo-Aryan language that is mainly spoken in Khyber Pukhtoonkhaw province of Pakistan. This work aims to identify the oral stops of Hindko and determine the intrinsic acoustic cues for them. The phonemic analysis is done with the help of minimal pairs and phoneme distribution in contrastive environments which reveals that Hindko has twelve oral stops with three way series. The acoustic analysis of these segments shows that intrinsically voice onset time (VOT, closure duration and burst are reliable and distinguishing cues of stops in Hindko.
Perceptual and acoustic outcomes of voice therapy for male-to-female transgender individuals immediately after therapy and 15 months later.

Science.gov (United States)

Gelfer, Marylou Pausewang; Tice, Ruthanne M

2013-05-01

The present study examined how effectively listeners' perceptions of gender could be changed from male to female for male-to-female (MTF) transgender (TG) clients based on the voice signal alone, immediately after voice therapy and at long-term follow-up. Short- and long-term changes in masculinity and femininity ratings and acoustic measures of speaking fundamental frequency (SFF) and vowel formant frequencies were also investigated. Prospective treatment study. Five MTF TG clients, five control female speakers, and five control male speakers provided a variety of speech samples for later analysis. The TG clients then underwent 8 weeks of voice therapy. Voice samples were collected immediately at the termination of therapy and again 15 months later. Two groups of listeners were recruited to evaluate gender and provide masculinity and femininity ratings. Perceptual results revealed that TG subjects were perceived as female 1.9% of the time in the pretest, 50.8% of the time in the immediate posttest, and 33.1% of the time in the long-term posttest. The TG speakers were also perceived as significantly less masculine and more feminine in the immediate posttest and the long-term posttest compared with the pre-test. Some acoustic measures showed significant differences between the pretest and the immediate posttest and long-term posttest. It appeared that 8 weeks of voice therapy could result in vocal changes in MTF TG individuals that persist at least partially for up to 15 months. However, some TG subjects were more successful with voice feminization than others. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
[Comparison of cepstral coefficients to other voice evaluation parameters in patients with occupational dysphonia].

Science.gov (United States)

Niebudek-Bogusz, Ewa; Strumiłło, Paweł; Wiktorowicz, Justyna; Sliwińska-Kowalska, Mariola

2013-01-01

BACKGROUND Special consideration has recently been given to cepstral analysis with mel-frequency cepstral coefficients (MFCCs). The aim of this study was to assess the applicability of MFCCs in acoustic analysis for diagnosing occupational dysphonia in comparison to subjective and objective parameters of voice evaluation. The study comprised 2 groups, one of 55 female teachers (mean age: 45 years) with occupational dysphonia confirmed by videostroboscopy and 40 female controls with normal voice (mean age: 43 years). The acoustic samples involving sustained vowels "a" and four standardized sentences were analyzed by computed analysis of MFCCs. The results were compared to acoustic parameters of jitter and shimmer groups, noise to harmonic ratio, Yanagihara index evaluating the grade of hoarseness, the aerodynamic parameter: maximum phonation time and also subjective parameters: GRBAS perceptual scale and Voice Handicap Index (VHI). The compared results revealed differences between the study and control groups, significant for MFCC2, MFCC3, MFCC5, MFCC6, MFCC8, MFCC10, particularly for MFCC6 (p teachers correlated with all eight objective parameters, also showed the significant relation with perceptual voice feature A (asthenity) of subjective scale GRBAS, characteristic of weak tired voice. The cepstral analysis with mel frequency cepstral coefficients is a promising tool for evaluating occupational voice disorders, capable of reflecting the perceptual voice features better than other methods of acoustic analysis.
Acoustic Measures of Voice and Physiologic Measures of Autonomic Arousal during Speech as a Function of Cognitive Load.

Science.gov (United States)

MacPherson, Megan K; Abur, Defne; Stepp, Cara E

2017-07-01

This study aimed to determine the relationship among cognitive load condition and measures of autonomic arousal and voice production in healthy adults. A prospective study design was conducted. Sixteen healthy young adults (eight men, eight women) produced a sentence containing an embedded Stroop task in each of two cognitive load conditions: congruent and incongruent. In both conditions, participants said the font color of the color words instead of the word text. In the incongruent condition, font color differed from the word text, creating an increase in cognitive load relative to the congruent condition in which font color and word text matched. Three physiologic measures of autonomic arousal (pulse volume amplitude, pulse period, and skin conductance response amplitude) and four acoustic measures of voice (sound pressure level, fundamental frequency, cepstral peak prominence, and low-to-high spectral energy ratio) were analyzed for eight sentence productions in each cognitive load condition per participant. A logistic regression model was constructed to predict the cognitive load condition (congruent or incongruent) using subject as a categorical predictor and the three autonomic measures and four acoustic measures as continuous predictors. It revealed that skin conductance response amplitude, cepstral peak prominence, and low-to-high spectral energy ratio were significantly associated with cognitive load condition. During speech produced under increased cognitive load, healthy young adults show changes in physiologic markers of heightened autonomic arousal and acoustic measures of voice quality. Future work is necessary to examine these measures in older adults and individuals with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Fundamental frequency and voice perturbation measures in smokers and non-smokers: An acoustic and perceptual study

Science.gov (United States)

Freeman, Allison

This research examined the fundamental frequency and perturbation (jitter % and shimmer %) measures in young adult (20-30 year-old) and middle-aged adult (40-55 year-old) smokers and non-smokers; there were 36 smokers and 36 non-smokers. Acoustic analysis was carried out utilizing one task: production of sustained /a/. These voice samples were analyzed utilizing Multi-Dimensional Voice Program (MDVP) software, which provided values for fundamental frequency, jitter %, and shimmer %.These values were analyzed for trends regarding smoking status, age, and gender. Statistical significance was found regarding the fundamental frequency, jitter %, and shimmer % for smokers as compared to non-smokers; smokers were found to have significantly lower fundamental frequency values, and significantly higher jitter % and shimmer % values. Statistical significance was not found regarding fundamental frequency, jitter %, and shimmer % for age group comparisons. With regard to gender, statistical significance was found regarding fundamental frequency; females were found to have statistically higher fundamental frequencies as compared to males. However, the relationships between gender and jitter % and shimmer % lacked statistical significance. These results indicate that smoking negatively affects voice quality. This study also examined the ability of untrained listeners to identify smokers and non-smokers based on their voices. Results of this voice perception task suggest that listeners are not accurately able to identify smokers and non-smokers, as statistical significance was not reached. However, despite a lack of significance, trends in data suggest that listeners are able to utilize voice quality to identify smokers and non-smokers.
Influence of Smartphones and Software on Acoustic Voice Measures.

OpenAIRE

Elizabeth U. Grillo; Jenna N. Brosious; Staci L. Sorrell; Supraja Anand

2016-01-01

This study assessed the within-subject variability of voice measures captured using different recording devices (i.e., smartphones and head mounted microphone) and software programs (i.e., Analysis of Dysphonia in Speech and Voice (ADSV), Multi-dimensional Voice Program (MDVP), and Praat). Correlations between the software programs that calculated the voice measures were also analyzed. Results demonstrated no significant within-subject variability across devices and software and that some o...
Singer's preferred acoustic condition in performance in an opera house and self-perception of the singer's voice

Science.gov (United States)

Noson, Dennis; Kato, Kosuke; Ando, Yoichi

2004-05-01

Solo singers have been shown to over estimate the relative sound pressure level of a delayed, external reproduction of their own voice, singing single syllables, which, in turn, appears to influence the preferred delay of simulated stage reflections [Noson, Ph.D. thesis, Kobe University, 2003]. Bone conduction is thought to be one factor separating singer versus instrumental performer judgments of stage acoustics. Using a parameter derived from the vocal signal autocorrelation function (ACF envelope), the changes in singer preference for delayed reflections is primarily explained by the ACF parameter, rather than internal bone conduction. An auditory model of a singer's preferred reflection delay is proposed, combining the effects of acoustical environment (reflection amplitude), bone conduction, and performer vocal overestimate, which may be applied to the acoustic design of reflecting elements in both upstage and forestage environments of opera stages. For example, soloists who characteristically underestimate external voice levels (or overestimate their own voice) should be provided shorter distances to reflective panels-irrespective of their singing style. Adjustable elements can be deployed to adapt opera houses intended for bel canto style performances to other styles. Additional examples will also be discussed. a)Now at Kumamoto Univ., Kumamoto, Japan. b)Now at: 1-10-27 Yamano Kami, Kumamoto, Japan.

Automatic Assessment of Acoustic Parameters of the Singing Voice: Application to Professional Western Operatic and Jazz Singers.

Science.gov (United States)

Manfredi, Claudia; Barbagallo, Davide; Baracca, Giovanna; Orlandi, Silvia; Bandini, Andrea; Dejonckere, Philippe H

2015-07-01

The obvious perceptual differences between various singing styles like Western operatic and jazz rely on specific dissimilarities in vocal technique. The present study focuses on differences in vibrato acoustics and in singer's formant as analyzed by a novel software tool, named BioVoice, based on robust high-resolution and adaptive techniques that have proven its validity on synthetic voice signals. A total of 48 professional singers were investigated (29 females; 19 males; 29 Western operatic; and 19 jazz). They were asked to sing "a cappella," but with artistic expression, a well-known musical phrase from Gershwin's Porgy and Bess, in their own style: either operatic or jazz. A specific sustained note was extracted for detailed vibrato analysis. Beside rate (s(-1)) and extent (cents), duration (seconds) and regularity were computed. Two new concepts are introduced: vibrato jitter and vibrato shimmer, by analogy with the traditional jitter and shimmer of voice signals. For the singer's formant, on the same sustained tone, the ratio of the acoustic energy in formants 1-2 to the energy in formants 3, 4, and 5 was automatically computed, providing a quality ratio (QR). Vibrato rates did not differ among groups. Extent was significantly larger in operatic singers, particularly females. Vibrato jitter and vibrato shimmer were significantly smaller in operatic singers. Duration of vibrato was also significantly longer in operatic singers. QR was significantly lower in male operatic singers. Some vibrato characteristics (extent, regularity, and duration) very clearly differentiate the Western operatic singing style from the jazz singing style. The singer's formant is typical of male operatic singers. The new software tool is well suited to provide useful feedback in a pedagogical context. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Influence of Smartphones and Software on Acoustic Voice Measures.

Directory of Open Access Journals (Sweden)

Elizabeth U. Grillo

2016-12-01

Full Text Available This study assessed the within-subject variability of voice measures captured using different recording devices (i.e., smartphones and head mounted microphone and software programs (i.e., Analysis of Dysphonia in Speech and Voice (ADSV, Multi-dimensional Voice Program (MDVP, and Praat. Correlations between the software programs that calculated the voice measures were also analyzed. Results demonstrated no significant within-subject variability across devices and software and that some of the measures were highly correlated across software programs. The study suggests that certain smartphones may be appropriate to record daily voice measures representing the effects of vocal loading within individuals. In addition, even though different algorithms are used to compute voice measures across software programs, some of the programs and measures share a similar relationship.
Materials of acoustic analysis: sustained vowel versus sentence.

Science.gov (United States)

Moon, Kyung Ray; Chung, Sung Min; Park, Hae Sang; Kim, Han Su

2012-09-01

Sustained vowel is a widely used material of acoustic analysis. However, vowel phonation does not sufficiently demonstrate sentence-based real-life phonation, and biases may occur depending on the test subjects intent during pronunciation. The purpose of this study was to investigate the differences between the results of acoustic analysis using each material. An individual prospective study. Two hundred two individuals (87 men and 115 women) with normal findings in videostroboscopy were enrolled. Acoustic analysis was done using the speech pattern element acquisition and display program. Fundamental frequency (Fx), amplitude (Ax), contact quotient (Qx), jitter, and shimmer were measured with sustained vowel-based acoustic analysis. Average fundamental frequency (FxM), average amplitude (AxM), average contact quotient (QxM), Fx perturbation (CFx), and amplitude perturbation (CAx) were measured with sentence-based acoustic analysis. Corresponding data of the two methods were compared with each other. SPSS (Statistical Package for the Social Sciences, Version 12.0; SPSS, Inc., Chicago, IL) software was used for statistical analysis. FxM was higher than Fx in men (Fx, 124.45 Hz; FxM, 133.09 Hz; P=0.000). In women, FxM seemed to be lower than Fx, but the results were not statistically significant (Fx, 210.58 Hz; FxM, 208.34 Hz; P=0.065). There was no statistical significance between Ax and AxM in both the groups. QxM was higher than Qx in men and women. Jitter was lower in men, but CFx was lower in women. Both Shimmer and CAx were higher in men. Sustained vowel phonation could not be a complete substitute for real-time phonation in acoustic analysis. Characteristics of acoustic materials should be considered when choosing the material for acoustic analysis and interpreting the results. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

Directory of Open Access Journals (Sweden)

Maureen Fontaine

2017-07-01

Full Text Available The ability to recognize an individual from their voice is a widespread ability with a long evolutionary history. Yet, the perceptual representation of familiar voices is ill-defined. In two experiments, we explored the neuropsychological processes involved in the perception of voice identity. We specifically explored the hypothesis that familiar voices (trained-to-familiar (Experiment 1, and famous voices (Experiment 2 are represented as a whole complex pattern, well approximated by the average of multiple utterances produced by a single speaker. In experiment 1, participants learned three voices over several sessions, and performed a three-alternative forced-choice identification task on original voice samples and several “speaker averages,” created by morphing across varying numbers of different vowels (e.g., [a] and [i] produced by the same speaker. In experiment 2, the same participants performed the same task on voice samples produced by familiar speakers. The two experiments showed that for famous voices, but not for trained-to-familiar voices, identification performance increased and response times decreased as a function of the number of utterances in the averages. This study sheds light on the perceptual representation of familiar voices, and demonstrates the power of average in recognizing familiar voices. The speaker average captures the unique characteristics of a speaker, and thus retains the information essential for recognition; it acts as a prototype of the speaker.
Comparison of cepstral coefficients to other voice evaluation parameters in patients with occupational dysphonia

Directory of Open Access Journals (Sweden)

Ewa Niebudek-Bogusz

2013-12-01

Full Text Available Background: Special consideration has recently been given to cepstral analysis with mel-frequency cepstral coefficients (MFCCs. The aim of this study was to assess the applicability of MFCCs in acoustic analysis for diagnosing occupational dysphonia in comparison to subjective and objective parameters of voice evaluation. Materials and Methods: The study comprised 2 groups, one of 55 female teachers (mean age: 45 years with occupational dysphonia confirmed by videostroboscopy and 40 female controls with normal voice (mean age: 43 years. The acoustic samples involving sustained vowels "a" and four standardized sentences were analyzed by computed analysis of MFCCs. The results were compared to acoustic parameters of jitter and shimmer groups, noise to harmonic ratio, Yanagihara index evaluating the grade of hoarseness, the aerodynamic parameter: maximum phonation time and also subjective parameters: GRBAS perceptual scale and Voice Handicap Index (VHI. Results: The compared results revealed differences between the study and control groups, significant for MFCC2, MFCC3, MFCC5, MFCC6, MFCC8, MFCC10, particularly for MFCC6 (p < 0.001 and MFCC8 (p < 0.009, which may suggest their clinical applicability. In the study group, MFCC4, MFCC8 and MFCC10 correlated significantly with the major objective parameters of voice assessment. Moreover, MFCC8 coefficient, which in the female teachers correlated with all eight objective parameters, also showed the significant relation with perceptual voice feature A (asthenity of subjective scale GRBAS, characteristic of weak tired voice. Conclusions: The cepstral analysis with mel frequency cepstral coefficients is a promising tool for evaluating occupational voice disorders, capable of reflecting the perceptual voice features better than other methods of acoustic analysis. Med Pr 2013;64(6:805–816
Lax Vox as a Voice Training Program for Teachers: A Pilot Study.

Science.gov (United States)

Mailänder, Eva; Mühre, Lea; Barsties, Ben

2017-03-01

The objective of this study was to explore the effectiveness of a 3-week training program with the voice therapy "Lax Vox" for teachers. Four healthy female teachers participated as volunteers for the study. Several voice measurements of perception, acoustics, aerodynamics, and self-evaluation were investigated. Furthermore, a survey to rate the applicability of Lax Vox was also part of the study. To assess the treatment effects of the Lax Vox training, an effect size analysis (d unb ) was conducted. After 3 weeks of training, medium and large improvements were found in some parameters of perceptual and acoustic voice quality assessments (d unb >0.50 and d unb >0.80, respectively). Furthermore, medium improvements were revealed in some parameters of self-evaluation (ie, physical and total scale of the Voice Handicap Index) and aerodynamic (ie, maximum phonation time) assessments (all d unb >0.50). Additionally, acoustic measures of vocal function showed an expansion in the upper contour of voice range profiles after training. Particularly, the main improvements in the voice range profile was found in the modal and the beginning of the falsetto voice registers. There was an increase of the intensity levels of about 4.6 dB. No changes were revealed in some acoustic measures of the voice range profile, self-evaluation measurements, and the perception of breathy voice quality (all d unb teachers appears to improve select measures of voice quality, maximum phonation time, vocal function, self-evaluation, and perceived applicability. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Updating signal typing in voice: addition of type 4 signals.

Science.gov (United States)

Sprecher, Alicia; Olszewski, Aleksandra; Jiang, Jack J; Zhang, Yu

2010-06-01

The addition of a fourth type of voice to Titze's voice classification scheme is proposed. This fourth voice type is characterized by primarily stochastic noise behavior and is therefore unsuitable for both perturbation and correlation dimension analysis. Forty voice samples were classified into the proposed four types using narrowband spectrograms. Acoustic, perceptual, and correlation dimension analyses were completed for all voice samples. Perturbation measures tended to increase with voice type. Based on reliability cutoffs, the type 1 and type 2 voices were considered suitable for perturbation analysis. Measures of unreliability were higher for type 3 and 4 voices. Correlation dimension analyses increased significantly with signal type as indicated by a one-way analysis of variance. Notably, correlation dimension analysis could not quantify the type 4 voices. The proposed fourth voice type represents a subset of voices dominated by noise behavior. Current measures capable of evaluating type 4 voices provide only qualitative data (spectrograms, perceptual analysis, and an infinite correlation dimension). Type 4 voices are highly complex and the development of objective measures capable of analyzing these voices remains a topic of future investigation.
Effects of melody and technique on acoustical and musical features of western operatic singing voices.

Science.gov (United States)

Larrouy-Maestri, Pauline; Magis, David; Morsomme, Dominique

2014-05-01

The operatic singing technique is frequently used in classical music. Several acoustical parameters of this specific technique have been studied but how these parameters combine remains unclear. This study aims to further characterize the Western operatic singing technique by observing the effects of melody and technique on acoustical and musical parameters of the singing voice. Fifty professional singers performed two contrasting melodies (popular song and romantic melody) with two vocal techniques (with and without operatic singing technique). The common quality parameters (energy distribution, vibrato rate, and extent), perturbation parameters (standard deviation of the fundamental frequency, signal-to-noise ratio, jitter, and shimmer), and musical features (fundamental frequency of the starting note, average tempo, and sound pressure level) of the 200 sung performances were analyzed. The results regarding the effect of melody and technique on the acoustical and musical parameters show that the choice of melody had a limited impact on the parameters observed, whereas a particular vocal profile appeared depending on the vocal technique used. This study confirms that vocal technique affects most of the parameters examined. In addition, the observation of quality, perturbation, and musical parameters contributes to a better understanding of the Western operatic singing technique. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Speaking comfort and voice use of teachers in classrooms

DEFF Research Database (Denmark)

Brunskog, Jonas; Pelegrin Garcia, David

2010-01-01

Teachers suffer from voice problems more often than the rest of the population, as a consequence of the intensive use of their voices during teaching. Noise and classroom acoustics have been defined as hazards eventually leading to voice problems. In order to make a good classroom acoustic design...... to preserve the teachers’ voices and maximize their comfort, it is necessary to understand the underlaying relationship between classroom acoustics and teachers’ voice production. This paper presents a brief summary of investigations looking into this relationship. A pilot study, carried out in different...... located at various distances, in rooms with very different acoustics. A field study in schools of southern Sweden found out that teachers with and without voice problems, during actual teaching, are affected differently by the support of the classroom. A last laboratory experiment was carried out...
Combined Functional Voice Therapy in Singers With Muscle Tension Dysphonia in Singing.

Science.gov (United States)

Sielska-Badurek, Ewelina; Osuch-Wójcikiewicz, Ewa; Sobol, Maria; Kazanecka, Ewa; Rzepakowska, Anna; Niemczyk, Kazimierz

2017-07-01

The purpose of this study was to evaluate vocal tract function and the voice quality in singers with muscle tension dysphonia (MTD) after undergoing combined functional voice therapy of the singing voice. This is a prospective, randomized study. Forty singers (29 females and 11 males, mean age: 24.6 ± 8.8 years) with MTD were enrolled in the study. The study group consisted of 20 singers who underwent combined functional voice therapy (10-15 individual sessions, 30-40 minutes each). Singers who did not opt for vocal rehabilitation consisted of the control group. Effects of rehabilitation were assessed with videolaryngostroboscopy, palpation of the vocal tract structures, flexible fiberoptic evaluation of the pharynx and the larynx, perceptual speaking and singing voice assessment, acoustic analysis, maximal phonation time, and the Voice Handicap Index. After combined functional voice therapy in the study group, great improvement was noticed in palpation of the vocal tract structures (P singing range obtained from acoustic analysis of glissando (P singing. Development of palpation and perceptual singing voice examination protocols enables one to compare results before and after rehabilitation in clinics. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
External Validation of the Acoustic Voice Quality Index Version 03.01 With Extended Representativity.

Science.gov (United States)

Barsties, Ben; Maryn, Youri

2016-07-01

The Acoustic Voice Quality Index (AVQI) is an objective method to quantify the severity of overall voice quality in concatenated continuous speech and sustained phonation segments. Recently, AVQI was successfully modified to be more representative and ecologically valid because the internal consistency of AVQI was balanced out through equal proportion of the 2 speech types. The present investigation aims to explore its external validation in a large data set. An expert panel of 12 speech-language therapists rated the voice quality of 1058 concatenated voice samples varying from normophonia to severe dysphonia. The Spearman rank-order correlation coefficients (r) were used to measure concurrent validity. The AVQI's diagnostic accuracy was evaluated with several estimates of its receiver operating characteristics (ROC). Finally, 8 of the 12 experts were chosen because of reliability criteria. A strong correlation was identified between AVQI and auditoryperceptual rating (r = 0.815, P = .000). It indicated that 66.4% of the auditory-perceptual rating's variation was explained by AVQI. Additionally, the ROC results showed again the best diagnostic outcome at a threshold of AVQI = 2.43. This study highlights external validation and diagnostic precision of the AVQI version 03.01 as a robust and ecologically valid measurement to objectify voice quality. © The Author(s) 2016.
Speech pattern recognition for forensic acoustic purposes

OpenAIRE

Herrera Martínez, Marcelo; Aldana Blanco, Andrea Lorena; Guzmán Palacios, Ana María

2014-01-01

The present paper describes the development of a software for analysis of acoustic voice parameters (APAVOIX), which can be used for forensic acoustic purposes, based on the speaker recognition and identification. This software enables to observe in a clear manner, the parameters which are sufficient and necessary when performing a comparison between two voice signals, the suspicious and the original one. These parameters are used according to the classic method, generally used by state entit...
Interactive Augmentation of Voice Quality and Reduction of Breath Airflow in the Soprano Voice.

Science.gov (United States)

Rothenberg, Martin; Schutte, Harm K

2016-11-01

In 1985, at a conference sponsored by the National Institutes of Health, Martin Rothenberg first described a form of nonlinear source-tract acoustic interaction mechanism by which some sopranos, singing in their high range, can use to reduce the total airflow, to allow holding the note longer, and simultaneously enrich the quality of the voice, without straining the voice. (M. Rothenberg, "Source-Tract Acoustic Interaction in the Soprano Voice and Implications for Vocal Efficiency," Fourth International Conference on Vocal Fold Physiology, New Haven, Connecticut, June 3-6, 1985.) In this paper, we describe additional evidence for this type of nonlinear source-tract interaction in some soprano singing and describe an analogous interaction phenomenon in communication engineering. We also present some implications for voice research and pedagogy. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Computerized Analysis of Acoustic Characteristics of Patients with Internal Nasal Valve Collapse Before and After Functional Rhinoplasty

Science.gov (United States)

Rezaei, Fariba; Omrani, Mohammad Reza; Abnavi, Fateme; Mojiri, Fariba; Golabbakhsh, Marzieh; Barati, Sohrab; Mahaki, Behzad

2015-01-01

Acoustic analysis of sounds produced during speech provides significant information about the physiology of larynx and vocal tract. The analysis of voice power spectrum is a fundamental sensitive method of acoustic assessment that provides valuable information about the voice source and characteristics of vocal tract resonance cavities. The changes in long-term average spectrum (LTAS) spectral tilt and harmony to noise ratio (HNR) were analyzed to assess the voice quality before and after functional rhinoplasty in patients with internal nasal valve collapse. Before and 3 months after functional rhinoplasty, 12 participants were evaluated and HNR and LTAS spectral tilt in /a/ and /i/ vowels were estimated. It was seen that an increase in HNR and a decrease in LTAS spectral tilt existed after surgery. Mean LTAS spectral tilt in vowel /a/ decreased from 2.37 ± 1.04 to 2.28 ± 1.17 (P = 0.388), and it was decreased from 4.16 ± 1.65 to 2.73 ± 0.69 in vowel /i/ (P = 0.008). Mean HNR in the vowel /a/ increased from 20.71 ± 3.93 to 25.06 ± 2.67 (P = 0.002), and it was increased from 21.28 ± 4.11 to 25.26 ± 3.94 in vowel /i/ (P = 0.002). Modification of the vocal tract caused the vocal cords to close sufficiently, and this showed that although rhinoplasty did not affect the larynx directly, it changes the structure of the vocal tract and consequently the resonance of voice production. The aim of this study was to investigate the changes in voice parameters after functional rhinoplasty in patients with internal nasal valve collapse by computerized analysis of acoustic characteristics. PMID:26955564
Voice after radiotherapy of the larynx carcinoma

International Nuclear Information System (INIS)

Niedzielska, Grazyna; Niedzielski, Antoni; Toman, Danuta

2010-01-01

Background: The study presents the evaluation of the phonatory function of the larynx after radiotherapy. The research covered the patients from the rural areas of Poland who revealed neoplastic changes in the glottis area. Material and methods: The test group consisted of 45 men aged 41-78 years with the carcinoma of the larynx with T1 and T2 progression types of cancer, according to the TNM classification. The analysis of laryngeal tone was performed with the digital analyzer Kay Elemetrics Model CSL 4300 and Multi Dimensional Voice Program (MDVP). A stroboscopic test in all the patients with T1 progression revealed the reduction of vibrations. Results: The acoustic analysis of the voice in the pre-treatment group as compared with the control group allowed for differentiation of the following parameters of a definitely pathologic character: Jita, Jitter, RAP, PPQ, vFo, Shimmer, APQ, vAm, NHR, VTI, SPI, and DUV. Conclusions: In the acoustic analysis of voice in the post-radiotherapy group, the following parameters reached values close to the norm: JITA, JITT, RAP, PPQ, vF0, vAM, DUV, and Schimmer dB.
Predicting Voice Disorder Status From Smoothed Measures of Cepstral Peak Prominence Using Praat and Analysis of Dysphonia in Speech and Voice (ADSV).

Science.gov (United States)

Sauder, Cara; Bretl, Michelle; Eadie, Tanya

2017-09-01

The purposes of this study were to (1) determine and compare the diagnostic accuracy of a single acoustic measure, smoothed cepstral peak prominence (CPPS), to predict voice disorder status from connected speech samples using two software systems: Analysis of Dysphonia in Speech and Voice (ADSV) and Praat; and (2) to determine the relationship between measures of CPPS generated from these programs. This is a retrospective cross-sectional study. Measures of CPPS were obtained from connected speech recordings of 100 subjects with voice disorders and 70 nondysphonic subjects without vocal complaints using commercially available ADSV and freely downloadable Praat software programs. Logistic regression and receiver operating characteristic (ROC) analyses were used to evaluate and compare the diagnostic accuracy of CPPS measures. Relationships between CPPS measures from the programs were determined. Results showed acceptable overall accuracy rates (75% accuracy, ADSV; 82% accuracy, Praat) and area under the ROC curves (area under the curve [AUC] = 0.81, ADSV; AUC = 0.91, Praat) for predicting voice disorder status, with slight differences in sensitivity and specificity. CPPS measures derived from Praat were uniquely predictive of disorder status above and beyond CPPS measures from ADSV (χ 2 (1) = 40.71, P disorder status using either program. Clinicians may consider using CPPS to complement clinical voice evaluation and screening protocols. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Analysis of Measured and Simulated Supraglottal Acoustic Waves.

Science.gov (United States)

Fraile, Rubén; Evdokimova, Vera V; Evgrafova, Karina V; Godino-Llorente, Juan I; Skrelin, Pavel A

2016-09-01

To date, although much attention has been paid to the estimation and modeling of the voice source (ie, the glottal airflow volume velocity), the measurement and characterization of the supraglottal pressure wave have been much less studied. Some previous results have unveiled that the supraglottal pressure wave has some spectral resonances similar to those of the voice pressure wave. This makes the supraglottal wave partially intelligible. Although the explanation for such effect seems to be clearly related to the reflected pressure wave traveling upstream along the vocal tract, the influence that nonlinear source-filter interaction has on it is not as clear. This article provides an insight into this issue by comparing the acoustic analyses of measured and simulated supraglottal and voice waves. Simulations have been performed using a high-dimensional discrete vocal fold model. Results of such comparative analysis indicate that spectral resonances in the supraglottal wave are mainly caused by the regressive pressure wave that travels upstream along the vocal tract and not by source-tract interaction. On the contrary and according to simulation results, source-tract interaction has a role in the loss of intelligibility that happens in the supraglottal wave with respect to the voice wave. This loss of intelligibility mainly corresponds to spectral differences for frequencies above 1500 Hz. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
FonaDyn - A system for real-time analysis of the electroglottogram, over the voice range

Science.gov (United States)

Ternström, Sten; Johansson, Dennis; Selamtzis, Andreas

2018-01-01

From soft to loud and low to high, the mechanisms of human voice have many degrees of freedom, making it difficult to assess phonation from the acoustic signal alone. FonaDyn is a research tool that combines acoustics with electroglottography (EGG). It characterizes and visualizes in real time the dynamics of EGG waveforms, using statistical clustering of the cycle-synchronous EGG Fourier components, and their sample entropy. The prevalence and stability of different EGG waveshapes are mapped as colored regions into a so-called voice range profile, without needing pre-defined thresholds or categories. With appropriately 'trained' clusters, FonaDyn can classify and map voice regimes. This is of potential scientific, clinical and pedagogical interest.
Assessment of dysphonia due to benign vocal fold lesions by acoustic and aerodynamic indices: a multivariate analysis.

Science.gov (United States)

Cantarella, Giovanna; Baracca, Giovanna; Pignataro, Lorenzo; Forti, Stella

2011-04-01

The goal was to identify acoustic and aerodynamic indices that allow the discrimination of a benign organic dysphonic voice from a normal voice. Fifty-three patients affected by dysphonia caused by vocal folds benign lesions, and a control group were subjected to maximum phonation time (MPT) measurements, GRB perceptual evaluations and acoustic/aerodynamic tests. All analyzed variables except the airflow variation coefficient were significantly different between the two groups. The unique significant factors in the discrimination between healthy and dysphonic subjects were the aerodynamic indices of MPT and Glottal efficiency index, and the acoustic index Shimmer. These results show that a combination of three parameters can discriminate a voice deviance and highlight the importance of a multidimensional assessment for objective voice evaluation.
Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms

Directory of Open Access Journals (Sweden)

Yan Ming Cheng

2008-08-01

Full Text Available We describe two voice-to-phoneme conversion algorithms for speaker-independent voice-tag creation specifically targeted at applications on embedded platforms. These algorithms (batch mode and sequential are compared in speech recognition experiments where they are first applied in a same-language context in which both acoustic model training and voice-tag creation and application are performed on the same language. Then, their performance is tested in a cross-language setting where the acoustic models are trained on a particular source language while the voice-tags are created and applied on a different target language. In the same-language environment, both algorithms either perform comparably to or significantly better than the baseline where utterances are manually transcribed by a phonetician. In the cross-language context, the voice-tag performances vary depending on the source-target language pair, with the variation reflecting predicted phonological similarity between the source and target languages. Among the most similar languages, performance nears that of the native-trained models and surpasses the native reference baseline.

Voice similarity in identical twins.

Science.gov (United States)

Van Gysel, W D; Vercammen, J; Debruyne, F

2001-01-01

If people are asked to discriminate visually the two individuals of a monozygotic twin (MT), they mostly get into trouble. Does this problem also exist when listening to twin voices? Twenty female and 10 male MT voices were randomly assembled with one "strange" voice to get voice trios. The listeners (10 female students in Speech and Language Pathology) were asked to label the twins (voices 1-2, 1-3 or 2-3) in two conditions: two standard sentences read aloud and a 2.5-second midsection of a sustained /a/. The proportion correctly labelled twins was for female voices 82% and 63% and for male voices 74% and 52% for the sentences and the sustained /a/ respectively, both being significantly greater than chance (33%). The acoustic analysis revealed a high intra-twin correlation for the speaking fundamental frequency (SFF) of the sentences and the fundamental frequency (F0) of the sustained /a/. So the voice pitch could have been a useful characteristic in the perceptual identification of the twins. We conclude that there is a greater perceptual resemblance between the voices of identical twins than between voices without genetic relationship. The identification however is not perfect. The voice pitch possibly contributes to the correct twin identifications.
Electroglottographic analysis of actresses and nonactresses' voices in different levels of intensity.

Science.gov (United States)

Master, Suely; Guzman, Marco; Carlos de Miranda, Helder; Lloyd, Adam

2013-03-01

Previous studies with long-term average spectrum (LTAS) showed the importance of the glottal source for understanding the projected voices of actresses. In this study, electroglottographic (EGG) analysis was used to investigate the contribution of the glottal source to the projected voice, comparing actresses and nonactresses' voices, in different levels of intensity. Thirty actresses and 30 nonactresses sustained vowels in habitual, moderate, and loud intensity levels. The EGG variables were contact quotient (CQ), closing quotient (QCQ), and opening quotient (QOQ). Other variables were sound pressure level (SPL) and fundamental frequency (F0). A KayPENTAX EGG was used. Variables were inputted in a general linear model. Actresses showed significantly higher values for SPL, in all levels, and both groups increased SPL significantly while changing from habitual to moderate and further to loud. There were no significant differences between groups for EGG quotients. There were significant differences between the levels only for F0 and CQ for both groups. SPL was significantly higher among actresses in all intensity levels, but in the EGG analysis, no differences were found. This apparently weak contribution of the glottal source in the supposedly projected voices of actresses, contrary to previous LTAS studies, might be because of a higher subglottal pressure or perhaps greater vocal tract contribution in SPL. Results from the present study suggest that trained subjects did not produce a significant higher SPL than untrained individuals by increasing the cost in terms of higher vocal fold collision and hence more impact stress. Future researches should explore the difference between trained and nontrained voices by aerodynamic measurements to evaluate the relationship between physiologic findings and the acoustic and EGG data. Moreover, further studies should consider both types of vocal tasks, sustained vowel and running speech, for both EGG and LTAS analysis
Assessments of Voice Use and Voice Quality among College/University Singing Students Ages 18–24 through Ambulatory Monitoring with a Full Accelerometer Signal

Science.gov (United States)

Schloneger, Matthew; Hunter, Eric

2016-01-01

The multiple social and performance demands placed on college/university singers could put their still developing voices at risk. Previous ambulatory monitoring studies have analyzed the duration, intensity, and frequency (in Hz) of voice use among such students. Nevertheless, no studies to date have incorporated the simultaneous acoustic voice quality measures into the acquisition of these measures to allow for direct comparison during the same voicing period. Such data could provide greater insight into how young singers use their voices, as well as identify potential correlations between vocal dose and acoustic changes in voice quality. The purpose of this study was to assess the voice use and estimated voice quality of college/university singing students (18–24 y/o, N = 19). Ambulatory monitoring was conducted over three full, consecutive weekdays measuring voice from an unprocessed accelerometer signal measured at the neck. From this signal were analyzed traditional vocal dose metrics such as phonation percentage, dose time, cycle dose, and distance dose. Additional acoustic measures included perceived pitch, pitch strength, LTAS slope, alpha ratio, dB SPL 1–3 kHz, and harmonic-to-noise ratio. Major findings from more than 800 hours of recording indicated that among these students (a) higher vocal doses correlated significantly with greater voice intensity, more vocal clarity and less perturbation; and (b) there were significant differences in some acoustic voice quality metrics between non-singing, solo singing and choral singing. PMID:26897545
Long-term voice handicap index after type II thyroplasty using titanium bridges for adductor spasmodic dysphonia.

Science.gov (United States)

Sanuki, Tetsuji; Yumoto, Eiji; Kodama, Narihiro; Minoda, Ryosei; Kumai, Yoshihiko

2014-06-01

To determine the long-term functional outcomes of type II thyroplasty using titanium bridges for adductor spasmodic dysphonia (AdSD) by perceptual analysis using the Voice Handicap Index-10 (VHI-10) and by acoustic analysis. Fifteen patients with AdSD underwent type II thyroplasty using titanium brides between August 2006 and February 2011. VHI-10 scores, a patient-based survey that quantifies a patient's perception of his or her vocal handicap, were determined before and at least 2 years after surgery. Concurrent with the VHI-10 evaluation, acoustic parameters were assessed, including jitter, shimmer, harmonic-to-noise ratio (HNR), standard deviation of F0 (SDF0), and degree of voice breaks (DVB). The average follow-up interval was 30.1 months. No patient had strangulation of the voice, and all were satisfied with the voice postoperatively. In the perceptual analysis, the mean VHI-10 score improved significantly, from 26.7 to 4.1 two years after surgery. All patients had significantly improved each score of three different aspects of VHI-10, representing improved functional, physical, and emotional well-being. All acoustic parameters improved significantly 2 years after surgery. The treatment of AdSD with type II thyroplasty significantly improved the voice-related quality of life and acoustic parameters 2 years after surgery. The results of the study suggest that type II thyroplasty using titanium bridges provides long-term relief of vocal symptoms in patients with AdSD. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Vocal effectiveness of speech-language pathology students: Before and after voice use during service delivery

Science.gov (United States)

Couch, Stephanie; Zieba, Dominique; van der Merwe, Anita

2015-01-01

Background As a professional voice user, it is imperative that a speech-language pathologist's (SLP) vocal effectiveness remain consistent throughout the day. Many factors may contribute to reduced vocal effectiveness, including prolonged voice use, vocally abusive behaviours, poor vocal hygiene and environmental factors. Objectives To determine the effect of service delivery on the perceptual and acoustic features of voice. Method A quasi-experimental., pre-test–post-test research design was used. Participants included third- and final-year speech-language pathology students at the University of Pretoria (South Africa). Voice parameters were evaluated in a pre-test measurement, after which the participants provided two consecutive hours of therapy. A post-test measurement was then completed. Data analysis consisted of an instrumental analysis in which the multidimensional voice programme (MDVP) and the voice range profile (VRP) were used to measure vocal parameters and then calculate the dysphonia severity index (DSI). The GRBASI scale was used to conduct a perceptual analysis of voice quality. Data were processed using descriptive statistics to determine change in each measured parameter after service delivery. Results A change of clinical significance was observed in the acoustic and perceptual parameters of voice. Conclusion Guidelines for SLPs in order to maintain optimal vocal effectiveness were suggested. PMID:26304213
Adductor spasmodic dysphonia: Relationships between acoustic indices and perceptual judgments

Science.gov (United States)

Cannito, Michael P.; Sapienza, Christine M.; Woodson, Gayle; Murry, Thomas

2003-04-01

This study investigated relationships between acoustical indices of spasmodic dysphonia and perceptual scaling judgments of voice attributes made by expert listeners. Audio-recordings of The Rainbow Passage were obtained from thirty one speakers with spasmodic dysphonia before and after a BOTOX injection of the vocal folds. Six temporal acoustic measures were obtained across 15 words excerpted from each reading sample, including both frequency of occurrence and percent time for (1) aperiodic phonation, (2) phonation breaks, and (3) fundamental frequency shifts. Visual analog scaling judgments were also obtained from six voice experts using an interactive computer interface to quantify four voice attributes (i.e., overall quality, roughness, brokenness, breathiness) in a carefully psychoacoustically controlled environment, using the same reading passages as stimuli. Number and percent aperiodicity and phonation breaks correlated significanly with perceived overall voice quality, roughness, and brokenness before and after the BOTOX injection. Breathiness was correlated with aperidocity only prior to injection, while roughness also correlated with frequency shifts following injection. Factor analysis reduced perceived attributes to two principal components: glottal squeezing and breathiness. The acoustic measures demonstrated a strong regression relationship with perceived glottal squeezing, but no regression relationship with breathiness was observed. Implications for an analysis of pathologic voices will be discussed.
Aeroacoustic analysis of the human phonation process based on a hybrid acoustic PIV approach

Science.gov (United States)

Lodermeyer, Alexander; Tautz, Matthias; Becker, Stefan; Döllinger, Michael; Birk, Veronika; Kniesburges, Stefan

2018-01-01

The detailed analysis of sound generation in human phonation is severely limited as the accessibility to the laryngeal flow region is highly restricted. Consequently, the physical basis of the underlying fluid-structure-acoustic interaction that describes the primary mechanism of sound production is not yet fully understood. Therefore, we propose the implementation of a hybrid acoustic PIV procedure to evaluate aeroacoustic sound generation during voice production within a synthetic larynx model. Focusing on the flow field downstream of synthetic, aerodynamically driven vocal folds, we calculated acoustic source terms based on the velocity fields obtained by time-resolved high-speed PIV applied to the mid-coronal plane. The radiation of these sources into the acoustic far field was numerically simulated and the resulting acoustic pressure was finally compared with experimental microphone measurements. We identified the tonal sound to be generated downstream in a small region close to the vocal folds. The simulation of the sound propagation underestimated the tonal components, whereas the broadband sound was well reproduced. Our results demonstrate the feasibility to locate aeroacoustic sound sources inside a synthetic larynx using a hybrid acoustic PIV approach. Although the technique employs a 2D-limited flow field, it accurately reproduces the basic characteristics of the aeroacoustic field in our larynx model. In future studies, not only the aeroacoustic mechanisms of normal phonation will be assessable, but also the sound generation of voice disorders can be investigated more profoundly.
[Acoustic characteristics of adductor spasmodic dysphonia].

Science.gov (United States)

Yang, Yang; Wang, Li-Ping

2008-06-01

To explore the acoustic characteristics of adductor spasmodic dysphonia. The acoustic characteristics, including acoustic signal of recorded voice, three-dimensional sonogram patterns and subjective assessment of voice, between 10 patients (7 women, 3 men) with adductor spasmodic dysphonia and 10 healthy volunteers (5 women, 5 men), were compared. The main clinical manifestation of adductor spasmodic dysphonia included the disorders of sound quality, rhyme and fluency. It demonstrated the tension dysphonia when reading, acoustic jitter, momentary fluctuation of frequency and volume, voice squeezing, interruption, voice prolongation, and losing normal chime. Among 10 patients, there were 1 mild dysphonia (abnormal syllable number dysphonia (abnormal syllable number 25%-49%), 1 severe dysphonia (abnormal syllable number 50%-74%) and 2 extremely severe dysphonia (abnormal syllable number > or = 75%). The average reading time in 10 patients was 49 s, with reading time extension and aphasia area interruption in acoustic signals, whereas the average reading time in health control group was 30 s, without voice interruption. The aphasia ratio averaged 42%. The respective symptom syllable in different patients demonstrated in the three-dimensional sonogram. There were voice onset time prolongation, irregular, interrupted and even absent vowel formants. The consonant of symptom syllables displayed absence or prolongation of friction murmur in the block-friction murmur occasionally. The acoustic characteristics of adductor spasmodic dysphonia is the disorders of sound quality, rhyme and fluency. The three-dimensional sonogram of the symptom syllables show distinctive changes of proportional vowels or consonant phonemes.
Behavioural evidence of a dissociation between voice gender categorization and phoneme categorization using auditory morphed stimuli

Directory of Open Access Journals (Sweden)

Cyril R Pernet

2014-01-01

Full Text Available Both voice gender and speech perception rely on neuronal populations located in the peri-sylvian areas. However, whilst functional imaging studies suggest a left versus right hemisphere and anterior versus posterior dissociation between voice and speech categorization, psycholinguistic studies on talker variability suggest that these two processes (voice and speech categorization share common mechanisms. In this study, we investigated the categorical perception of voice gender (male vs. female and phonemes (/pa/ vs. /ta/ using the same stimulus continua generated by morphing. This allowed the investigation of behavioural differences while controlling acoustic characteristics, since the same stimuli were used in both tasks. Despite a higher acoustic dissimilarity between items during the phoneme categorization task (a male and female voice producing the same phonemes than the gender task (the same person producing 2 phonemes, results showed that speech information is being processed much faster than voice information. In addition, f0 or timbre equalization did not affect RT, which disagrees with the classical psycholinguistic models in which voice information is stripped away or normalized to access phonetic content. Also, despite similar response (percentages and perceptual (d’ curves, a reverse correlation analysis on acoustic features revealed, as expected, that the formant frequencies of the consonant distinguished stimuli in the phoneme task, but that only the vowel formant frequencies distinguish stimuli in the gender task. The 2nd set of results thus also disagrees with models postulating that the same acoustic information is used for voice and speech. Altogether these results suggest that voice gender categorization and phoneme categorization are dissociated at an early stage on the basis of different enhanced acoustic features that are diagnostic to the task at hand.
Reproducibility of Automated Voice Range Profiles, a Systematic Literature Review

DEFF Research Database (Denmark)

Printz, Trine; Rosenberg, Tine; Godballe, Christian

2018-01-01

literature on test-retest accuracy of the automated voice range profile assessment. Study design: Systematic review. Data sources: PubMed, Scopus, Cochrane Library, ComDisDome, Embase, and CINAHL (EBSCO). Methods: We conducted a systematic literature search of six databases from 1983 to 2016. The following......Objective: Reliable voice range profiles are of great importance when measuring effects and side effects from surgery affecting voice capacity. Automated recording systems are increasingly used, but the reproducibility of results is uncertain. Our objective was to identify and review the existing...... keywords were used: phonetogram, voice range profile, and acoustic voice analysis. Inclusion criteria were automated recording procedure, healthy voices, and no intervention between test and retest. Test-retest values concerning fundamental frequency and voice intensity were reviewed. Results: Of 483...
Voice stress analysis and evaluation

Science.gov (United States)

Haddad, Darren M.; Ratley, Roy J.

2001-02-01

Voice Stress Analysis (VSA) systems are marketed as computer-based systems capable of measuring stress in a person's voice as an indicator of deception. They are advertised as being less expensive, easier to use, less invasive in use, and less constrained in their operation then polygraph technology. The National Institute of Justice have asked the Air Force Research Laboratory for assistance in evaluating voice stress analysis technology. Law enforcement officials have also been asking questions about this technology. If VSA technology proves to be effective, its value for military and law enforcement application is tremendous.
The normative study of acoustic parameters in normal Egyptian ...

African Journals Online (AJOL)

Yehia A. Abo-Ras

2013-03-21

Mar 21, 2013 ... all children were subjected to computerized acoustic analysis using Multidimensional voice program ... cal quality is important for social relations to happen effectively. ... lish comparative parameters with the normal values of the acoustic ... from lower age ranges in the normative studies since the child's.
Probing echoic memory with different voices.

Science.gov (United States)

Madden, D J; Bastian, J

1977-05-01

Considerable evidence has indicated that some acoustical properties of spoken items are preserved in an "echoic" memory for approximately 2 sec. However, some of this evidence has also shown that changing the voice speaking the stimulus items has a disruptive effect on memory which persists longer than that of other acoustical variables. The present experiment examined the effect of voice changes on response bias as well as on accuracy in a recognition memory task. The task involved judging recognition probes as being present in or absent from sets of dichotically presented digits. Recognition of probes spoken in the same voice as that of the dichotic items was more accurate than recognition of different-voice probes at each of three retention intervals of up to 4 sec. Different-voice probes increased the likelihood of "absent" responses, but only up to a 1.4-sec delay. These shifts in response bias may represent a property of echoic memory which should be investigated further.
Investigating the Effects of Glottal Stop Productions on Voice in Children With Cleft Palate Using Multidimensional Voice Assessment Methods.

Science.gov (United States)

Aydınlı, Fatma Esen; Özcebe, Esra; Kulak Kayıkçı, Maviş E; Yılmaz, Taner; Özgür, Fatma F

2016-11-01

The aim was to investigate the effects of glottal stop productions (GS) on voice in children with cleft palate using multidimensional voice assessment methods. This is a prospective case-control study. Children with repaired cleft palate (n = 34) who did not have any vocal fold lesions were separated into two groups based on the results of the articulation test. The glottal stop group (GSG) consisted of 17 children who had GS. The control group (CG) consisted of an equal number of age- and gender-matched children who did not have GS. The voice evaluation protocol included acoustic analysis, Pediatric Voice Handicap Index (pVHI), and perceptual analysis (Grade, Roughness, Breathiness, Asthenia, Strain method). The velopharyngeal statuses of the groups were compared using the nasopharyngoscopy and the nasometer. The total pVHI score and the subscales of the pVHI were found to be significantly higher in the GSG. The F0, jitter, and shimmer were found to be numerically higher in the GSG with the difference being statistically significant in jitter (P speech and language pathology intervention including voice therapy techniques. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Forensic Automatic Speaker Recognition Based on Likelihood Ratio Using Acoustic-phonetic Features Measured Automatically

Directory of Open Access Journals (Sweden)

Huapeng Wang

2015-01-01

Full Text Available Forensic speaker recognition is experiencing a remarkable paradigm shift in terms of the evaluation framework and presentation of voice evidence. This paper proposes a new method of forensic automatic speaker recognition using the likelihood ratio framework to quantify the strength of voice evidence. The proposed method uses a reference database to calculate the within- and between-speaker variability. Some acoustic-phonetic features are extracted automatically using the software VoiceSauce. The effectiveness of the approach was tested using two Mandarin databases: A mobile telephone database and a landline database. The experiment's results indicate that these acoustic-phonetic features do have some discriminating potential and are worth trying in discrimination. The automatic acoustic-phonetic features have acceptable discriminative performance and can provide more reliable results in evidence analysis when fused with other kind of voice features.
Part Summary of the Project ‘Speakers’ Comfort’: Teachers’ Voice use in Teaching Environments

DEFF Research Database (Denmark)

Lyberg-Åhlander, Viveka; Rydell, Roland; Löfqvist, Anders

2015-01-01

Classroom acoustics not always take the speaker’s comfort into consideration. The purpose of the presented papers was to investigate voice use, vocal behavior and prevalence of voice problems in Swedish teaching staff. Ratings of features in the work-environment on voice use were explored in n...... = 487 teachers. Based on their answers the respondents were split into two groups: teachers with self-assessed voice problems and voice-healthy teachers. Teachers with voice problems and were matched to a voice-healthy colleague from the same school and were investigated and compared for clinical...... findings and for vocal behavior. Acoustic properties of their teaching environments were measured. Teachers with voice-problems were more affected by any loading factor in the work-environment and were more aware of the room acoustics. Differences between the groups were found during field...
Perceptual evaluation and acoustic analysis of pneumatic artificial larynx.

Science.gov (United States)

Xu, Jie Jie; Chen, Xi; Lu, Mei Ping; Qiao, Ming Zhe

2009-12-01

To investigate the perceptual and acoustic characteristics of the pneumatic artificial larynx (PAL) and evaluate its speech ability and clinical value. Prospective study. The study was conducted in the Voice Lab, Department of Otorhinolaryngology, The First Affiliated Hospital of Nanjing Medical University. Forty-six laryngectomy patients using the PAL were rated for intelligibility and fluency of speech. The voice signals of sustained vowel /a/ for 40 healthy controls and 42 successful patients using the PAL were measured by a computer system. The acoustic parameters and sound spectrographs were analyzed and compared between the two groups. Forty-two of 46 patients using the PAL (91.3%) acquired successful speech capability. The intelligibility scores of 42 successful PAL speakers ranged from 71 to 95 percent, and the intelligibility range of four unsuccessful speakers was 30 to 50 percent. The fluency was judged as good or excellent in 42 successful patients, and poor or fair in four unsuccessful patients. There was no significant difference in average fundamental frequency, maximum intensity, jitter, shimmer, and normalized noise energy (NNE) between 42 successful PAL speakers and 40 healthy controls, while the maximum phonation time (MPT) of PAL speakers was slightly lower than that of the controls. The sound spectrographs of the patients using the PAL approximated those of the healthy controls. The PAL has the advantage of a high percentage of successful vocal rehabilitation. PAL speech is fluent and intelligible. The acoustic characteristics of the PAL are similar to those of a normal voice.
Effects of Voice Therapy on Laryngeal Motor Units During Phonation in Chronic Superior Laryngeal Nerve Paresis Dysphonia.

Science.gov (United States)

Kaneko, Mami; Hitomi, Takefumi; Takekawa, Takashi; Tsuji, Takuya; Kishimoto, Yo; Hirano, Shigeru

2017-09-26

Injury to the superior laryngeal nerve can result in dysphonia, and in particular, loss of vocal range. It can be an especially difficult problem to address with either voice therapy or surgical intervention. Some clinicians and scientists suggest that combining vocal exercises with adjunctive neuromuscular electrical stimulation may enhance the positive effects of voice therapy for superior laryngeal nerve paresis (SLNP). However, the effects of voice therapy without neuromuscular electrical stimulation are unknown. The purpose of this retrospective study was to demonstrate the clinical effectiveness of voice therapy for rehabilitating chronic SLNP dysphonia in two subjects, using interspike interval (ISI) variability of laryngeal motor units by laryngeal electromyography (LEMG). Both patients underwent LEMG and were diagnosed with having 70% recruitment of the cricothyroid muscle, and 70% recruitment of the cricothyroid and thyroarytenoid muscles, respectively. Both patients received voice therapy for 3 months. Grade, roughness, breathiness, asthenia, and strain (GRBAS) scale, stroboscopic examination, aerodynamic assessment, acoustic analysis, and Voice Handicap Index-10 were performed before and after voice therapy. Mean ISI variability during steady phonation was also assessed. After voice therapy, both patients showed improvement in vocal assessments by acoustic, aerodynamic, GRBAS, and Voice Handicap Index-10 analysis. LEMG indicated shortened ISIs in both cases. This study suggests that voice therapy for chronic SLNP dysphonia can be useful for improving SLNP and voice quality. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Perceptual structure of adductor spasmodic dysphonia and its acoustic correlates.

Science.gov (United States)

Cannito, Michael P; Doiuchi, Maki; Murry, Thomas; Woodson, Gayle E

2012-11-01

To examine the perceptual structure of voice attributes in adductor spasmodic dysphonia (ADSD) before and after botulinum toxin treatment and identify acoustic correlates of underlying perceptual factors. Reliability of perceptual judgments is considered in detail. Pre- and posttreatment trial with comparison to healthy controls, using single-blind randomized listener judgments of voice qualities, as well as retrospective comparison with acoustic measurements. Oral readings were recorded from 42 ADSD speakers before and after treatment as well as from their age- and sex-matched controls. Experienced judges listened to speech samples and rated attributes of overall voice quality, breathiness, roughness, and brokenness, using computer-implemented visual analog scaling. Data were adjusted for regression to the mean and submitted to principal components factor analysis. Acoustic waveforms, extracted from the reading samples, were analyzed and measurements correlated with perceptual factor scores. Four reliable perceptual variables of ADSD voice were effectively reduced to two underlying factors that corresponded to hyperadduction, most strongly associated with roughness, and hypoadduction, most strongly associated with breathiness. After treatment, the hyperadduction factor improved, whereas the hypoadduction factor worsened. Statistically significant (P<0.01) correlations were observed between perceived roughness and four acoustic measures, whereas breathiness correlated with aperiodicity and cepstral peak prominence (CPPs). This study supported a two-factor model of ADSD, suggesting perceptual characterization by both hyperadduction and hypoadduction before and after treatment. Responses of the factors to treatment were consistent with previous research. Correlations among perceptual and acoustic variables suggested that multiple acoustic features contributed to the overall impression of roughness. Although CPPs appears to be a partial correlate of perceived
Speech masking and cancelling and voice obscuration

Science.gov (United States)

Holzrichter, John F.

2013-09-10

A non-acoustic sensor is used to measure a user's speech and then broadcasts an obscuring acoustic signal diminishing the user's vocal acoustic output intensity and/or distorting the voice sounds making them unintelligible to persons nearby. The non-acoustic sensor is positioned proximate or contacting a user's neck or head skin tissue for sensing speech production information.

Connections between voice ergonomic risk factors in classrooms and teachers' voice production.

Science.gov (United States)

Rantala, Leena M; Hakala, Suvi; Holmqvist, Sofia; Sala, Eeva

2012-01-01

The aim of the study was to investigate if voice ergonomic risk factors in classrooms correlated with acoustic parameters of teachers' voice production. The voice ergonomic risk factors in the fields of working culture, working postures and indoor air quality were assessed in 40 classrooms using the Voice Ergonomic Assessment in Work Environment - Handbook and Checklist. Teachers (32 females, 8 males) from the above-mentioned classrooms recorded text readings before and after a working day. Fundamental frequency, sound pressure level (SPL) and the slope of the spectrum (alpha ratio) were analyzed. The higher the number of the risk factors in the classrooms, the higher SPL the teachers used and the more strained the males' voices (increased alpha ratio) were. The SPL was already higher before the working day in the teachers with higher risk than in those with lower risk. In the working environment with many voice ergonomic risk factors, speakers increase voice loudness and use more strained voice quality (males). A practical implication of the results is that voice ergonomic assessments are needed in schools. Copyright © 2013 S. Karger AG, Basel.
Mobile Digital Recording: Adequacy of the iRig and iOS Device for Acoustic and Perceptual Analysis of Normal Voice.

Science.gov (United States)

Oliveira, Gisele; Fava, Gaetano; Baglione, Melody; Pimpinella, Michael

2017-03-01

To determine whether the iRig and iOS device recording system is comparable with a standard computer recording system for digital voice recording. Thirty-seven vocally healthy adults, between ages 20 and 62, with a mean age of 33.9 years, 13 males and 24 females, were recruited. Recordings were simultaneously digitalized in an iPad and iPhone using a unidirectional condenser microphone for smartphones/tablets (iRig Mic, IK Multimedia) and in a computer laptop (Dell-Inspiron) using a unidirectional condenser microphone (Samson-CL5) connected to a preamplifier with phantom power. Both microphones were lined up at an equal fixed distance from the subject's mouth. Speech tasks consisted of a sustained vowel "ah" at comfortable pitch/loudness, counting from 1 to 10, and a glissando "ah" from a low to a high note. The samples captured on the iOS devices were transferred via SoundCloud in WAV format, and analyzed using the Praat software. The acoustic parameters measured were mean, min, and max F0, SD F0, jitter local, jitter rap, jitter ppq5, jitter ddp, shimmer local, shimmer local-dB, shimmer apq3, shimmer apq5, shimmer apq11, shimmer dda, NHR, and HNR. There were no statistically significant differences for any parameter and speech task analyzed for both iOS devices as compared with the gold standard computer/preamp system (all P values > 0.050). In addition, there were no statistical differences in the perceptual identification of the recordings among devices (P device may provide reliable digital recording of normal voices. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
[The combined treatment of dysphonia in the subjects engaged in the voice and speech professions].

Science.gov (United States)

Stepanova, Yu E; Gotovyakhina, T V; Korneenkov, A A; Koren', E E

The objective of the present study was to evaluate the effectiveness of the application of homeovox for the combined treatment of small vocal cord nodules and acute laryngitis in the professional voice users. A total of 40 subjects presenting with dysphonia were examined after they were divided into two study groups and two groups of comparison depending on the nosological form of the pathological condition. The subjects comprising the study groups were given traditional therapy in the combination with the intake of homeovox whereas the patients included in the two groups of comparison received the traditional treatment alone. The outcome of the treatment was evaluated on days 1, 5, and 10 after the initiation of therapy based on the analysis of the changes in the videoendostroboscopic picture of the larynx and the acoustic characteristics obtained by the computer-assisted analysis of the voice. The analysis of the results of the combined treatment has demonstrated the statistically significant differences in some acoustic parameters of the voice between the subjects with small vocal cord nodules and acute laryngitis belonging to the study groups and the groups of comparison. It is concluded that the introduction of homeovox in the combined treatment of the patients presenting with the small nodules in the vocal cords and acute catarrhal laryngitis accelerates the recovery of the acoustic characteristics of the voice within various periods after the onset of the treatment in comparison with the patients treated with the use of traditional therapy alone.
Measurement of Voice Onset Time in Maxillectomy Patients

OpenAIRE

Hattori, Mariko; Sumita, Yuka I.; Taniguchi, Hisashi

2014-01-01

Objective speech evaluation using acoustic measurement is needed for the proper rehabilitation of maxillectomy patients. For digital evaluation of consonants, measurement of voice onset time is one option. However, voice onset time has not been measured in maxillectomy patients as their consonant sound spectra exhibit unique characteristics that make the measurement of voice onset time challenging. In this study, we established criteria for measuring voice onset time in maxillectomy patients ...
Vocal parameters and voice-related quality of life in adult women with and without ovarian function.

Science.gov (United States)

Ferraz, Pablo Rodrigo Rocha; Bertoldo, Simão Veras; Costa, Luanne Gabrielle Morais; Serra, Emmeliny Cristini Nogueira; Silva, Eduardo Magalhães; Brito, Luciane Maria Oliveira; Chein, Maria Bethânia da Costa

2013-05-01

To identify the perceptual and acoustic parameters of voice in adult women with and without ovarian function and its impact on quality of life related to voice. Cross-sectional and analytical study with 106 women divided into, two groups: G1, with ovarian function (n=43) and G2, without physiological ovarian function (n=63). The women were instructed to sustain the vowel "a" and the sounds of /s/ and /z/ in habitual pitch and loudness. They were also asked to classify their voices and answer the voice-related quality of life (V-RQOL) questionnaire. The perceptual analysis of the vocal samples was performed by three speech-language pathologists using the GRBASI (G: grade; R: roughness; B: breathness; A: asthenia; S: strain; I: instability) scale. The acoustic analysis was carried out with the software VoxMetria 2.7h (CTS Informatica). The data were analyzed using descriptive statistics. In the perceptual analysis, both groups showed a mild deviation for the parameters roughness, strain, and instability, but only G2 showed a mild impact for the overall degree of dysphonia. The mean of fundamental frequency was significantly lower for the G2, with a difference of 17.41Hz between the two groups. There was no impact on V-RQOL in any of the V-RQOL domains for this group. With the menopause, there is a change in women's voices, impacting on some voice parameters. However, there is no direct impact on their quality of life related to voice. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Is OperaVOX a clinically useful tool for the assessment of voice in a general ENT clinic?

Directory of Open Access Journals (Sweden)

Richard Teck Kee Siau

2017-04-01

Full Text Available Abstract Background Objective acoustic analysis is a key component of multidimensional voice assessment. OperaVOX is an iOS app which has been shown to be comparable to Multi Dimensional Voice Program for most principal measures of vocal function. As a relatively cheap, portable and easily accessible form of acoustic analysis, OperaVOX may be more clinically useful than laboratory-based software in many situations. This study aims to determine whether correlation exists between acoustic measurements obtained using OperaVOX, and perceptual evaluation of voice. Methods Forty-four voices from the multidisciplinary voice clinic were examined. Each voice was assessed blindly by a single experienced voice therapist using the GRBAS scale, and analysed using OperaVOX. The Spearman rank correlation co-efficient was calculated between each element of the GRBAS scale and acoustic measurements obtained by OperaVOX. Results Significant correlations were identified between GRBAS scores and OperaVOX parameters. Grade correlated significantly with jitter (ρ = 0.495, p < 0.05, shimmer (ρ = 0.385, p < 0.05, noise-to-harmonic ratio (NHR; ρ = 0.526, p < 0.05 and maximum phonation time (MPT; ρ = −0.415, p < 0.05. Roughness did not correlate with any of the measured variables. Breathiness correlated significantly with jitter (ρ = 0.342, p < 0.05, NHR (ρ = 0.344, p < 0.05 and MPT (ρ = −0.336, p < 0.05. Aesthenia correlated with NHR (ρ = 0.413, p < 0.05 and MPT (ρ = −0.399, p < 0.05. Strain correlated with Jitter (ρ = 0.560, p < 0.05, NHR (ρ = 0.600, p < 0.05 and MPT (ρ = −0.356, p < 0.05. Conclusions OperaVOX provides objective acoustic analysis which has shown statistically significant correlation to perceptual evaluation using the GRBAS scale. The accessibility of the software package makes it possible for a wide range of health practitioners, e.g. general ENT
Acute effects of inhaling Oud incense on voice of Saudi adults.

Science.gov (United States)

Mesallam, Tamer A; Farahat, Mohamed; Shoeib, Rasha; Alharethy, Sami; Alshahwan, Abdulaziz; Murry, Thomas; Almalkia, Khalid

2015-01-01

Like in most of the Arab countries, incense burning, including Oud, is widely used in Saudi Arabia. The widespread effects of the Oud incense on voice have not been examined. Thus, the aim of this study was to examine the short-term effects of Oud incense on laryngeal symptoms and voice acoustics in normal Saudi adults. A prospective study that has been carried out at King Abdulaziz University Hospital between July 2012 and Jan 2014. Study subjects were recruited on a volunteer basis. A total of 72 adults (44.4% males and 55.6 % females), were exposed to Oud incense smoke for 5 minutes while sitting 1 m away from an electrical sensor in a closed room. Symptom and acoustic voice analyses were performed pre-exposure and immediately post-exposure. A total of 27.8% of the subjects reported throat and voice symptoms after 5 minutes of exposure. Some frequency-related acoustic measures increased in male and female subjects after exposure to Oud incense. However, the difference between the pre- and post-exposure measures was not statistically significant. One third of the study subjects reported voice-related symptoms following exposure to Oud incense. Despite the absence of statistical significant difference, some frequency-based acoustic parameters increased following exposure to Oud incense smoke.
The interaction of tone with voicing and foot structure: evidence from Kera phonetics and phonology

Science.gov (United States)

Pearce, Mary Dorothy

This thesis uses acoustic measurements as a basis for the phonological analysis of the interaction of tone with voicing and foot structure in Kera (a Chadic language). In both tone spreading and vowel harmony, the iambic foot acts as a domain for spreading. Further evidence for the foot comes from measurements of duration, intensity and vowel quality. Kera is unusual in combining a tone system with a partially independent metrical system based on iambs. In words containing more than one foot, the foot is the tone bearing unit (TBU), but in shorter words, the TBU is the syllable. In perception and production experiments, results show that Kera speakers, unlike English and French, use the fundamental frequency as the principle cue to 'Voicing" contrast. Voice onset time (VOT) has only a minor role. Historically, tones probably developed from voicing through a process of tonogenesis, but synchronically, the feature voice is no longer contrastive and VOT is used in an enhancing role. Some linguists have claimed that Kera is a key example for their controversial theory of long-distance voicing spread. But as voice is not part of Kera phonology, this thesis gives counter-evidence to the voice spreading claim. An important finding from the experiments is that the phonological grammars are different between village women, men moving to town and town men. These differences are attributed to French contact. The interaction between Kera tone and voicing and contact with French have produced changes from a 2-way voicing contrast, through a 3-way tonal contrast, to a 2-way voicing contrast plus another contrast with short VOT. These diachronic and synchronic tone/voicing facts are analysed using laryngeal features and Optimality Theory. This thesis provides a body of new data, detailed acoustic measurements, and an analysis incorporating current theoretical issues in phonology, which make it of interest to Africanists and theoreticians alike.
Voice Quality and Gender Stereotypes: A Study of Lebanese Women With Reinke's Edema.

Science.gov (United States)

Matar, Nayla; Portes, Cristel; Lancia, Leonardo; Legou, Thierry; Baider, Fabienne

2016-12-01

Women with Reinke's edema (RW) report being mistaken for men during telephone conversations. For this reason, their masculine-sounding voices are interesting for the study of gender stereotypes. The study's objective is to verify their complaint and to understand the cues used in gender identification. Using a self-evaluation study, we verified RW's perception of their own voices. We compared the acoustic parameters of vowels produced by 10 RW to those produced by 10 men and 10 women with healthy voices (hereafter referred to as NW) in Lebanese Arabic. We conducted a perception study for the evaluation of RW, healthy men's, and NW voices by naïve listeners. RW self-evaluated their voices as masculine and their gender identities as feminine. The acoustic parameters that distinguish RW from NW voices concern fundamental frequency, spectral slope, harmonicity of the voicing signal, and complexity of the spectral envelope. Naïve listeners very often rate RW as surely masculine. Listeners may rate RW's gender incorrectly. These incorrect gender ratings are correlated with acoustic measures of fundamental frequency and voice quality. Further investigations will reveal the contribution of each of these parameters to gender perception and guide the treatment plan of patients complaining of a gender ambiguous voice.
Improvement of electrolaryngeal speech quality using a supraglottal voice source with compensation of vocal tract characteristics.

Science.gov (United States)

Wu, Liang; Wan, Congying; Wang, Supin; Wan, Mingxi

2013-07-01

Electrolarynx (EL) is a medical speech-recovery device designed for patients who have lost their original voice box due to laryngeal cancer. As a substitute for human larynx, the current commercial EL voice source cannot reconstruct natural EL speech under laryngectomy conditions. To eliminate the abnormal acoustic properties of EL speech, a supraglottal voice source with compensation of vocal tract characteristics was proposed and provided through an experimental EL(SGVS-EL) system. The acoustic analyses of simulated EL speech and reconstructed EL speech produced with different voice sources were performed in the normal subject and laryngectomee. The results indicated that the supraglottal voice source was successful in improving the acoustic properties of EL speech by enhancing low- frequency energy, correcting the shifted formants to normal range, and eliminating the visible spectral zeros. Both normal subject and laryngectomee also produced more natural vowels using SGVS-EL than commercial EL, even if the vocal tract parameter was substituted and the supraglottal voice source was biased to a certain degree. Therefore, supraglottal voice source is a feasible and effective approach to improving the acoustic quality of EL speech.
Speaker comfort and increase of voice level in lecture rooms

DEFF Research Database (Denmark)

Brunskog, Jonas; Gade, Anders Christian; Bellester, G P

2008-01-01

Teachers often suffer health problems or tension related to their voice. These problems may be related to there working environment, including room acoustics of the lecture rooms which forces them to stress their voices. The present paper describes a first effort in finding relationships between...... were also measured in the rooms and subjective impressions from about 20 persons who had experience talking in these rooms were collected as well. Analysis of the data revealed significant differences in the sound power produced by the speaker in the different rooms. It was also found...
Role of the Internal Superior Laryngeal Nerve in the Motor Responses of Vocal Cords and the Related Voice Acoustic Changes

Science.gov (United States)

Seifpanahi, Sadegh; Izadi, Farzad; Jamshidi, Ali-Ashraf; Torabinezhad, Farhad; Sarrafzadeh, Javad; Mohammadi, Siavash

2016-01-01

Background: Repeated efforts by researchers to impose voice changes by laryngeal surface electrical stimulation (SES) have come to no avail. This present pre-experimental study employed a novel method for SES application so as to evoke the motor potential of the internal superior laryngeal nerve (ISLN) and create voice changes. Methods: Thirty-two normal individuals (22 females and 10 males) participated in this study. The subjects were selected from the students of Iran University of Medical Sciences in 2014. Two monopolar active electrodes were placed on the thyrohyoid space at the location of the ISLN entrance to the larynx and 1 dispersive electrode was positioned on the back of the neck. A current with special programmed parameters was applied to stimulate the ISLN via the active electrodes and simultaneously the resultant acoustic changes were evaluated. All the means of the acoustic parameters during SES and rest periods were compared using the paired t-test. Results: The findings indicated significant changes (P=0.00) in most of the acoustic parameters during SES presentation compared to them at rest. The mean of fundamental frequency standard deviation (SD F0) at rest was 1.54 (SD=0.55) versus 4.15 (SD=3.00) for the SES period. The other investigated parameters comprised fundamental frequency (F0), minimum F0, jitter, shimmer, harmonic-to-noise ratio (HNR), mean intensity, and minimum intensity. Conclusion: These findings demonstrated significant changes in most of the important acoustic features, suggesting that the stimulation of the ISLN via SES could induce motor changes in the vocal folds. The clinical applicability of the method utilized in the current study in patients with vocal fold paralysis requires further research. PMID:27582586
Role of the Internal Superior Laryngeal Nerve in the Motor Responses of Vocal Cords and the Related Voice Acoustic Changes

Directory of Open Access Journals (Sweden)

Sadegh Seifpanahi

2016-09-01

Full Text Available Background: Repeated efforts by researchers to impose voice changes by laryngeal surface electrical stimulation (SES have come to no avail. This present pre-experimental study employed a novel method for SES application so as to evoke the motor potential of the internal superior laryngeal nerve (ISLN and create voice changes. Methods: Thirty-two normal individuals (22 females and 10 males participated in this study. The subjects were selected from the students of Iran University of Medical Sciences in 2014. Two monopolar active electrodes were placed on the thyrohyoid space at the location of the ISLN entrance to the larynx and 1 dispersive electrode was positioned on the back of the neck. A current with special programmed parameters was applied to stimulate the ISLN via the active electrodes and simultaneously the resultant acoustic changes were evaluated. All the means of the acoustic parameters during SES and rest periods were compared using the paired t-test. Results: The findings indicated significant changes (P=0.00 in most of the acoustic parameters during SES presentation compared to them at rest. The mean of fundamental frequency standard deviation (SD F0 at rest was 1.54 (SD=0.55 versus 4.15 (SD=3.00 for the SES period. The other investigated parameters comprised fundamental frequency (F0, minimum F0, jitter, shimmer, harmonic-to-noise ratio (HNR, mean intensity, and minimum intensity. Conclusion: These findings demonstrated significant changes in most of the important acoustic features, suggesting that the stimulation of the ISLN via SES could induce motor changes in the vocal folds. The clinical applicability of the method utilized in the current study in patients with vocal fold paralysis requires further research.
[The impact of vibratory stimulation therapy on voice quality in hyperfunctional occupational dysphonia].

Science.gov (United States)

Kosztyła-Hojna, Bożena; Kuryliszyn-Moskal, Anna; Rogowski, Marek; Moskal, Diana; Dakowicz, Agnieszka; Falkowski, Dawid; Kasperuk, Joanna

2012-01-01

Hyperfunctional dysphonia is the most frequent type of occupational functional dysphonia. Pharmacotherapy, physiotherapy and psychotherapy are used in the treatment of occupational dysphonia. Vibratory massages of the regions of the larynx relax the external muscles of neck, which have an indirect impact on the tension of the vocal folds. The aim of the study is to assess the impact of vibratory stimulation therapy on voice quality in patients with hyperfunctional occupational dysphonia treated pharmacologically. Forty patients with hyperfunctional occupational dysphonia treated phoniatrically in the Phoniatric Outpatient Clinic were included in the study. Patients were divided into two groups. Group I consisted of 20 patients treated pharmacologically. In group II, including 20 patients, apart from pharmacotherapy the vibratory stimulation therapy by the device of VR type (CyberBioMed LLC) was used. In the analysis of voice quality the evaluation of the vocal folds vibration using videolaryngostroboscopy and acoustic assessment of voice were conducted. The perceptual assessment of voice, the visualization of the vocal folds vibration in stroboscopic examination of the larynx and the acoustic assessment of voice enable the appropriate diagnostics of the clinical type and voice quality in hyperfunctional dysphonia. The tension of superficial and deep muscles of neck has the impact on the phonatory function of the larynx. Pharmacological treatment improves the voice quality in hyperfunctional occupational dysphonia. Pharmacological treatment combines with the relaxation of muscles of neck using the device of VR type significantly improve voice quality in hyperfunctional occupational dysphonia. Copyright © 2012 Polish Otolaryngology Society. Published by Elsevier Urban & Partner Sp. z.o.o. All rights reserved.
Effect of voice therapy in sulcus vocalis: A single case study

Directory of Open Access Journals (Sweden)

R. Rajasudhakar

2016-02-01

Full Text Available Background: Sulcus vocalis is a structural deformity of the vocal ligament. It is the focal invagination of the epithelium deeply attaching to the vocal ligament. There is a dearth of literature on the outcome of voice therapy in sulcus vocalis condition.Objective: The primary objective of this study was to document voice characteristics of sulcus vocalis and the secondary objective was to establish the efficacy of voice therapy in a patient with sulcus vocalis.Method: A trial of voice therapy was given to the client who was diagnosed as having sulcus vocalis. Boon’s facilitation techniques were used in voice therapy along with other techniques such as breath holding and push and pull approach prior to surgery. Acoustic, aerodynamic, perceptual, quantitative measures of voice quality and self-rating measurements were performed before and after voice therapy.Results: Improvement was noticed in 10/10 acoustic, 4/4 aerodynamic, perceptual, dysphonia severity index and voice handicap index scores, which hinted that voice therapy can be an option critically for clients with sulcus vocalis in the initial stage.Conclusion: Voice therapy showed promising improvement in the study and it must be recommended as the initial treatment option before any surgical management.
The role of classroom acoustics on vocal intensity regulation and speakers’ comfort

DEFF Research Database (Denmark)

Pelegrin Garcia, David

Teachers are one of the professional groups with the highest risk of suffering from voice disorders. Teachers point out classroom acoustics among the potential hazards affecting their vocal health, together with air dryness, background noise, and other environmental factors. The present project has...... investigated the relationships between the classroom acoustic condition and teachers’ voice, focusing on their vocal intensity, and between the classroom acoustic condition and the sensation of acoustic comfort for a speaker. In the presence of low background noise levels, teachers were found to adjust...... their vocal intensity according to the room gain or voice support of the classroom, which are equivalent objective measures that quantify the amplification of one’s own voice in a room due to the reflections at the room boundaries. Most of the vocal intensity variation among classrooms was due to differences...
Finite element modelling of vocal tract changes after voice therapy

Czech Academy of Sciences Publication Activity Database

Vampola, T.; Laukkanen, A. M.; Horáček, Jaromír; Švec, J. G.

2011-01-01

Roč. 5, č. 1 (2011), s. 77-88 ISSN 1802-680X R&D Projects: GA ČR GA101/08/1155 Institutional research plan: CEZ:AV0Z20760514 Keywords : biomechanics of human voice * voice production modelling * vocal excersing * voice training Subject RIV: BI - Acoustics http://www.kme.zcu.cz/acm/index.php/acm/article/view/138
Risk factors for voice quality after radiotherapy for early glottic cancer

International Nuclear Information System (INIS)

Hocevar-Boltezar, Irena; Zargi, Miha; Strojan, Primoz

2009-01-01

Background and purpose: In the majority of patients irradiated for early glottic cancer an abnormal voice was reported. The purpose of the study was to determine the factors influencing voice quality after radiotherapy for T1 glottic cancer. Methods: The voices of 75 male patients irradiated for T1 glottic carcinoma were assessed subjectively and objectively by acoustic analyses and aerodynamic measurements. The laryngeal function and morphology were evaluated by videolaryngostroboscopy. The data on smoking habits, the associated diseases influencing voice quality, the extent of the tumor, the type of biopsy, and the irradiation technique were collected from the medical records. The data on the factors influencing voice quality were compared for patients with a normal/near-normal voice and those with a hoarse voice. Results: Voice quality was at least slightly abnormal in 94.7% and 81.3% of patients, when assessed perceptively and objectively, respectively. Smoking after the completed treatment, more severe morphologic alterations of the vocal folds, dryness of the throat, incomplete closure of the vocal folds and functional voice disorders expressed as supraglottic activity adversely influenced the voice quality. A good correlation between the perceptive voice assessment and the acoustic analyses was established. Conclusions: After the successful irradiation for T1 glottic carcinoma, the great majority of the patients have at least a slightly hoarse voice. A better voice outcome could be achieved if radiotherapy was followed by the patient's cessation of smoking and the appropriate voice therapy.
Trends in Singing Voice Research: An Innovative Approach.

Science.gov (United States)

Pestana, Pedro Melo; Vaz-Freitas, Susana; Manso, Maria Conceição

2018-01-11

The objectives of this study were to trace and describe research patterns in singing voice, to compare the amount of published research over time, to identify journals that published most papers on "singing voice," and to establish the most frequent research topics. The study uses qualitative and quantitative approaches through descriptive statistics, text mining, and clustering. The authors conducted a search to identify scientific papers. The titles and abstracts were analyzed regarding word frequency and relations between them, through hierarchical cluster analysis and co-occurrence networks. The frequency of journals was calculated, as well as the amount of papers across time. Since 1949, 754 papers were published and an increase was noticed. Even though 162 journals were identified by the authors, the Journal of Voice holds the majority of papers, in every analyzed period. An evolution of studied topics is described. Up to 2010, the main theme was professional singers, especially classical and opera interpreters. Since then, voice quality and the effects of training gathered more attention. The growing interest in singing has been conspicuous since the first indexed paper. However, it has been slightly slowing down. Until 2010, great importance was given to the voice quality of singers and their occupational demands. Acoustic analysis was widely used to study the effects of training. Since 2010, the concern with functionality is increasing, rather than the organic voice structures. Musical perception studies have been a trend, as well as the use of electroglottography. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Voice Quality after Treatment for T1a Glottic Carcinoma - Radiotherapy Versus Laser Cordectomy

International Nuclear Information System (INIS)

Krengli, Marco; Policarpo, Mario; Manfredda, Irene; Aluffi, Paolo; Gambaro, Giuseppina; Panella, Massimiliano; Pia, Francesco

2004-01-01

The purpose of this study was to assess the anatomic and functional outcomes and compare the voice quality in patients affected by T1a glottic carcinoma treated with curative intent with radiotherapy or laser cordectomy. Fifty-seven cases were analysed: 27 after curative radiotherapy and 30 after laser cordectomy. All patients were studied with videolaryngostroboscopy, voice analysis by narrow spectrogram, and vocal parameters (Jitter, Shimmer, noise/harmonic ratio, and diplophonia). Videolaryngostroboscopy showed severe glottic inadequacy in 25% of cases treated with radiation and insufficient compensation 'ventricular band' or 'with arytenoid hyperadduction' in 65% of cases after surgery. Severe dysphonia on the electro-acoustic analysis of voice was observed in 25% of cases after radiation and 70% after laser (p<0.001). Fundamental frequency and vocal parameters showed more favourable results in the radiation group (p<0.001). Voice assessment showed better results after radiotherapy compared with laser cordectomy. Voice outcome should be carefully considered in the treatment decision for T1 glottic carcinoma

FE Modeling of Human Vocal Tract Acoustics. Part I: Production of Czech Vowels

Czech Academy of Sciences Publication Activity Database

Vampola, T.; Horáček, Jaromír; Švec, J. G.

2008-01-01

Roč. 94, č. 3 (2008), s. 433-447 ISSN 1610-1928 R&D Projects: GA ČR GA106/04/1025 Institutional research plan: CEZ:AV0Z20760514; CEZ:AV0Z10100502 Keywords : biomechanics of voice * FE models of human vocaltract * acoustic modal analysis Subject RIV: BI - Acoustics Impact factor: 0.538, year: 2008
Relationship Between Voice and Motor Disabilities of Parkinson's Disease.

Science.gov (United States)

Majdinasab, Fatemeh; Karkheiran, Siamak; Soltani, Majid; Moradi, Negin; Shahidi, Gholamali

2016-11-01

To evaluate voice of Iranian patients with Parkinson's disease (PD) and find any relationship between motor disabilities and acoustic voice parameters as speech motor components. We evaluated 27 Farsi-speaking PD patients and 21 age- and sex-matched healthy persons as control. Motor performance was assessed by the Unified Parkinson's Disease Rating Scale part III and Hoehn and Yahr rating scale in the "on" state. Acoustic voice evaluation, including fundamental frequency (f0), standard deviation of f0, minimum of f0, maximum of f0, shimmer, jitter, and harmonic to noise ratio, was done using the Praat software via /a/ prolongation. No difference was seen between the voice of the patients and the voice of the controls. f0 and its variation had a significant correlation with the duration of the disease, but did not have any relationships with the Unified Parkinson's Disease Rating Scale part III. Only limited relationship was observed between voice and motor disabilities. Tremor is an important main feature of PD that affects motor and phonation systems. Females had an older age at onset, more prolonged disease, and more severe motor disabilities (not statistically significant), but phonation disorders were more frequent in males and showed more relationship with severity of motor disabilities. Voice is affected by PD earlier than many other motor components and is more sensitive to disease progression. Tremor is the most effective part of PD that impacts voice. PD has more effect on voice of male versus female patients. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Performance of Phonatory Deviation Diagrams in Synthesized Voice Analysis.

Science.gov (United States)

Lopes, Leonardo Wanderley; da Silva, Karoline Evangelista; da Silva Evangelista, Deyverson; Almeida, Anna Alice; Silva, Priscila Oliveira Costa; Lucero, Jorge; Behlau, Mara

2018-05-02

To analyze the performance of a phonatory deviation diagram (PDD) in discriminating the presence and severity of voice deviation and the predominant voice quality of synthesized voices. A speech-language pathologist performed the auditory-perceptual analysis of the synthesized voice (n = 871). The PDD distribution of voice signals was analyzed according to area, quadrant, shape, and density. Differences in signal distribution regarding the PDD area and quadrant were detected when differentiating the signals with and without voice deviation and with different predominant voice quality. Differences in signal distribution were found in all PDD parameters as a function of the severity of voice disorder. The PDD area and quadrant can differentiate normal voices from deviant synthesized voices. There are differences in signal distribution in PDD area and quadrant as a function of the severity of voice disorder and the predominant voice quality. However, the PDD area and quadrant do not differentiate the signals as a function of severity of voice disorder and differentiated only the breathy and rough voices from the normal and strained voices. PDD density is able to differentiate only signals with moderate and severe deviation. PDD shape shows differences between signals with different severities of voice deviation. © 2018 S. Karger AG, Basel.
Preliminary study of acoustic analysis for evaluating speech-aid oral prostheses: Characteristic dips in octave spectrum for comparison of nasality.

Science.gov (United States)

Chang, Yen-Liang; Hung, Chao-Ho; Chen, Po-Yueh; Chen, Wei-Chang; Hung, Shih-Han

2015-10-01

Acoustic analysis is often used in speech evaluation but seldom for the evaluation of oral prostheses designed for reconstruction of surgical defect. This study aimed to introduce the application of acoustic analysis for patients with velopharyngeal insufficiency (VPI) due to oral surgery and rehabilitated with oral speech-aid prostheses. The pre- and postprosthetic rehabilitation acoustic features of sustained vowel sounds from two patients with VPI were analyzed and compared with the acoustic analysis software Praat. There were significant differences in the octave spectrum of sustained vowel speech sound between the pre- and postprosthetic rehabilitation. Acoustic measurements of sustained vowels for patients before and after prosthetic treatment showed no significant differences for all parameters of fundamental frequency, jitter, shimmer, noise-to-harmonics ratio, formant frequency, F1 bandwidth, and band energy difference. The decrease in objective nasality perceptions correlated very well with the decrease in dips of the spectra for the male patient with a higher speech bulb height. Acoustic analysis may be a potential technique for evaluating the functions of oral speech-aid prostheses, which eliminates dysfunctions due to the surgical defect and contributes to a high percentage of intelligible speech. Octave spectrum analysis may also be a valuable tool for detecting changes in nasality characteristics of the voice during prosthetic treatment of VPI. Copyright © 2014. Published by Elsevier B.V.
Speakers comfort and voice use in different environments and babble-noise. What are the effects on effort and cognition?

DEFF Research Database (Denmark)

Lyberg-Åhlander, Viveka; von Lochow, Heike; Brunskog, Jonas

2017-01-01

Teachers often report voice problems related to the occupational environment, and voice problems are more prevalent in teaching than in other occupations. Relationships between objectively measurable acoustical parameters and voice use have been shown. Speakers have been shown to be able to predi...... a correlation to cognitive aspects. Listener assessments and the data from the voice accumulator will be presented. This knowledge may contribute to the area of classroom acoustics and speakers’ comfort in general....
Comparative analysis of perceptual evaluation, acoustic analysis and indirect laryngoscopy for vocal assessment of a population with vocal complaint.

Science.gov (United States)

Nemr, Kátia; Amar, Ali; Abrahão, Marcio; Leite, Grazielle Capatto de Almeida; Köhle, Juliana; Santos, Alexandra de O; Correa, Luiz Artur Costa

2005-01-01

As a result of technology evolution and development, methods of voice evaluation have changed both in medical and speech and language pathology practice. To relate the results of perceptual evaluation, acoustic analysis and medical evaluation in the diagnosis of vocal and/or laryngeal affections of the population with vocal complaint. Clinical prospective. 29 people that attended vocal health protection campaign were evaluated. They were submitted to perceptual evaluation (AFPA), acoustic analysis (AA), indirect laryngoscopy (LI) and telelaryngoscopy (TL). Correlations between medical and speech language pathology evaluation methods were established, verifying possible statistical signification with the application of Fischer Exact Test. There were statistically significant results in the correlation between AFPA and LI, AFPA and TL, LI and TL. This research study conducted in a vocal health protection campaign presented correlations between speech language pathology evaluation and perceptual evaluation and clinical evaluation, as well as between vocal affection and/or laryngeal medical exams.
Acoustic analysis assessment in speech pathology detection

Directory of Open Access Journals (Sweden)

Panek Daria

2015-09-01

Full Text Available Automatic detection of voice pathologies enables non-invasive, low cost and objective assessments of the presence of disorders, as well as accelerating and improving the process of diagnosis and clinical treatment given to patients. In this work, a vector made up of 28 acoustic parameters is evaluated using principal component analysis (PCA, kernel principal component analysis (kPCA and an auto-associative neural network (NLPCA in four kinds of pathology detection (hyperfunctional dysphonia, functional dysphonia, laryngitis, vocal cord paralysis using the a, i and u vowels, spoken at a high, low and normal pitch. The results indicate that the kPCA and NLPCA methods can be considered a step towards pathology detection of the vocal folds. The results show that such an approach provides acceptable results for this purpose, with the best efficiency levels of around 100%. The study brings the most commonly used approaches to speech signal processing together and leads to a comparison of the machine learning methods determining the health status of the patient
Your Cheatin' Voice Will Tell on You: Detection of Past Infidelity from Voice.

Science.gov (United States)

Hughes, Susan M; Harrison, Marissa A

2017-01-01

Evidence suggests that many physical, behavioral, and trait qualities can be detected solely from the sound of a person's voice, irrespective of the semantic information conveyed through speech. This study examined whether raters could accurately assess the likelihood that a person has cheated on committed, romantic partners simply by hearing the speaker's voice. Independent raters heard voice samples of individuals who self-reported that they either cheated or had never cheated on their romantic partners. To control for aspects that may clue a listener to the speaker's mate value, we used voice samples that did not differ between these groups for voice attractiveness, age, voice pitch, and other acoustic measures. We found that participants indeed rated the voices of those who had a history of cheating as more likely to cheat. Male speakers were given higher ratings for cheating, while female raters were more likely to ascribe the likelihood to cheat to speakers. Additionally, we manipulated the pitch of the voice samples, and for both sexes, the lower pitched versions were consistently rated to be from those who were more likely to have cheated. Regardless of the pitch manipulation, speakers were able to assess actual history of infidelity; the one exception was that men's accuracy decreased when judging women whose voices were lowered. These findings expand upon the idea that the human voice may be of value as a cheater detection tool and very thin slices of vocal information are all that is needed to make certain assessments about others.
Perceptual Adaptation of Voice Gender Discrimination with Spectrally Shifted Vowels

Science.gov (United States)

Li, Tianhao; Fu, Qian-Jie

2011-01-01

Purpose: To determine whether perceptual adaptation improves voice gender discrimination of spectrally shifted vowels and, if so, which acoustic cues contribute to the improvement. Method: Voice gender discrimination was measured for 10 normal-hearing subjects, during 5 days of adaptation to spectrally shifted vowels, produced by processing the…
Panel acoustic contribution analysis.

Science.gov (United States)

Wu, Sean F; Natarajan, Logesh Kumar

2013-02-01

Formulations are derived to analyze the relative panel acoustic contributions of a vibrating structure. The essence of this analysis is to correlate the acoustic power flow from each panel to the radiated acoustic pressure at any field point. The acoustic power is obtained by integrating the normal component of the surface acoustic intensity, which is the product of the surface acoustic pressure and normal surface velocity reconstructed by using the Helmholtz equation least squares based nearfield acoustical holography, over each panel. The significance of this methodology is that it enables one to analyze and rank relative acoustic contributions of individual panels of a complex vibrating structure to acoustic radiation anywhere in the field based on a single set of the acoustic pressures measured in the near field. Moreover, this approach is valid for both interior and exterior regions. Examples of using this method to analyze and rank the relative acoustic contributions of a scaled vehicle cabin are demonstrated.
Voice Quality after Treatment for T1a Glottic Carcinoma - Radiotherapy Versus Laser Cordectomy

Energy Technology Data Exchange (ETDEWEB)

Krengli, Marco; Policarpo, Mario; Manfredda, Irene; Aluffi, Paolo; Gambaro, Giuseppina; Panella, Massimiliano; Pia, Francesco [Univ. of Piemonte Orientale ' Amedeo Avogadro' , Novara (Italy). Div. of Radiotherapy

2004-04-01

The purpose of this study was to assess the anatomic and functional outcomes and compare the voice quality in patients affected by T1a glottic carcinoma treated with curative intent with radiotherapy or laser cordectomy. Fifty-seven cases were analysed: 27 after curative radiotherapy and 30 after laser cordectomy. All patients were studied with videolaryngostroboscopy, voice analysis by narrow spectrogram, and vocal parameters (Jitter, Shimmer, noise/harmonic ratio, and diplophonia). Videolaryngostroboscopy showed severe glottic inadequacy in 25% of cases treated with radiation and insufficient compensation 'ventricular band' or 'with arytenoid hyperadduction' in 65% of cases after surgery. Severe dysphonia on the electro-acoustic analysis of voice was observed in 25% of cases after radiation and 70% after laser (p<0.001). Fundamental frequency and vocal parameters showed more favourable results in the radiation group (p<0.001). Voice assessment showed better results after radiotherapy compared with laser cordectomy. Voice outcome should be carefully considered in the treatment decision for T1 glottic carcinoma.
Start/End Delays of Voiced and Unvoiced Speech Signals

Energy Technology Data Exchange (ETDEWEB)

Herrnstein, A

1999-09-24

Recent experiments using low power EM-radar like sensors (e.g, GEMs) have demonstrated a new method for measuring vocal fold activity and the onset times of voiced speech, as vocal fold contact begins to take place. Similarly the end time of a voiced speech segment can be measured. Secondly it appears that in most normal uses of American English speech, unvoiced-speech segments directly precede or directly follow voiced-speech segments. For many applications, it is useful to know typical duration times of these unvoiced speech segments. A corpus, assembled earlier of spoken ''Timit'' words, phrases, and sentences and recorded using simultaneously measured acoustic and EM-sensor glottal signals, from 16 male speakers, was used for this study. By inspecting the onset (or end) of unvoiced speech, using the acoustic signal, and the onset (or end) of voiced speech using the EM sensor signal, the average duration times for unvoiced segments preceding onset of vocalization were found to be 300ms, and for following segments, 500ms. An unvoiced speech period is then defined in time, first by using the onset of the EM-sensed glottal signal, as the onset-time marker for the voiced speech segment and end marker for the unvoiced segment. Then, by subtracting 300ms from the onset time mark of voicing, the unvoiced speech segment start time is found. Similarly, the times for a following unvoiced speech segment can be found. While data of this nature have proven to be useful for work in our laboratory, a great deal of additional work remains to validate such data for use with general populations of users. These procedures have been useful for applying optimal processing algorithms over time segments of unvoiced, voiced, and non-speech acoustic signals. For example, these data appear to be of use in speaker validation, in vocoding, and in denoising algorithms.
How do you say 'hello'? Personality impressions from brief novel voices.

Science.gov (United States)

McAleer, Phil; Todorov, Alexander; Belin, Pascal

2014-01-01

On hearing a novel voice, listeners readily form personality impressions of that speaker. Accurate or not, these impressions are known to affect subsequent interactions; yet the underlying psychological and acoustical bases remain poorly understood. Furthermore, hitherto studies have focussed on extended speech as opposed to analysing the instantaneous impressions we obtain from first experience. In this paper, through a mass online rating experiment, 320 participants rated 64 sub-second vocal utterances of the word 'hello' on one of 10 personality traits. We show that: (1) personality judgements of brief utterances from unfamiliar speakers are consistent across listeners; (2) a two-dimensional 'social voice space' with axes mapping Valence (Trust, Likeability) and Dominance, each driven by differing combinations of vocal acoustics, adequately summarises ratings in both male and female voices; and (3) a positive combination of Valence and Dominance results in increased perceived male vocal Attractiveness, whereas perceived female vocal Attractiveness is largely controlled by increasing Valence. Results are discussed in relation to the rapid evaluation of personality and, in turn, the intent of others, as being driven by survival mechanisms via approach or avoidance behaviours. These findings provide empirical bases for predicting personality impressions from acoustical analyses of short utterances and for generating desired personality impressions in artificial voices.
How do you say 'hello'? Personality impressions from brief novel voices.

Directory of Open Access Journals (Sweden)

Phil McAleer

Full Text Available On hearing a novel voice, listeners readily form personality impressions of that speaker. Accurate or not, these impressions are known to affect subsequent interactions; yet the underlying psychological and acoustical bases remain poorly understood. Furthermore, hitherto studies have focussed on extended speech as opposed to analysing the instantaneous impressions we obtain from first experience. In this paper, through a mass online rating experiment, 320 participants rated 64 sub-second vocal utterances of the word 'hello' on one of 10 personality traits. We show that: (1 personality judgements of brief utterances from unfamiliar speakers are consistent across listeners; (2 a two-dimensional 'social voice space' with axes mapping Valence (Trust, Likeability and Dominance, each driven by differing combinations of vocal acoustics, adequately summarises ratings in both male and female voices; and (3 a positive combination of Valence and Dominance results in increased perceived male vocal Attractiveness, whereas perceived female vocal Attractiveness is largely controlled by increasing Valence. Results are discussed in relation to the rapid evaluation of personality and, in turn, the intent of others, as being driven by survival mechanisms via approach or avoidance behaviours. These findings provide empirical bases for predicting personality impressions from acoustical analyses of short utterances and for generating desired personality impressions in artificial voices.
Effect of Spinal Manipulative Therapy on the Singing Voice.

Science.gov (United States)

Fachinatto, Ana Paula A; Duprat, André de Campos; Silva, Marta Andrada E; Bracher, Eduardo Sawaya Botelho; Benedicto, Camila de Carvalho; Luz, Victor Botta Colangelo; Nogueira, Maruan Nogueira; Fonseca, Beatriz Suster Gomes

2015-09-01

This study investigated the effect of spinal manipulative therapy (SMT) on the singing voice of male individuals. Randomized, controlled, case-crossover trial. Twenty-nine subjects were selected among male members of the Heralds of the Gospel. This association was chosen because it is a group of persons with similar singing activities. Participants were randomly assigned to two groups: (A) chiropractic SMT procedure and (B) nontherapeutic transcutaneous electrical nerve stimulation (TENS) procedure. Recordings of the singing voice of each participant were taken immediately before and after the procedures. After a 14-day period, procedures were switched between groups: participants who underwent SMT on the first day were subjected to TENS and vice versa. Recordings were subjected to perceptual audio and acoustic evaluations. The same recording segment of each participant was selected. Perceptual audio evaluation was performed by a specialist panel (SP). Recordings of each participant were randomly presented thus making the SP blind to intervention type and recording session (before/after intervention). Recordings compiled in a randomized order were also subjected to acoustic evaluation. No differences in the quality of the singing on perceptual audio evaluation were observed between TENS and SMT. No differences in the quality of the singing voice of asymptomatic male singers were observed on perceptual audio evaluation or acoustic evaluation after a single spinal manipulative intervention of the thoracic and cervical spine. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Does CPAP treatment affect the voice?

Science.gov (United States)

Saylam, Güleser; Şahin, Mustafa; Demiral, Dilek; Bayır, Ömer; Yüceege, Melike Bağnu; Çadallı Tatar, Emel; Korkmaz, Mehmet Hakan

2016-12-20

The aim of this study was to investigate alterations in voice parameters among patients using continuous positive airway pressure (CPAP) for the treatment of obstructive sleep apnea syndrome. Patients with an indication for CPAP treatment without any voice problems and with normal laryngeal findings were included and voice parameters were evaluated before and 1 and 6 months after CPAP. Videolaryngostroboscopic findings, a self-rated scale (Voice Handicap Index-10, VHI-10), perceptual voice quality assessment (GRBAS: grade, roughness, breathiness, asthenia, strain), and acoustic parameters were compared. Data from 70 subjects (48 men and 22 women) with a mean age of 44.2 ± 6.0 years were evaluated. When compared with the pre-CPAP treatment period, there was a significant increase in the VHI-10 score after 1 month of treatment and in VHI- 10 and total GRBAS scores, jitter percent (P = 0.01), shimmer percent, noise-to-harmonic ratio, and voice turbulence index after 6 months of treatment. Vague negative effects on voice parameters after the first month of CPAP treatment became more evident after 6 months. We demonstrated nonsevere alterations in the voice quality of patients under CPAP treatment. Given that CPAP is a long-term treatment it is important to keep these alterations in mind.
Teachers’ voice use in teaching environment. Aspects on speakers’ comfort

DEFF Research Database (Denmark)

Lyberg-Åhlander, Viveka; Rydell, Roland; Löfqvist, Anders

2015-01-01

use and prevalence of voice problems in teachers and to explore their ratings of vocally loading aspects of their working environment. Method: A questionnaire-survey in 467 teachers aiming to explore the prevalence of voice problems in teaching staff identified teachers with voice problems and vocally...... in the teaching environment and aspects of the classroom environment were also measured. Results: Teachers with voice problems were more affected by any loading factor in the work-environment and were more perceptive of the room acoustics. Differences between the groups were found during field......-measurements of the voice, while there were no differences in the findings from the clinical examinations of larynx and voice. Conclusion: Teachers suffering from voice problems react stronger to loading factors in the teaching environment. It is in the interplay between the individual and the work environment that voice...
Double Fourier analysis for Emotion Identification in Voiced Speech

International Nuclear Information System (INIS)

Sierra-Sosa, D.; Bastidas, M.; Ortiz P, D.; Quintero, O.L.

2016-01-01

We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech. Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions. A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds. Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions. Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it. Finally features related with emotions in voiced speech are extracted and presented. (paper)
Quality of the voice after injection of hyaluronic acid into the vocal fold.

Science.gov (United States)

Szkiełkowska, Agata; Miaśkiewicz, Beata; Remacle, Marc; Krasnodębska, Paulina; Skarżyński, Henryk

2013-04-17

Voice disorders resulting from glottic insufficiency are a significant clinical problem in everyday phoniatric practice. One method of treatment is injection laryngoplasty. Our study aimed to assess the voice quality of patients treated with hyaluronic acid injection into the vocal fold. We studied 25 patients suffering from dysphonia, conducting laryngological and phoniatric examination, including videostroboscopy and acoustic voice analysis, before the operation and 1, 3, and 6 months later. In all cases there was complete or almost complete glottic closure after the operation. One month after the procedure, videostroboscopic examination revealed reappearance of vocal fold vibration in 8 cases; after 3 months this had risen to 15 cases. Perceptual voice quality (as assessed by the GRBAS scale) in patients with glottic insufficiency was improved. The most significant improvement was obtained 1 month after surgery (p=0.0002), and within the next months further statistically significant improvements (p=0.000002) were noted. Multidimensional voice analysis showed statistically significant and rapid improvement in frequency parameters, especially vFo. Other parameters were also improved 3 and 6 months after surgery. Injection of hyaluronic acid into the vocal fold improves phonatory functions of the larynx and the quality of voice in patients with glottic insufficiency. It may be a safe and conservative method for treatment of voice disorders. Hyaluronic acid injection to the vocal fold is an easy, effective, and fast method for restoration of good voice quality.
Speech coding, reconstruction and recognition using acoustics and electromagnetic waves

International Nuclear Information System (INIS)

Holzrichter, J.F.; Ng, L.C.

1998-01-01

The use of EM radiation in conjunction with simultaneously recorded acoustic speech information enables a complete mathematical coding of acoustic speech. The methods include the forming of a feature vector for each pitch period of voiced speech and the forming of feature vectors for each time frame of unvoiced, as well as for combined voiced and unvoiced speech. The methods include how to deconvolve the speech excitation function from the acoustic speech output to describe the transfer function each time frame. The formation of feature vectors defining all acoustic speech units over well defined time frames can be used for purposes of speech coding, speech compression, speaker identification, language-of-speech identification, speech recognition, speech synthesis, speech translation, speech telephony, and speech teaching. 35 figs

Speech coding, reconstruction and recognition using acoustics and electromagnetic waves

Science.gov (United States)

Holzrichter, John F.; Ng, Lawrence C.

1998-01-01

The use of EM radiation in conjunction with simultaneously recorded acoustic speech information enables a complete mathematical coding of acoustic speech. The methods include the forming of a feature vector for each pitch period of voiced speech and the forming of feature vectors for each time frame of unvoiced, as well as for combined voiced and unvoiced speech. The methods include how to deconvolve the speech excitation function from the acoustic speech output to describe the transfer function each time frame. The formation of feature vectors defining all acoustic speech units over well defined time frames can be used for purposes of speech coding, speech compression, speaker identification, language-of-speech identification, speech recognition, speech synthesis, speech translation, speech telephony, and speech teaching.
The Effect of Septoplasty on Voice Performance in Patients With Severe and Mild Nasal Septal Deviation.

Science.gov (United States)

Atan, Doğan; Özcan, Kürşat Murat; Gürbüz, Ayşe Betül Topak; Dere, Hüseyin

2016-07-01

The authors aimed to analyze the effect of septoplasty, performed in 2 groups with different grades of nasal septal deviation (NSD), on voice performance. A total of 43 patients who had septoplasty due to NSD and were included in the study. The study groups were divided into 2 groups as groups A and B. The patients in group A had severe NSD, and 1 of the nasal cavity was obstructed totally or near totally. In group B, the NSD narrowed the nasal passage, and the deviation was not severe. The voice performance was analyzed preoperatively, and 1 month after surgery with both objective and subjective methods. Objective analysis included acoustic voice analysis, and measurement of F0, jitter %, shimmer %. Preoperative and postoperative F0, jitter %, shimmer %, and Voice Handicap Index-30 (VHI-30) were compared in groups A and B. F0 showed a statistically significant improvement after surgery in group A (P performed for severe NSD obstructing nasal lumen totally or near totally results in significant improvements in the voice performance.
Using Innovative Acoustic Analysis to Predict the Postoperative Outcomes of Unilateral Vocal Fold Paralysis

Directory of Open Access Journals (Sweden)

Yung-An Tsou

2016-01-01

Full Text Available Objective. Autologous fat injection laryngoplasty is ineffective for some patients with iatrogenic vocal fold paralysis, and additional laryngeal framework surgery is often required. An acoustically measurable outcome predictor for lipoinjection laryngoplasty would assist phonosurgeons in formulating treatment strategies. Methods. Seventeen thyroid surgery patients with unilateral vocal fold paralysis participated in this study. All subjects underwent lipoinjection laryngoplasty to treat postsurgery vocal hoarseness. After treatment, patients were assigned to success and failure groups on the basis of voice improvement. Linear prediction analysis was used to construct a new voice quality indicator, the number of irregular peaks (NIrrP. It compared with the measures used in the Multi-Dimensional Voice Program (MDVP, such as jitter (frequency perturbation and shimmer (perturbation of amplitude. Results. By comparing the [i] vowel produced by patients before the lipoinjection laryngoplasty (AUC = 0.98, 95% CI = 0.78–0.99, NIrrP was shown to be a more accurate predictor of long-term surgical outcomes than jitter (AUC = 0.73, 95% CI = 0.47–0.91 and shimmer (AUC = 0.63, 95% CI = 0.37–0.85, as identified by the receiver operating characteristic curve. Conclusions. NIrrP measured using the LP model could be a more accurate outcome predictor than the parameters used in the MDVP.
Factors associated with voice disorders among teachers: a case-control study.

Science.gov (United States)

Giannini, Susana Pimentel Pinto; Latorre, Maria do Rosário Dias de Oliveira; Ferreira, Léslie Piccolotto

2013-01-01

We aimed at verifying an association between voice disorders/stress and loss of work ability among female teachers who work in São Paulo's public school system. This is a paired case- control study. The case group was composed offiteachers with alterations in speech and larynges assessments, and the control group was formed by teachers without alterations in these evaluations who work in the same schools. Both groups answered the following questionnaires: Conditions of Vocal Production-Teachers, Job Stress Scale, and Work Ability Index. The analysis was performed using the chi-square association test and logistic regression models with the purpose of estimating the association between independent variables and voice disorders. We found differences between the groups in relation to stress in the workplace under high demand, a situation that poses greater risks of adverse reactions to the workers' physical and mental health. Regarding the ability to work, the categories poor and moderate ability for work are associated with voice disorders, regardless of job stress factors, age, and the unsatisfactory acoustic properties of the classrooms. This study confirmed the association between voice disorders and job stress, as well as between voice disorders and loss of work ability.
Improving Speaker Recognition by Biometric Voice Deconstruction

Science.gov (United States)

Mazaira-Fernandez, Luis Miguel; Álvarez-Marquina, Agustín; Gómez-Vilda, Pedro

2015-01-01

Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g., YouTube) to broadcast its message. In this new scenario, classical identification methods (such as fingerprints or face recognition) have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. The present study benefits from the advances achieved during last years in understanding and modeling voice production. The paper hypothesizes that a gender-dependent characterization of speakers combined with the use of a set of features derived from the components, resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract the gender-dependent extended biometric parameters is given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions. PMID:26442245
The Sound of Voice: Voice-Based Categorization of Speakers' Sexual Orientation within and across Languages.

Directory of Open Access Journals (Sweden)

Simone Sulpizio

Full Text Available Empirical research had initially shown that English listeners are able to identify the speakers' sexual orientation based on voice cues alone. However, the accuracy of this voice-based categorization, as well as its generalizability to other languages (language-dependency and to non-native speakers (language-specificity, has been questioned recently. Consequently, we address these open issues in 5 experiments: First, we tested whether Italian and German listeners are able to correctly identify sexual orientation of same-language male speakers. Then, participants of both nationalities listened to voice samples and rated the sexual orientation of both Italian and German male speakers. We found that listeners were unable to identify the speakers' sexual orientation correctly. However, speakers were consistently categorized as either heterosexual or gay on the basis of how they sounded. Moreover, a similar pattern of results emerged when listeners judged the sexual orientation of speakers of their own and of the foreign language. Overall, this research suggests that voice-based categorization of sexual orientation reflects the listeners' expectations of how gay voices sound rather than being an accurate detector of the speakers' actual sexual identity. Results are discussed with regard to accuracy, acoustic features of voices, language dependency and language specificity.
Measurement of voice onset time in maxillectomy patients.

Science.gov (United States)

Hattori, Mariko; Sumita, Yuka I; Taniguchi, Hisashi

2014-01-01

Objective speech evaluation using acoustic measurement is needed for the proper rehabilitation of maxillectomy patients. For digital evaluation of consonants, measurement of voice onset time is one option. However, voice onset time has not been measured in maxillectomy patients as their consonant sound spectra exhibit unique characteristics that make the measurement of voice onset time challenging. In this study, we established criteria for measuring voice onset time in maxillectomy patients for objective speech evaluation. We examined voice onset time for /ka/ and /ta/ in 13 maxillectomy patients by calculating the number of valid measurements of voice onset time out of three trials for each syllable. Wilcoxon's signed rank test showed that voice onset time measurements were more successful for /ka/ and /ta/ when a prosthesis was used (Z = -2.232, P = 0.026 and Z = -2.401, P = 0.016, resp.) than when a prosthesis was not used. These results indicate a prosthesis affected voice onset measurement in these patients. Although more research in this area is needed, measurement of voice onset time has the potential to be used to evaluate consonant production in maxillectomy patients wearing a prosthesis.
Measurement of Voice Onset Time in Maxillectomy Patients

Directory of Open Access Journals (Sweden)

Mariko Hattori

2014-01-01

Full Text Available Objective speech evaluation using acoustic measurement is needed for the proper rehabilitation of maxillectomy patients. For digital evaluation of consonants, measurement of voice onset time is one option. However, voice onset time has not been measured in maxillectomy patients as their consonant sound spectra exhibit unique characteristics that make the measurement of voice onset time challenging. In this study, we established criteria for measuring voice onset time in maxillectomy patients for objective speech evaluation. We examined voice onset time for /ka/ and /ta/ in 13 maxillectomy patients by calculating the number of valid measurements of voice onset time out of three trials for each syllable. Wilcoxon’s signed rank test showed that voice onset time measurements were more successful for /ka/ and /ta/ when a prosthesis was used (Z=−2.232, P=0.026 and Z=−2.401, P=0.016, resp. than when a prosthesis was not used. These results indicate a prosthesis affected voice onset measurement in these patients. Although more research in this area is needed, measurement of voice onset time has the potential to be used to evaluate consonant production in maxillectomy patients wearing a prosthesis.
Amygdala and auditory cortex exhibit distinct sensitivity to relevant acoustic features of auditory emotions.

Science.gov (United States)

Pannese, Alessia; Grandjean, Didier; Frühholz, Sascha

2016-12-01

Discriminating between auditory signals of different affective value is critical to successful social interaction. It is commonly held that acoustic decoding of such signals occurs in the auditory system, whereas affective decoding occurs in the amygdala. However, given that the amygdala receives direct subcortical projections that bypass the auditory cortex, it is possible that some acoustic decoding occurs in the amygdala as well, when the acoustic features are relevant for affective discrimination. We tested this hypothesis by combining functional neuroimaging with the neurophysiological phenomena of repetition suppression (RS) and repetition enhancement (RE) in human listeners. Our results show that both amygdala and auditory cortex responded differentially to physical voice features, suggesting that the amygdala and auditory cortex decode the affective quality of the voice not only by processing the emotional content from previously processed acoustic features, but also by processing the acoustic features themselves, when these are relevant to the identification of the voice's affective value. Specifically, we found that the auditory cortex is sensitive to spectral high-frequency voice cues when discriminating vocal anger from vocal fear and joy, whereas the amygdala is sensitive to vocal pitch when discriminating between negative vocal emotions (i.e., anger and fear). Vocal pitch is an instantaneously recognized voice feature, which is potentially transferred to the amygdala by direct subcortical projections. These results together provide evidence that, besides the auditory cortex, the amygdala too processes acoustic information, when this is relevant to the discrimination of auditory emotions. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Multidimensional effects of voice therapy in patients affected by unilateral vocal fold paralysis due to cancer.

Science.gov (United States)

Barcelos, Camila Barbosa; Silveira, Paula Angélica Lorenzon; Guedes, Renata Lígia Vieira; Gonçalves, Aline Nogueira; Slobodticov, Luciana Dall'Agnol Siqueira; Angelis, Elisabete Carrara-de

2017-08-24

Patients with unilateral vocal fold paralysis may demonstrate different degrees of voice perturbation depending on the position of the paralyzed vocal fold. Understanding the effectiveness of voice therapy in this population may be an important coefficient to define the therapeutic approach. To evaluate the voice therapy effectiveness in the short, medium and long-term in patients with unilateral vocal fold paralysis and determine the risk factors for voice rehabilitation failure. Prospective study with 61 patients affected by unilateral vocal fold paralysis enrolled. Each subject had voice therapy with an experienced speech pathologist twice a week. A multidimensional assessment protocol was used pre-treatment and in three different times after voice treatment initiation: short-term (1-3 months), medium-term (4-6 months) and long-term (12 months); it included videoendoscopy, maximum phonation time, GRBASI scale, acoustic voice analysis and the portuguese version of the voice handicap index. Multiple comparisons for GRBASI scale and VHI revealed statistically significant differences, except between medium and long term (pvocal improvement over time with stabilization results after 6 months (medium term). From the 28 patients with permanent unilateral vocal fold paralysis, 18 (69.2%) reached complete glottal closure following vocal therapy (p=0.001). The logistic regression method indicated that the Jitter entered the final model as a risk factor for partial improvement. For every unit of increased jitter, there was an increase of 0.1% (1.001) of the chance for partial improvement, which means an increase on no full improvement chance during rehabilitation. Vocal rehabilitation improves perceptual and acoustic voice parameters and voice handicap index, besides favor glottal closure in patients with unilateral vocal fold paralysis. The results were also permanent during the period of 1 year. The Jitter value, when elevated, is a risk factor for the voice therapy
Particle Filter with Integrated Voice Activity Detection for Acoustic Source Tracking

Directory of Open Access Journals (Sweden)

Anders M. Johansson

2007-01-01

Full Text Available In noisy and reverberant environments, the problem of acoustic source localisation and tracking (ASLT using an array of microphones presents a number of challenging difficulties. One of the main issues when considering real-world situations involving human speakers is the temporally discontinuous nature of speech signals: the presence of silence gaps in the speech can easily misguide the tracking algorithm, even in practical environments with low to moderate noise and reverberation levels. A natural extension of currently available sound source tracking algorithms is the integration of a voice activity detection (VAD scheme. We describe a new ASLT algorithm based on a particle filtering (PF approach, where VAD measurements are fused within the statistical framework of the PF implementation. Tracking accuracy results for the proposed method is presented on the basis of synthetic audio samples generated with the image method, whereas performance results obtained with a real-time implementation of the algorithm, and using real audio data recorded in a reverberant room, are published elsewhere. Compared to a previously proposed PF algorithm, the experimental results demonstrate the improved robustness of the method described in this work when tracking sources emitting real-world speech signals, which typically involve significant silence gaps between utterances.
Occupational risk factors and voice disorders.

Science.gov (United States)

Vilkman, E

1996-01-01

From the point of view of occupational health, the field of voice disorders is very poorly developed as compared, for instance, to the prevention and diagnostics of occupational hearing disorders. In fact, voice disorders have not even been recognized in the field of occupational medicine. Hence, it is obviously very rare in most countries that the voice disorder of a professional voice user, e.g. a teacher, a singer or an actor, is accepted as an occupational disease by insurance companies. However, occupational voice problems do not lack significance from the point of view of the patient. We also know from questionnaires and clinical studies that voice complaints are very common. Another example of job-related health problems, which has proved more successful in terms of its occupational health status, is the repetition strain injury of the elbow, i.e. the "tennis elbow". Its textbook definition could be used as such to describe an occupational voice disorder ("dysphonia professional is"). In the present paper the effects of such risk factors as vocal loading itself, background noise and room acoustics and low relative humidity of the air are discussed. Due to individual factors underlying the development of professional voice disorders, recommendations rather than regulations are called for. There are many simple and even relatively low-cost methods available for the prevention of vocal problems as well as for supporting rehabilitation.
Pitch (F0) and formant profiles of human vowels and vowel-like baboon grunts: The role of vocalizer body size and voice-acoustic allometry

Science.gov (United States)

Rendall, Drew; Kollias, Sophie; Ney, Christina; Lloyd, Peter

2005-02-01

Key voice features-fundamental frequency (F0) and formant frequencies-can vary extensively between individuals. Much of the variation can be traced to differences in the size of the larynx and vocal-tract cavities, but whether these differences in turn simply reflect differences in speaker body size (i.e., neutral vocal allometry) remains unclear. Quantitative analyses were therefore undertaken to test the relationship between speaker body size and voice F0 and formant frequencies for human vowels. To test the taxonomic generality of the relationships, the same analyses were conducted on the vowel-like grunts of baboons, whose phylogenetic proximity to humans and similar vocal production biology and voice acoustic patterns recommend them for such comparative research. For adults of both species, males were larger than females and had lower mean voice F0 and formant frequencies. However, beyond this, F0 variation did not track body-size variation between the sexes in either species, nor within sexes in humans. In humans, formant variation correlated significantly with speaker height but only in males and not in females. Implications for general vocal allometry are discussed as are implications for speech origins theories, and challenges to them, related to laryngeal position and vocal tract length. .
Vocal Acoustic and Auditory-Perceptual Characteristics During Fluctuations in Estradiol Levels During the Menstrual Cycle: A Longitudinal Study.

Science.gov (United States)

Arruda, Polyanna; Diniz da Rosa, Marine Raquel; Almeida, Larissa Nadjara Alves; de Araujo Pernambuco, Leandro; Almeida, Anna Alice

2018-03-07

Estradiol production varies cyclically, changes in levels are hypothesized to affect the voice. The main objective of this study was to investigate vocal acoustic and auditory-perceptual characteristics during fluctuations in the levels of the hormone estradiol during the menstrual cycle. A total of 44 volunteers aged between 18 and 45 were selected. Of these, 27 women with regular menstrual cycles comprised the test group (TG) and 17 combined oral contraceptive users comprised the control group (CG). The study was performed in two phases. In phase 1, anamnesis was performed. Subsequently, the TG underwent blood sample collection for measurement of estradiol levels and voice recording for later acoustic and auditory-perceptual analysis. The CG underwent only voice recording. Phase 2 involved the same measurements as phase 1 for each group. Variables were evaluated using descriptive and inferential analysis to compare groups and phases and to determine relationships between variables. Voice changes were found during the menstrual cycle, and such changes were determined to be related to variations in estradiol levels. Impaired voice quality was observed to be associated with decreased levels of estradiol. The CG did not demonstrate significant vocal changes during phases 1 and 2. The TG showed significant increases in vocal parameters of roughness, tension, and instability during phase 2 (the period of low estradiol levels) when compared with the CG. Low estradiol levels were also found to be negatively correlated with the parameters of tension, instability, and jitter and positively correlated with fundamental voice frequency. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
[Environmental factors and vocal habits regarding pre-school teachers and functionaries suffering voice disorders].

Science.gov (United States)

Barrreto-Munévar, Deisy P; Cháux-Ramos, Oriana M; Estrada-Rangel, Mónica A; Sánchez-Morales, Jenifer; Moreno-Angarita, Marisol; Camargo-Mendoza, Maryluz

2011-06-01

Determining the relationship between vocal habits and environmental/ occupational conditions with the presence of vocal disturbance (dysphonia) in teachers and functionaries working at community-based, initial childhood education centres (kindergartens). This was a descriptive study which adopted across-sectional approach using 198 participants which was developed in three phases. Phase 1: consisted of identifying participants having the highest risk of presenting vocal disturbance. Phase 2consisted of observation-analysis concerning the voice use and vocal habits of participants who had been identified in phase 1. Phase 3consisted of perceptual and computational assessment of participants' voices using Wilson's vocal profile and the multidimensional voice program. Individuals having pitch breaks, throat clearing, increased voice intensity, and gastro-oesophageal reflux were found to present below standard fundamental frequency (FF). Subjects having altered breathing and increased voice intensity were identified as having above standard shimmer and jitter acoustic values. A high rate of inability to work was found due to vocal disturbance. It is thus suggested that there is a correlation between vocal habits and vocal disorders presented by preschool teachers in kindergarten settings.
Voice hearing within the context of hearers' social worlds: an interpretative phenomenological analysis.

Science.gov (United States)

Mawson, Amy; Berry, Katherine; Murray, Craig; Hayward, Mark

2011-09-01

Research has found relational qualities of power and intimacy to exist within hearer-voice interactions. The present study aimed to provide a deeper understanding of the interpersonal context of voice hearing by exploring participants' relationships with their voices and other people in their lives. This research was designed in consultation with service users and employed a qualitative, phenomenological, and idiographic design using semi-structured interviews. Ten participants, recruited via mental health services, and who reported hearing voices in the previous week, completed the interviews. These were transcribed verbatim and analysed using interpretative phenomenological analysis. Five themes resulted from the analysis. Theme 1: 'person and voice' demonstrated that participants' voices often reflected the identity, but not always the quality of social acquaintances. Theme 2: 'voices changing and confirming relationship with the self' explored the impact of voice hearing in producing an inferior sense-of-self in comparison to others. Theme 3: 'a battle for control' centred on issues of control and a dilemma of independence within voice relationships. Theme 4: 'friendships facilitating the ability to cope' and theme 5: 'voices creating distance in social relationships' explored experiences of social relationships within the context of voice hearing, and highlighted the impact of social isolation for voice hearers. The study demonstrated the potential role of qualitative research in developing theories of voice hearing. It extended previous research by highlighting the interface between voices and the social world of the hearer, including reciprocal influences of social relationships on voices and coping. Improving voice hearers' sense-of-self may be a key factor in reducing the distress caused by voices. ©2010 The British Psychological Society.
Improving Speaker Recognition by Biometric Voice Deconstruction

Directory of Open Access Journals (Sweden)

Luis Miguel eMazaira-Fernández

2015-09-01

Full Text Available Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g. YouTube to broadcast its message. In this new scenario, classical identification methods (such fingerprints or face recognition have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. Through the present paper, a new methodology to characterize speakers will be shown. This methodology is benefiting from the advances achieved during the last years in understanding and modelling voice production. The paper hypothesizes that a gender dependent characterization of speakers combined with the use of a new set of biometric parameters extracted from the components resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract gender-dependent extended biometric parameters are given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.
Acoustic analysis of a piping system

International Nuclear Information System (INIS)

Misra, A.S.; Vijay, D.K.

1996-01-01

Acoustic pulsations in the Darlington Nuclear Generating Station, a 881 MW CANDU, primary heat transport piping system caused fuel bundle failures under short term operations. The problem was successfully analyzed using the steady-state acoustic analysis capability of the ABAQUS program. This paper describes in general, modelling of low amplitude acoustic pulsations in a liquid filled piping system using ABAQUS. The paper gives techniques for estimating the acoustic medium properties--bulk modulus, fluid density and acoustic damping--and modelling fluid-structure interactions at orifices and elbows. The formulations and techniques developed are benchmarked against the experiments given in 3 cited references. The benchmark analysis shows that the ABAQUS results are in excellent agreement with the experiments
Effects of Voice Rehabilitation After Radiation Therapy for Laryngeal Cancer: A Randomized Controlled Study

International Nuclear Information System (INIS)

Tuomi, Lisa; Andréll, Paulin; Finizia, Caterina

2014-01-01

Background: Patients treated with radiation therapy for laryngeal cancer often experience voice problems. The aim of this randomized controlled trial was to assess the efficacy of voice rehabilitation for laryngeal cancer patients after having undergone radiation therapy and to investigate whether differences between different tumor localizations with regard to rehabilitation outcomes exist. Methods and Materials: Sixty-nine male patients irradiated for laryngeal cancer participated. Voice recordings and self-assessments of communicative dysfunction were performed 1 and 6 months after radiation therapy. Thirty-three patients were randomized to structured voice rehabilitation with a speech-language pathologist and 36 to a control group. Furthermore, comparisons with 23 healthy control individuals were made. Acoustic analyses were performed for all patients, including the healthy control individuals. The Swedish version of the Self Evaluation of Communication Experiences after Laryngeal Cancer and self-ratings of voice function were used to assess vocal and communicative function. Results: The patients who received vocal rehabilitation experienced improved self-rated vocal function after rehabilitation. Patients with supraglottic tumors who received voice rehabilitation had statistically significant improvements in voice quality and self-rated vocal function, whereas the control group did not. Conclusion: Voice rehabilitation for male patients with laryngeal cancer is efficacious regarding patient-reported outcome measurements. The patients experienced better voice function after rehabilitation. Patients with supraglottic tumors also showed an improvement in terms of acoustic voice outcomes. Rehabilitation with a speech-language pathologist is recommended for laryngeal cancer patients after radiation therapy, particularly for patients with supraglottic tumors
Effects of Voice Rehabilitation After Radiation Therapy for Laryngeal Cancer: A Randomized Controlled Study

Energy Technology Data Exchange (ETDEWEB)

Tuomi, Lisa, E-mail: lisa.tuomi@vgregion.se [Department of Otorhinolaryngology, Head and Neck Surgery, Institute of Clinical Sciences, Sahlgrenska Academy at the University of Gothenburg, Sahlgrenska University Hospital, Gothenburg (Sweden); Andréll, Paulin [Department of Molecular and Clinical Medicine/Multidisciplinary Pain Center, Institute of Medicine, Sahlgrenska Academy at the University of Gothenburg, Sahlgrenska University Hospital, Gothenburg (Sweden); Finizia, Caterina [Department of Otorhinolaryngology, Head and Neck Surgery, Institute of Clinical Sciences, Sahlgrenska Academy at the University of Gothenburg, Sahlgrenska University Hospital, Gothenburg (Sweden)

2014-08-01

Background: Patients treated with radiation therapy for laryngeal cancer often experience voice problems. The aim of this randomized controlled trial was to assess the efficacy of voice rehabilitation for laryngeal cancer patients after having undergone radiation therapy and to investigate whether differences between different tumor localizations with regard to rehabilitation outcomes exist. Methods and Materials: Sixty-nine male patients irradiated for laryngeal cancer participated. Voice recordings and self-assessments of communicative dysfunction were performed 1 and 6 months after radiation therapy. Thirty-three patients were randomized to structured voice rehabilitation with a speech-language pathologist and 36 to a control group. Furthermore, comparisons with 23 healthy control individuals were made. Acoustic analyses were performed for all patients, including the healthy control individuals. The Swedish version of the Self Evaluation of Communication Experiences after Laryngeal Cancer and self-ratings of voice function were used to assess vocal and communicative function. Results: The patients who received vocal rehabilitation experienced improved self-rated vocal function after rehabilitation. Patients with supraglottic tumors who received voice rehabilitation had statistically significant improvements in voice quality and self-rated vocal function, whereas the control group did not. Conclusion: Voice rehabilitation for male patients with laryngeal cancer is efficacious regarding patient-reported outcome measurements. The patients experienced better voice function after rehabilitation. Patients with supraglottic tumors also showed an improvement in terms of acoustic voice outcomes. Rehabilitation with a speech-language pathologist is recommended for laryngeal cancer patients after radiation therapy, particularly for patients with supraglottic tumors.

Tipping point analysis of ocean acoustic noise

Science.gov (United States)

Livina, Valerie N.; Brouwer, Albert; Harris, Peter; Wang, Lian; Sotirakopoulos, Kostas; Robinson, Stephen

2018-02-01

We apply tipping point analysis to a large record of ocean acoustic data to identify the main components of the acoustic dynamical system and study possible bifurcations and transitions of the system. The analysis is based on a statistical physics framework with stochastic modelling, where we represent the observed data as a composition of deterministic and stochastic components estimated from the data using time-series techniques. We analyse long-term and seasonal trends, system states and acoustic fluctuations to reconstruct a one-dimensional stochastic equation to approximate the acoustic dynamical system. We apply potential analysis to acoustic fluctuations and detect several changes in the system states in the past 14 years. These are most likely caused by climatic phenomena. We analyse trends in sound pressure level within different frequency bands and hypothesize a possible anthropogenic impact on the acoustic environment. The tipping point analysis framework provides insight into the structure of the acoustic data and helps identify its dynamic phenomena, correctly reproducing the probability distribution and scaling properties (power-law correlations) of the time series.
Tipping point analysis of ocean acoustic noise

Directory of Open Access Journals (Sweden)

V. N. Livina

2018-02-01

Full Text Available We apply tipping point analysis to a large record of ocean acoustic data to identify the main components of the acoustic dynamical system and study possible bifurcations and transitions of the system. The analysis is based on a statistical physics framework with stochastic modelling, where we represent the observed data as a composition of deterministic and stochastic components estimated from the data using time-series techniques. We analyse long-term and seasonal trends, system states and acoustic fluctuations to reconstruct a one-dimensional stochastic equation to approximate the acoustic dynamical system. We apply potential analysis to acoustic fluctuations and detect several changes in the system states in the past 14 years. These are most likely caused by climatic phenomena. We analyse trends in sound pressure level within different frequency bands and hypothesize a possible anthropogenic impact on the acoustic environment. The tipping point analysis framework provides insight into the structure of the acoustic data and helps identify its dynamic phenomena, correctly reproducing the probability distribution and scaling properties (power-law correlations of the time series.
Clinical voice analysis of Carnatic singers.

Science.gov (United States)

Arunachalam, Ravikumar; Boominathan, Prakash; Mahalingam, Shenbagavalli

2014-01-01

Carnatic singing is a classical South Indian style of music that involves rigorous training to produce an "open throated" loud, predominantly low-pitched singing, embedded with vocal nuances in higher pitches. Voice problems in singers are not uncommon. The objective was to report the nature of voice problems and apply a routine protocol to assess the voice. Forty-five trained performing singers (females: 36 and males: 9) who reported to a tertiary care hospital with voice problems underwent voice assessment. The study analyzed their problems and the clinical findings. Voice change, difficulty in singing higher pitches, and voice fatigue were major complaints. Most of the singers suffered laryngopharyngeal reflux that coexisted with muscle tension dysphonia and chronic laryngitis. Speaking voices were rated predominantly as "moderate deviation" on GRBAS (Grade, Rough, Breathy, Asthenia, and Strain). Maximum phonation time ranged from 4 to 29 seconds (females: 10.2, standard deviation [SD]: 5.28 and males: 15.7, SD: 5.79). Singing frequency range was reduced (females: 21.3 Semitones and males: 23.99 Semitones). Dysphonia severity index (DSI) scores ranged from -3.5 to 4.91 (females: 0.075 and males: 0.64). Singing frequency range and DSI did not show significant difference between sex and across clinical diagnosis. Self-perception using voice disorder outcome profile revealed overall severity score of 5.1 (SD: 2.7). Findings are discussed from a clinical intervention perspective. Study highlighted the nature of voice problems (hyperfunctional) and required modifications in assessment protocol for Carnatic singers. Need for regular assessments and vocal hygiene education to maintain good vocal health are emphasized as outcomes. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Vocal problems among teachers: evaluation of a preventive voice program.

Science.gov (United States)

Bovo, Roberto; Galceran, Marta; Petruccelli, Joseph; Hatzopoulos, Stavros

2007-11-01

Vocal education programs for teachers may prevent the emergence of vocal disorders; however, only a few studies have tried to evaluate the effectiveness of these preventive programs, particularly in the long term. Two hundred and sixty-four subjects, mostly kindergarten and primary school female teachers, participated in a course on voice care, including a theoretical seminar (120 minutes) and a short voice group therapy (180 minutes, small groups of 20 subjects). For 3 months, they had to either attend the vocal ergonomics norms and, as psychological reinforcement, they had to make out a daily report of vocal abuse, or to follow the given exercises for a more efficient vocal technique, reporting on whether the time scheduled was respected or not. The effectiveness of the course was assessed in a group of 21 female teachers through a randomized controlled study. Evaluation comprehended stroboscopy, perceptual and electro-acoustical voice analysis, Voice Handicap Index, and a course benefit questionnaire. A group of 20 teachers matched for age, working years, hoarseness grade, and vocal demand served as a control group. At 3 months evaluation, participants demonstrated amelioration in the global dysphonia rates (P=0.0003), jitter (P=0.0001), shimmer (P=0.0001), MPT (P=0.0001), and VHI (P=0.0001). Twelve months after the course, the positive effects remained, although they were slightly reduced. In conclusion, a course inclusive of two lectures, a short group voice therapy, home-controlled voice exercises, and hygiene, represents a feasible and cost-effective primary prevention of voice disorders in a homogeneous and well-motivated population of teachers.
The acoustic correlates of valence depend on emotion family.

Science.gov (United States)

Belyk, Michel; Brown, Steven

2014-07-01

The voice expresses a wide range of emotions through modulations of acoustic parameters such as frequency and amplitude. Although the acoustics of individual emotions are well understood, attempts to describe the acoustic correlates of broad emotional categories such as valence have yielded mixed results. In the present study, we analyzed the acoustics of emotional valence for different families of emotion. We divided emotional vocalizations into "motivational," "moral," and "aesthetic" families as defined by the OCC (Ortony, Clore, and Collins) model of emotion. Subjects viewed emotional scenarios and were cued to vocalize congruent exclamations in response to them, for example, "Yay!" and "Damn!". Positive valence was weakly associated with high-pitched and loud vocalizations. However, valence interacted with emotion family for both pitch and amplitude. A general acoustic code for valence does not hold across families of emotion, whereas family-specific codes provide a more accurate description of vocal emotions. These findings are consolidated into a set of "rules of expression" relating vocal dimensions to emotion dimensions. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Comparison Between Vocal Function Exercises and Voice Amplification.

Science.gov (United States)

Teixeira, Letícia Caldas; Behlau, Mara

2015-11-01

To compare the effectiveness of vocal function exercises (VFEs) versus voice amplification (VA) after a 6-week therapy for teachers diagnosed with behavioral dysphonia. A total of 162 teachers with behavioral dysphonia were randomly allocated into two intervention groups and one control group (CG). Outcomes were assessed using auditory-perceptual evaluation of voice, laryngeal status assessment, self-ratings of the impact of dysphonia, and acoustic analysis. The VFE group showed effective changes across treatment outcome measures: overall severity of dysphonia relative to the CG, laryngeal evaluation, and self-perceived dysphonia. The VA group showed positive outcomes in some measures of self-rated dysphonia. The CG had poorer outcomes across self-assessment dimensions. The VFE method is effective in treating the behavioral dysphonia of teachers, can change the overall severity and the self-perception of the impact of dysphonia, and the laryngeal evaluation outcomes. The use of a voice amplifier is effective as a preventive measure because it results in an improved self-perception of dysphonia, especially in the work-related dimension. One case of dysphonia aggravation can be prevented in every three patients with behavioral dysphonia engaged in VFE, and one case in every five patients using VA. The lack of a therapeutic intervention worsens teachers' behavioral dysphonia in a period of 6 weeks. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Parâmetros acústicos do contraste de sonoridade das plosivas no desenvolvimento fonológico típico e no desviante Acoustic parameters of the voicing contrast of plosives in typical phonological development and phonological disorder

Directory of Open Access Journals (Sweden)

Roberta Michelon Melo

2012-01-01

(['papa], ['baba], ['tata], ['dada], ['kaka] and ['gaga] inserted into carrier phrases, we measured voice onset time, vowel length, burst amplitude, and occlusion length of each plosive. The acoustic parameters of voiceless and voiced plosives were compared between and within groups through statistical analysis. RESULTS: The subjects within typical phonological development presented significant results mainly in distinguishing the parameters voice onset time, vowel length, and occlusion of voiceless and voiced stops, which was different from what was observed for children with phonological disorder. The comparison between groups showed differences related to the production of voice onset time and the occlusion length of voiced plosives. Regarding the other analyzed parameters, the values were similar between groups, with no statistical differences. CONCLUSION: The marking of the voicing contrast of the group with phonological disorder is different from the group with typical phonological development, especially regarding the voice onset time and the occlusion length of the voiced segments.
The Effect of Traditional Singing Warm-Up Versus Semioccluded Vocal Tract Exercises on the Acoustic Parameters of Singing Voice.

Science.gov (United States)

Duke, Emily; Plexico, Laura W; Sandage, Mary J; Hoch, Matthew

2015-11-01

This study investigated the effect of traditional vocal warm-up versus semioccluded vocal tract exercises on the acoustic parameters of voice through three questions: does vocal warm-up condition significantly alter the singing power ratio of the singing voice? Is singing power ratio dependent upon vowel? Is perceived phonatory effort affected by warm-up condition? Hypotheses were that vocal warm-up would alter the singing power ratio, and that semioccluded vocal tract warm-up would affect the singing power ratio more than no warm-up or traditional warm-up, that singing power ratio would vary across vowel, and that perceived phonatory effort would vary with warm-up condition. This study was a within-participant repeated measures design with counterbalanced conditions. Thirteen male singers were recorded under three different conditions: no warm-up, traditional warm-up, and semioccluded vocal tract exercise warm-up. Recordings were made of these singers performing the Star Spangled Banner, and singing power ratio (SPR) was calculated from four vowels. Singers rated their perceived phonatory effort (PPE) singing the Star Spangled Banner after each warm-up condition. Warm-up condition did not significantly affect SPR. SPR was significantly different for /i/ and /e/. PPE was not significantly different between warm-up conditions. The present study did not find significant differences in SPR between warm-up conditions. SPR differences for /i/, support previous findings. PPE did not differ significantly across warm-up condition despite the expectation that traditional or semioccluded warm-up would cause a decrease. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effects of muscle tension dysphonia on tone phonation: acoustic and perceptual studies in Vietnamese female teachers.

Science.gov (United States)

Nguyen, Duong Duy; Kenny, Dianna T

2009-07-01

Muscle tension dysphonia (MTD) is a hyperfunctional voice disorder commonly seen in professional voice users. To date, published acoustic studies of this disorder have mainly focused on nontonal language speakers, and no publication has documented its impact on lexical tone characteristics. In this study, we examined whether and how this voice disorder affected acoustically and perceptually the characteristics of tones in Vietnamese teachers. Voice data were obtained from 42 Vietnamese female primary school teachers diagnosed with MTD and 30 vocally healthy teachers. Tonal data were analyzed using Computerized Speech Lab (CSL-4300B) and Speech Analyzer. Parameters analyzed included the two most important acoustic cues in Vietnamese tones, that is, tonal fundamental frequency (F(0)) and laryngealization. Tonal F(0) was assessed using a factorial analysis of variance with group and career durations as independent variables. Tonal samples were also perceptually assessed by a panel of native speakers of the same dialect. The results showed that MTD lowered tonal F(0) in high tones and tones with extensive fundamental frequency variation. There was also a significant main effect for career duration; in MTD group, tonal F(0) was lower in teachers with longer career duration. The teachers with MTD showed different patterns of laryngealization compared with the control group. Tone perception was poorer for tones with extensive fundamental frequency variation and without a typical phonation type. The results in this group of teachers supported our hypothesis that MTD impairs lexical tone phonation.
Numerical analysis on acoustic impulse response for watermelon

International Nuclear Information System (INIS)

Kim, Yong Sul; Yang, Dong Hoon; Choi, Young Jae; Bae, Tas Joo; So, Chul Ho; Lee, Yun Ho

2002-01-01

In this study, we conducted both analysis on impact pulse signal and acoustic impulse response method using numerical analysistic finite element method. Considering its velocity, density, Young's Modulus, and Poisson's Ratio, we extracted featured parameters and compared both results of analysis on impact pulse signal and numerical analysis on acoustic impulse response then we found the feature of generated acoustic sound signal by way of numerical analysis varying featured parameters and consequently intended to extract feature indices influenced on its internal maturity through analysis of acoustic impulse response. As we analyzed impact pulse signal and extracted featured parameters concerned with evaluation of its ripeness, we found the plausibility of progress on nondestructive evaluation of ripeness and adoption of numerical analysis on acoustic impulse response.
The shouted voice: A pilot study of laryngeal physiology under extreme aerodynamic pressure.

Science.gov (United States)

Lagier, Aude; Legou, Thierry; Galant, Camille; Amy de La Bretèque, Benoit; Meynadier, Yohann; Giovanni, Antoine

2017-12-01

The objective was to study the behavior of the larynx during shouted voice production, when the larynx is exposed to extremely high subglottic pressure. The study involved electroglottographic, acoustic, and aerodynamic analyses of shouts produced at maximum effort by three male participants. Under a normal speaking voice, the voice sound pressure level (SPL) is proportional to the subglottic pressure. However, when the subglottic pressure reached high levels, the voice SPL reached a maximum value and then decreased as subglottic pressure increased further. Furthermore, the electroglottographic signal sometimes lost its periodicity during the shout, suggesting irregular vocal fold vibration.
Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations

Science.gov (United States)

Zhang, Yu; Jiang, Jack J.

2008-09-01

Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low-frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.
Human voice perception.

Science.gov (United States)

Latinus, Marianne; Belin, Pascal

2011-02-22

We are all voice experts. First and foremost, we can produce and understand speech, and this makes us a unique species. But in addition to speech perception, we routinely extract from voices a wealth of socially-relevant information in what constitutes a more primitive, and probably more universal, non-linguistic mode of communication. Consider the following example: you are sitting in a plane, and you can hear a conversation in a foreign language in the row behind you. You do not see the speakers' faces, and you cannot understand the speech content because you do not know the language. Yet, an amazing amount of information is available to you. You can evaluate the physical characteristics of the different protagonists, including their gender, approximate age and size, and associate an identity to the different voices. You can form a good idea of the different speaker's mood and affective state, as well as more subtle cues as the perceived attractiveness or dominance of the protagonists. In brief, you can form a fairly detailed picture of the type of social interaction unfolding, which a brief glance backwards can on the occasion help refine - sometimes surprisingly so. What are the acoustical cues that carry these different types of vocal information? How does our brain process and analyse this information? Here we briefly review an emerging field and the main tools used in voice perception research. Copyright © 2011 Elsevier Ltd. All rights reserved.
Voice Disorders in Teachers: Clinical, Videolaryngoscopical, and Vocal Aspects.

Science.gov (United States)

Pereira, Eny Regina Bóia Neves; Tavares, Elaine Lara Mendes; Martins, Regina Helena Garcia

2015-09-01

Dysphonia is more prevalent in teachers than among the general population. The objective of this study was to analyze clinical, vocal, and videolaryngoscopical aspects in dysphonic teachers. Ninety dysphonic teachers were inquired about their voice, comorbidities, and work conditions. They underwent vocal auditory-perceptual evaluation (maximum phonation time and GRBASI scale), acoustic voice analysis, and videolaryngoscopy. The results were compared with a control group consisting of 90 dysphonic nonteachers, of similar gender and ages, and with professional activities excluding teaching and singing. In both groups, there were 85 women and five men (age range 31-50 years). In the controls, the majority of subjects worked in domestic activities, whereas the majority of teachers worked in primary (42.8%) and secondary school (37.7%). Teachers and controls reported, respectively: vocal abuse (76.7%; 37.8%), weekly hours of work between 21 and 40 years (72.2%; 80%), under 10 years of practice (36%; 23%), absenteeism (23%; 0%), sinonasal (66%; 20%) and gastroesophageal symptoms (44%; 22%), hoarseness (82%; 78%), throat clearing (70%; 62%), and phonatory effort (72%; 52%). In both groups, there were decreased values of maximum phonation time, impairment of the G parameter in the GRBASI scale (82%), decrease of F0 and increase of the rest of acoustic parameters. Nodules and laryngopharyngeal reflux were predominant in teachers; laryngopharyngeal reflux, polyps, and sulcus vocalis predominated in the controls. Vocal symptoms, comorbidities, and absenteeism were predominant among teachers. The vocal analyses were similar in both groups. Nodules and laryngopharyngeal reflux were predominant among teachers, whereas polyps, laryngopharyngeal reflux, and sulcus were predominant among controls. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal effectiveness of speech-language pathology students: Before and after voice use during service delivery

OpenAIRE

Couch, Stephanie; Zieba, Dominique; van der Linde, Jeannie; van der Merwe, Anita

2015-01-01

Background: As a professional voice user, it is imperative that a speech-language pathologist’s(SLP) vocal effectiveness remain consistent throughout the day. Many factors may contribute to reduced vocal effectiveness, including prolonged voice use, vocally abusive behaviours,poor vocal hygiene and environmental factors. Objectives: To determine the effect of service delivery on the perceptual and acoustic features of voice. Method: A quasi-experimental., pre-test–post-test research de...
Numerical analysis on acoustic impulse response for watermelon

Energy Technology Data Exchange (ETDEWEB)

Kim, Yong Sul; Yang, Dong Hoon; Choi, Young Jae; Bae, Tas Joo; So, Chul Ho [Dongshin University, Naju (Korea, Republic of); Lee, Yun Ho [Korea Inspection and Engineering CO.,LTD., Seoul (Korea, Republic of)

2002-11-15

In this study, we conducted both analysis on impact pulse signal and acoustic impulse response method using numerical analysistic finite element method. Considering its velocity, density, Young's Modulus, and Poisson's Ratio, we extracted featured parameters and compared both results of analysis on impact pulse signal and numerical analysis on acoustic impulse response then we found the feature of generated acoustic sound signal by way of numerical analysis varying featured parameters and consequently intended to extract feature indices influenced on its internal maturity through analysis of acoustic impulse response. As we analyzed impact pulse signal and extracted featured parameters concerned with evaluation of its ripeness, we found the plausibility of progress on nondestructive evaluation of ripeness and adoption of numerical analysis on acoustic impulse response.
Performance of wavelet analysis and neural networks for pathological voices identification

Science.gov (United States)

Salhi, Lotfi; Talbi, Mourad; Abid, Sabeur; Cherif, Adnane

2011-09-01

Within the medical environment, diverse techniques exist to assess the state of the voice of the patient. The inspection technique is inconvenient for a number of reasons, such as its high cost, the duration of the inspection, and above all, the fact that it is an invasive technique. This study focuses on a robust, rapid and accurate system for automatic identification of pathological voices. This system employs non-invasive, non-expensive and fully automated method based on hybrid approach: wavelet transform analysis and neural network classifier. First, we present the results obtained in our previous study while using classic feature parameters. These results allow visual identification of pathological voices. Second, quantified parameters drifting from the wavelet analysis are proposed to characterise the speech sample. On the other hand, a system of multilayer neural networks (MNNs) has been developed which carries out the automatic detection of pathological voices. The developed method was evaluated using voice database composed of recorded voice samples (continuous speech) from normophonic or dysphonic speakers. The dysphonic speakers were patients of a National Hospital 'RABTA' of Tunis Tunisia and a University Hospital in Brussels, Belgium. Experimental results indicate a success rate ranging between 75% and 98.61% for discrimination of normal and pathological voices using the proposed parameters and neural network classifier. We also compared the average classification rate based on the MNN, Gaussian mixture model and support vector machines.
Evaluation of the effectiveness of a voice training program for teachers.

Science.gov (United States)

Pizolato, Raquel Aparecida; Beltrati Cornacchioni Rehder, Maria Inês; dos Santos Dias, Carlos Tadeu; de Castro Meneghim, Marcelo; Bovi Ambrosano, Glaúcia Maria; Mialhe, Fábio Luiz; Pereira, Antonio Carlos

2013-09-01

To investigate the effects of a voice education program to teachers on vocal function exercise and voice hygiene and compare a pre- and post-vocal exercise for the teacher's voice quality. A random sample of 102 subjects was divided into two groups: experimental group (29 women and seven men) with vocal hygiene and training exercises and control group (52 women and 14 men) with vocal hygiene. Two sessions were held about voice hygiene for the control group and five sessions for the experimental group, one being with reference to the vocal hygiene habit and four vocal exercise sessions. Acoustic analysis of the vowel [i] was made pre- and post-vocal exercise and for the situations of initial and final evaluation of the educational program. Student t test (paired) and Proc MIXED (repeated measures) were used for analyses with level of significance (α = 0.05). The training exercises, posture and relaxation cervical, decreased the mean of fundamental frequency (f(0)) for men (P = 0.04), and for the phonation, intensity, and frequency exercises, there was a significant increase for f(0) in woman (P = 0.02) and glottal to noise excitation ratio (P = 0.04). There was no statistically significant difference intergroup evaluations after 3 months. The control group presented increased mean voice intensity in the final evaluation (P = 0.01). Voice training exercises showed a positive and immediate impact on the teacher's quality of voice, but it was not sustained longitudinally, suggesting that actions for this purpose should be continued at schools. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Vibro-acoustic analysis of composite plates

International Nuclear Information System (INIS)

Sarigül, A S; Karagözlü, E

2014-01-01

Vibro-acoustic analysis plays a vital role on the design of aircrafts, spacecrafts, land vehicles and ships produced from thin plates backed by closed cavities, with regard to human health and living comfort. For this type of structures, it is required a coupled solution that takes into account structural-acoustic interaction which is crucial for sensitive solutions. In this study, coupled vibro-acoustic analyses of plates produced from composite materials have been performed by using finite element analysis software. The study has been carried out for E-glass/Epoxy, Kevlar/Epoxy and Carbon/Epoxy plates with different ply angles and numbers of ply. The effects of composite material, ply orientation and number of layer on coupled vibro-acoustic characteristics of plates have been analysed for various combinations. The analysis results have been statistically examined and assessed
Vibro-acoustic analysis of composite plates

Science.gov (United States)

Sarigül, A. S.; Karagözlü, E.

2014-03-01

Vibro-acoustic analysis plays a vital role on the design of aircrafts, spacecrafts, land vehicles and ships produced from thin plates backed by closed cavities, with regard to human health and living comfort. For this type of structures, it is required a coupled solution that takes into account structural-acoustic interaction which is crucial for sensitive solutions. In this study, coupled vibro-acoustic analyses of plates produced from composite materials have been performed by using finite element analysis software. The study has been carried out for E-glass/Epoxy, Kevlar/Epoxy and Carbon/Epoxy plates with different ply angles and numbers of ply. The effects of composite material, ply orientation and number of layer on coupled vibro-acoustic characteristics of plates have been analysed for various combinations. The analysis results have been statistically examined and assessed.

ATC/pilot voice communications: A survey of the literature

Science.gov (United States)

Prinzo, O. Veronika; Britton, Thomas W.

1993-11-01

The first radio-equipped control tower in the United States opened at the Cleveland Municipal Airport in 1930. From that time to the present, voice radio communications have played a primary role in air safety. Verbal communications in air traffic control (ATC) operations have been frequently cited as causal factors in operational errors and pilot deviations in the FAA Operational Error and Deviation System, the NASA Aviation Safety Reporting System (ASRS), and reports derived from government sponsored research projects. Collectively, the data provided by these programs indicate that communications constitute a significant problem for pilots and controllers. Although the communications problem was well known the research literature was fragmented, making it difficult to appreciate the various types of verbal communications problems that existed and their unique influence on the quality of ATC/pilot communications. This is a survey of the voice radio communications literature. The 43 reports in the review represent survey data, field studies, laboratory studies, narrative reports, and reviews. The survey topics pertain to communications taxonomies, acoustical correlates and cognitive/psycholinguistic perspectives. Communications taxonomies were used to identify the frequency and types of information that constitute routine communications, as well as those communications involved in operational errors, pilot deviations, and other safety-related events. Acoustical correlate methodologies identified some qualities of a speaker's voice, such as loudness, pitch, and speech rate, which might be used potentially to monitor stress, mental workload, and other forms of psychological or physiological factors that affect performance. Cognitive/psycho-linguistic research offered an information processing perspective for understanding how pilots' and controllers' memory and language comprehension processes affect their ability to communicate effectively with one another. This
Acoustic correlate of vocal effort in spasmodic dysphonia.

Science.gov (United States)

Eadie, Tanya L; Stepp, Cara E

2013-03-01

This study characterized the relationship between relative fundamental frequency (RFF) and listeners' perceptions of vocal effort and overall spasmodic dysphonia severity in the voices of 19 individuals with adductor spasmodic dysphonia. Twenty inexperienced listeners evaluated the vocal effort and overall severity of voices using visual analog scales. The squared correlation coefficients (R2) between average vocal effort and overall severity and RFF measures were calculated as a function of the number of acoustic instances used for the RFF estimate (from 1 to 9, of a total of 9 voiced-voiceless-voiced instances). Increases in the number of acoustic instances used for the RFF average led to increases in the variance predicted by the RFF at the first cycle of voicing onset (onset RFF) in the perceptual measures; the use of 6 or more instances resulted in a stable estimate. The variance predicted by the onset RFF for vocal effort (R2 range, 0.06 to 0.43) was higher than that for overall severity (R2 range, 0.06 to 0.35). The offset RFF was not related to the perceptual measures, irrespective of the sample size. This study indicates that onset RFF measures are related to perceived vocal effort in patients with adductor spasmodic dysphonia. These results have implications for measuring outcomes in this population.
Protective Strategies Against Dysphonia in Teachers: Preliminary Results Comparing Voice Amplification and 0.9% NaCl Nebulization.

Science.gov (United States)

Masson, Maria Lúcia Vaz; de Araújo, Tânia Maria

2018-03-01

This study aimed to compare the effects of two protective strategies, voice amplification (VA) and 0.9% NaCl nebulization (NEB), on teachers' voice in the work setting. An interventional evaluator-blind study was conducted, assigning 53 teachers from two public high schools to one of the two protective strategy groups (VA or NEB). Vocal function was assessed in a sound-treated booth before and after a 4-week period. Assessment included the severity of voice impairment (Consensus Auditory-Perceptual Evaluation of Voice [CAPE-V]), acoustic analysis of fundamental frequency (f0), sound pressure level (SPL), jitter, shimmer, glottal-to-noise excitation ratio (GNE), noise (VoxMetria), and the self-rated Screening Index for Voice Disorder (SIVD). Data were statistically analyzed using SPSS Statistics (version 22) with a significance level of P ≤ 0.05. Effect size was calculated using Cohen's d coefficient. There were no statistical differences between groups at baseline in terms of age, sex, time of teaching, teaching workload, and voice outcomes, except for SPL. During postintervention between groups, NEB displayed lower SIVD scores (VA = 3; NEB = 0; P = 0.018) and VA had lower acoustic irregularity (VA = 3.19; NEB = 3.69; P = 0.027), with moderate to large effect size. Postintervention within-groups decreased CAPE-V for VA (pretest = 31.97; posttest = 28.24; P = 0.021) and SIVD for NEB (pretest = 3; posttest = 0; P = 0.001). SPL decreased in both groups, NEB decreased in men only, and VA decreased in both men and women. NEB increased f0 for female participants (P ≤ 0.001). Both VA and NEB may help mitigate dysphonia in different pathways, being potential interventions for protecting teachers' voices in the work setting. An ongoing study with a control group will further support these preliminary results. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effects of the Interaction of Caffeine and Water on Voice Performance: A Pilot Study

Science.gov (United States)

Franca, Maria Claudia; Simpson, Kenneth O.

2013-01-01

The objective of this "pilot" investigation was to study the effects of the interaction of caffeine and water intake on voice as evidenced by acoustic and aerodynamic measures, to determine whether ingestion of 200 mg of caffeine and various levels of water intake have an impact on voice. The participants were 48 females ranging in age…
Emotionally conditioning the target-speech voice enhances recognition of the target speech under "cocktail-party" listening conditions.

Science.gov (United States)

Lu, Lingxi; Bao, Xiaohan; Chen, Jing; Qu, Tianshu; Wu, Xihong; Li, Liang

2018-05-01

Under a noisy "cocktail-party" listening condition with multiple people talking, listeners can use various perceptual/cognitive unmasking cues to improve recognition of the target speech against informational speech-on-speech masking. One potential unmasking cue is the emotion expressed in a speech voice, by means of certain acoustical features. However, it was unclear whether emotionally conditioning a target-speech voice that has none of the typical acoustical features of emotions (i.e., an emotionally neutral voice) can be used by listeners for enhancing target-speech recognition under speech-on-speech masking conditions. In this study we examined the recognition of target speech against a two-talker speech masker both before and after the emotionally neutral target voice was paired with a loud female screaming sound that has a marked negative emotional valence. The results showed that recognition of the target speech (especially the first keyword in a target sentence) was significantly improved by emotionally conditioning the target speaker's voice. Moreover, the emotional unmasking effect was independent of the unmasking effect of the perceived spatial separation between the target speech and the masker. Also, (skin conductance) electrodermal responses became stronger after emotional learning when the target speech and masker were perceptually co-located, suggesting an increase of listening efforts when the target speech was informationally masked. These results indicate that emotionally conditioning the target speaker's voice does not change the acoustical parameters of the target-speech stimuli, but the emotionally conditioned vocal features can be used as cues for unmasking target speech.
Performance of the phonatory deviation diagram in the evaluation of rough and breathy synthesized voices.

Science.gov (United States)

Lopes, Leonardo Wanderley; Freitas, Jonas Almeida de; Almeida, Anna Alice; Silva, Priscila Oliveira Costa; Alves, Giorvan Ânderson Dos Santos

2017-07-05

Voice disorders alter the sound signal in several ways, combining several types of vocal emission disturbances and noise. The Phonatory Deviation Diagram (PDD) is a two-dimensional chart that allows the evaluation of the vocal signal based on the combination of periodicity (jitter, shimmer, and correlation coefficient) and noise (Glottal to Noise Excitation - GNE) measurements. The use of synthesized signals, where one has a greater control and knowledge of the production conditions, may allow a better understanding of the physiological and acoustic mechanisms underlying the vocal emission and its main perceptual-auditory correlates regarding the intensity of the deviation and types of vocal quality. To analyze the performance of the PDD in the discrimination of the presence and degree of roughness and breathiness in synthesized voices. 871 synthesized vocal signals were used corresponding to the vowel /ɛ/. The perceptual-auditory analysis of the degree of roughness and breathiness of the synthesized signals was performed using Visual Analogue Scale (VAS). Subsequently, the signals were categorized regarding the presence/absence of these parameters based on the VAS cutoff values. Acoustic analysis was performed by assessing the distribution of vocal signals according to the PDD area, quadrant, shape, and density. The equality of proportions and the chi-square tests were performed to compare the variables. Rough and breathy vocal signals were located predominantly outside the normal range and in the lower right quadrant of the PDD. Voices with higher degrees of roughness and breathiness were located outside the area of normality in the lower right quadrant and had concentrated density. The normality area and the PDD quadrant can discriminate healthy voices from rough and breathy ones. Voices with higher degrees of roughness and breathiness are proportionally located outside the area of normality, in the lower right quadrant and with concentrated density. Copyright
The Plausibility of Tonal Evolution in the Malay Dialect Spoken in Thailand: Evidence from an Acoustic Study

Directory of Open Access Journals (Sweden)

Phanintra Teeranon

2007-12-01

Full Text Available The F0 values of vowels following voiceless consonants are higher than those of vowels following voiced consonants; high vowels have a higher F0 than low vowels. It has also been found that when high vowels follow voiced consonants, the F0 values decrease. In contrast, low vowels following voiceless consonants show increasing F0 values. In other words, the voicing of initial consonants has been found to counterbalance the intrinsic F0 values of high and low vowels (House and Fairbanks 1953, Lehiste and Peterson 1961, Lehiste 1970, Laver 1994, Teeranon 2006. To test whether these three findings are applicable to a disyllabic language, the F0 values of high and low vowels following voiceless and voiced consonants were studied in a Malay dialect of the Austronesian language family spoken in Pathumthani Province, Thailand. The data was collected from three male informants, aged 30-35. The Praat program was used for acoustic analysis. The findings revealed the influence of the voicing of initial consonants on the F0 of vowels to be greater than that of the influence of vowel height. Evidence from this acoustic study shows the plausibility for the Malay dialect spoken in Pathumthani to become a tonal language by the influence of initial consonants rather by the influence of the high-low vowel dimension.
Singing Voice Analysis, Synthesis, and Modeling

Science.gov (United States)

Kim, Youngmoo E.

The singing voice is the oldest musical instrument, but its versatility and emotional power are unmatched. Through the combination of music, lyrics, and expression, the voice is able to affect us in ways that no other instrument can. The fact that vocal music is prevalent in almost all cultures is indicative of its innate appeal to the human aesthetic. Singing also permeates most genres of music, attesting to the wide range of sounds the human voice is capable of producing. As listeners we are naturally drawn to the sound of the human voice, and, when present, it immediately becomes the focus of our attention.
Voice amplification for primary school teachers with voice disorders: a randomized clinical trial.

Science.gov (United States)

Bovo, Roberto; Trevisi, Patrizia; Emanuelli, Enzo; Martini, Alessandro

2013-06-01

Several studies have demonstrated a high prevalence of voice disorders in teachers, together with the personal, professional and economical consequences of the problem. Good primary prevention should be based on 3 aspects: 1) amelioration of classroom acoustics, 2) voice care programs for future professional voice users, including teachers and 3) classroom or portable amplification systems. The aim of the study was to assess the benefit obtained from the use of portable amplification systems by female primary school teachers in their occupational setting. Forty female primary school teachers attended a course about professional voice care, which comprised two theoretical lectures, each 60 min long. Thereafter, they were randomized into 2 groups: the teachers of the first group were asked to use a portable vocal amplifier for 3 months, till the end of school-year. The other 20 teachers were part of the control group, matched for age and years of employment. All subjects had a grade 1 of dysphonia with no significant organic lesion of the vocal folds. Most teachers of the experimental group used the amplifier consistently for the whole duration of the experiment and found it very useful in reducing the symptoms of vocal fatigue. In fact, after 3 months, Voice Handicap Index (VHI) scores in "course + amplifier" group demonstrated a significant amelioration (p = 0.003). The perceptual grade of dysphonia also improved significantly (p = 0.0005). The same parameters changed favourably also in the "course only" group, but the results were not statistically significant (p = 0.4 for VHI and p = 0.03 for perceptual grade). In teachers, and particularly in those with a constitutional weak voice and/or those who are prone to vocal fold pathology, vocal amplifiers may be an effective and low-cost intervention to decrease potentially damaging vocal loads and may represent a necessary form of prevention.
Do women's voices provide cues of the likelihood of ovulation? The importance of sampling regime.

Directory of Open Access Journals (Sweden)

Julia Fischer

Full Text Available The human voice provides a rich source of information about individual attributes such as body size, developmental stability and emotional state. Moreover, there is evidence that female voice characteristics change across the menstrual cycle. A previous study reported that women speak with higher fundamental frequency (F0 in the high-fertility compared to the low-fertility phase. To gain further insights into the mechanisms underlying this variation in perceived attractiveness and the relationship between vocal quality and the timing of ovulation, we combined hormone measurements and acoustic analyses, to characterize voice changes on a day-to-day basis throughout the menstrual cycle. Voice characteristics were measured from free speech as well as sustained vowels. In addition, we asked men to rate vocal attractiveness from selected samples. The free speech samples revealed marginally significant variation in F0 with an increase prior to and a distinct drop during ovulation. Overall variation throughout the cycle, however, precluded unequivocal identification of the period with the highest conception risk. The analysis of vowel samples revealed a significant increase in degree of unvoiceness and noise-to-harmonic ratio during menstruation, possibly related to an increase in tissue water content. Neither estrogen nor progestogen levels predicted the observed changes in acoustic characteristics. The perceptual experiments revealed a preference by males for voice samples recorded during the pre-ovulatory period compared to other periods in the cycle. While overall we confirm earlier findings in that women speak with a higher and more variable fundamental frequency just prior to ovulation, the present study highlights the importance of taking the full range of variation into account before drawing conclusions about the value of these cues for the detection of ovulation.
Voice reinstatement modulates neural indices of continuous word recognition.

Science.gov (United States)

Campeanu, Sandra; Craik, Fergus I M; Backer, Kristina C; Alain, Claude

2014-09-01

The present study was designed to examine listeners' ability to use voice information incidentally during spoken word recognition. We recorded event-related brain potentials (ERPs) during a continuous recognition paradigm in which participants indicated on each trial whether the spoken word was "new" or "old." Old items were presented at 2, 8 or 16 words following the first presentation. Context congruency was manipulated by having the same word repeated by either the same speaker or a different speaker. The different speaker could share the gender, accent or neither feature with the word presented the first time. Participants' accuracy was greatest when the old word was spoken by the same speaker than by a different speaker. In addition, accuracy decreased with increasing lag. The correct identification of old words was accompanied by an enhanced late positivity over parietal sites, with no difference found between voice congruency conditions. In contrast, an earlier voice reinstatement effect was observed over frontal sites, an index of priming that preceded recollection in this task. Our results provide further evidence that acoustic and semantic information are integrated into a unified trace and that acoustic information facilitates spoken word recollection. Copyright © 2014 Elsevier Ltd. All rights reserved.
Voice disorders in teachers: occupational risk factors and psycho-emotional factors.

Science.gov (United States)

van Houtte, Evelyne; Claeys, Sofie; Wuyts, Floris; van Lierde, Kristiane

2012-10-01

Teaching is a high-risk occupation for developing voice disorders. The purpose of this study was to investigate previously described vocal risk factors as well as to identify new risk factors related to both the personal life of the teacher (fluid intake, voice-demanding activities, family history of voice disorders, and children at home) and to environmental factors (temperature changes, chalk use, presence of curtains, carpet, or air-conditioning, acoustics in the classroom, and noise in and outside the classroom). The study group comprised 994 teachers (response rate 46.6%). All participants completed a questionnaire. Chi-square tests and logistic regression analyses were performed. A total of 51.2% (509/994) of the teachers presented with voice disorders. Women reported more voice disorders compared to men (56.4% versus 40.4%, P history of voice disorders (P = 0.005), temperature changes in the classroom (P = 0.017), the number of pupils per classroom (P = 0.001), and noise level inside the classroom (P = 0.001). Teachers with voice disorders presented a higher level of psychological distress (P < 0.001) compared to teachers without voice problems. Voice disorders are frequent among teachers, especially in female teachers. The results of this study emphasize that multiple factors are involved in the development of voice disorders.
The expression and recognition of emotions in the voice across five nations: A lens model analysis based on acoustic features.

Science.gov (United States)

Laukka, Petri; Elfenbein, Hillary Anger; Thingujam, Nutankumar S; Rockstuhl, Thomas; Iraki, Frederick K; Chui, Wanda; Althoff, Jean

2016-11-01

This study extends previous work on emotion communication across cultures with a large-scale investigation of the physical expression cues in vocal tone. In doing so, it provides the first direct test of a key proposition of dialect theory, namely that greater accuracy of detecting emotions from one's own cultural group-known as in-group advantage-results from a match between culturally specific schemas in emotional expression style and culturally specific schemas in emotion recognition. Study 1 used stimuli from 100 professional actors from five English-speaking nations vocally conveying 11 emotional states (anger, contempt, fear, happiness, interest, lust, neutral, pride, relief, sadness, and shame) using standard-content sentences. Detailed acoustic analyses showed many similarities across groups, and yet also systematic group differences. This provides evidence for cultural accents in expressive style at the level of acoustic cues. In Study 2, listeners evaluated these expressions in a 5 × 5 design balanced across groups. Cross-cultural accuracy was greater than expected by chance. However, there was also in-group advantage, which varied across emotions. A lens model analysis of fundamental acoustic properties examined patterns in emotional expression and perception within and across groups. Acoustic cues were used relatively similarly across groups both to produce and judge emotions, and yet there were also subtle cultural differences. Speakers appear to have a culturally nuanced schema for enacting vocal tones via acoustic cues, and perceivers have a culturally nuanced schema in judging them. Consistent with dialect theory's prediction, in-group judgments showed a greater match between these schemas used for emotional expression and perception. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Reliability in perceptual analysis of voice quality.

Science.gov (United States)

Bele, Irene Velsvik

2005-12-01

This study focuses on speaking voice quality in male teachers (n = 35) and male actors (n = 36), who represent untrained and trained voice users, because we wanted to investigate normal and supranormal voices. In this study, both substantial and methodologic aspects were considered. It includes a method for perceptual voice evaluation, and a basic issue was rater reliability. A listening group of 10 listeners, 7 experienced speech-language therapists, and 3 speech-language therapist students evaluated the voices by 15 vocal characteristics using VA scales. Two sets of voice signals were investigated: text reading (2 loudness levels) and sustained vowel (3 levels). The results indicated a high interrater reliability for most perceptual characteristics. Connected speech was evaluated more reliably, especially at the normal level, but both types of voice signals were evaluated reliably, although the reliability for connected speech was somewhat higher than for vowels. Experienced listeners tended to be more consistent in their ratings than did the student raters. Some vocal characteristics achieved acceptable reliability even with a smaller panel of listeners. The perceptual characteristics grouped in 4 factors reflected perceptual dimensions.
Obligatory and facultative brain regions for voice-identity recognition

Science.gov (United States)

Roswandowitz, Claudia; Kappes, Claudia; Obrig, Hellmuth; von Kriegstein, Katharina

2018-01-01

Abstract Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal
Test-retest reliability for aerodynamic measures of voice.

Science.gov (United States)

Awan, Shaheen N; Novaleski, Carolyn K; Yingling, Julie R

2013-11-01

The purpose of this study was to investigate the intrasubject reliability of aerodynamic characteristics of the voice within typical/normal speakers across testing sessions using the Phonatory Aerodynamic System (PAS 6600; KayPENTAX, Montvale, NJ). Participants were 60 healthy young adults (30 males and 30 females) between the ages 18 and 31 years with perceptually typical voice. Participants were tested using the PAS 6600 (Phonatory Aerodynamic System) on two separate days with approximately 1 week between each session at approximately the same time of day. Four PAS protocols were conducted (vital capacity, maximum sustained phonation, comfortable sustained phonation, and voicing efficiency) and measures of expiratory volume, maximum phonation time, mean expiratory airflow (during vowel production) and target airflow (obtained via syllable repetition), peak air pressure, aerodynamic power, aerodynamic resistance, and aerodynamic efficiency were obtained during each testing session. Associated acoustic measures of vocal intensity and frequency were also collected. All phonations were elicited at comfortable pitch and loudness. All aerodynamic and associated variables evaluated in this study showed useable test-retest reliability (ie, intraclass correlation coefficients [ICCs] ≥ 0.60). A high degree of mean test-retest reliability was found across all subjects for aerodynamic and associated acoustic measurements of vital capacity, maximum sustained phonation, glottal resistance, and vocal intensity (all with ICCs > 0.75). Although strong ICCs were observed for measures of glottal power and mean expiratory airflow in males, weaker overall results for these measures (ICC range: 0.60-0.67) were observed in females subjects and sizable coefficients of variation were observed for measures of power, resistance, and efficiency in both men and women. Differences in degree of reliability from measure to measure were revealed in greater detail using methods such as ICCs and
A randomized controlled trial of stretch-and-flow voice therapy for muscle tension dysphonia.

Science.gov (United States)

Watts, Christopher R; Hamilton, Amy; Toles, Laura; Childs, Lesley; Mau, Ted

2015-06-01

To investigate the effect of stretch-and-flow voice therapy on vocal function and handicap. Randomized controlled trial. Participants with primary muscle tension dysphonia were randomly assigned to experimental or control groups. Experimental participants received vocal hygiene education followed by 6 weeks of stretch-and-flow voice therapy. Control participants received vocal hygiene education only. Outcome variables consisted of a measure of vocal handicap (Voice Handicap Index [VHI]), maximum phonation time, s/z ratio, and acoustic measures. All measures were obtained at baseline prior to treatment and within 2 weeks posttreatment or at the end of the control period. The pre- to posttreatment measurement change (delta Δ) was applied to statistical analyses. A multivariate analysis of variance revealed significant group differences in pre-to-post changes on measures of VHI, maximum phonation time, and cepstral peak prominence (CPP) in connected speech and vowels (P = 0.003, 0.013, 0.025, and 0.017 respectively), with a significant reduction of VHI (Cohen's d = 1.6), increase in maximum phonation time (Cohen's d = 1.2), increase of CPP in connected speech (Cohen's d = 1.2), and increase of CPP in vowels (Cohen's d = 1.1) in the experimental group compared to the control group. This preliminary small sample randomized controlled trial found significantly greater improvement in vocal handicap, maximum phonation time, and acoustic measures of vocal function after participants received stretch-and-flow voice therapy compared to participants receiving vocal hygiene education alone. Additional research incorporating larger samples will be needed to confirm and further investigate these findings. 1b. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.
Condition Monitoring and Management from Acoustic Emissions

DEFF Research Database (Denmark)

Pontoppidan, Niels Henrik Bohl

2005-01-01

In the following, I will use technical terms without explanation as it gives the freedom to describe the project in a shorter form for those who already know. The thesis is about condition monitoring of large diesel engines from acoustic emission signals. The experiments have been focused...... is the analysis of the angular position changes of the engine related events such as fuel injection and valve openings, caused by operational load changes. With inspiration from speech recognition and voice effects the angular timing changes have been inverted with the event alignment framework. With the event...
Objective and subjective evaluation of the acoustic comfort in classrooms.

Science.gov (United States)

Zannin, Paulo Henrique Trombetta; Marcon, Carolina Reich

2007-09-01

The acoustic comfort of classrooms in a Brazilian public school has been evaluated through interviews with 62 teachers and 464 pupils, measurements of background noise, reverberation time, and sound insulation. Acoustic measurements have revealed the poor acoustic quality of the classrooms. Results have shown that teachers and pupils consider the noise generated and the voice of the teacher in neighboring classrooms as the main sources of annoyance inside the classroom. Acoustic simulations resulted in the suggestion of placement of perforated plywood on the ceiling, for reduction in reverberation time and increase in the acoustic comfort of the classrooms.
The stability of locus equation slopes across stop consonant voicing/aspiration

Science.gov (United States)

Sussman, Harvey M.; Modarresi, Golnaz

2004-05-01

The consistency of locus equation slopes as phonetic descriptors of stop place in CV sequences across voiced and voiceless aspirated stops was explored in the speech of five male speakers of American English and two male speakers of Persian. Using traditional locus equation measurement sites for F2 onsets, voiceless labial and coronal stops had significantly lower locus equation slopes relative to their voiced counterparts, whereas velars failed to show voicing differences. When locus equations were derived using F2 onsets for voiced stops that were measured closer to the stop release burst, comparable to the protocol for measuring voiceless aspirated stops, no significant effects of voicing/aspiration on locus equation slopes were observed. This methodological factor, rather than an underlying phonetic-based explanation, provides a reasonable account for the observed flatter locus equation slopes of voiceless labial and coronal stops relative to voiced cognates reported in previous studies [Molis et al., J. Acoust. Soc. Am. 95, 2925 (1994); O. Engstrand and B. Lindblom, PHONUM 4, 101-104]. [Work supported by NIH.

Natural variations of vocal effort and comfort in simulated acoustic environments

DEFF Research Database (Denmark)

Pelegrin Garcia, David; Brunskog, Jonas

2010-01-01

acoustic conditions, artificially generated by electroacoustic means. The vocal intensity decreased with the objective parameter support, which quantifies the amount of sound reflections provided by the room at the talker‟s ears,relative to the direct sound, at a rate of -0.21 dB/dB. The reading pace......Many teachers suffer from voice problems related to the use of their voices in the working environment. The noise generated by students and external sound sources (like traffic noise or neighboring classrooms) is a major problem, as it leads to an increased vocal effort. In the absence of high...... levels of background noise, the room has also an effect on the talker‟s voice. In order to quantify the relative importance of the acoustic environment on the vocal demands for teachers, a laboratory investigation was carried out. Thirteen teachers had to read a text aloud under ten different room...
Análise perceptivo-auditiva, acústica computadorizada e laringológica da voz de adultos jovens fumantes e não-fumantes Auditory perceptual, acoustic, computerized and laryngological analysis of young smokers' and nonsmokers' voice

Directory of Open Access Journals (Sweden)

Daniele C. de Figueiredo

2003-12-01

Full Text Available OBJETIVO: Realizar a avaliação laringológica, análise perceptivo-auditiva e acústica computadorizada das vozes de adultos jovens fumantes e não-fumantes, sem queixa vocal, compará-las e verificar a incidência de alterações laríngeas. FORMA DE ESTUDO: Caso-controle. MATERIAL E MÉTODO: Foram analisadas as vozes de 80 indivíduos com idades compreendidas entre 20 e 40 anos. Estes foram divididos em quatro grupos: 20 homens fumantes, 20 homens não-fumantes, 20 mulheres fumantes e 20 mulheres não-fumantes. Este estudo envolveu laringoscopia, realizada e interpretada por uma médica otorrinolaringologista, e gravação em fita cassete das vogais sustentadas /a/, /m/, /i/ e /u/, contagem dos números de 1 a 20, emissão dos dias da semana, dos meses do ano e da canção "Parabéns a você". A gravação em fita cassete foi editada para posterior análise espectrográfica e avaliação perceptiva auditiva por quatro avaliadores com experiência na área de voz. RESULTADOS: Após a análise, foi constatada uma discreta diminuição da freqüência fundamental da voz dos indivíduos fumantes de ambos os sexos, bem como maior incidência de rouquidão e de alterações laríngeas entre os tabagistas.AIM: The goal of this study was to make the laryngological, auditory perceptual and acoustic computer analyses of young adults' (smokers and non-smokers voices, without vocal complaint, compare them and verify the incidence of vocal alterations. STUDY DESIGN: Clinical comparative. MATERIAL AND METHOD: The voices of 80 individuals with age range from 20 to 40 years were analyzed. These individuals were divided in four groups: 20 male smokers, 20 male non-smokers, 20 female smokers and 20 female non-smokers. This analysis involved laryngoscopy, which was performed and interpreted by an otolaryngologist, and cassette tape recordings of the sustained vowels /a/, /m/, /i/ e /u/, number counting from 1 to 20, speech of the days of the week, months of
Marshall’s Voice

Directory of Open Access Journals (Sweden)

Halper Thomas

2017-12-01

Full Text Available Most judicial opinions, for a variety of reasons, do not speak with the voice of identifiable judges, but an analysis of several of John Marshall’s best known opinions reveals a distinctive voice, with its characteristic language and style of argumentation. The power of this voice helps to account for the influence of his views.
Analysis of failure of voice production by a sound-producing voice prosthesis

NARCIS (Netherlands)

van der Torn, M.; van Gogh, C.D.L.; Verdonck-de Leeuw, I M; Festen, J.M.; Mahieu, H.F.

OBJECTIVE: To analyse the cause of failing voice production by a sound-producing voice prosthesis (SPVP). METHODS: The functioning of a prototype SPVP is described in a female laryngectomee before and after its sound-producing mechanism was impeded by tracheal phlegm. This assessment included:
Occupational voice demands and their impact on the call-centre industry

Directory of Open Access Journals (Sweden)

Duffy OM

2009-04-01

Full Text Available Abstract Background Within the last decade there has been a growth in the call-centre industry in the UK, with a growing awareness of the voice as an important tool for successful communication. Occupational voice problems such as occupational dysphonia, in a business which relies on healthy, effective voice as the primary professional communication tool, may threaten working ability and occupational health and safety of workers. While previous studies of telephone call-agents have reported a range of voice symptoms and functional vocal health problems, there have been no studies investigating the use and impact of vocal performance in the communication industry within the UK. This study aims to address a significant gap in the evidence-base of occupational health and safety research. The objectives of the study are: 1. to investigate the work context and vocal communication demands for call-agents; 2. to evaluate call-agents' vocal health, awareness and performance; and 3. to identify key risks and training needs for employees and employers within call-centres. Methods and design This is an occupational epidemiological study, which plans to recruit call-centres throughout the UK and Ireland. Data collection will consist of three components: 1. interviews with managers from each participating call-centre to assess their communication and training needs; 2. an online biopsychosocial questionnaire will be administered to investigate the work environment and vocal demands of call-agents; and 3. voice acoustic measurements of a random sample of participants using the Multi-dimensional Voice Program (MDVP. Qualitative content analysis from the interviews will identify underlying themes and issues. A multivariate analysis approach will be adopted using Structural Equation Modelling (SEM, to develop voice measurement models in determining the construct validity of potential factors contributing to occupational dysphonia. Quantitative data will be
Voice Habits and Behaviors: Voice Care Among Flamenco Singers.

Science.gov (United States)

Garzón García, Marina; Muñoz López, Juana; Y Mendoza Lara, Elvira

2017-03-01

The purpose of this study is to analyze the vocal behavior of flamenco singers, as compared with classical music singers, to establish a differential vocal profile of voice habits and behaviors in flamenco music. Bibliographic review was conducted, and the Singer's Vocal Habits Questionnaire, an experimental tool designed by the authors to gather data regarding hygiene behavior, drinking and smoking habits, type of practice, voice care, and symptomatology perceived in both the singing and the speaking voice, was administered. We interviewed 94 singers, divided into two groups: the flamenco experimental group (FEG, n = 48) and the classical control group (CCG, n = 46). Frequency analysis, a Likert scale, and discriminant and exploratory factor analysis were used to obtain a differential profile for each group. The FEG scored higher than the CCG in speaking voice symptomatology. The FEG scored significantly higher than the CCG in use of "inadequate vocal technique" when singing. Regarding voice habits, the FEG scored higher in "lack of practice and warm-up" and "environmental habits." A total of 92.6% of the subjects classified themselves correctly in each group. The Singer's Vocal Habits Questionnaire has proven effective in differentiating flamenco and classical singers. Flamenco singers are exposed to numerous vocal risk factors that make them more prone to vocal fatigue, mucosa dehydration, phonotrauma, and muscle stiffness than classical singers. Further research is needed in voice training in flamenco music, as a means to strengthen the voice and enable it to meet the requirements of this musical genre. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Giving Voice to Emotion: Voice Analysis Technology Uncovering Mental States is Playing a Growing Role in Medicine, Business, and Law Enforcement.

Science.gov (United States)

Allen, Summer

2016-01-01

It's tough to imagine anything more frustrating than interacting with a call center. Generally, people don't reach out to call centers when they?re happy-they're usually trying to get help with a problem or gearing up to do battle over a billing error. Add in an automatic phone tree, and you have a recipe for annoyance. But what if that robotic voice offering you a smorgasbord of numbered choices could tell that you were frustrated and then funnel you to an actual human being? This type of voice analysis technology exists, and it's just one example of the many ways that computers can use your voice to extract information about your mental and emotional state-including information you may not think of as being accessible through your voice alone.
Perceptual adaptation of voice gender discrimination with spectrally shifted vowels.

Science.gov (United States)

Li, Tianhao; Fu, Qian-Jie

2011-08-01

To determine whether perceptual adaptation improves voice gender discrimination of spectrally shifted vowels and, if so, which acoustic cues contribute to the improvement. Voice gender discrimination was measured for 10 normal-hearing subjects, during 5 days of adaptation to spectrally shifted vowels, produced by processing the speech of 5 male and 5 female talkers with 16-channel sine-wave vocoders. The subjects were randomly divided into 2 groups; one subjected to 50-Hz, and the other to 200-Hz, temporal envelope cutoff frequencies. No preview or feedback was provided. There was significant adaptation in voice gender discrimination with the 200-Hz cutoff frequency, but significant improvement was observed only for 3 female talkers with F(0) > 180 Hz and 3 male talkers with F(0) gender discrimination under spectral shift conditions with perceptual adaptation, but spectral shift may limit the exclusive use of spectral information and/or the use of formant structure on voice gender discrimination. The results have implications for cochlear implant users and for understanding voice gender discrimination.
The voice of emotion across species: how do human listeners recognize animals' affective states?

Directory of Open Access Journals (Sweden)

Marina Scheumann

Full Text Available Voice-induced cross-taxa emotional recognition is the ability to understand the emotional state of another species based on its voice. In the past, induced affective states, experience-dependent higher cognitive processes or cross-taxa universal acoustic coding and processing mechanisms have been discussed to underlie this ability in humans. The present study sets out to distinguish the influence of familiarity and phylogeny on voice-induced cross-taxa emotional perception in humans. For the first time, two perspectives are taken into account: the self- (i.e. emotional valence induced in the listener versus the others-perspective (i.e. correct recognition of the emotional valence of the recording context. Twenty-eight male participants listened to 192 vocalizations of four different species (human infant, dog, chimpanzee and tree shrew. Stimuli were recorded either in an agonistic (negative emotional valence or affiliative (positive emotional valence context. Participants rated the emotional valence of the stimuli adopting self- and others-perspective by using a 5-point version of the Self-Assessment Manikin (SAM. Familiarity was assessed based on subjective rating, objective labelling of the respective stimuli and interaction time with the respective species. Participants reliably recognized the emotional valence of human voices, whereas the results for animal voices were mixed. The correct classification of animal voices depended on the listener's familiarity with the species and the call type/recording context, whereas there was less influence of induced emotional states and phylogeny. Our results provide first evidence that explicit voice-induced cross-taxa emotional recognition in humans is shaped more by experience-dependent cognitive mechanisms than by induced affective states or cross-taxa universal acoustic coding and processing mechanisms.
Nonlinear dynamic-based analysis of severe dysphonia in patients with vocal fold scar and sulcus vocalis

Science.gov (United States)

Choi, Seong Hee; Zhang, Yu; Jiang, Jack J.; Bless, Diane M.; Welham, Nathan V.

2011-01-01

Objective The primary goal of this study was to evaluate a nonlinear dynamic approach to the acoustic analysis of dysphonia associated with vocal fold scar and sulcus vocalis. Study Design Case-control study. Methods Acoustic voice samples from scar/sulcus patients and age/sex-matched controls were analyzed using correlation dimension (D2) and phase plots, time-domain based perturbation indices (jitter, shimmer, signal-to-noise ratio [SNR]), and an auditory-perceptual rating scheme. Signal typing was performed to identify samples with bifurcations and aperiodicity. Results Type 2 and 3 acoustic signals were highly represented in the scar/sulcus patient group. When data were analyzed irrespective of signal type, all perceptual and acoustic indices successfully distinguished scar/sulcus patients from controls. Removal of type 2 and 3 signals eliminated the previously identified differences between experimental groups for all acoustic indices except D2. The strongest perceptual-acoustic correlation in our dataset was observed for SNR; the weakest correlation was observed for D2. Conclusions These findings suggest that D2 is inferior to time-domain based perturbation measures for the analysis of dysphonia associated with scar/sulcus; however, time-domain based algorithms are inherently susceptible to inflation under highly aperiodic (i.e., type 2 and 3) signal conditions. Auditory-perceptual analysis, unhindered by signal aperiodicity, is therefore a robust strategy for distinguishing scar/sulcus patient voices from normal voices. Future acoustic analysis research in this area should consider alternative (e.g., frequency- and quefrency-domain based) measures alongside additional nonlinear approaches. PMID:22516315
Voice disorders in mucosal leishmaniasis.

Directory of Open Access Journals (Sweden)

Ana Cristina Nunes Ruas

Full Text Available INTRODUCTION: Leishmaniasis is considered as one of the six most important infectious diseases because of its high detection coefficient and ability to produce deformities. In most cases, mucosal leishmaniasis (ML occurs as a consequence of cutaneous leishmaniasis. If left untreated, mucosal lesions can leave sequelae, interfering in the swallowing, breathing, voice and speech processes and requiring rehabilitation. OBJECTIVE: To describe the anatomical characteristics and voice quality of ML patients. MATERIALS AND METHODS: A descriptive transversal study was conducted in a cohort of ML patients treated at the Laboratory for Leishmaniasis Surveillance of the Evandro Chagas National Institute of Infectious Diseases-Fiocruz, between 2010 and 2013. The patients were submitted to otorhinolaryngologic clinical examination by endoscopy of the upper airways and digestive tract and to speech-language assessment through directed anamnesis, auditory perception, phonation times and vocal acoustic analysis. The variables of interest were epidemiologic (sex and age and clinic (lesion location, associated symptoms and voice quality. RESULTS: 26 patients under ML treatment and monitored by speech therapists were studied. 21 (81% were male and five (19% female, with ages ranging from 15 to 78 years (54.5+15.0 years. The lesions were distributed in the following structures 88.5% nasal, 38.5% oral, 34.6% pharyngeal and 19.2% laryngeal, with some patients presenting lesions in more than one anatomic site. The main complaint was nasal obstruction (73.1%, followed by dysphonia (38.5%, odynophagia (30.8% and dysphagia (26.9%. 23 patients (84.6% presented voice quality perturbations. Dysphonia was significantly associated to lesions in the larynx, pharynx and oral cavity. CONCLUSION: We observed that vocal quality perturbations are frequent in patients with mucosal leishmaniasis, even without laryngeal lesions; they are probably associated to disorders of some
Objective voice and speech analysis of persons with chronic hoarseness by prosodic analysis of speech samples.

Science.gov (United States)

Haderlein, Tino; Döllinger, Michael; Matoušek, Václav; Nöth, Elmar

2016-10-01

Automatic voice assessment is often performed using sustained vowels. In contrast, speech analysis of read-out texts can be applied to voice and speech assessment. Automatic speech recognition and prosodic analysis were used to find regression formulae between automatic and perceptual assessment of four voice and four speech criteria. The regression was trained with 21 men and 62 women (average age 49.2 years) and tested with another set of 24 men and 49 women (48.3 years), all suffering from chronic hoarseness. They read the text 'Der Nordwind und die Sonne' ('The North Wind and the Sun'). Five voice and speech therapists evaluated the data on 5-point Likert scales. Ten prosodic and recognition accuracy measures (features) were identified which describe all the examined criteria. Inter-rater correlation within the expert group was between r = 0.63 for the criterion 'match of breath and sense units' and r = 0.87 for the overall voice quality. Human-machine correlation was between r = 0.40 for the match of breath and sense units and r = 0.82 for intelligibility. The perceptual ratings of different criteria were highly correlated with each other. Likewise, the feature sets modeling the criteria were very similar. The automatic method is suitable for assessing chronic hoarseness in general and for subgroups of functional and organic dysphonia. In its current version, it is almost as reliable as a randomly picked rater from a group of voice and speech therapists.
Examining the Impact of Video Modeling Techniques on the Efficacy of Clinical Voice Assessment.

Science.gov (United States)

Werner, Cara; Bowyer, Samantha; Weinrich, Barbara; Gottliebson, Renee; Brehm, Susan Baker

2017-01-01

The purpose of the current study was to determine whether or not presenting patients with a video model improves efficacy of the assessment as defined by efficiency and decreased variability in trials during the acoustic component of voice evaluations. Twenty pediatric participants with a mean age of 7.6 years (SD = 1.50; range = 6-11 years), 32 college-age participants with a mean age of 21.32 years (SD = 1.61; range = 18-30 years), and 17 adult participants with a mean age of 54.29 years (SD = 2.78; range = 50-70 years) were included in the study and divided into experimental and control groups. The experimental group viewed a training video prior to receiving verbal instructions and performing acoustic assessment tasks, whereas the control group received verbal instruction only prior to completing the acoustic assessment. Primary measures included the number of clinician cues required and instructional time. Standard deviations of acoustic measurements (eg, minimum and maximum frequency) were also examined to determine effects on stability. Individuals in the experimental group required significantly less cues, P = 0.012, compared to the control group. Although some trends were observed in instructional time and stability of measurements, no significant differences were observed. The findings of this study may be useful for speech-language pathologists in regard to improving assessment of patients' voice disorders with the use of video modeling. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Obligatory and facultative brain regions for voice-identity recognition.

Science.gov (United States)

Roswandowitz, Claudia; Kappes, Claudia; Obrig, Hellmuth; von Kriegstein, Katharina

2018-01-01

Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal lobe is
Variation in stop consonant voicing in two regional varieties of American English

Science.gov (United States)

Jacewicz, Ewa; Fox, Robert Allen; Lyle, Samantha

2010-01-01

This study is an acoustic investigation of the nature and extent of consonant voicing of the stop /b/ in two dialectal varieties of American English spoken in south-central Wisconsin and western North Carolina. The stop /b/ occurred at the juncture of two words such as small bids, in a position between two voiced sonorants, i.e. the liquid /l/ and a vowel. Twenty women participated, ten representing the Wisconsin and ten the North Carolina variety, respectively. Significant dialectal differences were found in the voicing patterns. The Wisconsin stop closures were usually not fully voiced and terminated in a complete silence followed by a closure release whereas North Carolina speakers produced mostly fully voiced closures. Further dialectal differences included the proportion of closure voicing as a function of word emphasis. For Wisconsin speakers, the proportion of closure voicing was smallest when the word was emphasized and it was greatest in non-emphatic positions. For North Carolina speakers, the degree of word emphasis did not have an effect on the proportion of closure voicing. The results suggest different mechanisms by which closure voicing is maintained in these two dialects, pointing to active articulatory maneuvers in North Carolina speakers and passive in Wisconsin speakers. PMID:20198112
Voice amplification for primary school teachers with voice disorders: A randomized clinical trial

Directory of Open Access Journals (Sweden)

Roberto Bovo

2013-06-01

Full Text Available Objectives: Several studies have demonstrated a high prevalence of voice disorders in teachers, together with the personal, professional and economical consequences of the problem. Good primary prevention should be based on 3 aspects: 1 amelioration of classroom acoustics, 2 voice care programs for future professional voice users, including teachers and 3 classroom or portable amplification systems. The aim of the study was to assess the benefit obtained from the use of portable amplification systems by female primary school teachers in their occupational setting. Materials and Methods: Forty female primary school teachers attended a course about professional voice care, which comprised two theoretical lectures, each 60 min long. Thereafter, they were randomized into 2 groups: the teachers of the first group were asked to use a portable vocal amplifier for 3 months, till the end of school-year. The other 20 teachers were part of the control group, matched for age and years of employment. All subjects had a grade 1 of dysphonia with no significant organic lesion of the vocal folds. Results: Most teachers of the experimental group used the amplifier consistently for the whole duration of the experiment and found it very useful in reducing the symptoms of vocal fatigue. In fact, after 3 months, Voice Handicap Index (VHI scores in "course + amplifier" group demonstrated a significant amelioration (p = 0.003. The perceptual grade of dysphonia also improved significantly (p = 0.0005. The same parameters changed favourably also in the "course only" group, but the results were not statistically significant (p = 0.4 for VHI and p = 0.03 for perceptual grade. Conclusions: In teachers, and particularly in those with a constitutional weak voice and/or those who are prone to vocal fold pathology, vocal amplifiers may be an effective and low-cost intervention to decrease potentially damaging vocal loads and may represent a necessary form of prevention.
Detecting Abnormal Word Utterances in Children With Autism Spectrum Disorders: Machine-Learning-Based Voice Analysis Versus Speech Therapists.

Science.gov (United States)

Nakai, Yasushi; Takiguchi, Tetsuya; Matsui, Gakuyo; Yamaoka, Noriko; Takada, Satoshi

2017-10-01

Abnormal prosody is often evident in the voice intonations of individuals with autism spectrum disorders. We compared a machine-learning-based voice analysis with human hearing judgments made by 10 speech therapists for classifying children with autism spectrum disorders ( n = 30) and typical development ( n = 51). Using stimuli limited to single-word utterances, machine-learning-based voice analysis was superior to speech therapist judgments. There was a significantly higher true-positive than false-negative rate for machine-learning-based voice analysis but not for speech therapists. Results are discussed in terms of some artificiality of clinician judgments based on single-word utterances, and the objectivity machine-learning-based voice analysis adds to judging abnormal prosody.
Dominant distortion classification for pre-processing of vowels in remote biomedical voice analysis

DEFF Research Database (Denmark)

Poorjam, Amir Hossein; Jensen, Jesper Rindom; Little, Max A

2017-01-01

for pathological voice assessments and investigate the impact of four major types of distortion that are commonly present during recording or transmission in voice analysis, namely: background noise, reverberation, clipping and compression, on Mel-frequency cepstral coefficients (MFCCs) – the most widely...
Response Analysis Of Payload Fairing Due To Acoustic Excitation

Directory of Open Access Journals (Sweden)

Annu Cherian

2015-08-01

Full Text Available Abstract During flight missions launch vehicles are subjected to a severe dynamic pressure loading aero-acoustic and structure-borne excitations of various circumstances which can endanger the survivability of the payload and the vehicles electronic equipment and consequently the success of the mission. The purpose of the fairing is to protect the satellite from damage during launch until deployment in space. Both the structural and acoustic loads are significant during the first few minutes of a launch and have the potential to damage the payload. This paper describes the analysis of mechanical structure and the inner acoustic cavity of the payload fairing subjected to acoustic field. The vibro-acoustic behaviour of the fairing is analyzed using Statistical Energy Analysis SEA Model. The software VA One is used for the statistical energy analysis of launch vehicle payload fairing due to acoustic excitation.
Effects of Early Smoking Habits on Young Adult Female Voices in Greece.

Science.gov (United States)

Tafiadis, Dionysios; Toki, Eugenia I; Miller, Kevin J; Ziavra, Nausica

2017-11-01

Cigarette use is a preventable cause of mortality and diseases. The World Health Organization states that Europe and especially Greece has the highest occurrence of smoking among adults. The prevalence of smoking among women in Greece was estimated to be over 30% in 2012. Smoking is a risk factor for many diseases. Studies have demonstrated the association between smoking and laryngeal pathologies as well as changes in voice characteristics. The purpose of this study was to estimate the effect of early smoking habit on young adult female voices and if they perceive any vocal changes using two assessment methods. The Voice Handicap Index and the acoustic analyses of voice measurements were used, with both serving as mini-assessment protocols. Two hundred and ten young females (110 smokers and 100 nonsmokers) attending the Technological Educational Institute of Epirus in the School of Health and Welfare were included. Statistically significant increases for physical and total scores of the Voice Handicap Index were found in the smokers group (P smoking habits. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

Voice deviation, dysphonia risk screening and quality of life in individuals with various laryngeal diagnoses

Science.gov (United States)

Nemr, Katia; Cota, Ariane; Tsuji, Domingos; Simões-Zenari, Marcia

2018-01-01

OBJECTIVES: To characterize the voice quality of individuals with dysphonia and to investigate possible correlations between the degree of voice deviation (D) and scores on the Dysphonia Risk Screening Protocol-General (DRSP), the Voice-Related Quality of Life (V-RQOL) measure and the Voice Handicap Index, short version (VHI-10). METHODS: The sample included 200 individuals with dysphonia. Following laryngoscopy, the participants completed the DRSP, the V-RQOL measure, and the VHI-10; subsequently, voice samples were recorded for auditory-perceptual and acoustic analyses. The correlation between the score for each questionnaire and the overall degree of vocal deviation was analyzed, as was the correlation among the scores for the three questionnaires. RESULTS: Most of the participants (62%) were female, and the mean age of the sample was 49 years. The most common laryngeal diagnosis was organic dysphonia (79.5%). The mean D was 59.54, and the predominance of roughness had a mean of 54.74. All the participants exhibited at least one abnormal acoustic aspect. The mean questionnaire scores were DRSP, 44.7; V-RQOL, 57.1; and VHI-10, 16. An inverse correlation was found between the V-RQOL score and D; however, a positive correlation was found between both the VHI-10 and DRSP scores and D. CONCLUSION: A predominance of adult women, organic dysphonia, moderate voice deviation, high dysphonia risk, and low to moderate quality of life impact characterized our sample. There were correlations between the scores of each of the three questionnaires and the degree of voice deviation. It should be noted that the DRSP monitored the degree of dysphonia severity, which reinforces its applicability for patients with different laryngeal diagnoses. PMID:29538494
Measurement and prediction of voice support and room gain

DEFF Research Database (Denmark)

Pelegrin Garcia, David; Brunskog, Jonas; Lyberg-Åhlander, Viveka

2012-01-01

and good acoustical quality lies in the range between 14 and 9 dB, whereas the room gain is in the range between 0.2 and 0.5 dB. The prediction model for voice support describes the measurements in the classrooms with a coefficient of determination of 0.84 and a standard deviation of 1.2 dB....
The Acoustic Correlates of Breathy Voice: a Study of Source-Vowel INTERACTION{00}{00}{00}{00}{00}{00}{00} {00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00} {00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00} {00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}{00}.

Science.gov (United States)

Lin, Yeong-Fen Emily

This thesis is the result of an investigation of the source-vowel interaction from the point of view of perception. Major objectives include the identification of the acoustic correlates of breathy voice and the disclosure of the interdependent relationship between the perception of vowel identity and breathiness. Two experiments were conducted to achieve these objectives. In the first experiment, voice samples from one control group and seven patient groups were compared. The control group consisted of five female and five male adults. The ten normals were recruited to perform a sustained vowel phonation task with constant pitch and loudness. The voice samples of seventy patients were retrieved from a hospital data base, with vowels extracted from sentences repeated by patients at their habitual pitch and loudness. The seven patient groups were divided, based on a unique combination of patients' measures on mean flow rate and glottal resistance. Eighteen acoustic variables were treated with a three-way (Gender x Group x Vowel) ANOVA. Parameters showing a significant female-male difference as well as group differences, especially those between the presumed breathy group and the other groups, were identified as relevant to the distinction of breathy voice. As a result, F1-F3 amplitude difference and slope were found to be most effective in distinguishing breathy voice. Other acoustic correlates of breathy voice included F1 bandwidth, RMS-H1 amplitude difference, and F1-F2 amplitude difference and slope. In the second experiment, a formant synthesizer was used to generate vowel stimuli with varying spectral tilt and F1 bandwidth. Thirteen native American English speakers made dissimilarity judgements on paired stimuli in terms of vowel identity and breathiness. Listeners' perceptual vowel spaces were found to be affected by changes in the acoustic correlates of breathy voice. The threshold of detecting a change of vocal quality in the breathiness domain was also
Contemporary Commercial Music Singing Students-Voice Quality and Vocal Function at the Beginning of Singing Training.

Science.gov (United States)

Sielska-Badurek, Ewelina M; Sobol, Maria; Olszowska, Katarzyna; Niemczyk, Kazimierz

2017-10-03

The purpose of this study was to assess the voice quality and the vocal tract function in popular singing students at the beginning of their singing training at the High School of Music. This is a retrospective cross-sectional study. The study consisted of 45 popular singing students (35 females and 10 males, mean age: 19.9 ± 2.8 years). They were assessed in the first 2 months of their 4-year singing training at the High School of Music, between 2013 and 2016. Voice quality and vocal tract function were evaluated using videolaryngostroboscopy, palpation of the vocal tract structures, the perceptual speaking and singing voice assessment, acoustic analysis, maximal phonation time, the Voice Handicap Index, and the Singing Voice Handicap Index (SVHI). Twenty-two percent of Contemporary Commercial Music singing students began their education in the High School, with vocal nodules. Palpation of the vocal tract structure showed in 50% correct motions and tension in speaking and in 39.3% in singing. Perceptual voice assessment showed in 80% proper speaking voice quality and in 82.4% proper singing voice quality. The mean vocal fundamental frequency while speaking in females was 214 Hz and in males was 116 Hz. Dysphonia Severity Index was at the level of 2, and maximum phonation time was 17.7 seconds. The Voice Handicap Index and the SVHI remained within the normal range: 7.5 and 19, respectively. Perceptual singing voice assessment correlated with the SVHI (P = 0.006). Twenty-two percent of the Contemporary Commercial Music singing students began their education in the High School, with organic vocal fold lesions. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Validity and reliability of acoustic analysis of respiratory sounds in infants

Science.gov (United States)

Elphick, H; Lancaster, G; Solis, A; Majumdar, A; Gupta, R; Smyth, R

2004-01-01

Objective: To investigate the validity and reliability of computerised acoustic analysis in the detection of abnormal respiratory noises in infants. Methods: Blinded, prospective comparison of acoustic analysis with stethoscope examination. Validity and reliability of acoustic analysis were assessed by calculating the degree of observer agreement using the κ statistic with 95% confidence intervals (CI). Results: 102 infants under 18 months were recruited. Convergent validity for agreement between stethoscope examination and acoustic analysis was poor for wheeze (κ = 0.07 (95% CI, –0.13 to 0.26)) and rattles (κ = 0.11 (–0.05 to 0.27)) and fair for crackles (κ = 0.36 (0.18 to 0.54)). Both the stethoscope and acoustic analysis distinguished well between sounds (discriminant validity). Agreement between observers for the presence of wheeze was poor for both stethoscope examination and acoustic analysis. Agreement for rattles was moderate for the stethoscope but poor for acoustic analysis. Agreement for crackles was moderate using both techniques. Within-observer reliability for all sounds using acoustic analysis was moderate to good. Conclusions: The stethoscope is unreliable for assessing respiratory sounds in infants. This has important implications for its use as a diagnostic tool for lung disorders in infants, and confirms that it cannot be used as a gold standard. Because of the unreliability of the stethoscope, the validity of acoustic analysis could not be demonstrated, although it could discriminate between sounds well and showed good within-observer reliability. For acoustic analysis, targeted training and the development of computerised pattern recognition systems may improve reliability so that it can be used in clinical practice. PMID:15499065
Deviant vocal fold vibration as observed during videokymography : the effect on voice quality

NARCIS (Netherlands)

Verdonck-de Leeuw, I M; Festen, J.M.; Mahieu, H.F.

Videokymographic images of deviant or irregular vocal fold vibration, including diplophonia, the transition from falsetto to modal voice, irregular vibration onset and offset, and phonation following partial laryngectomy were compared with the synchronously recorded acoustic speech signals. A clear
A hybrid approach to the computational aeroacoustics of human voice production

Czech Academy of Sciences Publication Activity Database

Šidlof, Petr; Zörner, S.; Huppe, A.

2015-01-01

Roč. 14, č. 3 (2015), s. 473-488 ISSN 1617-7959 R&D Projects: GA ČR(CZ) GAP101/11/0207 Institutional support: RVO:61388998 Keywords : computational aeroacoustics * parallel CFD * human voice * vocal folds * ventricular folds Subject RIV: BI - Acoustics Impact factor: 3.032, year: 2015
Who's your neighbor? Acoustic cues to individual identity in red squirrel Tamiasciurus hudsonicus rattle calls

Directory of Open Access Journals (Sweden)

Shannon M. DIGWEED, Drew RENDALL, Teana IMBEAU

2012-10-01

Full Text Available North American red squirrels Tamiasciurus hudsonicus often produce a loud territorial rattle call when conspecifics enter or invade a territory. Previous playback experiments suggest that the territorial rattle call may indicate an invader's identity as squirrels responded more intensely to calls played from strangers than to calls played from neighbors. This dear-enemy effect is well known in a variety of bird and mammal species and functions to reduce aggressive interactions between known neighbors. However, although previous experiments on red squirrels suggest some form of individual differentiation and thus recognition, detailed acoustic analysis of potential acoustic cues in rattle calls have not been conducted. If calls function to aid in conspecific identification in order to mitigate aggressive territorial interactions, we would expect that individual recognition cues would be acoustically represented. Our work provides a detailed analysis of acoustic cues to identity within rattle calls. A total of 225 calls across 32 individual squirrels from Sheep River Provincial Park, Kananaskis, AB, Canada, were analyzed with discriminant function analysis for potential acoustic cues to individual identity. Initial analysis of all individuals revealed a reliable acoustic differentiation across individuals. A more detailed analysis of clusters of neighboring squirrels was performed and results again indicated a statistically significant likelihood that calls were assigned correctly to specific squirrels (55%-75% correctly assigned; in other words squirrels have distinct voices that should allow for individual identification and discrimination by conspecifics [Current Zoology 58 (5: 758–764, 2012].
Integrating acoustic analysis in the architectural design process using parametric modelling

DEFF Research Database (Denmark)

Peters, Brady

2011-01-01

This paper discusses how parametric modeling techniques can be used to provide architectural designers with a better understanding of the acoustic performance of their designs and provide acoustic engineers with models that can be analyzed using computational acoustic analysis software. Architects......, acoustic performance can inform the geometry and material logic of the design. In this way, the architectural design and the acoustic analysis model become linked....
Numerical simulation of flow-induced sound in human voice production

Czech Academy of Sciences Publication Activity Database

Šidlof, Petr; Zörner, S.; Huppe, A.

2013-01-01

Roč. 61, č. 2013 (2013), s. 333-340 E-ISSN 1877-7058. [ParCFD 2013 International conference /25./. Changsha, 20.05.2013-24.05.2013] R&D Projects: GA ČR(CZ) GAP101/11/0207 Institutional support: RVO:61388998 Keywords : aeroacoustics * parallel CFD * human voice * biomechanics * vocal folds Subject RIV: BI - Acoustics
Dimensionality in voice quality.

Science.gov (United States)

Bele, Irene Velsvik

2007-05-01

This study concerns speaking voice quality in a group of male teachers (n = 35) and male actors (n = 36), as the purpose was to investigate normal and supranormal voices. The goal was the development of a method of valid perceptual evaluation for normal to supranormal and resonant voices. The voices (text reading at two loudness levels) had been evaluated by 10 listeners, for 15 vocal characteristics using VA scales. In this investigation, the results of an exploratory factor analysis of the vocal characteristics used in this method are presented, reflecting four dimensions of major importance for normal and supranormal voices. Special emphasis is placed on the effects on voice quality of a change in the loudness variable, as two loudness levels are studied. Furthermore, the vocal characteristics Sonority and Ringing voice quality are paid special attention, as the essence of the term "resonant voice" was a basic issue throughout a doctoral dissertation where this study was included.
Trends in musical theatre voice: an analysis of audition requirements for singers.

Science.gov (United States)

Green, Kathryn; Freeman, Warren; Edwards, Matthew; Meyer, David

2014-05-01

The American musical theatre industry is a multibillion dollar business in which the requirements for singers are varied and complex. This study identifies the musical genres and voice requirements that are currently most requested at professional auditions to help voice teachers, pedagogues, and physicians who work with musical theatre singers understand the demands of their clients' business. Frequency count. One thousand two thirty-eight professional musical theatre audition listings were gathered over a 6-month period, and information from each listing was categorized and entered into a spreadsheet for analysis. The results indicate that four main genres of music were requested over a wide variety of styles, with more than half of auditions requesting genre categories that may not be served by traditional or classical voice technique alone. To adequately prepare young musical theatre performers for the current job market and keep the performers healthily making the sounds required by the industry, new singing styles may need to be studied and integrated into voice training that only teaches classical styles. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Speaker-Oriented Classroom Acoustics Design Guidelines in the Context of Current Regulations in European Countries

DEFF Research Database (Denmark)

Pelegrin Garcia, David; Brunskog, Jonas; Rasmussen, Birgit

2014-01-01

Most European countries have regulatory requirements or guidelines for reverberation time in classrooms which have the goal of enhancing speech intelligibility and reducing noise levels in schools. At the same time, school teachers suffer frequently from voice problems due to high vocal load...... experienced at work. With the aim of improving working conditions for teachers, this article presents guidelines for classroom acoustics design that meet simultaneously criteria of vocal comfort and speech intelligibility, which may be of use in future discussions for updating regulatory requirements...... in classroom acoustics. Two room acoustic parameters are shown relevant for a speaker: the voice support, linked to vocal effort, and the decay time derived from an oral-binaural impulse response, linked to vocal comfort. Theoretical prediction models for room-averaged values of these parameters are combined...
Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.

Science.gov (United States)

Fang, Shih-Hau; Tsao, Yu; Hsiao, Min-Jing; Chen, Ji-Ying; Lai, Ying-Hui; Lin, Feng-Chuan; Wang, Chi-Te

2018-03-19

Computerized detection of voice disorders has attracted considerable academic and clinical interest in the hope of providing an effective screening method for voice diseases before endoscopic confirmation. This study proposes a deep-learning-based approach to detect pathological voice and examines its performance and utility compared with other automatic classification algorithms. This study retrospectively collected 60 normal voice samples and 402 pathological voice samples of 8 common clinical voice disorders in a voice clinic of a tertiary teaching hospital. We extracted Mel frequency cepstral coefficients from 3-second samples of a sustained vowel. The performances of three machine learning algorithms, namely, deep neural network (DNN), support vector machine, and Gaussian mixture model, were evaluated based on a fivefold cross-validation. Collective cases from the voice disorder database of MEEI (Massachusetts Eye and Ear Infirmary) were used to verify the performance of the classification mechanisms. The experimental results demonstrated that DNN outperforms Gaussian mixture model and support vector machine. Its accuracy in detecting voice pathologies reached 94.26% and 90.52% in male and female subjects, based on three representative Mel frequency cepstral coefficient features. When applied to the MEEI database for validation, the DNN also achieved a higher accuracy (99.32%) than the other two classification algorithms. By stacking several layers of neurons with optimized weights, the proposed DNN algorithm can fully utilize the acoustic features and efficiently differentiate between normal and pathological voice samples. Based on this pilot study, future research may proceed to explore more application of DNN from laboratory and clinical perspectives. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Brain 'talks over' boring quotes: top-down activation of voice-selective areas while listening to monotonous direct speech quotations.

Science.gov (United States)

Yao, Bo; Belin, Pascal; Scheepers, Christoph

2012-04-15

In human communication, direct speech (e.g., Mary said, "I'm hungry") is perceived as more vivid than indirect speech (e.g., Mary said that she was hungry). This vividness distinction has previously been found to underlie silent reading of quotations: Using functional magnetic resonance imaging (fMRI), we found that direct speech elicited higher brain activity in the temporal voice areas (TVA) of the auditory cortex than indirect speech, consistent with an "inner voice" experience in reading direct speech. Here we show that listening to monotonously spoken direct versus indirect speech quotations also engenders differential TVA activity. This suggests that individuals engage in top-down simulations or imagery of enriched supra-segmental acoustic representations while listening to monotonous direct speech. The findings shed new light on the acoustic nature of the "inner voice" in understanding direct speech. Copyright Â© 2012 Elsevier Inc. All rights reserved.
Distributed acoustic cues for caller identity in macaque vocalization.

Science.gov (United States)

Fukushima, Makoto; Doyle, Alex M; Mullarkey, Matthew P; Mishkin, Mortimer; Averbeck, Bruno B

2015-12-01

Individual primates can be identified by the sound of their voice. Macaques have demonstrated an ability to discern conspecific identity from a harmonically structured 'coo' call. Voice recognition presumably requires the integrated perception of multiple acoustic features. However, it is unclear how this is achieved, given considerable variability across utterances. Specifically, the extent to which information about caller identity is distributed across multiple features remains elusive. We examined these issues by recording and analysing a large sample of calls from eight macaques. Single acoustic features, including fundamental frequency, duration and Weiner entropy, were informative but unreliable for the statistical classification of caller identity. A combination of multiple features, however, allowed for highly accurate caller identification. A regularized classifier that learned to identify callers from the modulation power spectrum of calls found that specific regions of spectral-temporal modulation were informative for caller identification. These ranges are related to acoustic features such as the call's fundamental frequency and FM sweep direction. We further found that the low-frequency spectrotemporal modulation component contained an indexical cue of the caller body size. Thus, cues for caller identity are distributed across identifiable spectrotemporal components corresponding to laryngeal and supralaryngeal components of vocalizations, and the integration of those cues can enable highly reliable caller identification. Our results demonstrate a clear acoustic basis by which individual macaque vocalizations can be recognized.
The Traditional/Acoustic Music Project: a study of vocal demands and vocal health.

Science.gov (United States)

Erickson, Molly L

2012-09-01

The Traditional/Acoustic Music Project seeks to identify the musical and performance characteristics of traditional/acoustic musicians and determine the vocal demands they face with the goals of (1) providing information and outreach to this important group of singers and (2) providing information to physicians, speech-language pathologists, and singing teachers who will enable them to provide appropriate services. Descriptive cross-sectional study. Data have been collected through administration of a 53-item questionnaire. The questionnaire was administered to artists performing at local venues in Knoxville, Tennessee and also to musicians attending the 2008 Folk Alliance Festival in Memphis, Tennessee. Approximately 41% of the respondents have had no vocal training, whereas approximately 34% of the respondents have had some form of formal vocal training (private lessons or group instruction). About 41% of the participants had experienced a tired voice, whereas about 30% of the participants had experienced either a loss of the top range of the voice or a total loss of voice at least once in their careers. Approximately 31% of the respondents had no health insurance. Approximately 69% of the respondents reported that they get their information about healthy singing practices solely from fellow musicians or that they do not get any information at all. Traditional/acoustic musicians are a poorly studied population at risk for the development of voice disorders. Continued research is necessary with the goal of a large sample that can be analyzed for associations, identification of subpopulations, and formulation of specific hypotheses that lend themselves to experimental research. Appropriate models of information and service delivery tailored for the singer-instrumentalist are needed. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Prevalence of Voice Disorders in Singers: Systematic Review and Meta-Analysis.

Science.gov (United States)

Pestana, Pedro Melo; Vaz-Freitas, Susana; Manso, Maria Conceição

2017-11-01

The study aimed to review the prevalence of self-reported voice disorders in singers. The study is a systematic review and meta-analysis. A systematic review of five major scientific databases was conducted. An extensive search strategy was used considering the rules of each database. Original articles were included only if they had data related to self-perception of dysphonia in the past. Furthermore, heterogeneity and its relative significance were assessed. There were 2371 articles identified; duplicates were deleted, screenings were conducted, and inclusion and exclusion criteria were applied. The final analysis was conducted on 11 studies. The most implemented instruments for the study were customized questionnaires. The findings about singing styles, voice use, and age were found to be different among subjects. The overall prevalence of self-reported dysphonia in singers was 46.09% (95% confidence interval: 38.16-54.12). The heterogeneity was considerable among the studied samples (I 2 = 90.59%). Four groups were then established-students, teachers, classical, and nonclassical-and compared regarding overall prevalence (21.76% in students, and significantly higher and nondifferent in the other three groups, 55.15%, 40.53%, and 46.96%, respectively) and heterogeneity (low only for the students' studies). Although with low homogeneity, singers present a high prevalence of self-perceived dysphonia over their careers. Singing students were the group with a lower prevalence. On the other hand, traditional and popular music singers, as well as singing teachers, revealed significantly higher prevalence of self-perceived dysphonia. Overall, singers are likely to report voice disorders, no matter their singing style or skills. This highlights the need of a preventive approach to address voice disorders in traditional and untrained singers. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Acoustic cue weighting in the singleton vs geminate contrast in Lebanese Arabic: The case of fricative consonants.

Science.gov (United States)

Al-Tamimi, Jalal; Khattab, Ghada

2015-07-01

This paper is the first reported investigation of the role of non-temporal acoustic cues in the singleton-geminate contrast in Lebanese Arabic, alongside the more frequently reported temporal cues. The aim is to explore the extent to which singleton and geminate consonants show qualitative differences in a language where phonological length is prominent and where moraic structure governs segment timing and syllable weight. Twenty speakers (ten male, ten female) were recorded producing trochaic disyllables with medial singleton and geminate fricatives preceded by phonologically short and long vowels. The following acoustic measures were applied on the medial fricative and surrounding vowels: absolute duration; intensity; fundamental frequency; spectral peak and shape, dynamic amplitude, and voicing patterns of medial fricatives; and vowel quality and voice quality correlates of surrounding vowels. Discriminant analysis and receiver operating characteristics (ROC) curves were used to assess each acoustic cue's contribution to the singleton-geminate contrast. Classification rates of 89% and ROC curves with an area under the curve rate of 96% confirmed the major role played by temporal cues, with non-temporal cues contributing to the contrast but to a much lesser extent. These results confirm that the underlying contrast for gemination in Arabic is temporal, but highlight [+tense] (fortis) as a secondary feature.
Lower Vocal Tract Morphologic Adjustments Are Relevant for Voice Timbre in Singing.

Science.gov (United States)

Mainka, Alexander; Poznyakovskiy, Anton; Platzek, Ivan; Fleischer, Mario; Sundberg, Johan; Mürbe, Dirk

2015-01-01

The vocal tract shape is crucial to voice production. Its lower part seems particularly relevant for voice timbre. This study analyzes the detailed morphology of parts of the epilaryngeal tube and the hypopharynx for the sustained German vowels /a/, /e/, /i/, /o/, and /u/ by thirteen male singer subjects who were at the beginning of their academic singing studies. Analysis was based on two different phonatory conditions: a natural, speech-like phonation and a singing phonation, like in classical singing. 3D models of the vocal tract were derived from magnetic resonance imaging and compared with long-term average spectrum analysis of audio recordings from the same subjects. Comparison of singing to the speech-like phonation, which served as reference, showed significant adjustments of the lower vocal tract: an average lowering of the larynx by 8 mm and an increase of the hypopharyngeal cross-sectional area (+ 21:9%) and volume (+ 16:8%). Changes in the analyzed epilaryngeal portion of the vocal tract were not significant. Consequently, lower larynx-to-hypopharynx area and volume ratios were found in singing compared to the speech-like phonation. All evaluated measures of the lower vocal tract varied significantly with vowel quality. Acoustically, an increase of high frequency energy in singing correlated with a wider hypopharyngeal area. The findings offer an explanation how classical male singers might succeed in producing a voice timbre with increased high frequency energy, creating a singer`s formant cluster.

Back-and-Forth Methodology for Objective Voice Quality Assessment: From/to Expert Knowledge to/from Automatic Classification of Dysphonia

Science.gov (United States)

Fredouille, Corinne; Pouchoulin, Gilles; Ghio, Alain; Revis, Joana; Bonastre, Jean-François; Giovanni, Antoine

2009-12-01

This paper addresses voice disorder assessment. It proposes an original back-and-forth methodology involving an automatic classification system as well as knowledge of the human experts (machine learning experts, phoneticians, and pathologists). The goal of this methodology is to bring a better understanding of acoustic phenomena related to dysphonia. The automatic system was validated on a dysphonic corpus (80 female voices), rated according to the GRBAS perceptual scale by an expert jury. Firstly, focused on the frequency domain, the classification system showed the interest of 0-3000 Hz frequency band for the classification task based on the GRBAS scale. Later, an automatic phonemic analysis underlined the significance of consonants and more surprisingly of unvoiced consonants for the same classification task. Submitted to the human experts, these observations led to a manual analysis of unvoiced plosives, which highlighted a lengthening of VOT according to the dysphonia severity validated by a preliminary statistical analysis.
Back-and-Forth Methodology for Objective Voice Quality Assessment: From/to Expert Knowledge to/from Automatic Classification of Dysphonia

Directory of Open Access Journals (Sweden)

Corinne Fredouille

2009-01-01

Full Text Available This paper addresses voice disorder assessment. It proposes an original back-and-forth methodology involving an automatic classification system as well as knowledge of the human experts (machine learning experts, phoneticians, and pathologists. The goal of this methodology is to bring a better understanding of acoustic phenomena related to dysphonia. The automatic system was validated on a dysphonic corpus (80 female voices, rated according to the GRBAS perceptual scale by an expert jury. Firstly, focused on the frequency domain, the classification system showed the interest of 0–3000 Hz frequency band for the classification task based on the GRBAS scale. Later, an automatic phonemic analysis underlined the significance of consonants and more surprisingly of unvoiced consonants for the same classification task. Submitted to the human experts, these observations led to a manual analysis of unvoiced plosives, which highlighted a lengthening of VOT according to the dysphonia severity validated by a preliminary statistical analysis.
Influence of consonant voicing characteristics on sentence production in abductor versus adductor spasmodic dysphonia.

Science.gov (United States)

Cannito, Michael P; Chorna, Lesya B; Kahane, Joel C; Dworkin, James P

2014-05-01

This study evaluated the hypotheses that sentence production by speakers with adductor (AD) and abductor (AB) spasmodic dysphonia (SD) may be differentially influenced by consonant voicing and manner features, in comparison with healthy, matched, nondysphonic controls. This was a prospective, single blind study, using a between-groups, repeated measures design for the independent variables of perceived voice quality and sentence duration. Sixteen subjects with ADSD and 10 subjects with ABSD, as well as 26 matched healthy controls produced four short, simple sentences that were systematically loaded with voiced or voiceless consonants of either obstruant or continuant manner categories. Experienced voice clinicians, who were "blind" as to speakers' group affixations, used visual analog scaling to judge the overall voice quality of each sentence. Acoustic sentence durations were also measured. Speakers with ABSD or ADSD demonstrated significantly poorer than normal voice quality on all sentences. Speakers with ABSD exhibited longer than normal duration for voiceless consonant sentences. Speakers with ADSD had poorer voice quality for voiced than for voiceless consonant sentences. Speakers with ABSD had longer durations for voiceless than for voiced consonant sentences. The two subtypes of SD exhibit differential performance on the basis of consonant voicing in short, simple sentences; however, each subgroup manifested voicing-related differences on a different variable (voice quality vs sentence duration). Findings suggest different underlying pathophysiological mechanisms for ABSD and ADSD. Findings also support inclusion of short, simple sentences containing voiced or voiceless consonants as part of the diagnostic protocol for SD, with measurement of sentence duration in addition to judments of voice quality severity. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Análise acústica da voz captada na faringe próximo à fonte glótica através de microfone acoplado ao fibrolaringoscópio Acoustic analysis of voice captured in the pharynx above the glottic source through a microphone on a laryngo-fiberscope

Directory of Open Access Journals (Sweden)

Erica E. Fukuyama

2001-01-01

Full Text Available Objetivo: O objetivo deste trabalho foi estudar a voz próximo à sua fonte produtora, as pregas vocais, através de um microfone miniaturizado de aparelho auditivo que foi adaptado para ser acoplado à extremidade de um fibrolaringoscópio, permitindo a captação da voz durante a laringoscopia direta. Forma de estudo: Experimental. Material e Método: A voz foi estudada em um grupo de 50 indivíduos, 25 homens e 25 mulheres sem doenças, através de um programa de análise acústica MDVP (Multi-Dimensional Voice Program do laboratório de voz Computerized Speech Lab, Model 4300B, da Kay Elemetrics. Amostras de vogais sustentadas /a/, /i/ e /u/ foram captadas de três formas diferentes, primeiramente com um microfone comum externo a 15 cm da boca, em segundo lugar com o microfone especial na faringe a 1,5 cm acima das pregas vocais e por último com o microfone especial externamente a 2 cm da boca. Doze parâmetros acústicos relacionados a freqüência fundamental, amplitude e ruído de cada uma das vogais foram comparadas estatisticamente conforme à sua forma de captação. Resultados: Os resultados mostraram diferenças estatisticamente significativas entre a voz captada pelo microfone comum externo e o microfone especial, em relação à freqüência fundamental, aos parâmetros de variação de periodicidade de freqüência, amplitude e ruído. Conclusão: A diferença do som da fonte glótica do som da voz externa pode mostrar as modificações sofridas pela voz no decorrer da passagem pelo trato vocal.Aim: The aim of the present study is to examine the voice to its acoustic source - the vocal folds - with a miniature hearing-aid microphone coupled to the extremity of a laryngo-fiberscope allowing the voice to be captured during direct laryngoscopy. Study design: Experimental. Material and Method: The voice of 50 individuals - 25 males and 25 females bearing no pathologies - was collected by the Multi-Dimensional Voice Program (MDVP by
Acoustics and the Performance of Music Manual for Acousticians, Audio Engineers, Musicians, Architects and Musical Instrument Makers

CERN Document Server

Meyer, Jürgen

2009-01-01

Acoustics and the Performance of Music connects scientific understandings of acoustics with practical applications to musical performance. Of central importance are the tonal characteristics of musical instruments and the singing voice including detailed representations of directional characteristics. Furthermore, room acoustical concerns related to concert halls and opera houses are considered. Based on this, suggestions are made for musical performance. Included are seating arrangements within the orchestra and adaptations of performance techniques to the performance environment. In the presentation we dispense with complicated mathematical connections and deliberately aim for conceptual explanations accessible to musicians, particularly for conductors. The graphical representations of the directional dependence of sound radiation by musical instruments and the singing voice are unique. Since the first edition was published in 1978, this book has been completely revised and rewritten to include current rese...
Validity and Reliability Study of Bahasa Malaysia Version of Voice Handicap Index-10.

Science.gov (United States)

Ong, Fei Ming; Husna Nik Hassan, Nik Fariza; Azman, Mawaddah; Sani, Abdullah; Mat Baki, Marina

2018-05-21

This study aimed to determine the validity and reliability of Bahasa Malaysia version of Voice Handicap Index-10 (mVHI-10). This cross-sectional study was carried out in the Otorhinolaryngology, Head and Neck Surgery Department of Universiti Kebangsaan Malaysia Medical Centre (UKMMC) from June 2015 to May 2016. The mVHI-10 was produced following a rigorous forward and backward translation. One hundred participants, including 50 healthy volunteers (17 male, 33 female) and 50 patients with voice disorders (26 male, 24 female), were recruited to complete the mVHI-10 before flexible laryngoscopic examinations and acoustic analysis. The mVHI-10 was repeated in 2 weeks via telephone interview or clinic visit. Its reliability and validity were assessed using interclass correlation. The test-retest reliability for total mVHI-10 and each item score was high, with the Cronbach alpha of >0.90. The total mVHI-10 score and domain scores were significantly higher (P Kaiser-Meyer-Olkin measure was 0.92, which depicted excellent construct validity. There was a significant positive correlation between the mVHI-10 score and jitter and shimmer result (P < 0.001). The present study showed good reliability and validity of the mVHI-10 when applied to both healthy volunteers and patients with voice disorders. We recommend the use of the mVHI-10 in daily clinical practice among Bahasa Malaysia-speaking population. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Acoustic analysis of warp potential of green ponderosa pine lumber

Science.gov (United States)

Xiping Wang; William T. Simpson

2005-01-01

This study evaluated the potential of acoustic analysis as presorting criteria to identify warp-prone boards before kiln drying. Dimension lumber, 38 by 89 mm (nominal 2 by 4 in.) and 2.44 m (8 ft) long, sawn from open-grown small-diameter ponderosa pine trees, was acoustically tested lengthwise at green condition. Three acoustic properties (acoustic speed, rate of...
Connections between voice ergonomic risk factors and voice symptoms, voice handicap, and respiratory tract diseases.

Science.gov (United States)

Rantala, Leena M; Hakala, Suvi J; Holmqvist, Sofia; Sala, Eeva

2012-11-01

The aim of the study was to investigate the connections between voice ergonomic risk factors found in classrooms and voice-related problems in teachers. Voice ergonomic assessment was performed in 39 classrooms in 14 elementary schools by means of a Voice Ergonomic Assessment in Work Environment--Handbook and Checklist. The voice ergonomic risk factors assessed included working culture, noise, indoor air quality, working posture, stress, and access to a sound amplifier. Teachers from the above-mentioned classrooms reported their voice symptoms, respiratory tract diseases, and completed a Voice Handicap Index (VHI). The more voice ergonomic risk factors found in the classroom the higher were the teachers' total scores on voice symptoms and VHI. Stress was the factor that correlated most strongly with voice symptoms. Poor indoor air quality increased the occurrence of laryngitis. Voice ergonomics were poor in the classrooms studied and voice ergonomic risk factors affected the voice. It is important to convey information on voice ergonomics to education administrators and those responsible for school planning and taking care of school buildings. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
The pattern of educator voice in clinical counseling in an educational hospital in Shiraz, Iran: a conversation analysis.

Science.gov (United States)

Kalateh Sadati, Ahmad; Bagheri Lankarani, Kamran

2017-01-01

Doctor-patient interaction (DPI) includes different voices, of which the educator voice is of considerable importance. Physicians employ this voice to educate patients and their caregivers by providing them with information in order to change the patients' behavior and improve their health status. The subject has not yet been fully understood, and therefore the present study was conducted to explore the pattern of educator voice. For this purpose, conversation analysis (CA) of 33 recorded clinical consultations was performed in outpatient educational clinics in Shiraz, Iran between April 2014 and September 2014. In this qualitative study, all utterances, repetitions, lexical forms, chuckles and speech particles were considered and interpreted as social actions. Interpretations were based on inductive data-driven analysis with the aim to find recurring patterns of educator voice. The results showed educator voice to have two general features: descriptive and prescriptive. However, the pattern of educator voice comprised characteristics such as superficiality, marginalization of patients, one-dimensional approach, ignoring a healthy lifestyle, and robotic nature. The findings of this study clearly demonstrated a deficiency in the educator voice and inadequacy in patient-centered dialogue. In this setting, the educator voice was related to a distortion of DPI through the physicians' dominance, leading them to ignore their professional obligation to educate patients. Therefore, policies in this regard should take more account of enriching the educator voice through training medical students and faculty members in communication skills.
Muscular tension and body posture in relation to voice handicap and voice quality in teachers with persistent voice complaints.

Science.gov (United States)

Kooijman, P G C; de Jong, F I C R S; Oudes, M J; Huinck, W; van Acht, H; Graamans, K

2005-01-01

The aim of this study was to investigate the relationship between extrinsic laryngeal muscular hypertonicity and deviant body posture on the one hand and voice handicap and voice quality on the other hand in teachers with persistent voice complaints and a history of voice-related absenteeism. The study group consisted of 25 female teachers. A voice therapist assessed extrinsic laryngeal muscular tension and a physical therapist assessed body posture. The assessed parameters were clustered in categories. The parameters in the different categories represent the same function. Further a tension/posture index was created, which is the summation of the different parameters. The different parameters and the index were related to the Voice Handicap Index (VHI) and the Dysphonia Severity Index (DSI). The scores of the VHI and the individual parameters differ significantly except for the posterior weight bearing and tension of the sternocleidomastoid muscle. There was also a significant difference between the individual parameters and the DSI, except for tension of the cricothyroid muscle and posterior weight bearing. The score of the tension/posture index correlates significantly with both the VHI and the DSI. In a linear regression analysis, the combination of hypertonicity of the sternocleidomastoid, the geniohyoid muscles and posterior weight bearing is the most important predictor for a high voice handicap. The combination of hypertonicity of the geniohyoid muscle, posterior weight bearing, high position of the hyoid bone, hypertonicity of the cricothyroid muscle and anteroposition of the head is the most important predictor for a low DSI score. The results of this study show the higher the score of the index, the higher the score of the voice handicap and the worse the voice quality is. Moreover, the results are indicative for the importance of assessment of muscular tension and body posture in the diagnosis of voice disorders.
Neural effects of environmental advertising: An fMRI analysis of voice age and temporal framing.

Science.gov (United States)

Casado-Aranda, Luis-Alberto; Martínez-Fiestas, Myriam; Sánchez-Fernández, Juan

2018-01-15

Ecological information offered to society through advertising enhances awareness of environmental issues, encourages development of sustainable attitudes and intentions, and can even alter behavior. This paper, by means of functional Magnetic Resonance Imaging (fMRI) and self-reports, explores the underlying mechanisms of processing ecological messages. The study specifically examines brain and behavioral responses to persuasive ecological messages that differ in temporal framing and in the age of the voice pronouncing them. The findings reveal that attitudes are more positive toward future-framed messages presented by young voices. The whole-brain analysis reveals that future-framed (FF) ecological messages trigger activation in brain areas related to imagery, prospective memories and episodic events, thus reflecting the involvement of past behaviors in future ecological actions. Past-framed messages (PF), in turn, elicit brain activations within the episodic system. Young voices (YV), in addition to triggering stronger activation in areas involved with the processing of high-timbre, high-pitched and high-intensity voices, are perceived as more emotional and motivational than old voices (OV) as activations in anterior cingulate cortex and amygdala. Messages expressed by older voices, in turn, exhibit stronger activation in areas formerly linked to low-pitched voices and voice gender perception. Interestingly, a link is identified between neural and self-report responses indicating that certain brain activations in response to future-framed messages and young voices predicted higher attitudes toward future-framed and young voice advertisements, respectively. The results of this study provide invaluable insight into the unconscious origin of attitudes toward environmental messages and indicate which voice and temporal frame of a message generate the greatest subconscious value. Copyright © 2017 Elsevier Ltd. All rights reserved.
Status reports on developments and applications of acoustic emission analysis. Lectures

International Nuclear Information System (INIS)

1994-01-01

The 25 lectures give a survey of the field of acoustic emission analysis. After a state-of-the-art report, the 4 new SE regulations of the DGZfP are presented which will standardize the applications of acoustic emission analysis. Acoustic emission sources and signal processing, including 3D detection, are discussed in 5 papers. After this, the practical applications of acoustic emission analysis are discussed in detail: Testing of gas tanks, inspection of storage container bottoms, reinforced concrete (3D analysis during extension testing), polymer-impregnated concrete, glass (testing up to 90 MHz), ceramics (thermoshock behaviour), fibre-reinforced plastics (4 contributions), PVD films, rock (analysis of workability and structure), glued joints between metals, monitoring of laser beam welding, metal cutting, and drying of cut wood. (orig.) [de
Research Paper: Investigation of Acoustic Characteristics of Speech Motor Control in Children Who Stutter and Children Who Do Not Stutter

Directory of Open Access Journals (Sweden)

Fatemeh Fakar Gharamaleki

2016-11-01

Full Text Available Objective Stuttering is a developmental disorder of speech fluency with unknown causes. One of the proposed theories in this field is deficits in speech motor control that is associated with damaged control, timing, and coordination of the speech muscles. Fundamental frequency, fundamental frequency range, intensity, intensity range, and voice onset time are the most important acoustic components that are often used for indirect evaluation of physiological functions underlying the mechanisms of speech motor control. The purpose of this investigation was to compare some of the acoustic characteristics of speech motor control in children who stutter and children who do not stutter. Materials & Methods This research is a descriptive-analytic and cross-sectional comparative study. A total of 25 Azari-Persian bilingual boys who stutter (stutters group and 23 Azari-Persian bilinguals and 21 Persian monolingual boys who do not stutter (non-stutters group in the age range of 6 to 10 years participated in this study. Children participated in /a/ and /i/ vowels prolongation and carrier phrase repetition tasks for the analysis of some of their acoustic characteristics including fundamental frequency, fundamental frequency range, intensity, intensity range, and voice onset time. The PRAAT software was used for acoustic analysis. SPSS software (version 17, one-way ANOVA, and Kruskal-Wallis test were used for analyzing the data. Results The results indicated that there were no significant differences between the stutters and non-stutters groups (P>0.05 with respect to the acoustic features of speech motor control . Conclusion No significant group differences were observed in all of the dependent variables reported in this study. Thus, the results of this research do not support the notion of aberrant speech motor control in children who stutter.
Objective Voice Parameters in Colombian School Workers with Healthy Voices

Directory of Open Access Journals (Sweden)

Lady Catherine Cantor Cutiva

2015-09-01

Full Text Available Objectives: To characterize the objective voice parameters among school workers, and to identify associated factors of three objective voice parameters, namely fundamental frequency, sound pressure level and maximum phonation time. Materials and methods: We conducted a cross-sectional study among 116 Colombian teachers and 20 Colombian non-teachers. After signing the informed consent form, participants filled out a questionnaire. Then, a voice sample was recorded and evaluated perceptually by a speech therapist and by objective voice analysis with praat software. Short-term environmental measurements of sound level, temperature, humidity, and reverberation time were conducted during visits at the workplaces, such as classrooms and offices. Linear regression analysis was used to determine associations between individual and work-related factors and objective voice parameters. Results: Compared with men, women had higher fundamental frequency (201 Hz for teachers and 209 for non-teachers vs. 120 Hz for teachers and 127 for non-teachers and sound pressure level (82 dB vs. 80 dB, and shorter maximum phonation time (around 14 seconds vs. around 16 seconds. Female teachers younger than 50 years of age evidenced a significant tendency to speak with lower fundamental frequency and shorter mpt compared with female teachers older than 50 years of age. Female teachers had significantly higher fundamental frequency (66 Hz, higher sound pressure level (2 dB and short phonation time (2 seconds than male teachers. Conclusion: Female teachers younger than 50 years of age had significantly lower F0 and shorter mpt compared with those older than 50 years of age. The multivariate analysis showed that gender was a much more important determinant of variations in F0, spl and mpt than age and teaching occupation. Objectively measured temperature also contributed to the changes on spl among school workers.
Site-Specific Soundscape Design for the Creation of Sonic Architectures and the Emergent Voices of Buildings

Directory of Open Access Journals (Sweden)

Jordan Lacey

2014-01-01

Full Text Available Does a building contain its own Voice? And if so, can that Voice be discovered, transformed and augmented by soundscape design? Barry Blesser’s writings on acoustic space, discuss reverberation and resonant frequencies as providing architectural spaces with characteristic listening conditions related to the architectural space’s dimensions and materiality. The paper argues that Blesser and Salter expand such discussion into pantheistic speculation when suggesting that humanity contains the imaginative capacity to experience spaces as “living spirits”. This argument is achieved by building on the speculation through the discussion of a soundscape design methodology that considers space as containing pantheistic qualities. Sonic architectures are created with electroacoustic sound installations that recompose existing architectural soundscapes, to create the conditions for the emergence of the Voices of buildings. This paper describes two soundscape designs, Revoicing the Striated Soundscape and Subterranean Voices, which transformed existing architectural soundscapes for the emergence of Voices in a laneway and a building located in the City of Melbourne, Australia.
Evaluation of vocal acoustic and efficiency analysis parameters in medical students and academic teachers with use of iris and diagnoscope specialist software.

Science.gov (United States)

Zielińska-Bliźniewska, Hanna; Sułkowski, Wiesław J; Pietkiewicz, Piotr; Miłoński, Jarosław; Mazurek, Agnieszka; Olszewski, Jurek

2012-06-01

The aim of this study was to compare the parameters of vocal acoustic and vocal efficiency analyses in medical students and academic teachers with use of the IRIS and DiagnoScope Specialist software and to evaluate their usefulness in prevention and certification of occupational disease. The study group comprised 40 women, including students and employees of the Military Medical Faculty, Medical University of Łodź. After informed consent had been obtained from the participant women, the primary medical history was taken, videolaryngoscopic and stroboscopic examinations were performed and diagnostic vocal acoustic analysis was carried out with the use of the IRIS and Diagno-Scope Specialist software. Based on the results of the performed measurements, the statistical analysis evidenced the compatibility between two software programs, IRIS and DiagnoScope Specialist, with the only exception of the F4 formant. The mean values of vocal acoustic parameters in medical students and academic teachers, obtained by means of the IRIS software, can be used as standards for the female population not yet developed by the producer. When using the DiagnoScope Specialist software, some mean values were higher and some lower than the standards specified by the producer. The study evidenced the compatibility between two measurement software programs, IRIS and DiagnoScope Specialist, except for the F4 formant. It should be noted that the later has advantage over the former since the standard values of vocal acoustic parameters have been worked out by the producer. Moreover, they only slightly departed from the values obtained in our study and may be useful in diagnostics of occupational voice disorders.
Prospective clinical study on long-term swallowing function and voice quality in advanced head and neck cancer patients treated with concurrent chemoradiotherapy and preventive swallowing exercises.

Science.gov (United States)

Kraaijenga, Sophie A C; van der Molen, Lisette; Jacobi, Irene; Hamming-Vrieze, Olga; Hilgers, Frans J M; van den Brekel, Michiel W M

2015-11-01

Concurrent chemoradiotherapy (CCRT) for advanced head and neck cancer (HNC) is associated with substantial early and late side effects, most notably regarding swallowing function, but also regarding voice quality and quality of life (QoL). Despite increased awareness/knowledge on acute dysphagia in HNC survivors, long-term (i.e., beyond 5 years) prospectively collected data on objective and subjective treatment-induced functional outcomes (and their impact on QoL) still are scarce. The objective of this study was the assessment of long-term CCRT-induced results on swallowing function and voice quality in advanced HNC patients. The study was conducted as a randomized controlled trial on preventive swallowing rehabilitation (2006-2008) in a tertiary comprehensive HNC center with twenty-two disease-free and evaluable HNC patients as participants. Multidimensional assessment of functional sequels was performed with videofluoroscopy, mouth opening measurements, Functional Oral Intake Scale, acoustic voice parameters, and (study specific, SWAL-QoL, and VHI) questionnaires. Outcome measures at 6 years post-treatment were compared with results at baseline and at 2 years post-treatment. At a mean follow-up of 6.1 years most initial tumor-, and treatment-related problems remained similarly low to those observed after 2 years follow-up, except increased xerostomia (68%) and increased (mild) pain (32%). Acoustic voice analysis showed less voicedness, increased fundamental frequency, and more vocal effort for the tumors located below the hyoid bone (n = 12), without recovery to baseline values. Patients' subjective vocal function (VHI score) was good. Functional swallowing and voice problems at 6 years post-treatment are minimal in this patient cohort, originating from preventive and continued post-treatment rehabilitation programs.
Acoustic analysis in Mudejar-Gothic churches: experimental results.

Science.gov (United States)

Galindo, Miguel; Zamarreño, Teófilo; Girón, Sara

2005-05-01

This paper describes the preliminary results of research work in acoustics, conducted in a set of 12 Mudejar-Gothic churches in the city of Seville in the south of Spain. Despite common architectural style, the churches feature individual characteristics and have volumes ranging from 3947 to 10 708 m3. Acoustic parameters were measured in unoccupied churches according to the ISO-3382 standard. An extensive experimental study was carried out using impulse response analysis through a maximum length sequence measurement system in each church. It covered aspects such as reverberation (reverberation times, early decay times), distribution of sound levels (sound strength); early to late sound energy parameters derived from the impulse responses (center time, clarity for speech, clarity, definition, lateral energy fraction), and speech intelligibility (rapid speech transmission index), which all take both spectral and spatial distribution into account. Background noise was also measured to obtain the NR indices. The study describes the acoustic field inside each temple and establishes a discussion for each one of the acoustic descriptors mentioned by using the theoretical models available and the principles of architectural acoustics. Analysis of the quality of the spaces for music and speech is carried out according to the most widespread criteria for auditoria.
Acoustic analysis in Mudejar-Gothic churches: Experimental results

Science.gov (United States)

Galindo, Miguel; Zamarreño, Teófilo; Girón, Sara

2005-05-01

This paper describes the preliminary results of research work in acoustics, conducted in a set of 12 Mudejar-Gothic churches in the city of Seville in the south of Spain. Despite common architectural style, the churches feature individual characteristics and have volumes ranging from 3947 to 10 708 m3. Acoustic parameters were measured in unoccupied churches according to the ISO-3382 standard. An extensive experimental study was carried out using impulse response analysis through a maximum length sequence measurement system in each church. It covered aspects such as reverberation (reverberation times, early decay times), distribution of sound levels (sound strength); early to late sound energy parameters derived from the impulse responses (center time, clarity for speech, clarity, definition, lateral energy fraction), and speech intelligibility (rapid speech transmission index), which all take both spectral and spatial distribution into account. Background noise was also measured to obtain the NR indices. The study describes the acoustic field inside each temple and establishes a discussion for each one of the acoustic descriptors mentioned by using the theoretical models available and the principles of architectural acoustics. Analysis of the quality of the spaces for music and speech is carried out according to the most widespread criteria for auditoria. .
Injection laryngoplasty as miniinvasive office-based surgery in patients with unilateral vocal fold paralysis - voice quality outcomes.

Science.gov (United States)

Sielska-Badurek, Ewelina M; Sobol, Maria; Jędra, Katarzyna; Rzepakowska, Anna; Osuch-Wójcikiewicz, Ewa; Niemczyk, Kazimierz

2017-09-01

Injection laryngoplasty (glottis augmentation) is the preferred method in surgical management of unilateral vocal fold paralysis (UVFP). Traditionally, these procedures are performed in the operating room. Nowadays, however, these procedures have moved into the office. To evaluate the voice quality after transoral injection laryngoplasty under local anaesthesia in patients with unilateral vocal fold paralysis. Fourteen subjects (5 women and 9 men) with unilateral vocal fold paresis (9 with right vocal fold paresis and 5 with left vocal fold paresis) were included in the study. The mean age of the group was 57.8 ±19.0 years (32-83 years). All of the injection laryngoplasties were performed transorally, under local anaesthesia. The injection material was calcium hydroxylapatite. Before and 1, 3 and 6 months after the procedure the following variables were evaluated: voice perception, videostroboscopy, acoustic analysis, aerodynamic evaluation, and the subjective rating of the voice quality by the patient. After injection laryngoplasty, complete glottal closure was achieved or there was a significant improvement in the glottal closure of each subject. We noted great improvement in the post-injection objective and subjective voice outcomes and patients reported improvement in the voice-related quality of life. The transoral approach for injection laryngoplasty under local anaesthesia is an effective and safe way to treat incomplete glottal closure in patients with UVFP. The transoral approach is an efficient alternative to other surgical techniques used for vocal fold injection.

Robust signal selection for lineair prediction analysis of voiced speech

NARCIS (Netherlands)

Ma, C.; Kamp, Y.; Willems, L.F.

1993-01-01

This paper investigates a weighted LPC analysis of voiced speech. In view of the speech production model, the weighting function is either chosen to be the short-time energy function of the preemphasized speech sample sequence with certain delays or is obtained by thresholding the short-time energy
Analyzing the mediated voice - a datasession

DEFF Research Database (Denmark)

Lawaetz, Anna

Broadcasted voices are technologically manipulated. In order to achieve a certain autencity or sound of “reality” paradoxically the voices are filtered and trained in order to reach the listeners. This “mis-en-scene” is important knowledge when it comes to the development of a consistent method o...... of analysis of the mediated voice...
Acoustic Emission Analysis Applet (AEAA) Software

Science.gov (United States)

Nichols, Charles T.; Roth, Don J.

2013-01-01

NASA Glenn Research and NASA White Sands Test Facility have developed software supporting an automated pressure vessel structural health monitoring (SHM) system based on acoustic emissions (AE). The software, referred to as the Acoustic Emission Analysis Applet (AEAA), provides analysts with a tool that can interrogate data collected on Digital Wave Corp. and Physical Acoustics Corp. software using a wide spectrum of powerful filters and charts. This software can be made to work with any data once the data format is known. The applet will compute basic AE statistics, and statistics as a function of time and pressure (see figure). AEAA provides value added beyond the analysis provided by the respective vendors' analysis software. The software can handle data sets of unlimited size. A wide variety of government and commercial applications could benefit from this technology, notably requalification and usage tests for compressed gas and hydrogen-fueled vehicles. Future enhancements will add features similar to a "check engine" light on a vehicle. Once installed, the system will ultimately be used to alert International Space Station crewmembers to critical structural instabilities, but will have little impact to missions otherwise. Diagnostic information could then be transmitted to experienced technicians on the ground in a timely manner to determine whether pressure vessels have been impacted, are structurally unsound, or can be safely used to complete the mission.
Modification of computational auditory scene analysis (CASA) for noise-robust acoustic feature

Science.gov (United States)

Kwon, Minseok

While there have been many attempts to mitigate interferences of background noise, the performance of automatic speech recognition (ASR) still can be deteriorated by various factors with ease. However, normal hearing listeners can accurately perceive sounds of their interests, which is believed to be a result of Auditory Scene Analysis (ASA). As a first attempt, the simulation of the human auditory processing, called computational auditory scene analysis (CASA), was fulfilled through physiological and psychological investigations of ASA. CASA comprised of Zilany-Bruce auditory model, followed by tracking fundamental frequency for voice segmentation and detecting pairs of onset/offset at each characteristic frequency (CF) for unvoiced segmentation. The resulting Time-Frequency (T-F) representation of acoustic stimulation was converted into acoustic feature, gammachirp-tone frequency cepstral coefficients (GFCC). 11 keywords with various environmental conditions are used and the robustness of GFCC was evaluated by spectral distance (SD) and dynamic time warping distance (DTW). In "clean" and "noisy" conditions, the application of CASA generally improved noise robustness of the acoustic feature compared to a conventional method with or without noise suppression using MMSE estimator. The intial study, however, not only showed the noise-type dependency at low SNR, but also called the evaluation methods in question. Some modifications were made to capture better spectral continuity from an acoustic feature matrix, to obtain faster processing speed, and to describe the human auditory system more precisely. The proposed framework includes: 1) multi-scale integration to capture more accurate continuity in feature extraction, 2) contrast enhancement (CE) of each CF by competition with neighboring frequency bands, and 3) auditory model modifications. The model modifications contain the introduction of higher Q factor, middle ear filter more analogous to human auditory system
Effects of flow gradients on directional radiation of human voice.

Science.gov (United States)

Pulkki, Ville; Lähivaara, Timo; Huhtakallio, Ilkka

2018-02-01

In voice communication in windy outdoor conditions, complex velocity gradients appear in the flow field around the source, the receiver, and also in the atmosphere. It is commonly known that voice emanates stronger towards the downstream direction when compared with the upstream direction. In literature, the atmospheric effects are used to explain the stronger emanation in the downstream direction. This work shows that the wind also has an effect to the directivity of voice also favouring the downstream direction. The effect is addressed by measurements and simulations. Laboratory measurements are conducted by using a large pendulum with a loudspeaker mimicking the human head, whereas practical measurements utilizing the human voice are realized by placing a subject through the roof window of a moving car. The measurements and a simulation indicate congruent results in the speech frequency range: When the source faces the downstream direction, stronger radiation coinciding with the wind direction is observed, and when it faces the upstream direction, radiation is not affected notably. The simulated flow gradients show a wake region in the downstream direction, and the simulated acoustic field in the flow show that the region causes a wave-guide effect focusing the sound in the direction.
Vocal therapy of hyperkinetic dysphonia.

Science.gov (United States)

Mumović, Gordana; Veselinović, Mila; Arbutina, Tanja; Škrbić, Renata

2014-01-01

Hyperkinetic (hyperfunctional) dysphonia is a common pathology. The disorder is often found in vocal professionals faced with high vocal requirements. The objective of this study was to evaluate the effects of vocal therapy on voice condition characterized by hyperkinetic dysphonia with prenodular lesions and soft nodules. The study included 100 adult patients and 27 children aged 4-16 years with prenodular lesions and soft nodules. A subjective acoustic analysis using the GIRBAS scale was performed prior to and after vocal therapy. Twenty adult patients and 10 children underwent objective acoustic analysis including several acoustic parameters. Pathological vocal qualities (hoarse, harsh and breathy voice) were also obtained by computer analysis. The subjective acoustic analysis revealed a significant (pvocal treatment in adults and children. After treatment, all levels of dysphonia were lowered in 85% (85/100) of adult patients and 29% (29/100) had a normal voice. Before vocal therapy 9 children had severe, 13 had moderate and 8 slight dysphonia. After vocal therapy only 1 child had severe dysphonia, 7 had moderate, 10 had slight levels of dysphonia and 9 were without voice disorder. The objective acoustic analysis in adults revealed a significant improvement (p≤0.025) in all dysphonia parameters except SD FO and jitter %. In children, the acoustic parameters SD FO, jitter % and NNE (normal noise energy) were significantly improved (p=0.003-0.03). Pathological voice qualities were also improved in adults and children (pVocal therapy effectively improves the voice in hyperkinetic dysphonia with prenodular lesions and soft nodules in both adults and children, affectinq diverse acoustic parameters.
The Role of Occupational Voice Demand and Patient-Rated Impairment in Predicting Voice Therapy Adherence.

Science.gov (United States)

Ebersole, Barbara; Soni, Resha S; Moran, Kathleen; Lango, Miriam; Devarajan, Karthik; Jamal, Nausheen

2018-05-01

Examine the relationship among the severity of patient-perceived voice impairment, perceptual dysphonia severity, occupational voice demand, and voice therapy adherence. Identify clinical predictors of increased risk for therapy nonadherence. A retrospective cohort study of patients presenting with a chief complaint of persistent dysphonia at an interdisciplinary voice center was done. The Voice Handicap Index-10 (VHI-10) and the Voice-Related Quality of Life (V-RQOL) survey scores, clinician rating of dysphonia severity using the Grade score from the Grade, Roughness Breathiness, Asthenia, and Strain scale, occupational voice demand, and patient demographics were tested for associations with therapy adherence, defined as completion of the treatment plan. Classification and Regression Tree (CART) analysis was performed to establish thresholds for nonadherence risk. Of 166 patients evaluated, 111 were recommended for voice therapy. The therapy nonadherence rate was 56%. Occupational voice demand category, VHI-10, and V-RQOL scores were the only factors significantly correlated with therapy adherence (P demand are significantly more likely to be nonadherent with therapy than those with high occupational voice demand (P 40 is a significant cutoff point for predicting therapy nonadherence (P demand and patient perception of impairment are significantly and independently correlated with therapy adherence. A VHI-10 score of ≤9 or a V-RQOL score of >40 is a significant cutoff point for predicting nonadherence risk. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Phonomicrosurgery in Vocal Fold Nodules: Quantification of Outcomes in Professional and Non-Professional Voice Users.

Science.gov (United States)

Caffier, Philipp P; Salmen, Tatjana; Ermakova, Tatiana; Forbes, Eleanor; Ko, Seo-Rin; Song, Wen; Gross, Manfred; Nawka, Tadeus

2017-12-01

There are few data demonstrating the specific extent to which surgical intervention for vocal fold nodules (VFN) improves vocal function in professional (PVU) and non-professional voice users (NVU). The objective of this study was to compare and quantify results after phonomicrosurgery for VFN in these patient groups. In a prospective clinical study, surgery was performed via microlaryngoscopy in 37 female patients with chronic VFN manifestations (38±12 yrs, mean±SD). Pre- and postoperative evaluations of treatment efficacy comprised videolaryngostroboscopy, auditory-perceptual voice assessment, voice range profile (VRP), acoustic-aerodynamic analysis, and voice handicap index (VHI-9i). The dysphonia severity index (DSI) was compared with the vocal extent measure (VEM). PVU (n=24) and NVU (n=13) showed comparable laryngeal findings and levels of suffering (VHI-9i 16±7 vs 17±8), but PVU had a better pretherapeutic vocal range (26.8±7.4 vs 17.7±5.1 semitones, p<0.001) and vocal capacity (VEM 106±18 vs 74±29, p<0.01). Three months postoperatively, all patients had straight vocal fold edges, complete glottal closure, and recovered mucosal wave propagation. The mean VHI-9i score decreased by 8±6 points. DSI increased from 4.0±2.4 to 5.5±2.4, and VEM from 95±27 to 108±23 (p<0.001). Both parameters correlated significantly (rs=0.82). The average vocal range increased by 4.1±5.3 semitones, and the mean speaking pitch lowered by 0.5±1.4 semitones. These results confirm that phonomicrosurgery for VFN is a safe therapy for voice improvement in both PVU and NVU who do not respond to voice therapy alone. Top-level artistic capabilities in PVU were restored, but numeric changes of most vocal parameters were considerably larger in NVU.
Foetal response to music and voice.

Science.gov (United States)

Al-Qahtani, Noura H

2005-10-01

To examine whether prenatal exposure to music and voice alters foetal behaviour and whether foetal response to music differs from human voice. A prospective observational study was conducted in 20 normal term pregnant mothers. Ten foetuses were exposed to music and voice for 15 s at different sound pressure levels to find out the optimal setting for the auditory stimulation. Music, voice and sham were played to another 10 foetuses via a headphone on the maternal abdomen. The sound pressure level was 105 db and 94 db for music and voice, respectively. Computerised assessment of foetal heart rate and activity were recorded. 90 actocardiograms were obtained for the whole group. One way anova followed by posthoc (Student-Newman-Keuls method) analysis was used to find if there is significant difference in foetal response to music and voice versus sham. Foetuses responded with heart rate acceleration and motor response to both music and voice. This was statistically significant compared to sham. There was no significant difference between the foetal heart rate acceleration to music and voice. Prenatal exposure to music and voice alters the foetal behaviour. No difference was detected in foetal response to music and voice.
Effects of pain on vowel production – Towards a new way of pain-level estimation based on acoustic speech-signal analyses

DEFF Research Database (Denmark)

Salinas-Ranneberg, Melissa; Niebuhr, Oliver; Kunz, Miriam

2017-01-01

, particularly in those vowels that are associated with stereotypical pain groaning. Moreover, inspections of the acoustic data beyond the measured parameters suggest that the scope of our analysis is worth being extended in future studies to include voice-quality and formant parameters. Our research has...... the potential to create new opportunities in electrical engineering and provides a basis for developing various applications in healthcare and welfare technology....
A model for treating voice disorders in school-age children within a video gaming environment.

Science.gov (United States)

King, Suzanne N; Davis, Larry; Lehman, Jeffrey J; Ruddy, Bari Hoffman

2012-09-01

Clinicians use a variety of approaches to motivate children with hyperfunctional voice disorders to comply with voice therapy in a therapeutic session and improve the motivation of children to practice home-based exercises. Utilization of current entertainment technology in such approaches may improve participation and motivation in voice therapy. The purpose of this study is to test the feasibility of using an entertainment video game as a therapy device. Prospective cohort and case-control study. Three levels of game testing were conducted to an existing entertainment video game for use as a voice therapy protocol. The game was tested by two computer programmers and five normal participants. The third level of testing was a case study with a child diagnosed with a hyperfunctional voice disorder. Modifications to the game were made after each feasibility test. Errors with the video game performance were modified, including the addition of a time stamp directory and game controller. Resonance voice exercises were modified to accommodate the gaming environment and unique competitive situation, including speech rate, acoustic parameters, game speed, and point allocations. The development of video games for voice therapeutic purposes attempt to replicate the high levels of engagement and motivation attained with entertainment video games, stimulating a more productive means of learning while doing. This case study found that a purely entertainment video game can be implemented as a voice therapeutic protocol based on information obtained from the case study. Copyright © 2012 The Voice Foundation. All rights reserved.
Phonological experience modulates voice discrimination: Evidence from functional brain networks analysis.

Science.gov (United States)

Hu, Xueping; Wang, Xiangpeng; Gu, Yan; Luo, Pei; Yin, Shouhang; Wang, Lijun; Fu, Chao; Qiao, Lei; Du, Yi; Chen, Antao

2017-10-01

Numerous behavioral studies have found a modulation effect of phonological experience on voice discrimination. However, the neural substrates underpinning this phenomenon are poorly understood. Here we manipulated language familiarity to test the hypothesis that phonological experience affects voice discrimination via mediating the engagement of multiple perceptual and cognitive resources. The results showed that during voice discrimination, the activation of several prefrontal regions was modulated by language familiarity. More importantly, the same effect was observed concerning the functional connectivity from the fronto-parietal network to the voice-identity network (VIN), and from the default mode network to the VIN. Our findings indicate that phonological experience could bias the recruitment of cognitive control and information retrieval/comparison processes during voice discrimination. Therefore, the study unravels the neural substrates subserving the modulation effect of phonological experience on voice discrimination, and provides new insights into studying voice discrimination from the perspective of network interactions. Copyright © 2017. Published by Elsevier Inc.
Synchronous visualization of multimodal measurements on lips and glottis: comparison between brass instruments and the human voice production system.

OpenAIRE

Hézard , Thomas; FREOUR , Vincent; Causse , René; Hélie , Thomas; Scavone , Gary P.

2013-01-01

cote interne IRCAM: Hezard13a; None / None; National audience; Brass instruments and the human voice production system are both composed of a vibrating "human valve" (constriction in a pipe) coupled to an acoustic resonator: lips coupled to the brass instrument or vocal folds coupled to the vocal tract. In both cases, the aeroacoustic coupling is responsible for the self-oscillations and a large variety of regimes. Additionally, brass instruments and voice share difficulties for the...
Colour and texture associations in voice-induced synaesthesia

Directory of Open Access Journals (Sweden)

Anja eMoos

2013-09-01

Full Text Available Voice-induced synaesthesia, a form of synaesthesia in which synaesthetic perceptions are induced by the sounds of people’s voices, appears to be relatively rare and has not been systematically studied. In this study we investigated the synaesthetic colour and visual texture perceptions experienced in response to different types of voice quality (e.g. nasal, whisper, falsetto. Experiences of three different groups – self-reported voice synaesthetes, phoneticians and controls – were compared using both qualitative and quantitative analysis in a study conducted online. Whilst, in the qualitative analysis, synaesthetes used more colour and texture terms to describe voices than either phoneticians or controls, only weak differences, and many similarities, between groups were found in the quantitative analysis. Notable consistent results between groups were the matching of higher speech fundamental frequencies with lighter and redder colours, the matching of whispery voices with smoke-like textures and the matching of harsh and creaky voices with textures resembling dry cracked soil. These data are discussed in the light of current thinking about definitions and categorizations of synaesthesia, especially in cases where individuals apparently have a range of different synaesthetic inducers.
Color and texture associations in voice-induced synesthesia

Science.gov (United States)

Moos, Anja; Simmons, David; Simner, Julia; Smith, Rachel

2013-01-01

Voice-induced synesthesia, a form of synesthesia in which synesthetic perceptions are induced by the sounds of people's voices, appears to be relatively rare and has not been systematically studied. In this study we investigated the synesthetic color and visual texture perceptions experienced in response to different types of “voice quality” (e.g., nasal, whisper, falsetto). Experiences of three different groups—self-reported voice synesthetes, phoneticians, and controls—were compared using both qualitative and quantitative analysis in a study conducted online. Whilst, in the qualitative analysis, synesthetes used more color and texture terms to describe voices than either phoneticians or controls, only weak differences, and many similarities, between groups were found in the quantitative analysis. Notable consistent results between groups were the matching of higher speech fundamental frequencies with lighter and redder colors, the matching of “whispery” voices with smoke-like textures, and the matching of “harsh” and “creaky” voices with textures resembling dry cracked soil. These data are discussed in the light of current thinking about definitions and categorizations of synesthesia, especially in cases where individuals apparently have a range of different synesthetic inducers. PMID:24032023
Influences of Fundamental Frequency, Formant Frequencies, Aperiodicity, and Spectrum Level on the Perception of Voice Gender

Science.gov (United States)

Skuk, Verena G.; Schweinberger, Stefan R.

2014-01-01

Purpose: To determine the relative importance of acoustic parameters (fundamental frequency [F0], formant frequencies [FFs], aperiodicity, and spectrum level [SL]) on voice gender perception, the authors used a novel parameter-morphing approach that, unlike spectral envelope shifting, allows the application of nonuniform scale factors to transform…
Acoustic and Perceptual Analyses of Adductor Spasmodic Dysphonia in Mandarin-speaking Chinese.

Science.gov (United States)

Chen, Zhipeng; Li, Jingyuan; Ren, Qingyi; Ge, Pingjiang

2018-02-12

The objective of this study was to examine the perceptual structure and acoustic characteristics of speech of patients with adductor spasmodic dysphonia (ADSD) in Mandarin. Case-Control Study MATERIALS AND METHODS: For the estimation of dysphonia level, perceptual and acoustic analysis were used for patients with ADSD (N = 20) and the control group (N = 20) that are Mandarin-Chinese speakers. For both subgroups, a sustained vowel and connected speech samples were obtained. The difference of perceptual and acoustic parameters between the two subgroups was assessed and analyzed. For acoustic assessment, the percentage of phonatory breaks (PBs) of connected reading and the percentage of aperiodic segments and frequency shifts (FS) of vowel and reading in patients with ADSD were significantly worse than controls, the mean harmonics-to-noise ratio and the fundamental frequency standard deviation of vowel as well. For perceptual evaluation, the rating of speech and vowel in patients with ADSD are significantly higher than controls. The percentage of aberrant acoustic events (PB, frequency shift, and aperiodic segment) and the fundamental frequency standard deviation and mean harmonics-to-noise ratio were significantly correlated with the perceptual rating in the vowel and reading productions. The perceptual and acoustic parameters of connected vowel and reading in patients with ADSD are worse than those in normal controls, and could validly and reliably estimate dysphonia of ADSD in Mandarin-speaking Chinese. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Anti-voice adaptation suggests prototype-based coding of voice identity

Directory of Open Access Journals (Sweden)

Marianne eLatinus

2011-07-01

Full Text Available We used perceptual aftereffects induced by adaptation with anti-voice stimuli to investigate voice identity representations. Participants learned a set of voices then were tested on a voice identification task with vowel stimuli morphed between identities, after different conditions of adaptation. In Experiment 1, participants chose the identity opposite to the adapting anti-voice significantly more often than the other two identities (e.g., after being adapted to anti-A, they identified the average voice as A. In Experiment 2, participants showed a bias for identities opposite to the adaptor specifically for anti-voice, but not for non anti-voice adaptors. These results are strikingly similar to adaptation aftereffects observed for facial identity. They are compatible with a representation of individual voice identities in a multidimensional perceptual voice space referenced on a voice prototype.
Acoustic analysis of trill sounds.

Science.gov (United States)

Dhananjaya, N; Yegnanarayana, B; Bhaskararao, Peri

2012-04-01

In this paper, the acoustic-phonetic characteristics of steady apical trills--trill sounds produced by the periodic vibration of the apex of the tongue--are studied. Signal processing methods, namely, zero-frequency filtering and zero-time liftering of speech signals, are used to analyze the excitation source and the resonance characteristics of the vocal tract system, respectively. Although it is natural to expect the effect of trilling on the resonances of the vocal tract system, it is interesting to note that trilling influences the glottal source of excitation as well. The excitation characteristics derived using zero-frequency filtering of speech signals are glottal epochs, strength of impulses at the glottal epochs, and instantaneous fundamental frequency of the glottal vibration. Analysis based on zero-time liftering of speech signals is used to study the dynamic resonance characteristics of vocal tract system during the production of trill sounds. Qualitative analysis of trill sounds in different vowel contexts, and the acoustic cues that may help spotting trills in continuous speech are discussed.
Voice - How humans communicate?

Science.gov (United States)

Tiwari, Manjul; Tiwari, Maneesha

2012-01-01

Voices are important things for humans. They are the medium through which we do a lot of communicating with the outside world: our ideas, of course, and also our emotions and our personality. The voice is the very emblem of the speaker, indelibly woven into the fabric of speech. In this sense, each of our utterances of spoken language carries not only its own message but also, through accent, tone of voice and habitual voice quality it is at the same time an audible declaration of our membership of particular social regional groups, of our individual physical and psychological identity, and of our momentary mood. Voices are also one of the media through which we (successfully, most of the time) recognize other humans who are important to us-members of our family, media personalities, our friends, and enemies. Although evidence from DNA analysis is potentially vastly more eloquent in its power than evidence from voices, DNA cannot talk. It cannot be recorded planning, carrying out or confessing to a crime. It cannot be so apparently directly incriminating. As will quickly become evident, voices are extremely complex things, and some of the inherent limitations of the forensic-phonetic method are in part a consequence of the interaction between their complexity and the real world in which they are used. It is one of the aims of this article to explain how this comes about. This subject have unsolved questions, but there is no direct way to present the information that is necessary to understand how voices can be related, or not, to their owners.

A Meta-Analysis: Acoustic Measurement of Roughness and Breathiness

Science.gov (United States)

v. Latoszek, Ben Barsties; Maryn, Youri; Gerrits, Ellen; De Bodt, Marc

2018-01-01

Purpose: Over the last 5 decades, many acoustic measures have been created to measure roughness and breathiness. The aim of this study is to present a meta-analysis of correlation coefficients (r) between auditory-perceptual judgment of roughness and breathiness and various acoustic measures in both sustained vowels and continuous speech. Method:…
Sensitivity of Acoustic Resonance Properties to a Change in Volume of Piriform Sinuses

Czech Academy of Sciences Publication Activity Database

Radolf, Vojtěch

2016-01-01

Roč. 821, č. 2016 (2016), s. 671-676 ISSN 1662-7482 R&D Projects: GA ČR GPP101/12/P579 Institutional support: RVO:61388998 Keywords : piriform sinus * vocal tract model * biomechanics of voice * formant frequency Subject RIV: BI - Acoustics
Voice, Schooling, Inequality, and Scale

Science.gov (United States)

Collins, James

2013-01-01

The rich studies in this collection show that the investigation of voice requires analysis of "recognition" across layered spatial-temporal and sociolinguistic scales. I argue that the concepts of voice, recognition, and scale provide insight into contemporary educational inequality and that their study benefits, in turn, from paying attention to…
Eletromiografia laríngea e análise vocal em pacientes com Mal de Parkinson: estudo comparativo Laryngeal electromyography and acoustic voice analysis in Parkinson's disease: a comparative study

Directory of Open Access Journals (Sweden)

Ana Paula Zarzur

2010-02-01

. RESULTS: The main electromyographic pattern observed in the PD group was rest hypertonicity meaning that patients with PD presented with spontaneous intrinsic laryngeal muscle activity during voice rest, which occurred in 73% of the individuals. Not a case of laryngeal tremor was detected by electromyography, although vocal tremor was detected by VOXMETRIA in 69.5% of the individuals and in 61% of them by perceptive-auditive analysis. CONCLUSION: Vocal tremor was the main acoustic change in the PD group, with no correlation to LEMG findings.
Auditory vocal analysis and factors associated with voice disorders among teachers.

Science.gov (United States)

de Ceballos, Albanita Gomes da Costa; Carvalho, Fernando Martins; de Araújo, Tânia Maria; Dos Reis, Eduardo José Farias Borges

2011-06-01

Teachers are professionals who demand much of their voices and, consequently, present a high risk of developing vocal disorders during the course of employment. To identify factors associated with vocal disorders among teachers. An exploratory cross-sectional study, which investigated 476 teachers in primary and secondary schools in the city of Salvador, Bahia. Teachers answered a questionnaire and were submitted to auditory vocal analysis. The GRBAS was used for the diagnosis of vocal disorders. The study population comprised 82.8% women, teachers with an average age of 40.7 years, teachers with higher education (88.4%), with an average workday of 38 hours per week, average 11.5 years of professional practice and average monthly income of R$1.817.18. The prevalence of voice disorders was 53.6%. (255 teachers). The bivariate analysis showed statistically significant associations between vocal disorders and age above 40 years (PR = 1.83; 95% CI; 1.27-2.64), family history of dysphonia (PR = 1.72; 95% CI; 1.06-2.80), over 20 hours of weekly working hours (PR = 1.66; 95% CI; 1.09-2.52) and presence of chalk dust in the classroom (PR = 1.70; 95% CI; 1.14-2.53). The study concluded that teachers, 40 years old and over, with a family history of dysphonia, working over 20 hours weekly, and teaching in classrooms with chalk dust are more likely to develop voice disorders than others.
Spatio-Temporal Analysis of Urban Acoustic Environments with Binaural Psycho-Acoustical Considerations for IoT-Based Applications.

Science.gov (United States)

Segura-Garcia, Jaume; Navarro-Ruiz, Juan Miguel; Perez-Solano, Juan J; Montoya-Belmonte, Jose; Felici-Castell, Santiago; Cobos, Maximo; Torres-Aranda, Ana M

2018-02-26

Sound pleasantness or annoyance perceived in urban soundscapes is a major concern in environmental acoustics. Binaural psychoacoustic parameters are helpful to describe generic acoustic environments, as it is stated within the ISO 12913 framework. In this paper, the application of a Wireless Acoustic Sensor Network (WASN) to evaluate the spatial distribution and the evolution of urban acoustic environments is described. Two experiments are presented using an indoor and an outdoor deployment of a WASN with several nodes using an Internet of Things (IoT) environment to collect audio data and calculate meaningful parameters such as the sound pressure level, binaural loudness and binaural sharpness. A chunk of audio is recorded in each node periodically with a microphone array and the binaural rendering is conducted by exploiting the estimated directional characteristics of the incoming sound by means of DOA estimation. Each node computes the parameters in a different location and sends the values to a cloud-based broker structure that allows spatial statistical analysis through Kriging techniques. A cross-validation analysis is also performed to confirm the usefulness of the proposed system.
The impact of preventive voice care programs for training teachers: a longitudinal study.

Science.gov (United States)

Duffy, Orla M; Hazlett, Diane E

2004-03-01

The teaching profession puts vocal health at a higher risk than other professions, causing what is referred to as "occupational dysphonia." There is a need for primary prevention of "occupational dysphonia" among the teaching profession, where good vocal health is promoted before a problem occurs. To investigate the primary prevention of occupational dysphonia among teachers, this study uses a sample population of 55 training teachers, in the postgraduate certificate of education (PGCE) course at the University of Ulster, Northern Ireland, who were randomly assigned to three training groups: control, indirect, and direct. The vocal performance of the three groups was measured at two points over the year of the PGCE course: first before any teaching or training began, and again after the first teaching practice. The training for the indirect and direct groups was provided before the teaching practices. Acoustic and self-perceptual measurements were used to assess the multidimensional outcomes. The results demonstrate interesting trends, that although not found to be significant, are approaching significance. Their voices will be reevaluated at a third point of measurement. The acoustic measurement reflects deterioration from time 1 to time 2 for the control group, improvement for the direct group, and no change for the indirect group, indicating that the training has proved beneficial. The self-rating scores vary in agreement with the acoustic results, presenting interesting findings. The findings of this study will be of benefit to teachers, their educators, voice therapists, health promoters, and human resource personnel.
Comparação entre as análises auditiva e acústica nas disartrias Comparison between auditory-perceptual and acoustic analyses in dysarthrias

Directory of Open Access Journals (Sweden)

Karin Zazo Ortiz

2008-01-01

, loudness (adequate, decreased or increased, pitch (adequate, low or high, vocal attack (isochronic, sudden or breathy, and voice stability (stable or unstable. Acoustic analyses were made with GRAM 5.1.7 Program that considered voice quality and spectrographic tracing, and Vox Metria Program to obtain objective measures. RESULTS: The comparison between auditory-perceptual and acoustic data showed no correlation for all the parameters analyzed. It was found a significant difference between breathiness and shimmer alteration (p=0.048, and between breathiness and harmonics definition (p=0.040, evidencing correlation between noise presence during emission and breathiness. CONCLUSION: Acoustic analysis associated to auditory-perceptual analysis provided different but complementary data, helping the clinical diagnosis of dysarthias.
Perceptual and Acoustic Reliability Estimates for the Speech Disorders Classification System (SDCS)

Science.gov (United States)

Shriberg, Lawrence D.; Fourakis, Marios; Hall, Sheryl D.; Karlsson, Heather B.; Lohmeier, Heather L.; McSweeny, Jane L.; Potter, Nancy L.; Scheer-Cohen, Alison R.; Strand, Edythe A.; Tilkens, Christie M.; Wilson, David L.

2010-01-01

A companion paper describes three extensions to a classification system for paediatric speech sound disorders termed the Speech Disorders Classification System (SDCS). The SDCS uses perceptual and acoustic data reduction methods to obtain information on a speaker's speech, prosody, and voice. The present paper provides reliability estimates for…
Atypical cry acoustics in 6-month-old infants at risk for autism spectrum disorder.

Science.gov (United States)

Sheinkopf, Stephen J; Iverson, Jana M; Rinaldi, Melissa L; Lester, Barry M

2012-10-01

This study examined differences in acoustic characteristics of infant cries in a sample of babies at risk for autism and a low-risk comparison group. Cry samples derived from vocal recordings of 6-month-old infants at risk for autism spectrum disorder (ASD; n = 21) and low-risk infants (n = 18) were subjected to acoustic analyses using analysis software designed for this purpose. Cries were categorized as either pain-related or non-pain-related based on videotape coding. At-risk infants produced pain-related cries with higher and more variable fundamental frequency (F (0) ) than low-risk infants. At-risk infants later classified with ASD at 36 months had among the highest F (0) values for both types of cries and produced cries that were more poorly phonated than those of nonautistic infants, reflecting cries that were less likely to be produced in a voiced mode. These results provide preliminary evidence that disruptions in cry acoustics may be part of an atypical vocal signature of autism in early life. © 2012 International Society for Autism Research, Wiley Periodicals, Inc.
How do you say ‘hello’? Personality impressions from brief novel voices

OpenAIRE

McAleer, Phil; Todorov, Alexander; Belin, Pascal

2014-01-01

On hearing a novel voice, listeners readily form personality impressions of that speaker. Accurate or not, these impressions are known to affect subsequent interactions; yet the underlying psychological and acoustical bases remain poorly understood. Furthermore, hitherto studies have focussed on extended speech as opposed to analysing the instantaneous impressions we obtain from first experience. In this paper, through a mass online rating experiment, 320 participants rated 64 sub-second voca...
Analysis of aspects of quality of life in teachers' voice after discharged: longitudinal study.

Science.gov (United States)

Ferreira, Josiane Mendes; Campos, Nathália Ferreira; Bassi, Iara Barreto; Santos, Marco Aurélio Rocha; Teixeira, Letícia Caldas; Gama, Ana Cristina Côrtes

2013-01-01

To evaluate the long-term effects of voice therapy on the life quality of teachers who were discharged or abandoned the voice therapy for dysphonia. This was a longitudinal study based on analysis of assessments with teachers of municipal schools in Belo Horizonte, who were referred to voice therapy and were discharged or abandoned the speech-language therapy for more than six months. A total of 33 teachers in the discharged group and 20 teachers in the abandoned group were contacted by phone and invited to participate in the study by answering the Voice activity and participation profile, which was forwarded to the researchers and sent via letter. At the moment of the pre speech therapy, the discharged and abandoned groups were homogeneous, except in relation to daily communication parameter. Comparing the discharged group in the pre and post speech-language therapy, it was showed improvements in social communication parameter as well as in the total score. The discharged group presented worsening in self-perception parameter when comparing the average values in the post therapy and current moments, and the group abandoned presented worsening in work, social communication and total score when comparing to the average values in the pre therapy and current moments. The discharged and abandoned groups differ in the present moment in all investigated parameters. Speech-language therapy for dysphonia have long term positive effects on life quality and voice of teachers who were soon discharged from the therapy and in a period of two years on average. Teachers who have abandoned treatment and did not obtain improvement in the voice showed negative impact in life quality and voice in a time of 2 years and 2 months on average.
Audio-visual identification of place of articulation and voicing in white and babble noise.

Science.gov (United States)

Alm, Magnus; Behne, Dawn M; Wang, Yue; Eg, Ragnhild

2009-07-01

Research shows that noise and phonetic attributes influence the degree to which auditory and visual modalities are used in audio-visual speech perception (AVSP). Research has, however, mainly focused on white noise and single phonetic attributes, thus neglecting the more common babble noise and possible interactions between phonetic attributes. This study explores whether white and babble noise differentially influence AVSP and whether these differences depend on phonetic attributes. White and babble noise of 0 and -12 dB signal-to-noise ratio were added to congruent and incongruent audio-visual stop consonant-vowel stimuli. The audio (A) and video (V) of incongruent stimuli differed either in place of articulation (POA) or voicing. Responses from 15 young adults show that, compared to white noise, babble resulted in more audio responses for POA stimuli, and fewer for voicing stimuli. Voiced syllables received more audio responses than voiceless syllables. Results can be attributed to discrepancies in the acoustic spectra of both the noise and speech target. Voiced consonants may be more auditorily salient than voiceless consonants which are more spectrally similar to white noise. Visual cues contribute to identification of voicing, but only if the POA is visually salient and auditorily susceptible to the noise type.
Vocal therapy of hyperkinetic dysphonia

Directory of Open Access Journals (Sweden)

Mumović Gordana

2014-01-01

Full Text Available Introduction. Hyperkinetic (hyperfunctional dysphonia is a common pathology. The disorder is often found in vocal professionals faced with high vocal requirements. Objective. The objective of this study was to evaluate the effects of vocal therapy on voice condition characterized by hyperkinetic dysphonia with prenodular lesions and soft nodules. Methods. The study included 100 adult patients and 27 children aged 4-16 years with prenodular lesions and soft nodules. A subjective acoustic analysis using the GIRBAS scale was performed prior to and after vocal therapy. Twenty adult patients and 10 children underwent objective acoustic analysis including several acoustic parameters. Pathological vocal qualities (hoarse, harsh and breathy voice were also obtained by computer analysis. Results. The subjective acoustic analysis revealed a significant (p<0.01 reduction in all dysphonia parameters after vocal treatment in adults and children. After treatment, all levels of dysphonia were lowered in 85% (85/100 of adult patients and 29% (29/100 had a normal voice. Before vocal therapy 9 children had severe, 13 had moderate and 8 slight dysphonia. After vocal therapy only 1 child had severe dysphonia, 7 had moderate, 10 had slight levels of dysphonia and 9 were without voice disorder. The objective acoustic analysis in adults revealed a significant improvement (p≤0.025 in all dysphonia parameters except SD F0 and jitter %. In children, the acoustic parameters SD F0, jitter % and NNE (normal noise energy were significantly improved (p=0.003-0.03. Pathological voice qualities were also improved in adults and children (p<0.05. Conclusion. Vocal therapy effectively improves the voice in hyperkinetic dysphonia with prenodular lesions and soft nodules in both adults and children, affecting diverse acoustic parameters.
The Voice of Emotion: Acoustic Properties of Six Emotional Expressions.

Science.gov (United States)

Baldwin, Carol May

Studies in the perceptual identification of emotional states suggested that listeners seemed to depend on a limited set of vocal cues to distinguish among emotions. Linguistics and speech science literatures have indicated that this small set of cues included intensity, fundamental frequency, and temporal properties such as speech rate and duration. Little research has been done, however, to validate these cues in the production of emotional speech, or to determine if specific dimensions of each cue are associated with the production of a particular emotion for a variety of speakers. This study addressed deficiencies in understanding of the acoustical properties of duration and intensity as components of emotional speech by means of speech science instrumentation. Acoustic data were conveyed in a brief sentence spoken by twelve English speaking adult male and female subjects, half with dramatic training, and half without such training. Simulated expressions included: happiness, surprise, sadness, fear, anger, and disgust. The study demonstrated that the acoustic property of mean intensity served as an important cue for a vocal taxonomy. Overall duration was rejected as an element for a general taxonomy due to interactions involving gender and role. Findings suggested a gender-related taxonomy, however, based on differences in the ways in which men and women use the duration cue in their emotional expressions. Results also indicated that speaker training may influence greater use of the duration cue in expressions of emotion, particularly for male actors. Discussion of these results provided linkages to (1) practical management of emotional interactions in clinical and interpersonal environments, (2) implications for differences in the ways in which males and females may be socialized to express emotions, and (3) guidelines for future perceptual studies of emotional sensitivity.
Dynamic response analysis of an aircraft structure under thermal-acoustic loads

International Nuclear Information System (INIS)

Cheng, H; Li, H B; Zhang, W; Wu, Z Q; Liu, B R

2016-01-01

Future hypersonic aircraft will be exposed to extreme combined environments includes large magnitude thermal and acoustic loads. It presents a significant challenge for the integrity of these vehicles. Thermal-acoustic test is used to test structures for dynamic response and sonic fatigue due to combined loads. In this research, the numerical simulation process for the thermal acoustic test is presented, and the effects of thermal loads on vibro-acoustic response are investigated. To simulate the radiation heating system, Monte Carlo theory and thermal network theory was used to calculate the temperature distribution. Considering the thermal stress, the high temperature modal parameters are obtained with structural finite element methods. Based on acoustic finite element, modal-based vibro-acoustic analysis is carried out to compute structural responses. These researches are very vital to optimum thermal-acoustic test and structure designs for future hypersonic vehicles structure (paper)
Spatio-Temporal Analysis of Urban Acoustic Environments with Binaural Psycho-Acoustical Considerations for IoT-Based Applications

Directory of Open Access Journals (Sweden)

Jaume Segura-Garcia

2018-02-01

Full Text Available Sound pleasantness or annoyance perceived in urban soundscapes is a major concern in environmental acoustics. Binaural psychoacoustic parameters are helpful to describe generic acoustic environments, as it is stated within the ISO 12913 framework. In this paper, the application of a Wireless Acoustic Sensor Network (WASN to evaluate the spatial distribution and the evolution of urban acoustic environments is described. Two experiments are presented using an indoor and an outdoor deployment of a WASN with several nodes using an Internet of Things (IoT environment to collect audio data and calculate meaningful parameters such as the sound pressure level, binaural loudness and binaural sharpness. A chunk of audio is recorded in each node periodically with a microphone array and the binaural rendering is conducted by exploiting the estimated directional characteristics of the incoming sound by means of DOA estimation. Each node computes the parameters in a different location and sends the values to a cloud-based broker structure that allows spatial statistical analysis through Kriging techniques. A cross-validation analysis is also performed to confirm the usefulness of the proposed system.
Relationship between acoustic voice onset and offset and selected instances of oscillatory onset and offset in young healthy males and females

Science.gov (United States)

Patel, Rita; Forrest, Karen; Hedges, Drew

2016-01-01

Objective To investigate the relationship between (1) onset of the acoustic signal and pre-phonatory phases associated with oscillatory onset and (2) offset of the acoustic signal with the post-phonatory events associated with oscillatory offset across vocally healthy adults. Subjects and Methods High-speed videoendoscopy was captured simultaneously with the acoustic signal during repeated production of /hi.hi.hi/ at typical pitch and loudness from 56 vocally healthy adults (age 20–42 years; 21 male, 35 female). The relationship between the acoustic sound pressure signal and oscillatory onset /offset events from the glottal area waveforms (GAW), were statistically investigated using a multivariate linear regression analysis. Results The onset of the acoustic signal (X1a) is a significant predictor of the onset of first oscillations (X1g) and onset of sustained oscillations (X2g). X1a as well as gender are significant predictors of the first instance of medial contact (X1.5g). The offset of the acoustic signal (X2a) is a significant predictor of the first instance of oscillatory offset (X3g), first instance of incomplete glottal closure (X3.5g), and cessation of vocal fold motion (X4g). Conclusions The acoustic signal onset is closely related to the first medial contact of the vocal folds but the latency between these events is longer for females compared to males. The offset of the acoustic signal occurs immediately after incomplete glottal adduction. The emerging normative group latencies between the onset/offset of the acoustic and the GAW from this study appear promising for future investigations. PMID:27769696
Listening to Schneiderian Voices: A Novel Phenomenological Analysis.

Science.gov (United States)

Rosen, Cherise; Chase, Kayla A; Jones, Nev; Grossman, Linda S; Gin, Hannah; Sharma, Rajiv P

This paper reports on analyses designed to elucidate phenomenological characteristics, content and experience specifically targeting participants with Schneiderian voices conversing/commenting (VC) while exploring differences in clinical presentation and quality of life compared to those with voices not conversing (VNC). This mixed-method investigation of Schneiderian voices included standardized clinical metrics and exploratory phenomenological interviews designed to elicit in-depth information about the characteristics, content, meaning, and personification of auditory verbal hallucinations. The subjective experience shows a striking pattern of VC, as they are experienced as internal at initial onset and during the longer-term course of illness when compared to VNC. Participants in the VC group were more likely to attribute the origin of their voices to an external source such as God, telepathic communication, or mediumistic sources. VC and VNC were described as characterological entities that were distinct from self (I/we vs. you). We also found an association between VC and the positive, cognitive, and depression symptom profile. However, we did not find a significant group difference in overall quality of life. The clinical portrait of VC is complex, multisensory, and distinct, and suggests a need for further research into the biopsychosocial interface between subjective experience, socioenvironmental constraints, individual psychology, and the biological architecture of intersecting symptoms. © 2016 S. Karger AG, Basel.
A Voice Processing Technology for Rural Specific Context

Science.gov (United States)

He, Zhiyong; Zhang, Zhengguang; Zhao, Chunshen

Durian the promotion and applications of rural information, different geographical dialect voice interaction is a very complex issue. Through in-depth analysis of TTS core technologies, this paper presents the methods of intelligent segmentation, word segmentation algorithm and intelligent voice thesaurus construction in the different dialects context. And then COM based development methodology for specific context voice processing system implementation and programming method. The method has a certain reference value for the rural dialect and voice processing applications.

Assessment of vocal intensity in lecturers depending on acoustic properties of lecture rooms

Directory of Open Access Journals (Sweden)

Witold Mikulski

2015-08-01

Full Text Available Background: Lombard’s effect increases the level of vocal intensity in the environment, in which noise occurs. This article presents the results of the author’s own study of vocal intensity level and A-weighted sound pressure level of background noise during normal lectures. The aim of the study was to define whether above-mentioned parameters depend on acoustic properties of rooms (classrooms or lecture rooms and to define how many lectors speak with raised voice. Material and Methods: The study was performed in a group of 50 teachers and lecturers in 10 classrooms with cubature of 160–430 m3 and reverberation time of 0.37–1.3 s (group A consisted of 3 rooms which fulfilled, group B consisted of 3 rooms which almost fulfilled and group C consisted of 4 rooms which did not fulfill criteria based on reverberation time (maximum permissible value is 0.6–0.8 s according to PN-B-02151-4:2015. Criteria of raising voice were based on vocal intensity level (maximum value: 65 dB according to EN ISO 9921:2003. The values of above-mentioned parameters were determined from modes of A-weighted sound pressure level distributions during lectures. Results: Great differentiation of vocal intensity level between lectors was found. In classrooms of group A lectors were not using raised voice, in group B – 21%, and in group C – 60% of lectors were using raised voice. Conclusions: It was observed that acoustic properties of classrooms (defined by reverberation time exert their effect on lecturer’s vocal intensity level (i.e., raising voice, which may contribute to the increased risk of vocal tract illnesses. The occurrence of Lombard’s effect in groups of teachers and lecturers, conducting lectures in rooms, was evidenced. Med Pr 2015;66(4:487–496
Acoustic Correlates of Compensatory Adjustments to the Glottic and Supraglottic Structures in Patients with Unilateral Vocal Fold Paralysis

Directory of Open Access Journals (Sweden)

Luis M. T. Jesus

2015-01-01

Full Text Available The goal of this study was to analyse perceptually and acoustically the voices of patients with Unilateral Vocal Fold Paralysis (UVFP and compare them to the voices of normal subjects. These voices were analysed perceptually with the GRBAS scale and acoustically using the following parameters: mean fundamental frequency (F0, standard-deviation of F0, jitter (ppq5, shimmer (apq11, mean harmonics-to-noise ratio (HNR, mean first (F1 and second (F2 formants frequency, and standard-deviation of F1 and F2 frequencies. Statistically significant differences were found in all of the perceptual parameters. Also the jitter, shimmer, HNR, standard-deviation of F0, and standard-deviation of the frequency of F2 were statistically different between groups, for both genders. In the male data differences were also found in F1 and F2 frequencies values and in the standard-deviation of the frequency of F1. This study allowed the documentation of the alterations resulting from UVFP and addressed the exploration of parameters with limited information for this pathology.
Voice Onset Time for the Word-Initial Voiceless Consonant /t/ in Japanese Spasmodic Dysphonia-A Comparison With Normal Controls.

Science.gov (United States)

Yanagida, Saori; Nishizawa, Noriko; Mizoguchi, Kenji; Hatakeyama, Hiromitsu; Fukuda, Satoshi

2015-07-01

Voice onset time (VOT) for word-initial voiceless consonants in adductor spasmodic dysphonia (ADSD) and abductor spasmodic dysphonia (ABSD) patients were measured to determine (1) which acoustic measures differed from the controls and (2) whether acoustic measures were related to the pause or silence between the test word and the preceding word. Forty-eight patients with ADSD and nine patients with ABSD, as well as 20 matched normal controls read a story in which the word "taiyo" (the sun) was repeated three times, each differentiated by the position of the word in the sentence. The target of measurement was the VOT for the word-initial voiceless consonant /t/. When the target syllable appeared in a sentence following a comma, or at the beginning of a sentence following a period, the ABSD patients' VOTs were significantly longer than those of the ADSD patients and controls. Abnormal prolongation of the VOTs was related to the pause or silence between the test word and the preceding word. VOTs in spasmodic dysphonia (SD) may vary according to the SD subtype or speaking conditions. VOT measurement was suggested to be a useful method for quantifying voice symptoms in SD. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Acoustic and wind speed data analysis as an environmental issue

International Nuclear Information System (INIS)

Whitson, R.J.; MacKinnon, A.

1995-01-01

This paper examines how the output from a cup anemometer, used for wind speed measurement, can be recorded on magnetic tape and analysed using instrumentation normally employed to measure acoustic data. The purpose of this being to allow true simultaneous analysis of acoustic and wind speed data. NEL's NWTC (National Wind Turbine Centre) Anemometer Calibration Facility is used to compare pulsed and analogue outputs from a typical anemometer to the data obtained from a pitot/static tube for a range of different wind speeds. The usefulness of 1/24- and 1/12-octave analysis is examined and accuracy limits are derived for the 'acoustic' approach to wind speed measurement. The allowable positions for anemometer locations are also discussed with reference to currently available standards and recommended practices. (Author)
Psychological effects of dysphonia in voice professionals.

Science.gov (United States)

Salturk, Ziya; Kumral, Tolgar Lutfi; Aydoğdu, Imran; Arslanoğlu, Ahmet; Berkiten, Güler; Yildirim, Güven; Uyar, Yavuz

2015-08-01

To evaluate the psychological effects of dysphonia in voice professionals compared to non-voice professionals and in both genders. Cross-sectional analysis. Forty-eight 48 voice professionals and 52 non-voice professionals with dysphonia were included in this study. All participants underwent a complete ear, nose, and throat examination and an evaluation for pathologies that might affect vocal quality. Participants were asked to complete the Turkish versions of the Voice Handicap Index-30 (VHI-30), Perceived Stress Scale (PSS), and the Hospital Anxiety and Depression Scale (HADS). HADS scores were evaluated as HADS-A (anxiety) and HADS-D (depression). Dysphonia status was evaluated by grade, roughness, breathiness, asthenia, and strain (GRBAS) scale perceptually. The results were compared statistically. Significant differences between the two groups were evident when the VHI-30 and PSS data were compared (P = .00001 and P = .00001, respectively). However, neither HADS score (HADS-A and HADS-D) differed between groups. An analysis of the scores in terms of sex revealed that females had significantly higher PSS scores (P = .006). The GRBAS scale revealed no difference between groups (P = .819, .931, .803, .655, and .803, respectively). No between-sex differences in the VHI-30 or HADS scores were evident We found that voice professionals and females experienced more stress and were more dissatisfied with their voices. 4. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.
Disability: a voice in Australian bioethics?

Science.gov (United States)

Newell, Christopher

2003-06-01

The rise of research and advocacy over the years to establish a disability voice in Australia with regard to bioethical issues is explored. This includes an analysis of some of the political processes and engagement in mainstream bioethical debate. An understanding of the politics of rejected knowledge is vital in understanding the muted disability voices in Australian bioethics and public policy. It is also suggested that the voices of those who are marginalised or oppressed in society, such as people with disability, have particular contribution to make in fostering critical bioethics.
Auditory-Perceptual and Acoustic Methods in Measuring Dysphonia Severity of Korean Speech.

Science.gov (United States)

Maryn, Youri; Kim, Hyung-Tae; Kim, Jaeock

2016-09-01

The purpose of this study was to explore the criterion-related concurrent validity of two standardized auditory-perceptual rating protocols and the Acoustic Voice Quality Index (AVQI) for measuring dysphonia severity in Korean speech. Sixty native Korean subjects with various voice disorders were asked to sustain the vowel [a:] and to read aloud the Korean text "Walk." A 3-second midvowel portion of the sustained vowel and two sentences (with 25 syllables) were edited, concatenated, and analyzed according to methods described elsewhere. From 56 participants, both continuous speech and sustained vowel recordings had sufficiently high signal-to-noise ratios (35.5 dB and 37 dB on average, respectively) and were therefore subjected to further dysphonia severity analysis with (1) "G" or Grade from the GRBAS protocol, (2) "OS" or Overall Severity from the Consensus Auditory-Perceptual Evaluation of Voice protocol, and (3) AVQI. First, high correlations were found between G and OS (rS = 0.955 for sustained vowels; rS = 0.965 for continuous speech). Second, the AVQI showed a strong correlation with G (rS = 0.911) as well as OS (rP = 0.924). These findings are in agreement with similar studies dealing with continuous speech in other languages. The present study highlights the criterion-related concurrent validity of these methods in Korean speech. Furthermore, it supports the cross-linguistic robustness of the AVQI as a valid and objective marker of overall dysphonia severity. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
[Role of aerodynamic parameters in voice function assessment].

Science.gov (United States)

Guo, Yong-qing; Lin, Sheng-zhi; Xu, Xin-lin; Zhou, Li; Zhuang, Pei-yun; Jiang, Jack J

2012-10-01

To investigate the application and significance of aerodynamic parameters in voice function assessment. The phonatory aerodynamic system (PAS) was used to collect aerodynamic parameters from subjects with normal voice, vocal fold polyp, vocal fold cyst, and vocal fold immobility. Multivariate statistical analysis was used to compare measurements across groups. Phonation threshold flow (PTF), mean flow rate (MFR), maximum phonation time (MPT), and glottal resistance (GR) in one hundred normal subjects were significantly affected by sex (P efficiency (VE) were not (P > 0.05). PTP, PTF, MFR, SGP, and MPT were significantly different between normal voice and voice disorders (P 0.05). Receiver operating characteristic (ROC) analysis found that PTP, PTF, SGP, MFR, MPT, and VE in one hundred thirteen voice dis orders had similar diagnostic utility (P aerodynamic parameters of the three degrees of voice dysfunction due to vocal cord polyps were compared and found to have no significant differences (P > 0.05). PTP, PTF, MFR, SGP and MPT in forty one patients with vocal polyps were significantly different after surgical resection of vocal cord polyps (P aerodynamic parameters can objectively and effectively evaluate the variations of vocal function, and have good auxiliary diagnostic value.
[Voice disorders in female teachers assessed by Voice Handicap Index].

Science.gov (United States)

Niebudek-Bogusz, Ewa; Kuzańska, Anna; Woźnicka, Ewelina; Sliwińska-Kowalska, Mariola

2007-01-01

The aim of this study was to assess the application of Voice Handicap Index (VHI) in the diagnosis of occupational voice disorders in female teachers. The subjective assessment of voice by VHI was performed in fifty subjects with dysphonia diagnosed in laryngovideostroboscopic examination. The control group comprised 30 women whose jobs did not involve vocal effort. The results of the total VHI score and each of its subscales: functional, emotional and physical was significantly worse in the study group than in controls (p teachers estimated their own voice problems as a moderate disability, while 12% of them reported severe voice disability. However, all non-teachers assessed their voice problems as slight, their results ranged at the lowest level of VHI score. This study confirmed that VHI as a tool for self-assessment of voice can be a significant contribution to the diagnosis of occupational dysphonia.
Vibro-acoustic modeling and analysis of a coupled acoustic system comprising a partially opened cavity coupled with a flexible plate

Science.gov (United States)

Shi, Shuangxia; Su, Zhu; Jin, Guoyong; Liu, Zhigang

2018-01-01

This paper is concerned with the modeling and solution method of a three-dimensional (3D) coupled acoustic system comprising a partially opened cavity coupled with a flexible plate and an exterior field of semi-infinite size, which is ubiquitously encountered in architectural acoustics and is a reasonable representation of many engineering occasions. A general solution method is presented to predict the dynamic behaviors of the three-dimensional (3D) acoustic coupled system, in which the displacement of the plate and the sound pressure in the cavity are respectively constructed in the form of the two-dimensional and three-dimensional modified Fourier series with several auxiliary functions introduced to ensure the uniform convergence of the solution over the entire solution domain. The effect of the opening is taken into account via the work done by the sound pressure acting at the coupling aperture that is contributed from the vibration of particles on the acoustic coupling interface and on the structural-acoustic coupling interface. Both the acoustic coupling between finite cavity and exterior field and the structural-acoustic coupling between flexible plate and interior acoustic field are considered in the vibro-acoustic modeling of the three-dimensional acoustic coupled acoustic system. The dynamic responses of the coupled structural-acoustic system are obtained using the Rayleigh-Ritz procedure based on the energy expressions for the coupled system. The accuracy and effectiveness of the proposed method are validated through numerical examples and comparison with results obtained by the boundary element analysis. Furthermore, the influence of the opening and the cavity volume on the acoustic behaviors of opened cavity system is studied.
A Wireless LAN and Voice Information System for Underground Coal Mine

Directory of Open Access Journals (Sweden)

Yu Zhang

2014-06-01

Full Text Available In this paper we constructed a wireless information system, and developed a wireless voice communication subsystem based on Wireless Local Area Networks (WLAN for underground coal mine, which employs Voice over IP (VoIP technology and Session Initiation Protocol (SIP to achieve wireless voice dispatching communications. The master control voice dispatching interface and call terminal software are also developed on the WLAN ground server side to manage and implement the voice dispatching communication. A testing system for voice communication was constructed in tunnels of an underground coal mine, which was used to actually test the wireless voice communication subsystem via a network analysis tool, named Clear Sight Analyzer. In tests, the actual flow charts of registration, call establishment and call removal were analyzed by capturing call signaling of SIP terminals, and the key performance indicators were evaluated in coal mine, including average subjective value of voice quality, packet loss rate, delay jitter, disorder packet transmission and end-to- end delay. Experimental results and analysis demonstrate that the wireless voice communication subsystem developed communicates well in underground coal mine environment, achieving the designed function of voice dispatching communication.
A numerical study on acoustic behavior in gas turbine combustor with acoustic resonator

International Nuclear Information System (INIS)

Park, I Sun; Sohn, Chae Hoon

2005-01-01

Acoustic behavior in gas turbine combustor with acoustic resonator is investigated numerically by adopting linear acoustic analysis. Helmholtz-type resonator is employed as acoustic resonator to suppress acoustic instability passively. The tuning frequency of acoustic resonator is adjusted by varying its length. Through harmonic analysis, acoustic-pressure responses of chamber to acoustic excitation are obtained and the resonant acoustic modes are identified. Acoustic damping effect of acoustic resonator is quantified by damping factor. As the tuning frequency of acoustic resonator approaches the target frequency of the resonant mode to be suppressed, mode split from the original resonant mode to lower and upper modes appears and thereby complex patterns of acoustic responses show up. Considering mode split and damping effect as a function of tuning frequency, it is desirable to make acoustic resonator tuned to broad-band frequencies near the maximum frequency of those of the possible upper modes
Perception of acoustic scale and size in musical instrument sounds.

Science.gov (United States)

van Dinther, Ralph; Patterson, Roy D

2006-10-01

There is size information in natural sounds. For example, as humans grow in height, their vocal tracts increase in length, producing a predictable decrease in the formant frequencies of speech sounds. Recent studies have shown that listeners can make fine discriminations about which of two speakers has the longer vocal tract, supporting the view that the auditory system discriminates changes on the acoustic-scale dimension. Listeners can also recognize vowels scaled well beyond the range of vocal tracts normally experienced, indicating that perception is robust to changes in acoustic scale. This paper reports two perceptual experiments designed to extend research on acoustic scale and size perception to the domain of musical sounds: The first study shows that listeners can discriminate the scale of musical instrument sounds reliably, although not quite as well as for voices. The second experiment shows that listeners can recognize the family of an instrument sound which has been modified in pitch and scale beyond the range of normal experience. We conclude that processing of acoustic scale in music perception is very similar to processing of acoustic scale in speech perception.
[Fundamental frequency analysis - a contribution to the objective examination of the speaking and singing voice (author's transl)].

Science.gov (United States)

Schultz-Coulon, H J

1975-07-01

The applicability of a newly developed fundamental frequency analyzer to diagnosis in phoniatrics is reviewed. During routine voice examination, the analyzer allows a quick and accurate measurement of fundamental frequency and sound level of the speaking voice, and of vocal range and maximum phonation time. By computing fundamental frequency histograms, the median fundamental frequency and the total pitch range can be better determined and compared. Objective studies of certain technical faculties of the singing voice, which usually are estimated subjectively by the speech therapist, may now be done by means of this analyzer. Several examples demonstrate the differences between correct and incorrect phonation. These studies compare the pitch perturbations during the crescendo and decrescendo of a swell-tone, and show typical traces of staccato, thrill and yodel. Conclusions of the study indicate that fundamental frequency analysis is a valuable supplemental method for objective voice examination.
Singing voice outcomes following singing voice therapy.

Science.gov (United States)

Dastolfo-Hromack, Christina; Thomas, Tracey L; Rosen, Clark A; Gartner-Schmidt, Jackie

2016-11-01

The objectives of this study were to describe singing voice therapy (SVT), describe referred patient characteristics, and document the outcomes of SVT. Retrospective. Records of patients receiving SVT between June 2008 and June 2013 were reviewed (n = 51). All diagnoses were included. Demographic information, number of SVT sessions, and symptom severity were retrieved from the medical record. Symptom severity was measured via the 10-item Singing Voice Handicap Index (SVHI-10). Treatment outcome was analyzed by diagnosis, history of previous training, and SVHI-10. SVHI-10 scores decreased following SVT (mean change = 11, 40% decrease) (P singing lessons (n = 10) also completed an average of three SVT sessions. Primary muscle tension dysphonia (MTD1) and benign vocal fold lesion (lesion) were the most common diagnoses. Most patients (60%) had previous vocal training. SVHI-10 decrease was not significantly different between MTD and lesion. This is the first outcome-based study of SVT in a disordered population. Diagnosis of MTD or lesion did not influence treatment outcomes. Duration of SVT was short (approximately three sessions). Voice care providers are encouraged to partner with a singing voice therapist to provide optimal care for the singing voice. This study supports the use of SVT as a tool for the treatment of singing voice disorders. 4 Laryngoscope, 126:2546-2551, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Employee voice and engagement : Connections and consequences

NARCIS (Netherlands)

Rees, C.; Alfes, K.; Gatenby, M.

2013-01-01

This paper considers the relationship between employee voice and employee engagement. Employee perceptions of voice behaviour aimed at improving the functioning of the work group are found to have both a direct impact and an indirect impact on levels of employee engagement. Analysis of data from two
Identifying hidden voice and video streams

Science.gov (United States)

Fan, Jieyan; Wu, Dapeng; Nucci, Antonio; Keralapura, Ram; Gao, Lixin

2009-04-01

Given the rising popularity of voice and video services over the Internet, accurately identifying voice and video traffic that traverse their networks has become a critical task for Internet service providers (ISPs). As the number of proprietary applications that deliver voice and video services to end users increases over time, the search for the one methodology that can accurately detect such services while being application independent still remains open. This problem becomes even more complicated when voice and video service providers like Skype, Microsoft, and Google bundle their voice and video services with other services like file transfer and chat. For example, a bundled Skype session can contain both voice stream and file transfer stream in the same layer-3/layer-4 flow. In this context, traditional techniques to identify voice and video streams do not work. In this paper, we propose a novel self-learning classifier, called VVS-I , that detects the presence of voice and video streams in flows with minimum manual intervention. Our classifier works in two phases: training phase and detection phase. In the training phase, VVS-I first extracts the relevant features, and subsequently constructs a fingerprint of a flow using the power spectral density (PSD) analysis. In the detection phase, it compares the fingerprint of a flow to the existing fingerprints learned during the training phase, and subsequently classifies the flow. Our classifier is not only capable of detecting voice and video streams that are hidden in different flows, but is also capable of detecting different applications (like Skype, MSN, etc.) that generate these voice/video streams. We show that our classifier can achieve close to 100% detection rate while keeping the false positive rate to less that 1%.
Electrical circuit modeling and analysis of microwave acoustic interaction with biological tissues.

Science.gov (United States)

Gao, Fei; Zheng, Qian; Zheng, Yuanjin

2014-05-01

Numerical study of microwave imaging and microwave-induced thermoacoustic imaging utilizes finite difference time domain (FDTD) analysis for simulation of microwave and acoustic interaction with biological tissues, which is time consuming due to complex grid-segmentation and numerous calculations, not straightforward due to no analytical solution and physical explanation, and incompatible with hardware development requiring circuit simulator such as SPICE. In this paper, instead of conventional FDTD numerical simulation, an equivalent electrical circuit model is proposed to model the microwave acoustic interaction with biological tissues for fast simulation and quantitative analysis in both one and two dimensions (2D). The equivalent circuit of ideal point-like tissue for microwave-acoustic interaction is proposed including transmission line, voltage-controlled current source, envelop detector, and resistor-inductor-capacitor (RLC) network, to model the microwave scattering, thermal expansion, and acoustic generation. Based on which, two-port network of the point-like tissue is built and characterized using pseudo S-parameters and transducer gain. Two dimensional circuit network including acoustic scatterer and acoustic channel is also constructed to model the 2D spatial information and acoustic scattering effect in heterogeneous medium. Both FDTD simulation, circuit simulation, and experimental measurement are performed to compare the results in terms of time domain, frequency domain, and pseudo S-parameters characterization. 2D circuit network simulation is also performed under different scenarios including different sizes of tumors and the effect of acoustic scatterer. The proposed circuit model of microwave acoustic interaction with biological tissue could give good agreement with FDTD simulated and experimental measured results. The pseudo S-parameters and characteristic gain could globally evaluate the performance of tumor detection. The 2D circuit network
Interventions for preventing voice disorders in adults.

Science.gov (United States)

Ruotsalainen, J H; Sellman, J; Lehto, L; Jauhiainen, M; Verbeek, J H

2007-10-17

Poor voice quality due to a voice disorder can lead to a reduced quality of life. In occupations where voice use is substantial it can lead to periods of absence from work. To evaluate the effectiveness of interventions to prevent voice disorders in adults. We searched MEDLINE (PubMed, 1950 to 2006), EMBASE (1974 to 2006), CENTRAL (The Cochrane Library, Issue 2 2006), CINAHL (1983 to 2006), PsychINFO (1967 to 2006), Science Citation Index (1986 to 2006) and the Occupational Health databases OSH-ROM (to 2006). The date of the last search was 05/04/06. Randomised controlled clinical trials (RCTs) of interventions evaluating the effectiveness of treatments to prevent voice disorders in adults. For work-directed interventions interrupted time series and prospective cohort studies were also eligible. Two authors independently extracted data and assessed trial quality. Meta-analysis was performed where appropriate. We identified two randomised controlled trials including a total of 53 participants in intervention groups and 43 controls. One study was conducted with teachers and the other with student teachers. Both trials were poor quality. Interventions were grouped into 1) direct voice training, 2) indirect voice training and 3) direct and indirect voice training combined.1) Direct voice training: One study did not find a significant decrease of the Voice Handicap Index for direct voice training compared to no intervention.2) Indirect voice training: One study did not find a significant decrease of the Voice Handicap Index for indirect voice training when compared to no intervention.3) Direct and indirect voice training combined: One study did not find a decrease of the Voice Handicap Index for direct and indirect voice training combined when compared to no intervention. The same study did however find an improvement in maximum phonation time (Mean Difference -3.18 sec; 95 % CI -4.43 to -1.93) for direct and indirect voice training combined when compared to no
Idiopathic Parkinson's disease: vocal and quality of life analysis

Directory of Open Access Journals (Sweden)

Luiza Furtado e Silva

2012-09-01

Full Text Available OBJECTIVE: To compare voice and life quality of male patients with idiopathic Parkinson's disease, with individuals without disease (Control Group. METHODS: A cross-sectional study that evaluated the voice of individuals with Parkinson's disease, the group was composed of 27 subjects, aged from 39 to 79 years-old (average 59.96. The Control Group was matched on sex and age. Participants underwent voice recording. Perceptual evaluation was made using GRBASI scale, which considers G as the overall degree of dysphonia, R as roughness, B as breathiness, A as asthenia, S as strain and I as instability. The acoustic parameters analyzed were: fundamental frequency, jitter, shimmer, and harmonic to noise ratio (NHR. For vocal self-perception analysis, we used the Voice Related Quality of Life protocol. RESULTS: Fundamental frequency and jitter presented higher values in the Parkinson's group. NHR values were higher in the Control Group. Perceptual analysis showed a deviation ranging. The vocal disorder self-perception demonstrated a worse impact on quality of life. CONCLUSIONS: Individuals with Parkinson's disease have an altered voice quality and a negative impact on quality of life.

Acoustic Analysis of Nasal Vowels in Monguor Language

Science.gov (United States)

Zhang, Hanbin

2017-09-01

The purpose of the study is to analyze the spectrum characteristics and acoustic features for the nasal vowels [ɑ˜] and [ɔ˜] in Monguor language. On the base of acoustic parameter database of the Monguor speech, the study finds out that there are five main zero-pole pairs appearing for the nasal vowel [ɔ˜] and two zero-pole pairs appear for the nasal vowel [ɔ˜]. The results of regression analysis demonstrate that the duration of the nasal vowel [ɔ˜] or the nasal vowel [ɔ˜] can be predicted by its F1, F2 and F3 respectively.
Applicability of the Arabic version of Vocal Tract Discomfort Scale (VTDS) with student singers as professional voice users.

Science.gov (United States)

Darawsheh, Wesam B; Natour, Yaser S; Sada, Eve G

2018-07-01

This pilot study aimed to evaluate the internal consistency, convergent construct validity and criterion validity of Arabic version of the Vocal Tract Discomfort Scale (VTDS), and to investigate the correlation between the scores of the VTDS, the VHI and the acoustic measures of fundamental frequency (F0), shimmer, jitter and signal-to-noise ratio (SNR). A cross-sectional study where 97 participants participated (47 males and 50 females) (mean age 20.5 ± 2.1 years) (31 student singers and 66 other non-professional voice user students). Participants were without self-perceived voice disorders who completed the VTDS-Arab scale and the Voice Handicap Index (VHI-Arab), and recorded a vocal sample of/a:/at a comfortable level. A positive internal consistency that signifies reliability was confirmed by Cronbach's α = .884 and 0.874 for the VTDS-Arab frequency and severity subscales, respectively. A moderate positive correlation was found between the VTDS-Arab (frequency, severity, total) and the VHI-Arab total where values of Pearson's correlation coefficient were r= 0.459, 0.430 and 0.451, respectively. Weak correlations were found between all of the acoustic measures and the scores of the VTDS-Arab and VHI-Arab (total and subscales). The area under curve for the VTDS was AUC= 0.824, 0.804 and 0.817 for the VTDS frequency, VTDS severity and VTDS total, respectively. The VTDS-Arab is a valid and reliable tool in measuring vocal tract sensations and predicting the perception of vocal handicap in student singers and can be used to predict the vocal load among professional voice users.
Work-related voice disorder

Directory of Open Access Journals (Sweden)

Paulo Eduardo Przysiezny

2015-04-01

Full Text Available INTRODUCTION: Dysphonia is the main symptom of the disorders of oral communication. However, voice disorders also present with other symptoms such as difficulty in maintaining the voice (asthenia, vocal fatigue, variation in habitual vocal fundamental frequency, hoarseness, lack of vocal volume and projection, loss of vocal efficiency, and weakness when speaking. There are several proposals for the etiologic classification of dysphonia: functional, organofunctional, organic, and work-related voice disorder (WRVD.OBJECTIVE: To conduct a literature review on WRVD and on the current Brazilian labor legislation.METHODS: This was a review article with bibliographical research conducted on the PubMed and Bireme databases, using the terms "work-related voice disorder", "occupational dysphonia", "dysphonia and labor legislation", and a review of labor and social security relevant laws.CONCLUSION: WRVD is a situation that frequently is listed as a reason for work absenteeism, functional rehabilitation, or for prolonged absence from work. Currently, forensic physicians have no comparative parameters to help with the analysis of vocal disorders. In certain situations WRVD may cause, work disability. This disorder may be labor-related, or be an adjuvant factor to work-related diseases.
Distinguishing between forensic science and forensic pseudoscience: testing of validity and reliability, and approaches to forensic voice comparison.

Science.gov (United States)

Morrison, Geoffrey Stewart

2014-05-01

In this paper it is argued that one should not attempt to directly assess whether a forensic analysis technique is scientifically acceptable. Rather one should first specify what one considers to be appropriate principles governing acceptable practice, then consider any particular approach in light of those principles. This paper focuses on one principle: the validity and reliability of an approach should be empirically tested under conditions reflecting those of the case under investigation using test data drawn from the relevant population. Versions of this principle have been key elements in several reports on forensic science, including forensic voice comparison, published over the last four-and-a-half decades. The aural-spectrographic approach to forensic voice comparison (also known as "voiceprint" or "voicegram" examination) and the currently widely practiced auditory-acoustic-phonetic approach are considered in light of this principle (these two approaches do not appear to be mutually exclusive). Approaches based on data, quantitative measurements, and statistical models are also considered in light of this principle. © 2013.
Measuring positive and negative affect in the voiced sounds of African elephants (Loxodonta africana).

Science.gov (United States)

Soltis, Joseph; Blowers, Tracy E; Savage, Anne

2011-02-01

As in other mammals, there is evidence that the African elephant voice reflects affect intensity, but it is less clear if positive and negative affective states are differentially reflected in the voice. An acoustic comparison was made between African elephant "rumble" vocalizations produced in negative social contexts (dominance interactions), neutral social contexts (minimal social activity), and positive social contexts (affiliative interactions) by four adult females housed at Disney's Animal Kingdom®. Rumbles produced in the negative social context exhibited higher and more variable fundamental frequencies (F(0)) and amplitudes, longer durations, increased voice roughness, and higher first formant locations (F1), compared to the neutral social context. Rumbles produced in the positive social context exhibited similar shifts in most variables (F(0 )variation, amplitude, amplitude variation, duration, and F1), but the magnitude of response was generally less than that observed in the negative context. Voice roughness and F(0) observed in the positive social context remained similar to that observed in the neutral context. These results are most consistent with the vocal expression of affect intensity, in which the negative social context elicited higher intensity levels than the positive context, but differential vocal expression of positive and negative affect cannot be ruled out.
Toward the Development of an Objective Index of Dysphonia Severity: A Four-Factor Acoustic Model

Science.gov (United States)

Awan, Shaheen N.; Roy, Nelson

2006-01-01

During assessment and management of individuals with voice disorders, clinicians routinely attempt to describe or quantify the severity of a patient's dysphonia. This investigation used acoustic measures derived from sustained vowel samples to predict dysphonia severity (as determined by auditory-perceptual ratings), for a diverse set of voice…
Coating adherence in galvanized steel assessed by acoustic emission wavelet analysis

International Nuclear Information System (INIS)

Gallego, Antolino; Gil, Jose F.; Vico, Juan M.; Ruzzante, Jose E.; Piotrkowski, Rosa

2005-01-01

Coating-substrate adherence in galvanized steel is evaluated by acoustic emission wavelet analysis in scratch tests on hot-dip galvanized samples. The acoustic emission results are compared with optical and electron microscopy observations in order to understand coating features related to adherence and to establish criteria aimed at improving the manufacture process
I like my voice better: self-enhancement bias in perceptions of voice attractiveness.

Science.gov (United States)

Hughes, Susan M; Harrison, Marissa A

2013-01-01

Previous research shows that the human voice can communicate a wealth of nonsemantic information; preferences for voices can predict health, fertility, and genetic quality of the speaker, and people often use voice attractiveness, in particular, to make these assessments of others. But it is not known what we think of the attractiveness of our own voices as others hear them. In this study eighty men and women rated the attractiveness of an array of voice recordings of different individuals and were not told that their own recorded voices were included in the presentation. Results showed that participants rated their own voices as sounding more attractive than others had rated their voices, and participants also rated their own voices as sounding more attractive than they had rated the voices of others. These findings suggest that people may engage in vocal implicit egotism, a form of self-enhancement.
Modeling and analysis of voice and data in cognitive radio networks

CERN Document Server

Gunawardena, Subodha

2014-01-01

This Springer Brief investigates the voice and elastic/interactive data service support over cognitive radio networks (CRNs), in terms of their delay requirements. The increased demand for wireless communication conflicts with the scarcity of the radio spectrum, but CRNS allow for more efficient use of the networks. The authors review packet level delay requirements of the voice service and session level delay requirements of the elastic/interactive data services, particularly constant-rate and on-o? voice tra?c capacities in CRNs with centralized and distributed network coordination. Some gen
Linear Stability Analysis of an Acoustically Vaporized Droplet

Science.gov (United States)

Siddiqui, Junaid; Qamar, Adnan; Samtaney, Ravi

2015-11-01

Acoustic droplet vaporization (ADV) is a phase transition phenomena of a superheat liquid (Dodecafluoropentane, C5F12) droplet to a gaseous bubble, instigated by a high-intensity acoustic pulse. This approach was first studied in imaging applications, and applicable in several therapeutic areas such as gas embolotherapy, thrombus dissolution, and drug delivery. High-speed imaging and theoretical modeling of ADV has elucidated several physical aspects, ranging from bubble nucleation to its subsequent growth. Surface instabilities are known to exist and considered responsible for evolving bubble shapes (non-spherical growth, bubble splitting and bubble droplet encapsulation). We present a linear stability analysis of the dynamically evolving interfaces of an acoustically vaporized micro-droplet (liquid A) in an infinite pool of a second liquid (liquid B). We propose a thermal ADV model for the base state. The linear analysis utilizes spherical harmonics (Ynm, of degree m and order n) and under various physical assumptions results in a time-dependent ODE of the perturbed interface amplitudes (one at the vapor/liquid A interface and the other at the liquid A/liquid B interface). The perturbation amplitudes are found to grow exponentially and do not depend on m. Supported by KAUST Baseline Research Funds.
Lactobacilli : Important in biofilm formation on voice prostheses

NARCIS (Netherlands)

Buijssen, Kevin J. D. A.; Harmsen, Hermie J. M.; van der Mei, Henny C.; Busscher, Henk J.; van der Laan, Bernard F. A. M.

OBJECTIVE: We sought to identify bacterial strains responsible for biofilm formation on silicone rubber voice prostheses. STUDY DESIGN: We conducted an analysis of the bacterial population in biofilms on used silicone rubber voice prostheses by using new microbiological methods. METHODS: Two
Self-perception, complaints and vocal quality among undergraduate students enrolled in a Pedagogy course.

Science.gov (United States)

Fabron, Eliana Maria Gradim; Regaçone, Simone Fiuza; Marino, Viviane Cristina de Castro; Mastria, Marina Ludovico; Motonaga, Suely Mayumi; Sebastião, Luciana Tavares

2015-01-01

To compare the vocal self-perception and vocal complaints reported by two groups of students of the pedagogy course (freshmen and graduates); to relate the vocal self-perception to the vocal complaints for these groups; and to compare the voice quality of the students from these groups through perceptual auditory assessment and acoustic analysis. Initially, 89 students from the pedagogy course answered a questionnaire about self-perceived voice quality and vocal complaints. In a second phase, auditory-perceptual evaluation and acoustic analyses of 48 participants were made through voice recordings of sustained vowel emission and poem reading. The most reported vocal complaints were fatigue while using the voice, sore throat, effort to speak, irritation or burning in the throat, hoarseness, tightness in the neck, and variations of voice throughout the day. There was a higher occurrence of complaints from graduates than from freshmen, with significant differences for four of the nine complaints. It was also possible to observe the relationship between vocal self-perception and complaints reported by these students. No significant differences were observed in the results of auditory-perceptual evaluation; however, some graduates had their voices evaluated with higher severity of deviation of normalcy. During acoustic analysis no difference was observed between groups. The increase in vocal demand by the graduates may have caused the greatest number and diversity of vocal complaints, and several of them are related to the self-assessment of voice quality. The auditory-perceptual evaluation and acoustic analysis showed no deviations in their voice.
Acoustical conditions for speech communication in active elementary school classrooms

Science.gov (United States)

Sato, Hiroshi; Bradley, John

2005-04-01

Detailed acoustical measurements were made in 34 active elementary school classrooms with typical rectangular room shape in schools near Ottawa, Canada. There was an average of 21 students in classrooms. The measurements were made to obtain accurate indications of the acoustical quality of conditions for speech communication during actual teaching activities. Mean speech and noise levels were determined from the distribution of recorded sound levels and the average speech-to-noise ratio was 11 dBA. Measured mid-frequency reverberation times (RT) during the same occupied conditions varied from 0.3 to 0.6 s, and were a little less than for the unoccupied rooms. RT values were not related to noise levels. Octave band speech and noise levels, useful-to-detrimental ratios, and Speech Transmission Index values were also determined. Key results included: (1) The average vocal effort of teachers corresponded to louder than Pearsons Raised voice level; (2) teachers increase their voice level to overcome ambient noise; (3) effective speech levels can be enhanced by up to 5 dB by early reflection energy; and (4) student activity is seen to be the dominant noise source, increasing average noise levels by up to 10 dBA during teaching activities. [Work supported by CLLRnet.
Voice analysis as an objective state marker in bipolar disorder

DEFF Research Database (Denmark)

Faurholt-Jepsen, M.; Busk, Jonas; Frost, M.

2016-01-01

Changes in speech have been suggested as sensitive and valid measures of depression and mania in bipolar disorder. The present study aimed at investigating (1) voice features collected during phone calls as objective markers of affective states in bipolar disorder and (2) if combining voice...... features, automatically generated objective smartphone data on behavioral activities and electronic self-monitored data were collected from 28 outpatients with bipolar disorder in naturalistic settings on a daily basis during a period of 12 weeks. Depressive and manic symptoms were assessed using...... and electronic self-monitored data increased the accuracy, sensitivity and specificity of classification of affective states slightly. Voice features collected in naturalistic settings using smartphones may be used as objective state markers in patients with bipolar disorder....
A survey analysis of acoustic trauma related to MR scans

International Nuclear Information System (INIS)

Nakai, Toshiharu; Kamiya, Naoki; Sone, Michihiko; Muranaka, Hiroyuki; Tsuchihashi, Toshio; Yamada, Naoki; Yamaguchi, Sachiko

2012-01-01

The maximum limit of MR scanner noise and necessity of ear protection is defined in the IEC standard (IEC60601-2-33) of MR safety. With improvements in MR scanner performance, pulse sequences generating higher scanning noise have been used clinically. In this study, we investigated the factors significantly related to potential acoustic trauma cases (PATC) after MR examinations. To consider the future direction for MR safety and prevention of acoustic trauma, issues related to noise generation by MR scanners and acoustic trauma were systematically reviewed. A statistical analysis was performed using the data set from a survey (n=974) conducted in 2010 by the Japanese Society for Magnetic Resonance in Medicine (JSMRM) safety committee. Hierarchical clustering analysis was used to extract the characteristics of the responders. With this classification as a reference, tests of independence and a residual analysis were employed to evaluate the factors related to PATC. No significant relationship was observed between the ear protection policy and the incidence or the reported outcome of PATC. While the two main clusters out of the six clusters extracted were associated with who reported the PATC and the confirmation process of the acoustic noise level of MR scanners, no cluster was associated with the frequency of PATC. An absence of PATC was significantly less reported (p=0.03) and more PATC was reported (p=0.04) by facilities with 3T MR systems. Although the total frequency was 4 cases, it should be noted that persistent hearing disturbances are a possible consequence of MR examinations. Neither the condition of the subjects nor the ear protection method was significantly related to the probability of PATC, suggesting the difficulty of predicting the potential risk of acoustic trauma. It is recommended to more systematically follow up PATC cases and clarify the risk factors. (author)
The Effect of Intertalker Variations on Acoustic-Perceptual Mapping in Cantonese and Mandarin Tone Systems

Science.gov (United States)

Peng, Gang; Zhang, Caicai; Zheng, Hong-Ying; Minett, James W.; Wang, William S.-Y.

2012-01-01

Purpose: This study investigates the impact of intertalker variations on the process of mapping acoustic variations on tone categories in two different tone languages. Method: Pitch stimuli manipulated from four voice ranges were presented in isolation through a blocked-talker design. Listeners were instructed to identify the stimuli that they…
[Voice assessment and demographic data of applicants for a school of speech therapists].

Science.gov (United States)

Reiter, R; Brosch, S

2008-05-01

Demographic data, subjective und objective voice analysis as well as self-assessment of voice quality from applicants for a school of speech therapists were investigated. Demographic data from 116 applicants were collected and their voice quality assessed by three independent judges. An objective evaluation was done by maximum phonation time, average fundamental frequency, dynamic range and percent of jitter and shimmer by means of Goettinger Hoarseness diagram. Self-assessment of voice quality was done by "voice handicap index questionnaire". The twenty successful applicants had a physiological voice in 95 %, they were all musical and had university entrance qualifications. Subjective voice assessment showed in 16 % of the applicants a hoarse voice. In this subgroup an unphysiological vocal use was observed in 72 % and a reduced articulation in 45 %. The objective voice parameters did not show a significant difference between the 3 groups. Self-assessment of the voice was inconspicuous in all applicants. Applicants with general qualification for university entrance, musicality and a physiological voice were more likely to be successful. There were main differences between self assessment of voice and quantitative analysis or subjective assessment by three independent judges.
Finding your mate at a cocktail party: frequency separation promotes auditory stream segregation of concurrent voices in multi-species frog choruses.

Directory of Open Access Journals (Sweden)

Vivek Nityananda

Full Text Available Vocal communication in crowded social environments is a difficult problem for both humans and nonhuman animals. Yet many important social behaviors require listeners to detect, recognize, and discriminate among signals in a complex acoustic milieu comprising the overlapping signals of multiple individuals, often of multiple species. Humans exploit a relatively small number of acoustic cues to segregate overlapping voices (as well as other mixtures of concurrent sounds, like polyphonic music. By comparison, we know little about how nonhuman animals are adapted to solve similar communication problems. One important cue enabling source segregation in human speech communication is that of frequency separation between concurrent voices: differences in frequency promote perceptual segregation of overlapping voices into separate "auditory streams" that can be followed through time. In this study, we show that frequency separation (ΔF also enables frogs to segregate concurrent vocalizations, such as those routinely encountered in mixed-species breeding choruses. We presented female gray treefrogs (Hyla chrysoscelis with a pulsed target signal (simulating an attractive conspecific call in the presence of a continuous stream of distractor pulses (simulating an overlapping, unattractive heterospecific call. When the ΔF between target and distractor was small (e.g., ≤3 semitones, females exhibited low levels of responsiveness, indicating a failure to recognize the target as an attractive signal when the distractor had a similar frequency. Subjects became increasingly more responsive to the target, as indicated by shorter latencies for phonotaxis, as the ΔF between target and distractor increased (e.g., ΔF = 6-12 semitones. These results support the conclusion that gray treefrogs, like humans, can exploit frequency separation as a perceptual cue to segregate concurrent voices in noisy social environments. The ability of these frogs to segregate
Formation of the Actor's/Speaker's Formant: A Study Applying Spectrum Analysis and Computer Modeling

Czech Academy of Sciences Publication Activity Database

Leino, T.; Laukkanen, A. M.; Radolf, Vojtěch

2011-01-01

Roč. 25, č. 2 (2011), s. 150-158 ISSN 0892-1997 R&D Projects: GA ČR GA101/08/1155 Institutional research plan: CEZ:AV0Z20760514 Keywords : vocal exercising * voice quality * spectrum analysis * mathematical modeling Subject RIV: BI - Acoustics Impact factor: 1.390, year: 2011
Sound induced activity in voice sensitive cortex predicts voice memory ability

Directory of Open Access Journals (Sweden)

Rebecca eWatson

2012-04-01

Full Text Available The ‘temporal voice areas’ (TVAs (Belin et al., 2000 of the human brain show greater neuronal activity in response to human voices than to other categories of nonvocal sounds. However, a direct link between TVA activity and voice perceptionbehaviour has not yet been established. Here we show that a functional magnetic resonance imaging (fMRI measure of activity in the TVAs predicts individual performance at a separately administered voice memory test. This relation holds whengeneral sound memory ability is taken into account. These findings provide the first evidence that the TVAs are specifically involved in voice cognition.

Integrating cues of social interest and voice pitch in men's preferences for women's voices.

Science.gov (United States)

Jones, Benedict C; Feinberg, David R; Debruine, Lisa M; Little, Anthony C; Vukovic, Jovana

2008-04-23

Most previous studies of vocal attractiveness have focused on preferences for physical characteristics of voices such as pitch. Here we examine the content of vocalizations in interaction with such physical traits, finding that vocal cues of social interest modulate the strength of men's preferences for raised pitch in women's voices. Men showed stronger preferences for raised pitch when judging the voices of women who appeared interested in the listener than when judging the voices of women who appeared relatively disinterested in the listener. These findings show that voice preferences are not determined solely by physical properties of voices and that men integrate information about voice pitch and the degree of social interest expressed by women when forming voice preferences. Women's preferences for raised pitch in women's voices were not modulated by cues of social interest, suggesting that the integration of cues of social interest and voice pitch when men judge the attractiveness of women's voices may reflect adaptations that promote efficient allocation of men's mating effort.
Voice Therapy Practices and Techniques: A Survey of Voice Clinicians.

Science.gov (United States)

Mueller, Peter B.; Larson, George W.

1992-01-01

Eighty-three voice disorder therapists' ratings of statements regarding voice therapy practices indicated that vocal nodules are the most frequent disorder treated; vocal abuse and hard glottal attack elimination, counseling, and relaxation were preferred treatment approaches; and voice therapy is more effective with adults than with children.…
An acoustic analysis of laughter produced by congenitally deaf and normally hearing college students1

Science.gov (United States)

Makagon, Maja M.; Funayama, E. Sumie; Owren, Michael J.

2008-01-01

Relatively few empirical data are available concerning the role of auditory experience in nonverbal human vocal behavior, such as laughter production. This study compared the acoustic properties of laughter in 19 congenitally, bilaterally, and profoundly deaf college students and in 23 normally hearing control participants. Analyses focused on degree of voicing, mouth position, air-flow direction, temporal features, relative amplitude, fundamental frequency, and formant frequencies. Results showed that laughter produced by the deaf participants was fundamentally similar to that produced by the normally hearing individuals, which in turn was consistent with previously reported findings. Finding comparable acoustic properties in the sounds produced by deaf and hearing vocalizers confirms the presumption that laughter is importantly grounded in human biology, and that auditory experience with this vocalization is not necessary for it to emerge in species-typical form. Some differences were found between the laughter of deaf and hearing groups; the most important being that the deaf participants produced lower-amplitude and longer-duration laughs. These discrepancies are likely due to a combination of the physiological and social factors that routinely affect profoundly deaf individuals, including low overall rates of vocal fold use and pressure from the hearing world to suppress spontaneous vocalizations. PMID:18646991
Pattern recognition methods for acoustic emission analysis

International Nuclear Information System (INIS)

Doctor, P.G.; Harrington, T.P.; Hutton, P.H.

1979-07-01

Models have been developed that relate the rate of acoustic emissions to structural integrity. The implementation of these techniques in the field has been hindered by the noisy environment in which the data must be taken. Acoustic emissions from noncritical sources are recorded in addition to those produced by critical sources, such as flaws. A technique is discussed for prescreening acoustic events and filtering out those that are produced by noncritical sources. The methodology that was investigated is pattern recognition. Three different pattern recognition techniques were applied to a data set that consisted of acoustic emissions caused by crack growth and acoustic signals caused by extraneous noise sources. Examination of the acoustic emission data presented has uncovered several features of the data that can provide a reasonable filter. Two of the most valuable features are the frequency of maximum response and the autocorrelation coefficient at Lag 13. When these two features and several others were combined with a least squares decision algorithm, 90% of the acoustic emissions in the data set were correctly classified. It appears possible to design filters that eliminate extraneous noise sources from flaw-growth acoustic emissions using pattern recognition techniques
Measurement errors in voice-key naming latency for Hiragana.

Science.gov (United States)

Yamada, Jun; Tamaoka, Katsuo

2003-12-01

This study makes explicit the limitations and possibilities of voice-key naming latency research on single hiragana symbols (a Japanese syllabic script) by examining three sets of voice-key naming data against Sakuma, Fushimi, and Tatsumi's 1997 speech-analyzer voice-waveform data. Analysis showed that voice-key measurement errors can be substantial in standard procedures as they may conceal the true effects of significant variables involved in hiragana-naming behavior. While one can avoid voice-key measurement errors to some extent by applying Sakuma, et al.'s deltas and by excluding initial phonemes which induce measurement errors, such errors may be ignored when test items are words and other higher-level linguistic materials.
A Framework for Music-Speech Segregation using Music Fingerprinting and Acoustic Echo Cancellation Principle

International Nuclear Information System (INIS)

Hussain, F.; Habib, H. A.; Khan, M. J.

2015-01-01

Background interference creates voice intelligibility issue for listerner. This research work considers background music as interference for communication through smart phone in areas with loud background music. This paper proposes a novel framework for background music segregation from human speech using music fingerprinting and acoustic echo cancellation. Initially, background music is searched in the database by music fingerprinting. Identified background music is registered and segregated using acoustic echo cancellation. Proposed approach generates better quality music speech segregation than existing algorithms. The research work is novel and segregates background music completely in comparison to existing approaches where single instruments are segregated successfully. (author)
Violence in schools and the voice of teachers.

Science.gov (United States)

Dornelas, Rodrigo; Santos, Thaynara Alves Dos; Oliveira, Daniela Sena de; Irineu, Roxane de Alencar; Brito, Aline; Silva, Kelly

2017-08-10

To correlate self-reporting of voice disorders with habits that impact voice production and situations of violence experienced by teachers. The study involved 41 elementary-school teachers of rural and urban areas. Two instruments were used for data collection: The Vocal Production Condition - Teacher (CPV-P) questionnaire and the Screening Index for Voice Disorders - ITDV. The chi-square test was used to verify association among variables with a significance level of 5%. The sample consisted of 8 men and 33 women aged 25-66 years with a median of 39 years. Regarding vocal habits, 33 people (80.5%) mentioned the screaming as usual practice, 40 people (97.5%) declared they talk a lot. As for voice care, 31 people (73.1%) reported drinking water while using their voice. As for the ITDV total score, 30 teachers (73.1%) were above the score threshold set for predisposition to vocal disorders. Statistical analysis revealed a significant association between female participants and complaint of graffiti writings as a type of violence. No significant correlation between the ITDV results with gender and the ITDV with forms of violence evaluated in the study was indicated. Self-reporting of voice disorders showed no significant relationship with acts of violence. However, analysis of the context of violence in schools and vocal problems are issues worthy of attention, particularly the observed naturalization of gender inssues, which is seldom problematized.
Acoustical contribution calculation and analysis of compressor shell based on acoustic transfer vector method

Science.gov (United States)

Chen, Xiaol; Guo, Bei; Tuo, Jinliang; Zhou, Ruixin; Lu, Yang

2017-08-01

Nowadays, people are paying more and more attention to the noise reduction of household refrigerator compressor. This paper established a sound field bounded by compressor shell and ISO3744 standard field points. The Acoustic Transfer Vector (ATV) in the sound field radiated by a refrigerator compressor shell were calculated which fits the test result preferably. Then the compressor shell surface is divided into several parts. Based on Acoustic Transfer Vector approach, the sound pressure contribution to the field points and the sound power contribution to the sound field of each part were calculated. To obtain the noise radiation in the sound field, the sound pressure cloud charts were analyzed, and the contribution curves in different frequency of each part were acquired. Meanwhile, the sound power contribution of each part in different frequency was analyzed, to ensure those parts where contributes larger sound power. Through the analysis of acoustic contribution, those parts where radiate larger noise on the compressor shell were determined. This paper provides a credible and effective approach on the structure optimal design of refrigerator compressor shell, which is meaningful in the noise and vibration reduction.
Acoustic Changes in the Speech of Children with Cerebral Palsy Following an Intensive Program of Dysarthria Therapy

Science.gov (United States)

Pennington, Lindsay; Lombardo, Eftychia; Steen, Nick; Miller, Nick

2018-01-01

Background: The speech intelligibility of children with dysarthria and cerebral palsy has been observed to increase following therapy focusing on respiration and phonation. Aims: To determine if speech intelligibility change following intervention is associated with change in acoustic measures of voice. Methods & Procedures: We recorded 16…
FE Modeling of Human Vocal Tract Acoustics. Part II. Influence of Velopharyngeal Insufficiency on Phonation of Vowels

Czech Academy of Sciences Publication Activity Database

Vampola, T.; Horáček, Jaromír; Vokřál, J.

2008-01-01

Roč. 94, č. 3 (2008), s. 448-460 ISSN 1610-1928 R&D Projects: GA ČR GA106/04/1025 Institutional research plan: CEZ:AV0Z20760514 Keywords : biomechanics of voice * numerical simulations * nasality Subject RIV: BI - Acoustics Impact factor: 0.538, year: 2008
Acoustic signal analysis in the creeping discharge

International Nuclear Information System (INIS)

Nakamiya, T; Sonoda, Y; Tsuda, R; Ebihara, K; Ikegami, T

2008-01-01

We have previously succeeded in measuring the acoustic signal due to the dielectric barrier discharge and discriminating the dominant frequency components of the acoustic signal. The dominant frequency components appear over 20kHz of acoustic signal by the dielectric barrier discharge. Recently surface discharge control technology has been focused from practical applications such as ozonizer, NO X reactors, light source or display. The fundamental experiments are carried to examine the creeping discharge using the acoustic signal. When the high voltage (6kV, f = 10kHz) is applied to the electrode, the discharge current flows and the acoustic sound is generated. The current, voltage waveforms of creeping discharge and the sound signal detected by the condenser microphone are stored in the digital memory scope. In this scheme, Continuous Wavelet Transform (CWT) is applied to discriminate the acoustic sound of the micro discharge and the dominant frequency components are studied. CWT results of sound signal show the frequency spectrum of wideband up to 100kHz. In addition, the energy distributions of acoustic signal are examined by CWT
Air-pressure, vocal fold vibration and acoustic characteristics of phonation during vocal exercising. - Part 1: Measurement in vivo

Czech Academy of Sciences Publication Activity Database

Radolf, Vojtěch; Laukkanen, A. M.; Horáček, Jaromír; Liu, D.

2014-01-01

Roč. 21, č. 1 (2014), s. 53-59 ISSN 1802-1484 R&D Projects: GA ČR GPP101/12/P579 Institutional support: RVO:61388998 Keywords : biomechanics of voice * subglottal * oral and transglottal pressure * electroglottography * phonation into tubes Subject RIV: BI - Acoustics
Voice following radiotherapy

International Nuclear Information System (INIS)

Stoicheff, M.L.

1975-01-01

This study was undertaken to provide information on the voice of patients following radiotherapy for glottic cancer. Part I presents findings from questionnaires returned by 227 of 235 patients successfully irradiated for glottic cancer from 1960 through 1971. Part II presents preliminary findings on the speaking fundamental frequencies of 22 irradiated patients. Normal to near-normal voice was reported by 83 percent of the 227 patients; however, 80 percent did indicate persisting vocal difficulties such as fatiguing of voice with much usage, inability to sing, reduced loudness, hoarse voice quality and inability to shout. Amount of talking during treatments appeared to affect length of time for voice to recover following treatments in those cases where it took from nine to 26 weeks; also, with increasing years since treatment, patients rated their voices more favorably. Smoking habits following treatments improved significantly with only 27 percent smoking heavily as compared with 65 percent prior to radiation therapy. No correlation was found between smoking (during or after treatments) and vocal ratings or between smoking and length of time for voice to recover. There was no relationship found between reported vocal ratings and stage of the disease
Voices Not Heard: Voice-Use Profiles of Elementary Music Teachers, the Effects of Voice Amplification on Vocal Load, and Perceptions of Issues Surrounding Voice Use

Science.gov (United States)

Morrow, Sharon L.

2009-01-01

Teachers represent the largest group of occupational voice users and have voice-related problems at a rate of over twice that found in the general population. Among teachers, music teachers are roughly four times more likely than classroom teachers to develop voice-related problems. Although it has been established that music teachers use their…
The role of spectral and temporal cues in voice gender discrimination by normal-hearing listeners and cochlear implant users.

Science.gov (United States)

Fu, Qian-Jie; Chinchilla, Sherol; Galvin, John J

2004-09-01

The present study investigated the relative importance of temporal and spectral cues in voice gender discrimination and vowel recognition by normal-hearing subjects listening to an acoustic simulation of cochlear implant speech processing and by cochlear implant users. In the simulation, the number of speech processing channels ranged from 4 to 32, thereby varying the spectral resolution; the cutoff frequencies of the channels' envelope filters ranged from 20 to 320 Hz, thereby manipulating the available temporal cues. For normal-hearing subjects, results showed that both voice gender discrimination and vowel recognition scores improved as the number of spectral channels was increased. When only 4 spectral channels were available, voice gender discrimination significantly improved as the envelope filter cutoff frequency was increased from 20 to 320 Hz. For all spectral conditions, increasing the amount of temporal information had no significant effect on vowel recognition. Both voice gender discrimination and vowel recognition scores were highly variable among implant users. The performance of cochlear implant listeners was similar to that of normal-hearing subjects listening to comparable speech processing (4-8 spectral channels). The results suggest that both spectral and temporal cues contribute to voice gender discrimination and that temporal cues are especially important for cochlear implant users to identify the voice gender when there is reduced spectral resolution.
Air-pressure, vocal folds vibration and acoustic characteristics of phonation during vocal exercising. - Part 2: Measurement on a physical model

Czech Academy of Sciences Publication Activity Database

Horáček, Jaromír; Radolf, Vojtěch; Bula, Vítězslav; Laukkanen, A. M.

2014-01-01

Roč. 21, č. 3 (2014), s. 193-200 ISSN 1802-1484 R&D Projects: GA ČR GAP101/12/1306 Institutional support: RVO:61388998 Keywords : biomechanics of voice * subglottal * oral and transglottal pressure * flow resistance Subject RIV: BI - Acoustics
Predicting word-recognition performance in noise by young listeners with normal hearing using acoustic, phonetic, and lexical variables.

Science.gov (United States)

McArdle, Rachel; Wilson, Richard H

2008-06-01

To analyze the 50% correct recognition data that were from the Wilson et al (this issue) study and that were obtained from 24 listeners with normal hearing; also to examine whether acoustic, phonetic, or lexical variables can predict recognition performance for monosyllabic words presented in speech-spectrum noise. The specific variables are as follows: (a) acoustic variables (i.e., effective root-mean-square sound pressure level, duration), (b) phonetic variables (i.e., consonant features such as manner, place, and voicing for initial and final phonemes; vowel phonemes), and (c) lexical variables (i.e., word frequency, word familiarity, neighborhood density, neighborhood frequency). The descriptive, correlational study will examine the influence of acoustic, phonetic, and lexical variables on speech recognition in noise performance. Regression analysis demonstrated that 45% of the variance in the 50% point was accounted for by acoustic and phonetic variables whereas only 3% of the variance was accounted for by lexical variables. These findings suggest that monosyllabic word-recognition-in-noise is more dependent on bottom-up processing than on top-down processing. The results suggest that when speech-in-noise testing is used in a pre- and post-hearing-aid-fitting format, the use of monosyllabic words may be sensitive to changes in audibility resulting from amplification.
Mindfulness of voices, self-compassion, and secure attachment in relation to the experience of hearing voices.

Science.gov (United States)

Dudley, James; Eames, Catrin; Mulligan, John; Fisher, Naomi

2018-03-01

Developing compassion towards oneself has been linked to improvement in many areas of psychological well-being, including psychosis. Furthermore, developing a non-judgemental, accepting way of relating to voices is associated with lower levels of distress for people who hear voices. These factors have also been associated with secure attachment. This study explores associations between the constructs of mindfulness of voices, self-compassion, and distress from hearing voices and how secure attachment style related to each of these variables. Cross-sectional online. One hundred and twenty-eight people (73% female; M age = 37.5; 87.5% Caucasian) who currently hear voices completed the Self-Compassion Scale, Southampton Mindfulness of Voices Questionnaire, Relationships Questionnaire, and Hamilton Programme for Schizophrenia Voices Questionnaire. Results showed that mindfulness of voices mediated the relationship between self-compassion and severity of voices, and self-compassion mediated the relationship between mindfulness of voices and severity of voices. Self-compassion and mindfulness of voices were significantly positively correlated with each other and negatively correlated with distress and severity of voices. Mindful relation to voices and self-compassion are associated with reduced distress and severity of voices, which supports the proposed potential benefits of mindful relating to voices and self-compassion as therapeutic skills for people experiencing distress by voice hearing. Greater self-compassion and mindfulness of voices were significantly associated with less distress from voices. These findings support theory underlining compassionate mind training. Mindfulness of voices mediated the relationship between self-compassion and distress from voices, indicating a synergistic relationship between the constructs. Although the current findings do not give a direction of causation, consideration is given to the potential impact of mindful and
The singer's voice range profile: female professional opera soloists.

Science.gov (United States)

Lamarche, Anick; Ternström, Sten; Pabon, Peter

2010-07-01

This work concerns the collection of 30 voice range profiles (VRPs) of female operatic voice. We address the questions: Is there a need for a singer's protocol in VRP acquisition? Are physiological measurements sufficient or should the measurement of performance capabilities also be included? Can we address the female singing voice in general or is there a case for categorizing voices when studying phonetographic data? Subjects performed a series of structured tasks involving both standard speech voice protocols and additional singing tasks. Singers also completed an extensive questionnaire. Physiological VRPs differ from performance VRPs. Two new VRP metrics, the voice area above a defined level threshold and the dynamic range independent from the fundamental frequency (F(0)), were found to be useful in the analysis of singer VRPs. Task design had no effect on performance VRP outcomes. Voice category differences were mainly attributable to phonation frequency-based information. Results support the clinical importance of addressing the vocal instrument as it is used in performance. Equally important is the elaboration of a protocol suitable for the singing voice. The given context and instructions can be more important than task design for performance VRPs. Yet, for physiological VRP recordings, task design remains critical. Both types of VRPs are suggested for a singer's voice evaluation. Copyright (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Analysis of the acoustic sound in MRI

Energy Technology Data Exchange (ETDEWEB)

Wada, Tetsuro; Hara, Akira; Kusakari, Jun; Yoshioka, Hiroshi; Niitsu, Mamoru; Itai, Yuji [Tsukuba Univ., Ibaraki (Japan). Inst. of Clinical Medicine; Ase, Yuji

1999-04-01

The noise level and power spectra of the acoustic sound exposed during the examination of Magnetic Resonance Imaging (MRI) using a MRI scanner (Philips Gyroscan 1.5 T) were measured at the position of the human auricle. The overall noise levels on T1-weighted images and T2-weighted images with Spin Echo were 105 dB and 98 dB, respectively. The overall noise level on T2-weighted images with Turbo Spin Echo was 110 dB. Fourier analysis revealed energy peaks ranging from 225 to 325 Hz and a steep high frequency cutoff for each pulse sequence. The MRI noise was not likely to cause permanent threshold shift. However, because of the inter-subject variation in susceptibility to acoustic trauma and to exclude the anxiety in patients, ear protectors were recommended for all patients during MRI testing. (author)

Normative Values and Interrelationship of MDVP Voice Analysis Parameters Before and After Endotracheal Intubation

DEFF Research Database (Denmark)

Sørensen, Martin Kryspin; Durck, Tina Trier; Bork, Kristian

2016-01-01

normative values for adults and investigates the correlation between these MDVP parameters in relation to the "standardized" trauma of endotracheal intubation. METHODS: Preoperative and postoperative assessments of vocal fold pathology with flexible videolaryngoscopy and voice analysis with MDVP using...... the best-of-three standardized recording were performed in 121 patients with normal voices included consecutively in the RCT. The procedures of anesthesia were standardized. RESULTS: The normative MDVP values of this study are consistently lower compared with most normative values presented in other...... studies. The preoperative to postoperative differences in jitter values (jitter and relative average perturbation) were closely correlated to the shimmer values for patients with postoperative vocal fold edemas. In the patients with edema, the preoperative to postoperative differences in jitter had...
Voice Disorders in Occupations with Vocal Load in Slovenia.

Science.gov (United States)

Boltežar, Lučka; Šereg Bahar, Maja

2014-12-01

The aim of this paper is to compare the prevalence of voice disorders and the risk factors for them in different occupations with a vocal load in Slovenia. A meta-analysis of six different Slovenian studies involving teachers, physicians, salespeople, catholic priests, nurses and speech-and-language therapists (SLTs) was performed. In all six studies, similar questions about the prevalence of voice disorders and the causes for them were included. The comparison of the six studies showed that more than 82% of the 2347 included subjects had voice problems at some time during their career. The teachers were the most affected by voice problems. The prevalent cause of voice problems was the vocal load in teachers and salespeople and respiratory-tract infections in all the other occupational groups. When the occupational groups were compared, it was stated that the teachers had more voice problems and showed less care for their voices than the priests. The physicians had more voice problems and showed better consideration of vocal hygiene rules than the SLTs. The majority of all the included subjects did not receive instructions about voice care during education. In order to decrease the prevalence of voice disorders in vocal professionals, a screening program is recommended before the beginning of their studies. Regular courses on voice care and proper vocal technique should be obligatory for all professional voice users during their career. The inclusion of dysphonia in the list of occupational diseases should be considered in Slovenia as it is in some European countries.
Compact Acoustic Models for Embedded Speech Recognition

Directory of Open Access Journals (Sweden)

Lévy Christophe

2009-01-01

Full Text Available Speech recognition applications are known to require a significant amount of resources. However, embedded speech recognition only authorizes few KB of memory, few MIPS, and small amount of training data. In order to fit the resource constraints of embedded applications, an approach based on a semicontinuous HMM system using state-independent acoustic modelling is proposed. A transformation is computed and applied to the global model in order to obtain each HMM state-dependent probability density functions, authorizing to store only the transformation parameters. This approach is evaluated on two tasks: digit and voice-command recognition. A fast adaptation technique of acoustic models is also proposed. In order to significantly reduce computational costs, the adaptation is performed only on the global model (using related speaker recognition adaptation techniques with no need for state-dependent data. The whole approach results in a relative gain of more than 20% compared to a basic HMM-based system fitting the constraints.
Responsive acoustic surfaces

DEFF Research Database (Denmark)

Peters, Brady; Tamke, Martin; Nielsen, Stig Anton

2011-01-01

Acoustic performance is defined by the parameter of reverberation time; however, this does not capture the acoustic experience in some types of open plan spaces. As many working and learning activities now take place in open plan spaces, it is important to be able to understand and design...... for the acoustic conditions of these spaces. This paper describes an experimental research project that studied the design processes necessary to design for sound. A responsive acoustic surface was designed, fabricated and tested. This acoustic surface was designed to create specific sonic effects. The design...... was simulated using custom integrated acoustic software and also using Odeon acoustic analysis software. The research demonstrates a method for designing space- and sound-defining surfaces, defines the concept of acoustic subspace, and suggests some new parameters for defining acoustic subspaces....
Modal analysis and cut-off conditions of multichannel surface-acoustic-waveguide structures.

Science.gov (United States)

Griffel, G; Golan, G; Ruschin, S; Seidman, A; Croitoru, N

1988-01-01

Multichannel guides for surface acoustic waves can improve the efficiency of SAW (surface acoustic-wave) devices significantly. Focusing, steering, and modulating the propagating acoustical modes can be achieved similarly to optical waveguided devices. A general formulation is presented for the analysis of the lateral waveguiding properties of Rayleigh modes in surfaces loaded with deposited strips of different materials. General expressions are obtained for the number of modes and cutoff conditions in these structures. As examples of applications, a simple directional coupler and an electrically controlled coupler are proposed.
Face the voice

DEFF Research Database (Denmark)

Lønstrup, Ansa

2014-01-01

will be based on a reception aesthetic and phenomenological approach, the latter as presented by Don Ihde in his book Listening and Voice. Phenomenologies of Sound , and my analytical sketches will be related to theoretical statements concerning the understanding of voice and media (Cavarero, Dolar, La......Belle, Neumark). Finally, the article will discuss the specific artistic combination and our auditory experience of mediated human voices and sculpturally projected faces in an art museum context under the general conditions of the societal panophonia of disembodied and mediated voices, as promoted by Steven...
"Voice Forum" The Human Voice as Primary Instrument in Music Therapy

DEFF Research Database (Denmark)

Pedersen, Inge Nygaard; Storm, Sanne

2009-01-01

Aspects will be drawn on the human voice as tool for embodying our psychological and physiological state, and attempting integration of feelings. Presentations and dialogues on different methods and techniques in "Therapy related body-and voice work.", as well as the human voice as a tool for non...
Pressure potential and stability analysis in an acoustical noncontact transportation

Science.gov (United States)

Li, J.; Liu, C. J.; Zhang, W. J.

2017-01-01

Near field acoustic traveling wave is one of the most popular principles in noncontact manipulations and transportations. The stability behavior is a key factor in the industrial applications of acoustical noncontact transportation. We present here an in-depth analysis of the transportation stability of a planar object levitated in near field acoustic traveling waves. To more accurately describe the pressure distributions on the radiation surface, a 3D nonlinear traveling wave model is presented. A closed form solution is derived based on the pressure potential to quantitatively calculate the restoring forces and moments under small disturbances. The physical explanations of the effects of fluid inertia and the effects of non-uniform pressure distributions are provided in detail. It is found that a vibration rail with tapered cross section provides more stable transportation than a rail with rectangular cross section. The present study sheds light on the issue of quantitative evaluation of stability in acoustic traveling waves and proposes three main factors that influence the stability: (a) vibration shape, (b) pressure distribution and (c) restoring force/moment. It helps to provide a better understanding of the physics behind the near field acoustic transportation and provide useful design and optimization tools for industrial applications.
Monitoring treatment of vocal fold paralysis by biomechanical analysis of voice

OpenAIRE

Gómez Vilda, Pedro; Martínez de Arellano, Ana; Nieto Lluis, Victor; Rodellar Biarge, M. Victoria; Álvarez Marquina, Agustin; Mazaira Fernández, Luis Miguel

2013-01-01

A case study of vocal fold paralysis treatment is described with the help of the voice quality analysis application BioMet®Phon. The case corresponds to a description of a 40 - year old female patient who was diagnosed of vocal fold paralysis following a cardio - pulmonar intervention which required intubation for 8 days and posterior tracheotomy for 15 days. The patient presented breathy and asthenic phon ation, and dysphagia. Six main examinations were conducted during a full year period th...
Voice Use Among Music Theory Teachers: A Voice Dosimetry and Self-Assessment Study.

Science.gov (United States)

Schiller, Isabel S; Morsomme, Dominique; Remacle, Angélique

2017-07-25

This study aimed (1) to investigate music theory teachers' professional and extra-professional vocal loading and background noise exposure, (2) to determine the correlation between vocal loading and background noise, and (3) to determine the correlation between vocal loading and self-evaluation data. Using voice dosimetry, 13 music theory teachers were monitored for one workweek. The parameters analyzed were voice sound pressure level (SPL), fundamental frequency (F0), phonation time, vocal loading index (VLI), and noise SPL. Spearman correlation was used to correlate vocal loading parameters (voice SPL, F0, and phonation time) and noise SPL. Each day, the subjects self-assessed their voice using visual analog scales. VLI and self-evaluation data were correlated using Spearman correlation. Vocal loading parameters and noise SPL were significantly higher in the professional than in the extra-professional environment. Voice SPL, phonation time, and female subjects' F0 correlated positively with noise SPL. VLI correlated with self-assessed voice quality, vocal fatigue, and amount of singing and speaking voice produced. Teaching music theory is a profession with high vocal demands. More background noise is associated with increased vocal loading and may indirectly increase the risk for voice disorders. Correlations between VLI and self-assessments suggest that these teachers are well aware of their vocal demands and feel their effect on voice quality and vocal fatigue. Visual analog scales seem to represent a useful tool for subjective vocal loading assessment and associated symptoms in these professional voice users. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Psychosocial risk factors which may differentiate between women with Functional Voice Disorder, Organic Voice Disorder and a Control group.

Science.gov (United States)

Baker, Janet; Ben-Tovim, David; Butcher, Andrew; Esterman, Adrian; McLaughlin, Kristin

2013-12-01

This study aimed to explore psychosocial factors contributing to the development of functional voice disorders (FVD) and those differentiating between organic voice disorders (OVD) and a non-voice-disordered control group. A case-control study was undertaken of 194 women aged 18-80 years diagnosed with FVD (n = 73), OVD (n = 55), and controls (n = 66). FVD women were allocated into psychogenic voice disorder (PVD) (n = 37) and muscle tension voice disorder (MTVD) (n = 36) for sub-group analysis. Dependent variables included biographical and voice assessment data, the number and severity of life events and difficulties and conflict over speaking out (COSO) situations derived from the Life Events and Difficulties Schedule (LEDS), and psychological traits including emotional expressiveness scales. Four psychosocial components differentiated between the FVD and control group accounting for 84.9% of the variance: severe events, moderate events, severe COSO, and mild COSO difficulties. Severe events, severe and mild COSO difficulties differentiated between FVD and OVD groups, accounting for 80.5% of the variance. Moderate events differentiated between PVD and MTVD sub-groups, accounting for 58.9% of the variance. Psychological traits did not differentiate between groups. Stressful life events and COSO situations best differentiated FVD from OVD and control groups. More refined aetiological studies are needed to differentiate between PVD and MTVD.
Voice recognition through phonetic features with Punjabi utterances

Science.gov (United States)

Kaur, Jasdeep; Juglan, K. C.; Sharma, Vishal; Upadhyay, R. K.

2017-07-01

This paper deals with perception and disorders of speech in view of Punjabi language. Visualizing the importance of voice identification, various parameters of speaker identification has been studied. The speech material was recorded with a tape recorder in their normal and disguised mode of utterances. Out of the recorded speech materials, the utterances free from noise, etc were selected for their auditory and acoustic spectrographic analysis. The comparison of normal and disguised speech of seven subjects is reported. The fundamental frequency (F0) at similar places, Plosive duration at certain phoneme, Amplitude ratio (A1:A2) etc. were compared in normal and disguised speech. It was found that the formant frequency of normal and disguised speech remains almost similar only if it is compared at the position of same vowel quality and quantity. If the vowel is more closed or more open in the disguised utterance the formant frequency will be changed in comparison to normal utterance. The ratio of the amplitude (A1: A2) is found to be speaker dependent. It remains unchanged in the disguised utterance. However, this value may shift in disguised utterance if cross sectioning is not done at the same location.
Writing with Voice

Science.gov (United States)

Kesler, Ted

2012-01-01

In this Teaching Tips article, the author argues for a dialogic conception of voice, based in the work of Mikhail Bakhtin. He demonstrates a dialogic view of voice in action, using two writing examples about the same topic from his daughter, a fifth-grade student. He then provides five practical tips for teaching a dialogic conception of voice in…
Analysis of enhanced modal damping ratio in porous materials using an acoustic-structure interaction model

DEFF Research Database (Denmark)

Kook, Junghwan; Jensen, Jakob Søndergaard

2014-01-01

The aim of this paper is to investigate the enhancement of the damping ratio of a structure with embedded microbeam resonators in air-filled internal cavities. In this context, we discuss theoretical aspects in the framework of the effective modal damping ratio (MDR) and derive an approximate...... relation expressing how an increased damping due to the acoustic medium surrounding the microbeam affect the MDR of the macrobeam. We further analyze the effect of including dissipation of the acoustic medium by using finite element (FE) analysis with acoustic-structure interaction (ASI) using a simple...... phenomenological acoustic loss model. An eigenvalue analysis is carried out to demonstrate the improvement of the damping characteristic of the macrobeam with the resonating microbeam in the lossy air and the results are compared to a forced vibration analysis for a macrobeam with one or multiple embedded...
High transmission acoustic focusing by impedance-matched acoustic meta-surfaces

KAUST Repository

Al Jahdali, Rasha

2016-01-19

Impedance is an important issue in the design of acoustic lenses because mismatched impedance is detrimental to real focusing applications. Here, we report two designs of acoustic lenses that focus acoustic waves in water and air, respectively. They are tailored by acoustic meta-surfaces, which are rigid thin plates decorated with periodically distributed sub-wavelength slits. Their respective building blocks are constructed from the coiling-up spaces in water and the layered structures in air. Analytic analysis based on coupled-mode theory and transfer matrix reveals that the impedances of the lenses are matched to those of the background media. With these impedance-matched acoustic lenses, we demonstrate the acoustic focusing effect by finite-element simulations.
High transmission acoustic focusing by impedance-matched acoustic meta-surfaces

KAUST Repository

Al Jahdali, Rasha; Wu, Ying

2016-01-01

Impedance is an important issue in the design of acoustic lenses because mismatched impedance is detrimental to real focusing applications. Here, we report two designs of acoustic lenses that focus acoustic waves in water and air, respectively. They are tailored by acoustic meta-surfaces, which are rigid thin plates decorated with periodically distributed sub-wavelength slits. Their respective building blocks are constructed from the coiling-up spaces in water and the layered structures in air. Analytic analysis based on coupled-mode theory and transfer matrix reveals that the impedances of the lenses are matched to those of the background media. With these impedance-matched acoustic lenses, we demonstrate the acoustic focusing effect by finite-element simulations.
Statistical analysis of acoustic characteristics of Tibetan Lhasa dialect speech emotion

Directory of Open Access Journals (Sweden)

Guo Dandan

2016-01-01

Full Text Available The paper makes a quantitative analysis and comparison on the continuous speech emotion of Lhasa Tibetan in the four basic emotional patterns (happy, surprise, sad, neutral pitch, energy and time length by experimental phonetics and the linear statistical research methods, found that there is a positive correlation between the Lhasa Tibetan emotional speech and pitch, energy and duration, etc. And the pitch, energy and duration of negative emotion acoustic parameters are bigger than positive emotion, on this basis, drawing the Lhasa Tibetan speech emotion acoustic feature patterns. Compared with the Chinese language and the Tibetan, even though both have the tone prosodic features, they also have significant differences in the acoustic characteristics of the speech emotion.
Using acoustic analysis to presort warp-prone ponderosa pine 2 by 4s before kiln-drying

Science.gov (United States)

Xiping Wang; William T. Simpson

2006-01-01

This study evaluated the potential of acoustic analysis as presorting criteria to identify warp-prone boards before kiln-drying. Dimension lumber, 38 by 89 mm (nominal 2 by 4 in.) and 2.44 m (8 it) long, sawn from open-grown small-diameter ponderosa pine trees, was acoustically tested lengthwise at green condition. Three acoustic properties (acoustic speed, rate of...
Stated product formulation preferences for HIV pre-exposure prophylaxis among women in the VOICE-D (MTN-003D) study.

Science.gov (United States)

Luecke, Ellen H; Cheng, Helen; Woeber, Kubashni; Nakyanzi, Teopista; Mudekunye-Mahaka, Imelda C; van der Straten, Ariane

2016-01-01

The effectiveness of HIV pre-exposure prophylaxis (PrEP) requires consistent and correct product use, thus a deeper understanding of women's stated product formulation preferences, and the correlates of those preferences, can help guide future research. VOICE-D (MTN-003D), a qualitative ancillary study conducted after the VOICE trial, retrospectively explored participants' tablet and gel use, as well as their preferences for other potential PrEP product formulations. We conducted an analysis of quantitative and qualitative data from VOICE-D participants. During in-depth interviews, women were presented with pictures and descriptions of eight potential PrEP product formulations, including the oral tablet and vaginal gel tested in VOICE, and asked to discuss which product formulations they would prefer to use and why. Seven of the original product formulations displayed were combined into preferred product formulation categories based on exploratory factor and latent class analyses. We examined demographic and behavioural correlates of these preferred product formulation categories. In-depth interviews with participants were conducted, coded, and analysed for themes related to product preference. Of the 68 female participants who completed in-depth interviews (22 South Africa, 24 Zimbabwe, 22 Uganda), median age was 28 (range 21-41), 81% were HIV negative, and 49% were married or living with a partner. Four preferred product formulation categories were identified via exploratory factor analysis: 1) oral tablets; 2) vaginal gel; 3) injectable, implant, or vaginal ring; and 4) vaginal film or suppository. A majority of women (81%) expressed a preference for product formulations included in category 3. Characteristics significantly associated with each preferred product category differed. Attributes described by participants as being important in a preferred product formulation included duration of activity, ease of use, route of administration, clinic- versus self
"Ring" in the solo child singing voice.

Science.gov (United States)

Howard, David M; Williams, Jenevora; Herbst, Christian T

2014-03-01

Listeners often describe the voices of solo child singers as being "pure" or "clear"; these terms would suggest that the voice is not only pleasant but also clearly audible. The audibility or clarity could be attributed to the presence of high-frequency partials in the sound: a "brightness" or "ring." This article aims to investigate spectrally the acoustic nature of this ring phenomenon in children's solo voices, and in particular, relating it to their "nonring" production. Additionally, this is set in the context of establishing to what extent, if any, the spectral characteristics of ring are shared with those of the singer's formant cluster associated with professional adult opera singers in the 2.5-3.5kHz region. A group of child solo singers, acknowledged as outstanding by a singing teacher who specializes in teaching professional child singers, were recorded in a major UK concert hall performing Come unto him, all ye that labour, from the aria He shall feed his flock from The Messiah by GF Handel. Their singing was accompanied by a recording of a piano played through in-ear headphones. Sound pressure recordings were made from well within the critical distance in the hall. The singers were observed to produce notes with and without ring, and these recordings were analyzed in the frequency domain to investigate their spectra. The results indicate that there is evidence to suggest that ring in child solo singers is carried in two areas of the output spectrum: first in the singer's formant cluster region, centered around 4kHz, which is more than 1000Hz higher than what is observed in adults; and second in the region around 7.5-11kHz where a significant strengthening of harmonic presence is observed. A perceptual test has been carried out demonstrating that 94% of 62 listeners label a synthesized version of the calculated overall average ring spectrum for all subjects as having ring when compared with a synthesized version of the calculated overall average nonring

Intra-oral pressure-based voicing control of electrolaryngeal speech with intra-oral vibrator.

Science.gov (United States)

Takahashi, Hirokazu; Nakao, Masayuki; Kikuchi, Yataro; Kaga, Kimitaka

2008-07-01

In normal speech, coordinated activities of intrinsic laryngeal muscles suspend a glottal sound at utterance of voiceless consonants, automatically realizing a voicing control. In electrolaryngeal speech, however, the lack of voicing control is one of the causes of unclear voice, voiceless consonants tending to be misheard as the corresponding voiced consonants. In the present work, we developed an intra-oral vibrator with an intra-oral pressure sensor that detected utterance of voiceless phonemes during the intra-oral electrolaryngeal speech, and demonstrated that an intra-oral pressure-based voicing control could improve the intelligibility of the speech. The test voices were obtained from one electrolaryngeal speaker and one normal speaker. We first investigated on the speech analysis software how a voice onset time (VOT) and first formant (F1) transition of the test consonant-vowel syllables contributed to voiceless/voiced contrasts, and developed an adequate voicing control strategy. We then compared the intelligibility of consonant-vowel syllables among the intra-oral electrolaryngeal speech with and without online voicing control. The increase of intra-oral pressure, typically with a peak ranging from 10 to 50 gf/cm2, could reliably identify utterance of voiceless consonants. The speech analysis and intelligibility test then demonstrated that a short VOT caused the misidentification of the voiced consonants due to a clear F1 transition. Finally, taking these results together, the online voicing control, which suspended the prosthetic tone while the intra-oral pressure exceeded 2.5 gf/cm2 and during the 35 milliseconds that followed, proved efficient to improve the voiceless/voiced contrast.
A pneumatic Bionic Voice prosthesis-Pre-clinical trials of controlling the voice onset and offset.

Science.gov (United States)

Ahmadi, Farzaneh; Noorian, Farzad; Novakovic, Daniel; van Schaik, André

2018-01-01

Despite emergent progress in many fields of bionics, a functional Bionic Voice prosthesis for laryngectomy patients (larynx amputees) has not yet been achieved, leading to a lifetime of vocal disability for these patients. This study introduces a novel framework of Pneumatic Bionic Voice Prostheses as an electronic adaptation of the Pneumatic Artificial Larynx (PAL) device. The PAL is a non-invasive mechanical voice source, driven exclusively by respiration with an exceptionally high voice quality, comparable to the existing gold standard of Tracheoesophageal (TE) voice prosthesis. Following PAL design closely as the reference, Pneumatic Bionic Voice Prostheses seem to have a strong potential to substitute the existing gold standard by generating a similar voice quality while remaining non-invasive and non-surgical. This paper designs the first Pneumatic Bionic Voice prosthesis and evaluates its onset and offset control against the PAL device through pre-clinical trials on one laryngectomy patient. The evaluation on a database of more than five hours of continuous/isolated speech recordings shows a close match between the onset/offset control of the Pneumatic Bionic Voice and the PAL with an accuracy of 98.45 ±0.54%. When implemented in real-time, the Pneumatic Bionic Voice prosthesis controller has an average onset/offset delay of 10 milliseconds compared to the PAL. Hence it addresses a major disadvantage of previous electronic voice prostheses, including myoelectric Bionic Voice, in meeting the short time-frames of controlling the onset/offset of the voice in continuous speech.
Analysis of acoustic data from the PFR SGU condition monitor

International Nuclear Information System (INIS)

Rowley, R.; Airey, J.

1990-01-01

This paper gives an outline description of an acoustic monitoring system which has been installed on the SGU of the Prototype Fast Reactor (PFR) at Dounreay with the objective of giving early warning of any change in noise output which could be related to potentially damaging vibrations within the units. Data obtained from this PFR monitoring system is playing an important part in the development of acoustic instrumentation for leak detection although this had not been the primary objective of this particular installation. The PFR has three secondary circuits each containing an evaporator, a superheater and a reheater giving a total of nine SGUs. Although the design of the units is different from that intended for EFR, the measurements provide a valuable source of information on the character and amplitude of acoustic background noise in operational steam generator units. The vibration monitoring system uses the waveguides originally installed during reactor commissioning for leak detection studies. Twelve acoustic waveguides are fitted to the shell of each of the units. The superheaters and reheaters have three waveguides at each of four axial levels, while the evaporators have four waveguides at each of three axial levels. In addition the evaporators have a small number of waveguides attached to the top flange of the unit. Each waveguide is fitted with an accelerometer to record the acoustic signal from the SGU. Tape recordings of the acoustic noise from each unit are made on a regular basis and the tapes analysed on an automated analysis system which has been developed to extract and store in a database about 20 characteristic features from the data. The paper gives examples of the background noise from the SGU. The data demonstrates the use of location techniques to identify prominent acoustic source. 8 figs
Status reports on the development and application of acoustic emission analysis. Proceedings; Statusberichte zur Entwicklung und Anwendung der Schallemissionsanalyse. Beitraege

Energy Technology Data Exchange (ETDEWEB)

NONE

2009-07-01

This proceedings-CD comprises 20 papers presented at the 17. Kolloquium Schallemission (Acoustic Emission Colloquium) at Bad Schandau. The following subjects were discussed: 1. Acoustic emission analysis of tensile tests on standard test specimens of different wood materials; 2. Application of pattern recognition methods for damage analyses of fibre-reinforced plastics; 3. Acoustic emission analysis for measuring crack growth in finned armor steel under dynamic load; 4. Acoustic emission analysis with zonal sound location: Test objects, test results and evaluation of acoustic emission signals; 5. Acoustic emission analysis in overall fatigue testing of a wind rotor blade; 6. Laboratory methods for assessing the sensitivity of acoustic emission sensors; 7. Acoustic emission analysis in burst tests of cast aluminium casings; 8. Visualization of acoustic emission localizations; 9. Threshold-independent and complete recording of characteristics and wave forms of transient and continuous acoustic emission; 10. Characterization of wide-band acoustic emission sensors; 11. Handling of large data volumes in acoustic emission analysis, a contribution to the development of algorithms; 12. Acoustic emission analysis and ultrasonic analysis for the characterization of crack networks in saline rock. One of the papers is available as a separate record in this database. [German] Diese Tagungs-CD enthaelt 20 Vortraege, die auf dem 17. Kolloquium Schallemission in Bad Schandau gehalten wurden. Die Themen waren: 1. Schallemissionsanalyse von Zugversuchen an Standardpruefkoerpern aus unterschiedlichen Holzwerkstoffen; 2. Anwendung von Mustererkennungsverfahren zur Schadensanalyse in faserverstaerkten Kunststoffen; 3. Anwendung der Schallemissionsanalyse zur Ermittlung des Risswachstums bei schwingender Beanspruchung von geripptem Bewehrungsstahl; 4. Schallemissionspruefung mit zonaler Ortung Pruefobjekte, Pruefergebnisse und Nachbewertung von Schallemissionssignalen; 5
A Robust Multimodal Bio metric Authentication Scheme with Voice and Face Recognition

International Nuclear Information System (INIS)

Kasban, H.

2017-01-01

This paper proposes a multimodal biometric scheme for human authentication based on fusion of voice and face recognition. For voice recognition, three categories of features (statistical coefficients, cepstral coefficients and voice timbre) are used and compared. The voice identification modality is carried out using Gaussian Mixture Model (GMM). For face recognition, three recognition methods (Eigenface, Linear Discriminate Analysis (LDA), and Gabor filter) are used and compared. The combination of voice and face biometrics systems into a single multimodal biometrics system is performed using features fusion and scores fusion. This study shows that the best results are obtained using all the features (cepstral coefficients, statistical coefficients and voice timbre features) for voice recognition, LDA face recognition method and scores fusion for the multimodal biometrics system
Surface Acoustic Wave Monitor for Deposition and Analysis of Ultra-Thin Films

Science.gov (United States)

Hines, Jacqueline H. (Inventor)

2015-01-01

A surface acoustic wave (SAW) based thin film deposition monitor device and system for monitoring the deposition of ultra-thin films and nanomaterials and the analysis thereof is characterized by acoustic wave device embodiments that include differential delay line device designs, and which can optionally have integral reference devices fabricated on the same substrate as the sensing device, or on a separate device in thermal contact with the film monitoring/analysis device, in order to provide inherently temperature compensated measurements. These deposition monitor and analysis devices can include inherent temperature compensation, higher sensitivity to surface interactions than quartz crystal microbalance (QCM) devices, and the ability to operate at extreme temperatures.
Tridimensional assessment of adductor spasmodic dysphonia pre- and post-treatment with Botulinum toxin.

Science.gov (United States)

Dejonckere, P H; Neumann, K J; Moerman, M B J; Martens, J P; Giordano, A; Manfredi, C

2012-04-01

Spasmodic dysphonia voices form, in the same way as substitution voices, a particular category of dysphonia that seems not suited for a standardized basic multidimensional assessment protocol, like the one proposed by the European Laryngological Society. Thirty-three exhaustive analyses were performed on voices of 19 patients diagnosed with adductor spasmodic dysphonia (SD), before and after treatment with Botulinum toxin. The speech material consisted of 40 short sentences phonetically selected for constant voicing. Seven perceptual parameters (traditional and dedicated) were blindly rated by a panel of experienced clinicians. Nine acoustic measures (mainly based on voicing evidence and periodicity) were achieved by a special analysis program suited for strongly irregular signals and validated with synthesized deviant voices. Patients also filled in a VHI-questionnaire. Significant improvement is shown by all three approaches. The traditional GRB perceptual parameters appear to be adequate for these patients. Conversely, the special acoustic analysis program is successful in objectivating the improved regularity of vocal fold vibration: the basic jitter remains the most valuable parameter, when reliably quantified. The VHI is well suited for the voice-related quality of life. Nevertheless, when considering pre-therapy and post-therapy changes, the current study illustrates a complete lack of correlation between the perceptual, acoustic, and self-assessment dimensions. Assessment of SD-voices needs to be tridimensional.
Tips for Healthy Voices

Science.gov (United States)

... prevent voice problems and maintain a healthy voice: Drink water (stay well hydrated): Keeping your body well hydrated by drinking plenty of water each day (6-8 glasses) is essential to maintaining a healthy voice. The ...
An analysis of beam parameters on proton-acoustic waves through an analytic approach.

Science.gov (United States)

Kipergil, Esra Aytac; Erkol, Hakan; Kaya, Serhat; Gulsen, Gultekin; Unlu, Mehmet Burcin

2017-06-21

It has been reported that acoustic waves are generated when a high-energy pulsed proton beam is deposited in a small volume within tissue. One possible application of proton-induced acoustics is to get real-time feedback for intra-treatment adjustments by monitoring such acoustic waves. A high spatial resolution in ultrasound imaging may reduce proton range uncertainty. Thus, it is crucial to understand the dependence of the acoustic waves on the proton beam characteristics. In this manuscript, firstly, an analytic solution for the proton-induced acoustic wave is presented to reveal the dependence of the signal on the beam parameters; then it is combined with an analytic approximation of the Bragg curve. The influence of the beam energy, pulse duration and beam diameter variation on the acoustic waveform are investigated. Further analysis is performed regarding the Fourier decomposition of the proton-acoustic signals. Our results show that the smaller spill time of the proton beam upsurges the amplitude of the acoustic wave for a constant number of protons, which is hence beneficial for dose monitoring. The increase in the energy of each individual proton in the beam leads to the spatial broadening of the Bragg curve, which also yields acoustic waves of greater amplitude. The pulse duration and the beam width of the proton beam do not affect the central frequency of the acoustic wave, but they change the amplitude of the spectral components.
Quantifying Dysphonia Severity Using a Spectralcepstral-Based Acoustic Index: Comparisons with Auditory-Perceptual Judgements from the CAPE-V

Science.gov (United States)

Awan, Shaheen N.; Roy, Nelson; JettE, Marie E.; Meltzner, Geoffrey S.; Hillman, Robert E.

2010-01-01

This study investigated the relationship between acoustic spectral/cepstral measures and listener severity ratings in normal and disordered voice samples. CAPE-V sentence samples and the vowel /[script]/were elicited from eight normal speakers and 24 patients with varying degrees of dysphonia severity. Samples were analysed for measures of the…
Perceiving a stranger's voice as being one's own: a 'rubber voice' illusion?

Directory of Open Access Journals (Sweden)

Zane Z Zheng

2011-04-01

Full Text Available We describe an illusion in which a stranger's voice, when presented as the auditory concomitant of a participant's own speech, is perceived as a modified version of their own voice. When the congruence between utterance and feedback breaks down, the illusion is also broken. Compared to a baseline condition in which participants heard their own voice as feedback, hearing a stranger's voice induced robust changes in the fundamental frequency (F0 of their production. Moreover, the shift in F0 appears to be feedback dependent, since shift patterns depended reliably on the relationship between the participant's own F0 and the stranger-voice F0. The shift in F0 was evident both when the illusion was present and after it was broken, suggesting that auditory feedback from production may be used separately for self-recognition and for vocal motor control. Our findings indicate that self-recognition of voices, like other body attributes, is malleable and context dependent.
Unfamiliar voice identification: Effect of post-event information on accuracy and voice ratings

Directory of Open Access Journals (Sweden)

Harriet Mary Jessica Smith

2014-04-01

Full Text Available This study addressed the effect of misleading post-event information (PEI on voice ratings, identification accuracy, and confidence, as well as the link between verbal recall and accuracy. Participants listened to a dialogue between male and female targets, then read misleading information about voice pitch. Participants engaged in verbal recall, rated voices on a feature checklist, and made a lineup decision. Accuracy rates were low, especially on target-absent lineups. Confidence and accuracy were unrelated, but the number of facts recalled about the voice predicted later lineup accuracy. There was a main effect of misinformation on ratings of target voice pitch, but there was no effect on identification accuracy or confidence ratings. As voice lineup evidence from earwitnesses is used in courts, the findings have potential applied relevance.
A pneumatic Bionic Voice prosthesis-Pre-clinical trials of controlling the voice onset and offset.

Directory of Open Access Journals (Sweden)

Farzaneh Ahmadi

Full Text Available Despite emergent progress in many fields of bionics, a functional Bionic Voice prosthesis for laryngectomy patients (larynx amputees has not yet been achieved, leading to a lifetime of vocal disability for these patients. This study introduces a novel framework of Pneumatic Bionic Voice Prostheses as an electronic adaptation of the Pneumatic Artificial Larynx (PAL device. The PAL is a non-invasive mechanical voice source, driven exclusively by respiration with an exceptionally high voice quality, comparable to the existing gold standard of Tracheoesophageal (TE voice prosthesis. Following PAL design closely as the reference, Pneumatic Bionic Voice Prostheses seem to have a strong potential to substitute the existing gold standard by generating a similar voice quality while remaining non-invasive and non-surgical. This paper designs the first Pneumatic Bionic Voice prosthesis and evaluates its onset and offset control against the PAL device through pre-clinical trials on one laryngectomy patient. The evaluation on a database of more than five hours of continuous/isolated speech recordings shows a close match between the onset/offset control of the Pneumatic Bionic Voice and the PAL with an accuracy of 98.45 ±0.54%. When implemented in real-time, the Pneumatic Bionic Voice prosthesis controller has an average onset/offset delay of 10 milliseconds compared to the PAL. Hence it addresses a major disadvantage of previous electronic voice prostheses, including myoelectric Bionic Voice, in meeting the short time-frames of controlling the onset/offset of the voice in continuous speech.
A pneumatic Bionic Voice prosthesis—Pre-clinical trials of controlling the voice onset and offset

Science.gov (United States)

Noorian, Farzad; Novakovic, Daniel; van Schaik, André

2018-01-01

Despite emergent progress in many fields of bionics, a functional Bionic Voice prosthesis for laryngectomy patients (larynx amputees) has not yet been achieved, leading to a lifetime of vocal disability for these patients. This study introduces a novel framework of Pneumatic Bionic Voice Prostheses as an electronic adaptation of the Pneumatic Artificial Larynx (PAL) device. The PAL is a non-invasive mechanical voice source, driven exclusively by respiration with an exceptionally high voice quality, comparable to the existing gold standard of Tracheoesophageal (TE) voice prosthesis. Following PAL design closely as the reference, Pneumatic Bionic Voice Prostheses seem to have a strong potential to substitute the existing gold standard by generating a similar voice quality while remaining non-invasive and non-surgical. This paper designs the first Pneumatic Bionic Voice prosthesis and evaluates its onset and offset control against the PAL device through pre-clinical trials on one laryngectomy patient. The evaluation on a database of more than five hours of continuous/isolated speech recordings shows a close match between the onset/offset control of the Pneumatic Bionic Voice and the PAL with an accuracy of 98.45 ±0.54%. When implemented in real-time, the Pneumatic Bionic Voice prosthesis controller has an average onset/offset delay of 10 milliseconds compared to the PAL. Hence it addresses a major disadvantage of previous electronic voice prostheses, including myoelectric Bionic Voice, in meeting the short time-frames of controlling the onset/offset of the voice in continuous speech. PMID:29466455
Comparison of voice quality in patients with GERD-related dysphonia or chronic cough.

Science.gov (United States)

Domeracka-Kołodziej, Anna; Grabczak, Elżbieta M; Dąbrowska, Marta; Arcimowicz, Magdalena; Lachowska, Magdalena; Osuch-Wójcikiewicz, Ewa; Niemczyk, Kazimierz

2014-01-01

The aim was to compare a voice quality in patients with GERD-related dysphonia or chronic cough and to determine whether there is a relationship between the main symptom reported and voice quality. 249 consecutive patients diagnosed with GERD-related chronic cough or dysphonia were involved in this retrospective study and were divided into two main groups of men and women, and furthermore into groups of chronic cough and dysphonia. Laryngeal lesions were evaluated with videolaryngostroboscopy using Reflux Finding Score. Voice quality was assessed using GRBAS scale, sonograms, and multidimensional voice program (MDVP). All subjects were found to have vocal abnormalities both in subjective and objective voice analysis. Perceptual assessment of voice (GRBAS) did not reveal any differences between analyzed groups depending on the reported symptom. In MDVP analysis, the group of women with cough as the main symptom demonstrated significantly less abnormalities in VTI value. In men with cough as their main complaint, significantly less MDVP abnormalities were found in Jita, Jitt, RAP, PPQ, and sPPQ parameters. The comparison of voice perceptual assessment in patients with GERD-related dysphonia or chronic cough revealed no differences between analyzed groups. In objective voice analysis, the latter group presented lower degree of hoarseness in Yanagihara's scale. In objective MDVP analysis, the chronic cough group presented lower degree of abnormalities only in one of the noise related parameters in females and five frequency perturbation parameters in males. Copyright © 2013 Polish Otorhinolaryngology - Head and Neck Surgery Society. Published by Elsevier Urban & Partner Sp. z.o.o. All rights reserved.
Advanced Time-Frequency Representation in Voice Signal Analysis

Directory of Open Access Journals (Sweden)

Dariusz Mika

2018-03-01

Full Text Available The most commonly used time-frequency representation of the analysis in voice signal is spectrogram. This representation belongs in general to Cohen's class, the class of time-frequency energy distributions. From the standpoint of properties of the resolution spectrogram representation is not optimal. In Cohen class representations are known which have a better resolution properties. All of them are created by smoothing the Wigner-Ville'a (WVD distribution characterized by the best resolution, however, the biggest harmful interference. Used smoothing functions decide about a compromise between the properties of resolution and eliminating harmful interference term. Another class of time-frequency energy distributions is the affine class of distributions. From the point of view of readability of analysis the best properties are known so called Redistribution of energy caused by the use of a general methodology referred to as reassignment to any time-frequency representation. Reassigned distributions efficiently combine a reduction of the interference terms provided by a well adapted smoothing kernel and an increased concentration of the signal components.
AUTHORIAL VOICE IN ISLAMIC COLLEGE ENGLISH DEPARTMENT STUDENTS’ ARGUMENTATIVE WRITING

Directory of Open Access Journals (Sweden)

Nur Afifi

2014-11-01

Full Text Available While considered elusive and abstract, authorial voice is paramount in English writing. Unfortunately, many of Indonesian EFL learners found it is highly challeging to show their voice in their writing. The importance of voice is even exaggerated in argumentative writing, since this kind of writing needs obvious stance of the writer. This study investigates the authorial voice students made in their argumentative writing. The purpose of this study is to gain the picture of students‟ writing ability especially in authorial voice to map the road in guiding the next writing classes. The object of the study is the argumentative writing made by English department students at one Indonesian State College of Islamic Studies in their writing III course. Using Hyland‟s interactional model of voice (2008 the data analysis results the authorial presence in the essays is in position 2 at 0 – 4 scale which means the reader feels somehow weak presence of the authorial voice in the essay. This result confirms the findings of some previous studies that EFL learners especially from „interdependent‟ cultural background tend to find this authorial voice difficult in writing English essay.
Vibro-Acoustic Numerical Analysis for the Chain Cover of a Car Engine

Directory of Open Access Journals (Sweden)

Enrico Armentani

2017-06-01

Full Text Available In this work, a vibro-acoustic numerical and experimental analysis was carried out for the chain cover of a low powered four-cylinder four-stroke diesel engine, belonging to the FPT (FCA Power Train family called SDE (Small Diesel Engine. By applying a methodology used in the acoustic optimization of new FPT engine components, firstly a finite element model (FEM of the engine was defined, then a vibration analysis was performed for the whole engine (modal analysis, and finally a forced response analysis was developed for the only chain cover (separated from the overall engine. The boundary conditions applied to the chain cover were the accelerations experimentally measured by accelerometers located at the points of connection among chain cover, head cover, and crankcase. Subsequently, a boundary element (BE model of the only chain cover was realized to determine the chain cover noise emission, starting from the previously calculated structural vibrations. The numerical vibro-acoustic outcomes were compared with those experimentally observed, obtaining a good correlation. All the information thus obtained allowed the identification of those critical areas, in terms of noise generation, in which to undertake necessary improvements.
Dissident Voices in Theorising Europe: Another Theory is Possible

DEFF Research Database (Denmark)

Manners, Ian James; Whitman, Richard

The paper argues that dissident voices which attempt to theorise Europe differently and advocate another European trajectory have been largely excluded and left unheard in mainstream discussions over the past decade of scholarship and analysis. Dissident voices in European Studies are those that ...... Europe, and another theory, is possible indeed probable....
ACOUSTIC SPEECH RECOGNITION FOR MARATHI LANGUAGE USING SPHINX

Directory of Open Access Journals (Sweden)

Aman Ankit

2016-09-01

Full Text Available Speech recognition or speech to text processing, is a process of recognizing human speech by the computer and converting into text. In speech recognition, transcripts are created by taking recordings of speech as audio and their text transcriptions. Speech based applications which include Natural Language Processing (NLP techniques are popular and an active area of research. Input to such applications is in natural language and output is obtained in natural language. Speech recognition mostly revolves around three approaches namely Acoustic phonetic approach, Pattern recognition approach and Artificial intelligence approach. Creation of acoustic model requires a large database of speech and training algorithms. The output of an ASR system is recognition and translation of spoken language into text by computers and computerized devices. ASR today finds enormous application in tasks that require human machine interfaces like, voice dialing, and etc. Our key contribution in this paper is to create corpora for Marathi language and explore the use of Sphinx engine for automatic speech recognition

Influence of phonetic context on the dysphonic event: contribution of new methodologies for the analysis of pathological voice.

Science.gov (United States)

Revis, J; Galant, C; Fredouille, C; Ghio, A; Giovanni, A

2012-01-01

Widely studied in terms of perception, acoustics or aerodynamics, dysphonia stays nevertheless a speech phenomenon, closely related to the phonetic composition of the message conveyed by the voice. In this paper, we present a series of three works with the aim to understand the implications of the phonetic manifestation of dysphonia. Our first study proposes a new approach to the perceptual analysis of dysphonia (the phonetic labeling), which principle is to listen and evaluate each phoneme in a sentence separately. This study confirms the hypothesis of Laver that the dysphonia is not a constant noise added to the speech signal, but a discontinuous phenomenon, occurring on certain phonemes, based on the phonetic context. However, the burden of executing the task has led us to look to the techniques of automatic speaker recognition (ASR) to automate the procedure. With the collaboration of the LIA, we have developed a system for automatic classification of dysphonia from the techniques of ASR. This is the subject of our second study. The first results obtained with this system suggest that the unvoiced consonants show predominant performance in the task of automatic classification of dysphonia. This result is surprising since it is often assumed that dysphonia occurs only on laryngeal vibration. We started looking for explanations of this phenomenon and we present our assumptions and experiences in the third work we present.
Voice- and swallow-related quality of life in idiopathic Parkinson's disease.

Science.gov (United States)

van Hooren, Michel R A; Baijens, Laura W J; Vos, Rein; Pilz, Walmari; Kuijpers, Laura M F; Kremer, Bernd; Michou, Emilia

2016-02-01

This study explores whether changes in voice- and swallow-related QoL are associated with progression of idiopathic Parkinson's disease (IPD). Furthermore, it examines the relationship between patients' perception of both voice and swallowing disorders in IPD. Prospective clinical study, quality of life (QoL). One-hundred mentally competent IPD patients with voice and swallowing complaints were asked to answer four QoL questionnaires (Voice Handicap Index, MD Anderson Dysphagia Inventory, Visual Analog Scale [VAS] voice, and Dysphagia Severity Scale [DSS]). Differences in means for the QoL questionnaires and their subscales within Hoehn and Yahr stage groups were calculated using one-way analysis of variance. The relationship between voice- and swallow-related QoL questionnaires was determined with the Spearman correlation coefficient. Scores on both voice and swallow questionnaires suggest an overall decrease in QoL with progression of IPD. A plateau in QoL for VAS voice and the DSS was seen in the early Hoehn and Yahr stages. Finally, scores on voice-related QoL questionnaires were significantly correlated with swallow-related QoL outcomes. Voice- and swallow-related QoL decreases with progression of IPD. A significant association was found between voice- and swallow-related QoL questionnaires. Healthcare professionals can benefit from voice- and swallow-related QoL questionnaires in a multidimensional voice- or swallow-assessment protocol. The patient's perception of his/her voice and swallowing disorders and its impact on QoL in IPD should not be disregarded. 2b. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.
Voice-activated intelligent radiologic image display

International Nuclear Information System (INIS)

Fisher, P.

1989-01-01

The authors present a computer-based expert computer system called Mammo-Icon, which automatically assists the radiologist's case analysis by reviewing the trigger phrase output of a commercially available voice transcription system in he domain of mammography. A commercially available PC-based voice dictation system is coupled to an expert system implemented on a microcomputer. Software employs the LISP and C computer languages. Mammo-Icon responds to the trigger phrase output of a voice dictation system with a textual discussion of the potential significance of the findings that have been described and a display of reference images that may help the radiologist to confirm a suspected diagnosis or consider additional diagnoses. This results in automatic availability of potentially useful computer-based expert advice, making such systems much more likely to be used in routine clinical practice
Emotional state and its impact on voice authentication accuracy

Science.gov (United States)

Voznak, Miroslav; Partila, Pavol; Penhaker, Marek; Peterek, Tomas; Tomala, Karel; Rezac, Filip; Safarik, Jakub

2013-05-01

The paper deals with the increasing accuracy of voice authentication methods. The developed algorithm first extracts segmental parameters, such as Zero Crossing Rate, the Fundamental Frequency and Mel-frequency cepstral coefficients from voice. Based on these parameters, the neural network classifier detects the speaker's emotional state. These parameters shape the distribution of neurons in Kohonen maps, forming clusters of neurons on the map characterizing a particular emotional state. Using regression analysis, we can calculate the function of the parameters of individual emotional states. This relationship increases voice authentication accuracy and prevents unjust rejection.
METHODS FOR QUALITY ENHANCEMENT OF USER VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

Directory of Open Access Journals (Sweden)

O. N. Faizulaieva

2014-03-01

Full Text Available The reasonability for the usage of computer systems user voice in the authentication process is proved. The scientific task for improving the signal/noise ratio of the user voice signal in the authentication system is considered. The object of study is the process of input and output of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of voice signal against external interference signals are researched. Methods for quality enhancement of user voice signal in voice authentication systems are suggested. As modern computer facilities, including mobile ones, have two-channel audio card, the usage of two microphones is proposed in the voice signal input system of authentication system. Meanwhile, the task of forming a lobe of microphone array in a desired area of voice signal registration (100 Hz to 8 kHz is solved. The usage of directional properties of the proposed microphone array gives the possibility to have the influence of external interference signals two or three times less in the frequency range from 4 to 8 kHz. The possibilities for implementation of space-time processing of the recorded signals using constant and adaptive weighting factors are investigated. The simulation results of the proposed system for input and extraction of signals during digital processing of narrowband signals are presented. The proposed solutions make it possible to improve the value of the signal/noise ratio of the useful signals recorded up to 10, ..., 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker’s discrimination.
Bottom-up influences of voice continuity in focusing selective auditory attention.

Science.gov (United States)

Bressler, Scott; Masud, Salwa; Bharadwaj, Hari; Shinn-Cunningham, Barbara

2014-01-01

Selective auditory attention causes a relative enhancement of the neural representation of important information and suppression of the neural representation of distracting sound, which enables a listener to analyze and interpret information of interest. Some studies suggest that in both vision and in audition, the "unit" on which attention operates is an object: an estimate of the information coming from a particular external source out in the world. In this view, which object ends up in the attentional foreground depends on the interplay of top-down, volitional attention and stimulus-driven, involuntary attention. Here, we test the idea that auditory attention is object based by exploring whether continuity of a non-spatial feature (talker identity, a feature that helps acoustic elements bind into one perceptual object) also influences selective attention performance. In Experiment 1, we show that perceptual continuity of target talker voice helps listeners report a sequence of spoken target digits embedded in competing reversed digits spoken by different talkers. In Experiment 2, we provide evidence that this benefit of voice continuity is obligatory and automatic, as if voice continuity biases listeners by making it easier to focus on a subsequent target digit when it is perceptually linked to what was already in the attentional foreground. Our results support the idea that feature continuity enhances streaming automatically, thereby influencing the dynamic processes that allow listeners to successfully attend to objects through time in the cacophony that assails our ears in many everyday settings.
Vibro-acoustics

CERN Document Server

Nilsson, Anders

2015-01-01

This three-volume book gives a thorough and comprehensive presentation of vibration and acoustic theories. Different from traditional textbooks which typically deal with some aspects of either acoustic or vibration problems, it is unique of this book to combine those two correlated subjects together. Moreover, it provides fundamental analysis and mathematical descriptions for several crucial phenomena of Vibro-Acoustics which are quite useful in noise reduction, including how structures are excited, energy flows from an excitation point to a sound radiating surface, and finally how a structure radiates noise to a surrounding fluid. Many measurement results included in the text make the reading interesting and informative. Problems/questions are listed at the end of each chapter and the solutions are provided. This will help the readers to understand the topics of Vibro-Acoustics more deeply. The book should be of interest to anyone interested in sound and vibration, vehicle acoustics, ship acoustics and inter...
Airborne chemistry: acoustic levitation in chemical analysis.

Science.gov (United States)

Santesson, Sabina; Nilsson, Staffan

2004-04-01

This review with 60 references describes a unique path to miniaturisation, that is, the use of acoustic levitation in analytical and bioanalytical chemistry applications. Levitation of small volumes of sample by means of a levitation technique can be used as a way to avoid solid walls around the sample, thus circumventing the main problem of miniaturisation, the unfavourable surface-to-volume ratio. Different techniques for sample levitation have been developed and improved. Of the levitation techniques described, acoustic or ultrasonic levitation fulfils all requirements for analytical chemistry applications. This technique has previously been used to study properties of molten materials and the equilibrium shape()and stability of liquid drops. Temperature and mass transfer in levitated drops have also been described, as have crystallisation and microgravity applications. The airborne analytical system described here is equipped with different and exchangeable remote detection systems. The levitated drops are normally in the 100 nL-2 microL volume range and additions to the levitated drop can be made in the pL-volume range. The use of levitated drops in analytical and bioanalytical chemistry offers several benefits. Several remote detection systems are compatible with acoustic levitation, including fluorescence imaging detection, right angle light scattering, Raman spectroscopy, and X-ray diffraction. Applications include liquid/liquid extractions, solvent exchange, analyte enrichment, single-cell analysis, cell-cell communication studies, precipitation screening of proteins to establish nucleation conditions, and crystallisation of proteins and pharmaceuticals.
Enlargement of the supraglottal cavity and its relation to stop consonant voicing.

Science.gov (United States)

Westbury, J R

1983-04-01

Measurements were made of saggital plane movements of the larynx, soft palate, and portions of the tongue, from a high-speed cinefluorographic film of utterances produced by one adult male speaker of American English. These measures were then used to approximate the temporal variations in supraglottal cavity volume during the closures of voiced and voiceless stop consonants. All data were subsequently related to a synchronous acoustic recording of the utterances. Instances of /p,t,k/ were always accompanied by silent closures, and sometimes accompanied by decreases in supraglottal volume. In contrast, instances of /b,d,g/ were always accompanied both by significant intervals of vocal fold vibration during closure, and relatively large increases in supraglottal volume. However, the magnitudes of volume increments during the voiced stops, and the means by which those increments were achieved, differed considerably across place of articulation and phonetic environment. These results are discussed in the context of a well-known model of the breath-stream control mechanism, and their relevance for a general theory of speech motor control is considered.
Speakers’ comfort and voice level variation in classrooms: Laboratory research

DEFF Research Database (Denmark)

Pelegrin Garcia, David; Brunskog, Jonas

2012-01-01

from 0.93 dB/dB, with free speech, to 0.1 dB/dB with other less demanding communication tasks as reading and talking at short distances. The room effect for some individuals can be as strong as 1.7 dB/dB. A questionnaire investigation showed that the acoustic comfort for talking in classrooms......, in the absence of background noise, is correlated to the decay times derived from an impulse response measured from the mouth to the ears of a talker, and that there is a maximum of preference for decay times between 0.4 and 0.5 s. Teachers with self-reported voice problems prefer higher decay times to speak...
Understanding the mechanisms of familiar voice-identity recognition in the human brain.

Science.gov (United States)

Maguinness, Corrina; Roswandowitz, Claudia; von Kriegstein, Katharina

2018-03-31

Humans have a remarkable skill for voice-identity recognition: most of us can remember many voices that surround us as 'unique'. In this review, we explore the computational and neural mechanisms which may support our ability to represent and recognise a unique voice-identity. We examine the functional architecture of voice-sensitive regions in the superior temporal gyrus/sulcus, and bring together findings on how these regions may interact with each other, and additional face-sensitive regions, to support voice-identity processing. We also contrast findings from studies on neurotypicals and clinical populations which have examined the processing of familiar and unfamiliar voices. Taken together, the findings suggest that representations of familiar and unfamiliar voices might dissociate in the human brain. Such an observation does not fit well with current models for voice-identity processing, which by-and-large assume a common sequential analysis of the incoming voice signal, regardless of voice familiarity. We provide a revised audio-visual integrative model of voice-identity processing which brings together traditional and prototype models of identity processing. This revised model includes a mechanism of how voice-identity representations are established and provides a novel framework for understanding and examining the potential differences in familiar and unfamiliar voice processing in the human brain. Copyright © 2018 Elsevier Ltd. All rights reserved.
Objective voice parameters in Colombian school workers with healthy voices

NARCIS (Netherlands)

L.C. Cantor Cutiva (Lady Catherine); A. Burdorf (Alex)

2015-01-01

textabstractObjectives: To characterize the objective voice parameters among school workers, and to identify associated factors of three objective voice parameters, namely fundamental frequency, sound pressure level and maximum phonation time. Materials and methods: We conducted a cross-sectional
Analysis of acoustic emission during abrasive waterjet machining of sheet metals

Science.gov (United States)

Mokhtar, Nazrin; Gebremariam, MA; Zohari, H.; Azhari, Azmir

2018-04-01

The present paper reports on the analysis of acoustic emission (AE) produced during abrasive waterjet (AWJ) machining process. This paper focuses on the relationship of AE and surface quality of sheet metals. The changes in acoustic emission signals recorded by the mean of power spectral density (PSD) via covariance method in relation to the surface quality of the cut are discussed. The test was made using two materials for comparison namely aluminium 6061 and stainless steel 304 with five different feed rates. The acoustic emission data were captured by Labview and later processed using MATLAB software. The results show that the AE spectrums correlated with different feed rates and surface qualities. It can be concluded that the AE is capable of monitoring the changes of feed rate and surface quality.
Internal stress analysis by acoustic polarimetry

International Nuclear Information System (INIS)

Rouge, Jean; Robert, Andre

The associated improvements of acoustics and electronics allow the field of applications relative to the ultrasonic methods to be extended to the non destructive control of materials and structures. Thus, the acoustical polarimetry is a new method allowing the measurement in orientation and intensity of residual or induced internal stresses in metals or other materials [fr
Real-time adaptive concepts in acoustics blind signal separation and multichannel echo cancellation

CERN Document Server

Schobben, Daniel W E

2001-01-01

Blind Signal Separation (BSS) deals with recovering (filtered versions of) source signals from an observed mixture thereof. The term `blind' relates to the fact that there are no reference signals for the source signals and also that the mixing system is unknown. This book presents a new method for blind signal separation, which is developed to work on microphone signals. Acoustic Echo Cancellation (AEC) is a well-known technique to suppress the echo that a microphone picks up from a loudspeaker in the same room. Such acoustic feedback occurs for example in hands-free telephony and can lead to a perceived loud tone. For an application such as a voice-controlled television, a stereo AEC is required to suppress the contribution of the stereo loudspeaker setup. A generalized AEC is presented that is suited for multi-channel operation. New algorithms for Blind Signal Separation and multi-channel Acoustic Echo Cancellation are presented. A background is given in array signal processing methods, adaptive filter the...
Voice, stress, work and quality of life of soccer coaches and physical trainers.

Science.gov (United States)

Penteado, Regina Zanella; Silva, Noelle Bernardi da; Montebello, Maria Imaculada de Lima

2015-01-01

To assess aspects related to work, stress and quality of life related to voice in soccer coaches (C) and physical trainers (T), comparing the categories. Qualitative and quantitative studies with 13 C and 13 T of teams competing in Phase One of the highest level (Série A ) of the 2012 Campeonato Paulista (São Paulo State Soccer Championship). The questions were open ended and related to complaints, difficulties, and/or problems regarding voice use during work and to the relations between voice, work, stress, and quality of life. Stress at work was analyzed by the Job Stress Scale (JSS) questionnaire. The perception of the impact of the voice on quality of life was evaluated by the Voice-Related Quality of Life (V-RQOL) protocol. The answers to the questions were transcribed and submitted to content analysis, and regarding the questionnaire, descriptive data and analytical statistics were used. Content analysis showed lack of preparation for voice care; voice complaints; and intense vocal use demand under stressful work, in addition to the absence of healthy habits and social/family support. The JSS dimensions showed that the Active Work situation and the high V-RQOL scores are compatible with vocal health without complaints. There were no statistical differences between the categories. Both categories reported complaints/problems linked to professional voice use and stressful workload. However, the perception of vocal impact on the quality of life was positive, and the analysis of stress at work resulted in "good" and favorable conditions. The relationship between voice, work, stress, and quality of life in both the categories require further investigations.
Pedagogic Voice: Student Voice in Teaching and Engagement Pedagogies

Science.gov (United States)

Baroutsis, Aspa; McGregor, Glenda; Mills, Martin

2016-01-01

In this paper, we are concerned with the notion of "pedagogic voice" as it relates to the presence of student "voice" in teaching, learning and curriculum matters at an alternative, or second chance, school in Australia. This school draws upon many of the principles of democratic schooling via its utilisation of student voice…
The role of the medial temporal limbic system in processing emotions in voice and music.

Science.gov (United States)

Frühholz, Sascha; Trost, Wiebke; Grandjean, Didier

2014-12-01

Subcortical brain structures of the limbic system, such as the amygdala, are thought to decode the emotional value of sensory information. Recent neuroimaging studies, as well as lesion studies in patients, have shown that the amygdala is sensitive to emotions in voice and music. Similarly, the hippocampus, another part of the temporal limbic system (TLS), is responsive to vocal and musical emotions, but its specific roles in emotional processing from music and especially from voices have been largely neglected. Here we review recent research on vocal and musical emotions, and outline commonalities and differences in the neural processing of emotions in the TLS in terms of emotional valence, emotional intensity and arousal, as well as in terms of acoustic and structural features of voices and music. We summarize the findings in a neural framework including several subcortical and cortical functional pathways between the auditory system and the TLS. This framework proposes that some vocal expressions might already receive a fast emotional evaluation via a subcortical pathway to the amygdala, whereas cortical pathways to the TLS are thought to be equally used for vocal and musical emotions. While the amygdala might be specifically involved in a coarse decoding of the emotional value of voices and music, the hippocampus might process more complex vocal and musical emotions, and might have an important role especially for the decoding of musical emotions by providing memory-based and contextual associations. Copyright © 2014 Elsevier Ltd. All rights reserved.
Brain Maturation, Cognition and Voice Pattern in a Gender Dysphoria Case under Pubertal Suppression.

Science.gov (United States)

Schneider, Maiko A; Spritzer, Poli M; Soll, Bianca Machado Borba; Fontanari, Anna M V; Carneiro, Marina; Tovar-Moll, Fernanda; Costa, Angelo B; da Silva, Dhiordan C; Schwarz, Karine; Anes, Maurício; Tramontina, Silza; Lobato, Maria I R

2017-01-01

Introduction: Gender dysphoria (GD) (DMS-5) is a condition marked by increasing psychological suffering that accompanies the incongruence between one's experienced or expressed gender and one's assigned gender. Manifestation of GD can be seen early on during childhood and adolescence. During this period, the development of undesirable sexual characteristics marks an acute suffering of being opposite to the sex of birth. Pubertal suppression with gonadotropin releasing hormone analogs (GnRHa) has been proposed for these individuals as a reversible treatment for postponing the pubertal development and attenuating psychological suffering. Recently, increased interest has been observed on the impact of this treatment on brain maturation, cognition and psychological performance. Objectives: The aim of this clinical report is to review the effects of puberty suppression on the brain white matter (WM) during adolescence. WM Fractional anisotropy, voice and cognitive functions were assessed before and during the treatment. MRI scans were acquired before, and after 22 and 28 months of hormonal suppression. Methods: We performed a longitudinal evaluation of a pubertal transgender girl undergoing hormonal treatment with GnRH analog. Three longitudinal magnetic resonance imaging (MRI) scans were performed for diffusion tensor imaging (DTI), regarding Fractional Anisotropy (FA) for regions of interest analysis. In parallel, voice samples for acoustic analysis as well as executive functioning with the Wechsler Intelligence Scale (WISC-IV) were performed. Results: During the follow-up, white matter fractional anisotropy did not increase, compared to normal male puberty effects on the brain. After 22 months of pubertal suppression, operational memory dropped 9 points and remained stable after 28 months of follow-up. The fundamental frequency of voice varied during the first year; however, it remained in the female range. Conclusion: Brain white matter fractional anisotropy
Voice Savers for Music Teachers

Science.gov (United States)

Cookman, Starr

2012-01-01

Music teachers are in a class all their own when it comes to voice use. These elite vocal athletes require stamina, strength, and flexibility from their voices day in, day out for hours at a time. Voice rehabilitation clinics and research show that music education ranks high among the professionals most commonly affected by voice problems.…

Classroom acoustics design for speakers’ comfort and speech intelligibility: a European perspective

DEFF Research Database (Denmark)

Garcia, David Pelegrin; Rasmussen, Birgit; Brunskog, Jonas

2014-01-01

. The recommended values of reverberation time in fully occupied classrooms for exible teaching methods are between 0.45 s and 0.6 s (between 0.6 and 0.7 s in an unoccupied but furnished condition) for classrooms with less than 40 students and volumes below 210 m 3 . When designing larger classrooms, a dedicated......Current European regulatory requirements or guidelines for reverberation time in classrooms have the goal of enhancing speech intelligibility for students and reducing noise levels in classrooms. At the same time, school teachers suffer frequently from voice problems due to high vocal load...... intelligibility for students. Two room acoustic parameters are shown relevant for a speaker: the voice support, linked to vocal effort, and the decay time derived from an oral-binaural impulse response, linked to vocal comfort. Theoretical prediction models for room-averaged values of these parameters...
[Coenzyme Q10 (Q-ter) in treatment of functional voice disorders].

Science.gov (United States)

Sensini, M; Corvino, A; Passeri, L; Gallone, G O; Landolfo, V; Raimondo, L; Giordano, C

2011-01-01

Aim of this study was to evaluate the effectivness of Coenzyme Q-Ter and Vitamin A in functional voice disorders. Twenty two patients were treated with CoQ10-ter and vitamin A twice a day for ten days. A general otolaryngological/foniatric and logopedic examination were performed. Videolaringostroboscopy, GIRBAS, Voice Handicap Index questionnaire and Multi-Dimensional Voice analysis were carried out before and after treatment. In all patients an improvement was observed in almost all parameters considered after treatment. CoQ10-ter and Vitamin A risulted effective in treatment of patients with functional voice disorders (caused by vocal "malmenage" or "surmenage").
Effect of subthalamic stimulation on voice and speech in Parkinson´s disease: for the better or worse ?

Directory of Open Access Journals (Sweden)

Sabine eSkodda

2014-01-01

Full Text Available Background: Deep brain stimulation of the subthalamic nucleus, although highly effective for the treatment of motor impairment in Parkinson´s disease, can induce speech deterioration in a subgroup of patients. The aim of the current study was to survey 1 if there are distinctive stimulation effects on the different parameters of voice and speech and 2 if there is a special pattern of preexisting speech abnormalities indicating a risk for further worsening under stimulation. Methods: N = 38 patients with Parkinson´s disease had to perform a speech test without medication with stimulation ON and OFF. Speech samples were analysed: 1 according to a four-dimensional perceptual speech score and 2 by acoustic analysis to obtain quantifiable measures of distinctive speech parameters.Results: Quality of voice was ameliorated with stimulation ON, and there were trends to increased loudness and better pitch variability. N = 8 patients featured a deterioration of speech with stimulation ON, caused by worsening of articulation or/and fluency. These patients had more severe overall speech impairment with characteristic features of articulatory slurring and articulatory acceleration already under StimOFF condition.Conclusion: The influence of subthalamic stimulation on Parkinsonian speech differs considerably between individual patients, however, there is a trend to amelioration of voice quality and prosody. Patients with stimulation-associated speech deterioration featured higher overall speech impairment and showed a distinctive pattern of articulatory abnormalities at baseline. Further investigations to confirm these preliminary findings are necessary to allow neurologists to pre-surgically estimate the individual risk of deterioration of speech under stimulation.
Detecting vocal fatigue in student singers using acoustic measures of mean fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio

Science.gov (United States)

Sisakun, Siphan

2000-12-01

The purpose of this study is to explore the ability of four acoustic parameters, mean fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio, to detect vocal fatigue in student singers. The participants are 15 voice students, who perform two distinct tasks, data collection task and vocal fatiguing task. The data collection task includes the sustained vowel /a/, reading a standard passage, and self-rate on a vocal fatigue form. The vocal fatiguing task is the vocal practice of musical scores for a total of 45 minutes. The four acoustic parameters are extracted using the software EZVoicePlus. The data analyses are performed to answer eight research questions. The first four questions relate to correlations of the self-rating scale and each of the four parameters. The next four research questions relate to differences in the parameters over time using one-factor repeated measures analysis of variance (ANOVA). The result yields a proposed acoustic profile of vocal fatigue in student singers. This profile is characterized by increased fundamental frequency; slightly decreased jitter; slightly decreased shimmer; and slightly increased harmonics-to-noise ratio. The proposed profile requires further investigation.
CONVERSATIONS -- AND NEGOTIATED INTERACTION -- IN TEXT AND VOICE CHAT ROOMS

Directory of Open Access Journals (Sweden)

Kevin Jepson

2005-09-01

Full Text Available Despite the expanded use of the Internet for language learning and practice, little attention if any has been given to the quality of interaction among English L2 speakers in conversational text or voice chat rooms. This study explored the patterns of repair moves in synchronous non-native speaker (NNS text chat rooms in comparison to voice chat rooms on the Internet. The following questions were posed: (a Which types of repair moves occur in text and voice chats; and (b what are the differences, if any, between the repair moves in text chats and voice chats when time is held constant? Repair moves made by anonymous NNSs in 10, 5-minute, synchronous chat room sessions (5 text-chat sessions, 5 voice-chat sessions were counted and analyzed using chi-square with alpha set at .05. Significant differences were found between the higher number of total repair moves made in voice chats and the smaller number in text chats. Qualitative data analysis showed that repair work in voice chats was often pronunciation-related. The study includes discussion that may affect teachers' and learners' considerations of the value of NNS chat room interaction for second language development.
Mechanics of human voice production and control.

Science.gov (United States)

Zhang, Zhaoyan

2016-10-01

As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed.
Analysis of room acoustics in Danish Hospitals

DEFF Research Database (Denmark)

Hoffmann, Ida Ørduk; Zapata Rodriguez, Valentina; Jeong, Cheol-Ho

2018-01-01

time (EDT) and T20, and the sound pressure level metrics, namely the equivalent level and peak level. In addition, the staff at the hospitals is asked about their personal perception of the acoustic and noise conditions and the correlation between their subjective disturbances......This project aims to compare room acoustic parameters and noise levels in various Danish hospitals: Odense, Gentofte, Bispebjerg, Hillerød and Aarhus Hospitals. Room acoustic conditions are measured in audiometric rooms at Odense, Gentofte, Bispebjerg and Aarhus hospitals. The noise levels...
You're a What? Voice Actor

Science.gov (United States)

Liming, Drew

2009-01-01

This article talks about voice actors and features Tony Oliver, a professional voice actor. Voice actors help to bring one's favorite cartoon and video game characters to life. They also do voice-overs for radio and television commercials and movie trailers. These actors use the sound of their voice to sell a character's emotions--or an advertised…
Speaker's voice as a memory cue.

Science.gov (United States)

Campeanu, Sandra; Craik, Fergus I M; Alain, Claude

2015-02-01

Speaker's voice occupies a central role as the cornerstone of auditory social interaction. Here, we review the evidence suggesting that speaker's voice constitutes an integral context cue in auditory memory. Investigation into the nature of voice representation as a memory cue is essential to understanding auditory memory and the neural correlates which underlie it. Evidence from behavioral and electrophysiological studies suggest that while specific voice reinstatement (i.e., same speaker) often appears to facilitate word memory even without attention to voice at study, the presence of a partial benefit of similar voices between study and test is less clear. In terms of explicit memory experiments utilizing unfamiliar voices, encoding methods appear to play a pivotal role. Voice congruency effects have been found when voice is specifically attended at study (i.e., when relatively shallow, perceptual encoding takes place). These behavioral findings coincide with neural indices of memory performance such as the parietal old/new recollection effect and the late right frontal effect. The former distinguishes between correctly identified old words and correctly identified new words, and reflects voice congruency only when voice is attended at study. Characterization of the latter likely depends upon voice memory, rather than word memory. There is also evidence to suggest that voice effects can be found in implicit memory paradigms. However, the presence of voice effects appears to depend greatly on the task employed. Using a word identification task, perceptual similarity between study and test conditions is, like for explicit memory tests, crucial. In addition, the type of noise employed appears to have a differential effect. While voice effects have been observed when white noise is used at both study and test, using multi-talker babble does not confer the same results. In terms of neuroimaging research modulations, characterization of an implicit memory effect
Acoustic Analyses of the Singing Vibrato in Traditional Peking Opera.

Science.gov (United States)

Han, Qichao; Zhang, Ruifeng

2017-07-01

China's traditional Peking Opera has four standard categories of roles: Sheng, Dan, Jing, and Chou, the singing vibrato of each displaying a different auditory effect. The audio and respiratory signals were recorded for two performers of the Qing Yi role, one of the Jing role, one of the Chou role, one of the Lao Sheng role, one of the Xiao Sheng role, and one of the Lao Dan role. The recordings gained eventually consisted of 24 representative songs from six roles. The rates and extents of vibrato, fundamental frequency, and rib cage signals were analyzed. Two findings were obtained: (1) the classical opera singing vibratos of China and Western countries are acoustically different from each other; and (2) in Peking Opera, the singing vibratos of different roles show significant acoustic differences. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Department of Cybernetic Acoustics

Science.gov (United States)

The development of the theory, instrumentation and applications of methods and systems for the measurement, analysis, processing and synthesis of acoustic signals within the audio frequency range, particularly of the speech signal and the vibro-acoustic signal emitted by technical and industrial equipments treated as noise and vibration sources was discussed. The research work, both theoretical and experimental, aims at applications in various branches of science, and medicine, such as: acoustical diagnostics and phoniatric rehabilitation of pathological and postoperative states of the speech organ; bilateral ""man-machine'' speech communication based on the analysis, recognition and synthesis of the speech signal; vibro-acoustical diagnostics and continuous monitoring of the state of machines, technical equipments and technological processes.
Integrating cues of social interest and voice pitch in men's preferences for women's voices

OpenAIRE

Jones, Benedict C; Feinberg, David R; DeBruine, Lisa M; Little, Anthony C; Vukovic, Jovana

2008-01-01

Most previous studies of vocal attractiveness have focused on preferences for physical characteristics of voices such as pitch. Here we examine the content of vocalizations in interaction with such physical traits, finding that vocal cues of social interest modulate the strength of men's preferences for raised pitch in women's voices. Men showed stronger preferences for raised pitch when judging the voices of women who appeared interested in the listener than when judging the voices of women ...
Acoustic emission analysis coupled with thermogravimetric experiments dedicated to high temperature corrosion studies on metallic alloys

International Nuclear Information System (INIS)

Serris, Eric; Al Haj, Omar; Peres, Veronique; Cournil, Michel; Kittel, Jean; Grosjean, Francois; Ropital, Francois

2014-01-01

High temperature corrosion of metallic alloys (like iron, nickel, zirconium alloys) can damage equipment of many industrial fields (refinery, petrochemical, nuclear..). Acoustic emission (AE) is an interesting method owing to its sensitivity and its non-destructive aspect to quantify the level of damage in use of these alloys under various environmental conditions. High temperature corrosive phenomena create stresses in the materials; the relaxation by cracks of these stresses can be recorded and analyzed using the AE system. The goal of our study is to establish an acoustic signals database which assigns the acoustic signals to the specific corrosion phenomena. For this purpose, thermogravimetric analysis (TGA) is coupled with acoustic emission (AE) devices. The oxidation of a zirconium alloy, zircaloy-4, is first studied using thermogravimetric experiment coupled to acoustic emission analysis at 900 C. An inward zirconium oxide scale, preliminary dense, then porous, grow during the isothermal isobaric step. The kinetic rate increases significantly after a kinetic transition (breakaway). This acceleration occurs with an increase of acoustic emission activity. Most of the acoustic emission bursts are recorded after the kinetic transition. Acoustic emission signals are also observed during the cooling of the sample. AE numerical treatments (using wavelet transform) completed by SEM microscopy characterizations allows us to distinguish the different populations of cracks. Metal dusting represents also a severe form of corrosive degradation of metal alloy. Iron metal dusting corrosion is studied by AE coupled with TGA at 650 C under C 4 H 10 + H 2 + He atmosphere. Acoustic emission signals are detected after a significant increase of the sample mass.
Effectiveness of Transoral Laser Microsurgery for Precancerous Lesions and Early Glottic Cancer Guided by Analysis of Voice Quality

Czech Academy of Sciences Publication Activity Database

Bahannan, A.; Slavíček, A.; Černý, L.; Vokřál, J.; Valenta, Zdeněk; Lohynská, R.; Chovanec, M.; Betka, J.

2014-01-01

Roč. 36, č. 6 (2014), s. 763-767 ISSN 1043-3074 Institutional support: RVO:67985807 Keywords : cordectomy * voice analysis * glottic cancer * precancerous lesion * larynx Subject RIV: FF - HEENT, Dentistry Impact factor: 2.641, year: 2014
MOBILE DEVICE OF VOIP VOICE TRAFFIC ANALYSIS FOR RED5 SERVICE

OpenAIRE

Jeanne Chen; Tung-Shou Chen; Jhe-Wei Syu

2015-01-01

Due to the increase in popularity of mobile devices and mobile networks, the usage rate of VoIP has also increased. The flow consumption of types of VoIP has become very important from the limited mobile network data program. In recent years, the voice streaming server of Adobe Flash, Red5, has been increasingly used to realize the function of VoIP. Red5 is an open source media server and has good voice compression and video compression. However, it can only be executed on a PC computer. This...
Understanding the 'Anorexic Voice' in Anorexia Nervosa.

Science.gov (United States)

Pugh, Matthew; Waller, Glenn

2017-05-01

In common with individuals experiencing a number of disorders, people with anorexia nervosa report experiencing an internal 'voice'. The anorexic voice comments on the individual's eating, weight and shape and instructs the individual to restrict or compensate. However, the core characteristics of the anorexic voice are not known. This study aimed to develop a parsimonious model of the voice characteristics that are related to key features of eating disorder pathology and to determine whether patients with anorexia nervosa fall into groups with different voice experiences. The participants were 49 women with full diagnoses of anorexia nervosa. Each completed validated measures of the power and nature of their voice experience and of their responses to the voice. Different voice characteristics were associated with current body mass index, duration of disorder and eating cognitions. Two subgroups emerged, with 'weaker' and 'stronger' voice experiences. Those with stronger voices were characterized by having more negative eating attitudes, more severe compensatory behaviours, a longer duration of illness and a greater likelihood of having the binge-purge subtype of anorexia nervosa. The findings indicate that the anorexic voice is an important element of the psychopathology of anorexia nervosa. Addressing the anorexic voice might be helpful in enhancing outcomes of treatments for anorexia nervosa, but that conclusion might apply only to patients with more severe eating psychopathology. Copyright © 2016 John Wiley & Sons, Ltd. Experiences of an internal 'anorexic voice' are common in anorexia nervosa. Clinicians should consider the role of the voice when formulating eating pathology in anorexia nervosa, including how individuals perceive and relate to that voice. Addressing the voice may be beneficial, particularly in more severe and enduring forms of anorexia nervosa. When working with the voice, clinicians should aim to address both the content of the voice and how
Voice application development for Android

CERN Document Server

McTear, Michael

2013-01-01

This book will give beginners an introduction to building voice-based applications on Android. It will begin by covering the basic concepts and will build up to creating a voice-based personal assistant. By the end of this book, you should be in a position to create your own voice-based applications on Android from scratch in next to no time.Voice Application Development for Android is for all those who are interested in speech technology and for those who, as owners of Android devices, are keen to experiment with developing voice apps for their devices. It will also be useful as a starting po
ACEMAN (II): a PDP-11 software package for acoustic emission analysis

International Nuclear Information System (INIS)

Tobias, A.

1976-01-01

A powerful, but easy-to-use, software package (ACEMAN) for acoustic emission analysis has been developed at Berkeley Nuclear Laboratories. The system is based on a PDP-11 minicomputer with 24 K of memory, an RK05 DISK Drive and a Tektronix 4010 Graphics terminal. The operation of the system is described in detail in terms of the functions performed in response to the various command mnemonics. The ACEMAN software package offers many useful facilities not found on other acoustic emission monitoring systems. Its main features, many of which are unique, are summarised. The ACEMAN system automatically handles arrays of up to 12 sensors in real-time operation during which data are acquired, analysed, stored on the computer disk for future analysis and displayed on the terminal if required. (author)
DolphinAtack: Inaudible Voice Commands

OpenAIRE

Zhang, Guoming; Yan, Chen; Ji, Xiaoyu; Zhang, Taimin; Zhang, Tianchen; Xu, Wenyuan

2017-01-01

Speech recognition (SR) systems such as Siri or Google Now have become an increasingly popular human-computer interaction method, and have turned various systems into voice controllable systems(VCS). Prior work on attacking VCS shows that the hidden voice commands that are incomprehensible to people can control the systems. Hidden voice commands, though hidden, are nonetheless audible. In this work, we design a completely inaudible attack, DolphinAttack, that modulates voice commands on ultra...
Spectral analysis methods for vehicle interior vibro-acoustics identification

Science.gov (United States)

Hosseini Fouladi, Mohammad; Nor, Mohd. Jailani Mohd.; Ariffin, Ahmad Kamal

2009-02-01

Noise has various effects on comfort, performance and health of human. Sound are analysed by human brain based on the frequencies and amplitudes. In a dynamic system, transmission of sound and vibrations depend on frequency and direction of the input motion and characteristics of the output. It is imperative that automotive manufacturers invest a lot of effort and money to improve and enhance the vibro-acoustics performance of their products. The enhancement effort may be very difficult and time-consuming if one relies only on 'trial and error' method without prior knowledge about the sources itself. Complex noise inside a vehicle cabin originated from various sources and travel through many pathways. First stage of sound quality refinement is to find the source. It is vital for automotive engineers to identify the dominant noise sources such as engine noise, exhaust noise and noise due to vibration transmission inside of vehicle. The purpose of this paper is to find the vibro-acoustical sources of noise in a passenger vehicle compartment. The implementation of spectral analysis method is much faster than the 'trial and error' methods in which, parts should be separated to measure the transfer functions. Also by using spectral analysis method, signals can be recorded in real operational conditions which conduce to more consistent results. A multi-channel analyser is utilised to measure and record the vibro-acoustical signals. Computational algorithms are also employed to identify contribution of various sources towards the measured interior signal. These achievements can be utilised to detect, control and optimise interior noise performance of road transport vehicles.

Sinusoidal Representation of Acoustic Signals

Science.gov (United States)

Honda, Masaaki

Sinusoidal representation of acoustic signals has been an important tool in speech and music processing like signal analysis, synthesis and time scale or pitch modifications. It can be applicable to arbitrary signals, which is an important advantage over other signal representations like physical modeling of acoustic signals. In sinusoidal representation, acoustic signals are composed as sums of sinusoid (sine wave) with different amplitudes, frequencies and phases, which is based on the timedependent short-time Fourier transform (STFT). This article describes the principles of acoustic signal analysis/synthesis based on a sinusoid representation with focus on sine waves with rapidly varying frequency.
Risk factors for voice problems in teachers.

NARCIS (Netherlands)

Kooijman, P.G.C.; Jong, F.I.C.R.S. de; Thomas, G.; Huinck, W.J.; Donders, A.R.T.; Graamans, K.; Schutte, H.K.

2006-01-01

In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints
Risk factors for voice problems in teachers

NARCIS (Netherlands)

Kooijman, P. G. C.; de Jong, F. I. C. R. S.; Thomas, G.; Huinck, W.; Donders, R.; Graamans, K.; Schutte, H. K.

2006-01-01

In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints
Acoustic assessment of speech privacy curtains in two nursing units

Science.gov (United States)

Pope, Diana S.; Miller-Klein, Erik T.

2016-01-01

Hospitals have complex soundscapes that create challenges to patient care. Extraneous noise and high reverberation rates impair speech intelligibility, which leads to raised voices. In an unintended spiral, the increasing noise may result in diminished speech privacy, as people speak loudly to be heard over the din. The products available to improve hospital soundscapes include construction materials that absorb sound (acoustic ceiling tiles, carpet, wall insulation) and reduce reverberation rates. Enhanced privacy curtains are now available and offer potential for a relatively simple way to improve speech privacy and speech intelligibility by absorbing sound at the hospital patient's bedside. Acoustic assessments were performed over 2 days on two nursing units with a similar design in the same hospital. One unit was built with the 1970s’ standard hospital construction and the other was newly refurbished (2013) with sound-absorbing features. In addition, we determined the effect of an enhanced privacy curtain versus standard privacy curtains using acoustic measures of speech privacy and speech intelligibility indexes. Privacy curtains provided auditory protection for the patients. In general, that protection was increased by the use of enhanced privacy curtains. On an average, the enhanced curtain improved sound absorption from 20% to 30%; however, there was considerable variability, depending on the configuration of the rooms tested. Enhanced privacy curtains provide measureable improvement to the acoustics of patient rooms but cannot overcome larger acoustic design issues. To shorten reverberation time, additional absorption, and compact and more fragmented nursing unit floor plate shapes should be considered. PMID:26780959
Acoustic assessment of speech privacy curtains in two nursing units.

Science.gov (United States)

Pope, Diana S; Miller-Klein, Erik T

2016-01-01

Hospitals have complex soundscapes that create challenges to patient care. Extraneous noise and high reverberation rates impair speech intelligibility, which leads to raised voices. In an unintended spiral, the increasing noise may result in diminished speech privacy, as people speak loudly to be heard over the din. The products available to improve hospital soundscapes include construction materials that absorb sound (acoustic ceiling tiles, carpet, wall insulation) and reduce reverberation rates. Enhanced privacy curtains are now available and offer potential for a relatively simple way to improve speech privacy and speech intelligibility by absorbing sound at the hospital patient's bedside. Acoustic assessments were performed over 2 days on two nursing units with a similar design in the same hospital. One unit was built with the 1970s' standard hospital construction and the other was newly refurbished (2013) with sound-absorbing features. In addition, we determined the effect of an enhanced privacy curtain versus standard privacy curtains using acoustic measures of speech privacy and speech intelligibility indexes. Privacy curtains provided auditory protection for the patients. In general, that protection was increased by the use of enhanced privacy curtains. On an average, the enhanced curtain improved sound absorption from 20% to 30%; however, there was considerable variability, depending on the configuration of the rooms tested. Enhanced privacy curtains provide measureable improvement to the acoustics of patient rooms but cannot overcome larger acoustic design issues. To shorten reverberation time, additional absorption, and compact and more fragmented nursing unit floor plate shapes should be considered.
Acoustic assessment of speech privacy curtains in two nursing units

Directory of Open Access Journals (Sweden)

Diana S Pope

2016-01-01

Full Text Available Hospitals have complex soundscapes that create challenges to patient care. Extraneous noise and high reverberation rates impair speech intelligibility, which leads to raised voices. In an unintended spiral, the increasing noise may result in diminished speech privacy, as people speak loudly to be heard over the din. The products available to improve hospital soundscapes include construction materials that absorb sound (acoustic ceiling tiles, carpet, wall insulation and reduce reverberation rates. Enhanced privacy curtains are now available and offer potential for a relatively simple way to improve speech privacy and speech intelligibility by absorbing sound at the hospital patient′s bedside. Acoustic assessments were performed over 2 days on two nursing units with a similar design in the same hospital. One unit was built with the 1970s′ standard hospital construction and the other was newly refurbished (2013 with sound-absorbing features. In addition, we determined the effect of an enhanced privacy curtain versus standard privacy curtains using acoustic measures of speech privacy and speech intelligibility indexes. Privacy curtains provided auditory protection for the patients. In general, that protection was increased by the use of enhanced privacy curtains. On an average, the enhanced curtain improved sound absorption from 20% to 30%; however, there was considerable variability, depending on the configuration of the rooms tested. Enhanced privacy curtains provide measureable improvement to the acoustics of patient rooms but cannot overcome larger acoustic design issues. To shorten reverberation time, additional absorption, and compact and more fragmented nursing unit floor plate shapes should be considered.
Throughput Analysis on 3-Dimensional Underwater Acoustic Network with One-Hop Mobile Relay

Science.gov (United States)

Zhong, Xuefeng; Fan, Jiasheng; Guan, Quansheng; Ji, Fei; Yu, Hua

2018-01-01

Underwater acoustic communication network (UACN) has been considered as an essential infrastructure for ocean exploitation. Performance analysis of UACN is important in underwater acoustic network deployment and management. In this paper, we analyze the network throughput of three-dimensional randomly deployed transmitter–receiver pairs. Due to the long delay of acoustic channels, complicated networking protocols with heavy signaling overhead may not be appropriate. In this paper, we consider only one-hop or two-hop transmission, to save the signaling cost. That is, we assume the transmitter sends the data packet to the receiver by one-hop direct transmission, or by two-hop transmission via mobile relays. We derive the closed-form formulation of packet delivery rate with respect to the transmission delay and the number of transmitter–receiver pairs. The correctness of the derivation results are verified by computer simulations. Our analysis indicates how to obtain a precise tradeoff between the delay constraint and the network capacity. PMID:29337911
Throughput Analysis on 3-Dimensional Underwater Acoustic Network with One-Hop Mobile Relay.

Science.gov (United States)

Zhong, Xuefeng; Chen, Fangjiong; Fan, Jiasheng; Guan, Quansheng; Ji, Fei; Yu, Hua

2018-01-16

Underwater acoustic communication network (UACN) has been considered as an essential infrastructure for ocean exploitation. Performance analysis of UACN is important in underwater acoustic network deployment and management. In this paper, we analyze the network throughput of three-dimensional randomly deployed transmitter-receiver pairs. Due to the long delay of acoustic channels, complicated networking protocols with heavy signaling overhead may not be appropriate. In this paper, we consider only one-hop or two-hop transmission, to save the signaling cost. That is, we assume the transmitter sends the data packet to the receiver by one-hop direct transmission, or by two-hop transmission via mobile relays. We derive the closed-form formulation of packet delivery rate with respect to the transmission delay and the number of transmitter-receiver pairs. The correctness of the derivation results are verified by computer simulations. Our analysis indicates how to obtain a precise tradeoff between the delay constraint and the network capacity.
Numerical analysis of the resonance mechanism of the lumped parameter system model for acoustic mine detection

International Nuclear Information System (INIS)

Wang Chi; Zhou Yu-Qiu; Shen Gao-Wei; Wu Wen-Wen; Ding Wei

2013-01-01

The method of numerical analysis is employed to study the resonance mechanism of the lumped parameter system model for acoustic mine detection. Based on the basic principle of the acoustic resonance technique for mine detection and the characteristics of low-frequency acoustics, the ''soil-mine'' system could be equivalent to a damping ''mass-spring'' resonance model with a lumped parameter analysis method. The dynamic simulation software, Adams, is adopted to analyze the lumped parameter system model numerically. The simulated resonance frequency and anti-resonance frequency are 151 Hz and 512 Hz respectively, basically in agreement with the published resonance frequency of 155 Hz and anti-resonance frequency of 513 Hz, which were measured in the experiment. Therefore, the technique of numerical simulation is validated to have the potential for analyzing the acoustic mine detection model quantitatively. The influences of the soil and mine parameters on the resonance characteristics of the soil—mine system could be investigated by changing the parameter setup in a flexible manner. (electromagnetism, optics, acoustics, heat transfer, classical mechanics, and fluid dynamics)
Analysis of acoustic emission data for bearings subject to unbalance

Directory of Open Access Journals (Sweden)

Rapinder Sawhney

2013-01-01

Full Text Available Acoustic Emission (AE is an effective nondestructive method for investigating the behavior of materials under stress. In recent decades, AE applications in structural health monitoring have been extended to other areas such as rotating machineries and cutting tools. This research investigates the application of acoustic emission data for unbalance analysis and detection in rotary systems. The AE parameter of interest in this study is a discrete variable that covers the significance of count, duration and amplitude of AE signals. A statistical model based on Zero-Inflated Poisson (ZIP regression is proposed to handle over-dispersion and excess zeros of the counting data. The ZIP model indicates that faulty bearings can generate more transient wave in the AE waveform. Control charts can easily detect the faulty bearing using the parameters of the ZIP model. Categorical data analysis based on generalized linear models (GLM is also presented. The results demonstrate the significance of the couple unbalance.
Acoustic and temporal analysis of speech: A potential biomarker for schizophrenia.

LENUS (Irish Health Repository)

Rapcan, Viliam

2010-11-01

Currently, there are no established objective biomarkers for the diagnosis or monitoring of schizophrenia. It has been previously reported that there are notable qualitative differences in the speech of schizophrenics. The objective of this study was to determine whether a quantitative acoustic and temporal analysis of speech may be a potential biomarker for schizophrenia. In this study, 39 schizophrenic patients and 18 controls were digitally recorded reading aloud an emotionally neutral text passage from a children\\'s story. Temporal, energy and vocal pitch features were automatically extracted from the recordings. A classifier based on linear discriminant analysis was employed to differentiate between controls and schizophrenic subjects. Processing the recordings with the algorithm developed demonstrated that it is possible to differentiate schizophrenic patients and controls with a classification accuracy of 79.4% (specificity=83.6%, sensitivity=75.2%) based on speech pause related parameters extracted from recordings carried out in standard office (non-studio) environments. Acoustic and temporal analysis of speech may represent a potential tool for the objective analysis in schizophrenia.
Voice quality in relation to voice complaints and vocal fold condition during the screening of female student teachers.

Science.gov (United States)

Meulenbroek, Leo F P; de Jong, Felix I C R S

2011-07-01

The purpose of this study was to compare the perceptual examination of voice quality with the condition of the vocal folds and voice complaints during voice screening in female student teachers. This research was a cross-sectional study in 214 starting student teachers using the four-point grade scale of the GRBAS and laryngostroboscopic assessment of the vocal folds. The voice quality was assessed by speech pathologists using the ordinal 4-point G-scale (overall dysphonia) of the GRBAS method in a running speech sample. Glottal closure and vocal fold lesions were recorded. A questionnaire was used for assessing voice complaints. More students with an insufficient glottal closure (89%) were rated dysphonic compared with students with sufficient glottal closure (80%). Students with sufficient glottal closure had a significantly lower mean G-score (1.21) compared with the group with insufficient glottal closure (1.52) (P = 0.038). This study showed a larger percentage of students with vocal fold lesions (96%) labeled a dysphonic voice compared to students with no vocal fold problems (81%). Students with no vocal fold lesions had a significantly lower mean G-score (1.20) compared with the group with vocal fold lesions (2.05) (P=0.002). A dysphonic voice (G≥1) was rated in 76% of the students without voice complaints compared with 86% of the students with voice complaints. Students with no voice complaints had a lower mean G-score (1.07) compared with the group with voice complaints (1.41) (P=0.090). The present study showed that perceptual assessment of the voice and voice complaints is not sufficient to check if the future professional is at risk. Therefore, preventive measures are needed to detect students at risk early in their education and this depends on broader assessment: on the one hand, assessing voice quality and voice complaints and on the other hand, examination of the vocal folds of all starting students. Copyright © 2011 The Voice Foundation
Parametric Room Acoustic Workflows

DEFF Research Database (Denmark)

Parigi, Dario; Svidt, Kjeld; Molin, Erik

2017-01-01

The paper investigates and assesses different room acoustics software and the opportunities they offer to engage in parametric acoustics workflow and to influence architectural designs. The first step consists in the testing and benchmarking of different tools on the basis of accuracy, speed...... and interoperability with Grasshopper 3d. The focus will be placed to the benchmarking of three different acoustic analysis tools based on raytracing. To compare the accuracy and speed of the acoustic evaluation across different tools, a homogeneous set of acoustic parameters is chosen. The room acoustics parameters...... included in the set are reverberation time (EDT, RT30), clarity (C50), loudness (G), and definition (D50). Scenarios are discussed for determining at different design stages the most suitable acoustic tool. Those scenarios are characterized, by the use of less accurate but fast evaluation tools to be used...
DARHT Multi-intelligence Seismic and Acoustic Data Analysis

Energy Technology Data Exchange (ETDEWEB)

Stevens, Garrison Nicole [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Van Buren, Kendra Lu [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Hemez, Francois M. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

2016-07-21

The purpose of this report is to document the analysis of seismic and acoustic data collected at the Dual-Axis Radiographic Hydrodynamic Test (DARHT) facility at Los Alamos National Laboratory for robust, multi-intelligence decision making. The data utilized herein is obtained from two tri-axial seismic sensors and three acoustic sensors, resulting in a total of nine data channels. The goal of this analysis is to develop a generalized, automated framework to determine internal operations at DARHT using informative features extracted from measurements collected external of the facility. Our framework involves four components: (1) feature extraction, (2) data fusion, (3) classification, and finally (4) robustness analysis. Two approaches are taken for extracting features from the data. The first of these, generic feature extraction, involves extraction of statistical features from the nine data channels. The second approach, event detection, identifies specific events relevant to traffic entering and leaving the facility as well as explosive activities at DARHT and nearby explosive testing sites. Event detection is completed using a two stage method, first utilizing signatures in the frequency domain to identify outliers and second extracting short duration events of interest among these outliers by evaluating residuals of an autoregressive exogenous time series model. Features extracted from each data set are then fused to perform analysis with a multi-intelligence paradigm, where information from multiple data sets are combined to generate more information than available through analysis of each independently. The fused feature set is used to train a statistical classifier and predict the state of operations to inform a decision maker. We demonstrate this classification using both generic statistical features and event detection and provide a comparison of the two methods. Finally, the concept of decision robustness is presented through a preliminary analysis where
Error analysis by means of acoustic holography

International Nuclear Information System (INIS)

Kutzner, J.; Wuestenberg, H.

1976-01-01

The possilbilities to use the acoustical holography in nondestructive testing are discussed. Although compared to optical holography the image quality of acoustical holography is reduced this technique can give important informations about the shape of defects. Especially in nondestructive testing of thick walled components no alternative exists until now. (orig.) [de
When the face fits: recognition of celebrities from matching and mismatching faces and voices.

Science.gov (United States)

Stevenage, Sarah V; Neil, Greg J; Hamlin, Iain

2014-01-01

The results of two experiments are presented in which participants engaged in a face-recognition or a voice-recognition task. The stimuli were face-voice pairs in which the face and voice were co-presented and were either "matched" (same person), "related" (two highly associated people), or "mismatched" (two unrelated people). Analysis in both experiments confirmed that accuracy and confidence in face recognition was consistently high regardless of the identity of the accompanying voice. However accuracy of voice recognition was increasingly affected as the relationship between voice and accompanying face declined. Moreover, when considering self-reported confidence in voice recognition, confidence remained high for correct responses despite the proportion of these responses declining across conditions. These results converged with existing evidence indicating the vulnerability of voice recognition as a relatively weak signaller of identity, and results are discussed in the context of a person-recognition framework.
Classroom acoustics design guidelines based on the optimization of speaker conditions

DEFF Research Database (Denmark)

Pelegrin Garcia, David; Brunskog, Jonas

2012-01-01

School teachers suffer frequently from voice problems due to the high vocal load that they experience and the not-always-ideal conditions under which they have to teach. Traditionally, the purpose of the acoustic design of classrooms has been to optimize speech intelligibility. New guidelines...... and noise level measurements in classrooms. Requirements of optimum vocal comfort, average A-weighted speech levels across the audience higher than 50 dB, and a physical volume higher than 6 m3/student are combined to extract optimum acoustic conditions, which depend on the number of students....... These conditions, which are independent on the position of the speaker, cannot be optimum for more than 50 students. For classrooms with 10 students, the reverberation time in occupied conditions shall be between 0.5 and 0.65 s, and the volume between 60 and 170 m3. For classrooms with 40 students...
Correlation of the Voice Handicap Index-10 (VHI-10) and Voice-Related Quality of Life (V-RQOL) in patients with dysphonia.

Science.gov (United States)

Romak, Jonathan J; Orbelo, Diana M; Maragos, Nicolas E; Ekbom, Dale C

2014-03-01

This study examines the correlation between two voice-specific patient-reported outcome measures: the Voice Handicap Index-10 (VHI-10) and Voice-Related Quality of Life (V-RQOL). Retrospective chart review. Eight hundred four patients presenting to our voice clinic between May 2009 and August 2011. All patients completed the VHI-10 and V-RQOL in a single sitting. Correlation between the two scales was examined using Spearman rank analysis. Calculated VHI-10 score was derived from V-RQOL score by direct conversion equation and compared with measured VHI-10 score. Receiver Operating Characteristic (ROC) curves were derived for diagnostic groups. Spearman correlation coefficient between the VHI-10 and V-RQOL was -0.91 (P dysphonia (V-RQOL AUC = 0.536 [SE ± 0.026]; VHI-10 AUC = 0.508 [SE ± 0.26]; P = 0.018) groups, with the V-RQOL showing relatively greater sensitivity. The VHI-10 and V-RQOL are highly correlated. However, VHI-10 score cannot be calculated from V-RQOL score using the tested equation. The V-RQOL may be more sensitive than the VHI-10 in detecting the impact of presbyphonia and muscle tension dysphonia. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Aerodynamic and sound intensity measurements in tracheoesophageal voice

NARCIS (Netherlands)

Grolman, Wilko; Eerenstein, Simone E. J.; Tan, Frédérique M. L.; Tange, Rinze A.; Schouwenburg, Paul F.

2007-01-01

BACKGROUND: In laryngectomized patients, tracheoesophageal voice generally provides a better voice quality than esophageal voice. Understanding the aerodynamics of voice production in patients with a voice prosthesis is important for optimizing prosthetic designs and successful voice rehabilitation.
Crossing Cultures with Multi-Voiced Journals

Science.gov (United States)

Styslinger, Mary E.; Whisenant, Alison

2004-01-01

In this article, the authors discuss the benefits of using multi-voiced journals as a teaching strategy in reading instruction. Multi-voiced journals, an adaptation of dual-voiced journals, encourage responses to reading in varied, cultured voices of characters. It is similar to reading journals in that they prod students to connect to the lives…

Development and preliminary validation of the EASE: a tool to measure perceived singing voice function.

Science.gov (United States)

Phyland, Debra J; Pallant, Julie F; Benninger, Michael S; Thibeault, Susan L; Greenwood, Ken M; Smith, Julian A; Vallance, Neil

2013-07-01

Most voice self-rating tools are disease-specific measures and are not suitable for use with healthy voice users. There is a need for a tool that is sensitive to the subtleties of a singer's voice and to perceived physical changes in the singing voice mechanism as a function of load. The aim of this study was to devise and validate a scale to assess singer's perceptions of the current status of their singing voice. Ninety-five vocal health descriptors were collected from focus group interviews of singers. These were reviewed by 25 currently performing music theater (MT) singers. Based on a consensus technique, the number of descriptors was decreased to 42 items. These were administered to a sample of 284 professional MT singers using an online survey to evaluate their perception of current singing voice status. Principal component analysis identified two subsets of items. Rasch analysis was used to evaluate and refine these sets of items to form two 10-item subscales. Both subscales demonstrated good overall fit to the Rasch model, no differential item functioning by sex or age, and good internal consistency reliability. The two subscales were strongly correlated and subsequent Rasch analysis supported their combination to form a single 20-item scale with good psychometric properties. The Evaluation of the Ability to Sing Easily (EASE) is a concise clinical tool to assess singer's perceptions of the current status of their singing voice with good measurement properties. EASE may prove a useful tool to measure changes in the singing voice as indicators of the effect of vocal load. Furthermore, it may offer a valuable means for the prediction or screening of singers "at risk" of developing voice disorders. Copyright © 2013 The Voice Foundation. All rights reserved.
[Applicability of Voice Handicap Index to the evaluation of voice therapy effectiveness in teachers].

Science.gov (United States)

Niebudek-Bogusz, Ewa; Kuzańska, Anna; Błoch, Piotr; Domańska, Maja; Woźnicka, Ewelina; Politański, Piotr; Sliwińska-Kowalska, Mariola

2007-01-01

The aim of this study was to assess the applicability of Voice Handicap Index (VHI) to the evaluation of effectiveness of functional voice disorders treatment in teachers. The subjects were 45 female teachers with functional dysphonia who evaluated their voice problems according to the subjective VHI scale before and after phoniatric management. Group I (29 patients) were subjected to vocal training, whereas group II (16 patients) received only voice hygiene instructions. The results demonstrated that differences in the mean VHI score before and after phoniatric treatment were significantly higher in group 1 than in group II (p teacher's dysphonia.
Voice and Handgrip Strength Predict Reproductive Success in a Group of Indigenous African Females

Science.gov (United States)

Sorokowska, Agnieszka; Sorokowski, Piotr; Mberira, Mara; Bartels, Astrid; Gallup, Gordon G.

2012-01-01

Evolutionary accounts of human traits are often based on proxies for genetic fitness (e.g., number of sex partners, facial attractiveness). Instead of using proxies, actual differences in reproductive success is a more direct measure of Darwinian fitness. Certain voice acoustics such as fundamental frequency and measures of health such as handgrip strength correlate with proxies of fitness, yet there are few studies showing the relation of these traits to reproduction. Here, we explore whether the fundamental frequency of the voice and handgrip strength account for differences in actual reproduction among a population of natural fertility humans. Our results show that both fundamental frequency and handgrip strength predict several measures of reproductive success among a group of indigenous Namibian females, particularly amongst the elderly, with weight also predicting reproductive outcomes among males. These findings demonstrate that both hormonally regulated and phenotypic quality markers can be used as measures of Darwinian fitness among humans living under conditions that resemble the evolutionary environment of Homo sapiens. We also argue that these findings provide support for the Grandmother Hypothesis. PMID:22870251
Designing a Voice Controlled Interface For Radio : Guidelines for The First Generation of Voice Controlled Public Radio

OpenAIRE

Päärni, Anna

2017-01-01

From being a fictional element in sci-fi, voice control has become a reality, with inventions such as Apple's Siri, and interactive voice response (IVR) when calling your doctor's office. The combination of radio’s strength as a hands-free medium, public radio’s mission to reach across all platforms and the rise of voice makes up a relevant intersection; voice controlled public radio in Sweden. This thesis has aimed to investigate how radio listeners wish to interact using voice control to li...
Acoustics flow analysis in circular duct using sound intensity and dynamic mode decomposition

International Nuclear Information System (INIS)

Weyna, S

2014-01-01

Sound intensity generation in hard-walled duct with acoustic flow (no mean-flow) is treated experimentally and shown graphically. In paper, numerous methods of visualization illustrating the vortex flow (2D, 3D) can graphically explain diffraction and scattering phenomena occurring inside the duct and around open end area. Sound intensity investigation in annular duct gives a physical picture of sound waves in any duct mode. In the paper, modal energy analysis are discussed with particular reference to acoustics acoustic orthogonal decomposition (AOD). The image of sound intensity fields before and above 'cut-off' frequency region are found to compare acoustic modes which might resonate in duct. The experimental results show also the effects of axial and swirling flow. However acoustic field is extremely complicated, because pressures in non-propagating (cut-off) modes cooperate with the particle velocities in propagating modes, and vice versa. Measurement in cylindrical duct demonstrates also the cut-off phenomenon and the effect of reflection from open end. The aim of experimental study was to obtain information on low Mach number flows in ducts in order to improve physical understanding and validate theoretical CFD and CAA models that still may be improved.
Experimental Acoustic Evaluation of an Auditorium

Directory of Open Access Journals (Sweden)

Marina Dana Ţopa

2012-01-01

Full Text Available The paper presents a case history: the acoustical analysis of a rectangular auditorium. The following acoustical parameters were evaluated: early decay time, reverberation time, clarity, definition, and center time. The excitation signal was linear sweep sine and additional analysis was carried out: peak-to-noise ratio, reverberation time for empty and occupied room, standard deviation of acoustical parameters, diffusion, and just noticeable differences analysis. Conclusions about room’s destination and modeling were drawn in the end.
Application of computer voice input/output

International Nuclear Information System (INIS)

Ford, W.; Shirk, D.G.

1981-01-01

The advent of microprocessors and other large-scale integration (LSI) circuits is making voice input and output for computers and instruments practical; specialized LSI chips for speech processing are appearing on the market. Voice can be used to input data or to issue instrument commands; this allows the operator to engage in other tasks, move about, and to use standard data entry systems. Voice synthesizers can generate audible, easily understood instructions. Using voice characteristics, a control system can verify speaker identity for security purposes. Two simple voice-controlled systems have been designed at Los Alamos for nuclear safeguards applicaations. Each can easily be expanded as time allows. The first system is for instrument control that accepts voice commands and issues audible operator prompts. The second system is for access control. The speaker's voice is used to verify his identity and to actuate external devices
The contrast between alveolar and velar stops with typical speech data: acoustic and articulatory analyses.

Science.gov (United States)

Melo, Roberta Michelon; Mota, Helena Bolli; Berti, Larissa Cristina

2017-06-08

This study used acoustic and articulatory analyses to characterize the contrast between alveolar and velar stops with typical speech data, comparing the parameters (acoustic and articulatory) of adults and children with typical speech development. The sample consisted of 20 adults and 15 children with typical speech development. The analyzed corpus was organized through five repetitions of each target-word (/'kap ə/, /'tapə/, /'galo/ e /'daɾə/). These words were inserted into a carrier phrase and the participant was asked to name them spontaneously. Simultaneous audio and video data were recorded (tongue ultrasound images). The data was submitted to acoustic analyses (voice onset time; spectral peak and burst spectral moments; vowel/consonant transition and relative duration measures) and articulatory analyses (proportion of significant axes of the anterior and posterior tongue regions and description of tongue curves). Acoustic and articulatory parameters were effective to indicate the contrast between alveolar and velar stops, mainly in the adult group. Both speech analyses showed statistically significant differences between the two groups. The acoustic and articulatory parameters provided signals to characterize the phonic contrast of speech. One of the main findings in the comparison between adult and child speech was evidence of articulatory refinement/maturation even after the period of segment acquisition.
ACHIEVING THE NATURAL VOICE: THE ANALYSIS OF THE LINKLATER METHOD FROM A TRAINING PERSPECTİVE–

Directory of Open Access Journals (Sweden)

Asli YILMAZ DAVUTOGLU

2015-09-01

Full Text Available In the 20th century, one of the most widespread of the voice training methods that start with the concepts “natural voice” and “the rediscovery of voice” is the Linklater Method. The primary target group of this method is actors. With its exercises designed for the "reconstruction of the body, the voice and the mind", this method aims at utilizing the innate voice capacity. As a multidisciplinary method fostered by many a scientific discipline and Eastern teaching, Linklater method comes with a language that is imbued with sophisticated and metaphorical expressions, scientific terminology and acting jargon, which makes the method prone to false and/or superficial references. The target of the present study is to explicate with a “trainer's perspective” the fundamental concepts and propositions of the Linklater Method, most notably the “natural voice”. Also, the present study aims at analysing the relation between the basic practices of the method and recent scientific data, thus examining the mental substructure these practices are based on and their physical/technical goals. In this direction, the present study involves the adopting of a general framework with respect to the inclinations and scientific sources of voice training in the 20th century that affected Linklater's propositions, a simplified summarisation of the neuro-anatomic process producing the voice, the selection of exercises on which the principles and goals of the method can be seen concretely and the grouping of these exercises under titles pertaining to the four basic steps of voice production. In the conclusion part of the study, it is argued that this method, despite being regarded as “alternative/experimental” when compared to conventional methods in Turkey, is one of the mainstream methods in contemporary voice training and that it is shaped through a multi-purpose system whose aim is not only voice training but also to develop the creativity and
Acoustic Analysis and Design of the E-STA MSA Simulator

Science.gov (United States)

Bittinger, Samantha A.

2016-01-01

The Orion European Service Module Structural Test Article (E-STA) Acoustic Test was completed in May 2016 to verify that the European Service Module (ESM) can withstand qualification acoustic environments. The test article required an aft closeout to simulate the Multi-Purpose Crew Vehicle (MPCV) Stage Adapter (MSA) cavity, however, the flight MSA design was too cost-prohibitive to build. NASA Glenn Research Center (GRC) had 6 months to design an MSA Simulator that could recreate the qualification prediction MSA cavity sound pressure level to within a reasonable tolerance. This paper summarizes the design and analysis process to arrive at a design for the MSA Simulator, and then compares its performance to the final prediction models created prior to test.
Voice and silence in organizations

Directory of Open Access Journals (Sweden)

Moaşa, H.

2011-01-01

Full Text Available Unlike previous research on voice and silence, this article breaksthe distance between the two and declines to treat them as opposites. Voice and silence are interrelated and intertwined strategic forms ofcommunication which presuppose each other in such a way that the absence of one would minimize completely the other’s presence. Social actors are not voice, or silence. Social actors can have voice or silence, they can do both because they operate at multiple levels and deal with multiple issues at different moments in time.
Acoustic characteristics of modern Greek Orthodox Church music.

Science.gov (United States)

Delviniotis, Dimitrios S

2013-09-01

Some acoustic characteristics of the two types of vocal music of the Greek Orthodox Church Music, the Byzantine chant (BC) and ecclesiastical speech (ES), are studied in relation to the common Greek speech and the Western opera. Vocal samples were obtained, and their acoustic parameters of sound pressure level (SPL), fundamental frequency (F0), and the long-time average spectrum (LTAS) characteristics were analyzed. Twenty chanters, including two chanters-singers of opera, sang (BC) and read (ES) the same hymn of Byzantine music (BM), the two opera singers sang the same aria of opera, and common speech samples were obtained, and all audio were analyzed. The distribution of SPL values showed that the BC and ES have higher SPL by 9 and 12 dB, respectively, than common speech. The average F0 in ES tends to be lower than the common speech, and the smallest standard deviation (SD) of F0 values characterizes its monotonicity. The tone-scale intervals of BC are close enough to the currently accepted theory with SD equal to 0.24 semitones. The rate and extent of vibrato, which is rare in BC, equals 4.1 Hz and 0.6 semitones, respectively. The average LTAS slope is greatest in BC (+4.5 dB) but smaller than in opera (+5.7 dB). In both BC and ES, instead of a singer's formant appearing in an opera voice, a speaker's formant (SPF) was observed around 3300 Hz, with relative levels of +6.3 and +4.6 dB, respectively. The two vocal types of BM, BC, and ES differ both to each other and common Greek speech and opera style regarding SPL, the mean and SD of F0, the LTAS slope, and the relative level of SPF. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Analysis And Voice Recognition In Indonesian Language Using MFCC And SVM Method

Directory of Open Access Journals (Sweden)

Harvianto Harvianto

2016-06-01

Full Text Available Voice recognition technology is one of biometric technology. Sound is a unique part of the human being which made an individual can be easily distinguished one from another. Voice can also provide information such as gender, emotion, and identity of the speaker. This research will record human voices that pronounce digits between 0 and 9 with and without noise. Features of this sound recording will be extracted using Mel Frequency Cepstral Coefficient (MFCC. Mean, standard deviation, max, min, and the combination of them will be used to construct the feature vectors. This feature vectors then will be classified using Support Vector Machine (SVM. There will be two classification models. The first one is based on the speaker and the other one based on the digits pronounced. The classification model then will be validated by performing 10-fold cross-validation.The best average accuracy from two classification model is 91.83%. This result achieved using Mean + Standard deviation + Min + Max as features.
Laryngoscopic and spectral analysis of laryngeal and pharyngeal configuration in non-classical singing styles.

Science.gov (United States)

Guzman, Marco; Lanas, Andres; Olavarria, Christian; Azocar, Maria Josefina; Muñoz, Daniel; Madrid, Sofia; Monsalve, Sebastian; Martinez, Francisca; Vargas, Sindy; Cortez, Pedro; Mayerhoff, Ross M

2015-01-01

The present study aimed to assess three different singing styles (pop, rock, and jazz) with laryngoscopic, acoustic, and perceptual analysis in healthy singers at different loudness levels. Special emphasis was given to the degree of anterior-posterior (A-P) laryngeal compression, medial laryngeal compression, vertical laryngeal position (VLP), and pharyngeal compression. Prospective study. Twelve female trained singers with at least 5 years of voice training and absence of any voice pathology were included. Flexible and rigid laryngeal endoscopic examinations were performed. Voice recording was also carried out. Four blinded judges were asked to assess laryngoscopic and auditory perceptual variables using a visual analog scale. All laryngoscopic parameters showed significant differences for all singing styles. Rock showed the greatest degree for all of them. Overall A-P laryngeal compression scores demonstrated significantly higher values than overall medial compression and VLP. High loudness level produced the highest degree of A-P compression, medial compression, pharyngeal compression, and the lowest VLP for all singing styles. Additionally, rock demonstrated the highest values for alpha ratio (less steep spectral slope), L1-L0 ratio (more glottal adduction), and Leq (more vocal intensity). Statistically significant differences between the three loudness levels were also found for these acoustic parameters. Rock singing seems to be the style with the highest degree of both laryngeal and pharyngeal activity in healthy singers. Although, supraglottic activity during singing could be labeled as hyperfunctional vocal behavior, it may not necessarily be harmful, but a strategy to avoid vocal fold damage. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
A Correlated Study of the Response of a Satellite to Acoustic Radiation Using Statistical Energy Analysis and Acoustic Test Data

International Nuclear Information System (INIS)

CAP, JEROME S.; TRACEY, BRIAN

1999-01-01

Aerospace payloads, such as satellites, are subjected to vibroacoustic excitation during launch. Sandia's MTI satellite has recently been certified to this environment using a combination of base input random vibration and reverberant acoustic noise. The initial choices for the acoustic and random vibration test specifications were obtained from the launch vehicle Interface Control Document (ICD). In order to tailor the random vibration levels for the laboratory certification testing, it was necessary to determine whether vibration energy was flowing across the launch vehicle interface from the satellite to the launch vehicle or the other direction. For frequencies below 120 Hz this issue was addressed using response limiting techniques based on results from the Coupled Loads Analysis (CLA). However, since the CLA Finite Element Analysis FEA model was only correlated for frequencies below 120 Hz, Statistical Energy Analysis (SEA) was considered to be a better choice for predicting the direction of the energy flow for frequencies above 120 Hz. The existing SEA model of the launch vehicle had been developed using the VibroAcoustic Payload Environment Prediction System (VAPEPS) computer code[1]. Therefore, the satellite would have to be modeled using VAPEPS as well. As is the case for any computational model, the confidence in its predictive capability increases if one can correlate a sample prediction against experimental data. Fortunately, Sandia had the ideal data set for correlating an SEA model of the MTI satellite--the measured response of a realistic assembly to a reverberant acoustic test that was performed during MTI's qualification test series. The first part of this paper will briefly describe the VAPEPS modeling effort and present the results of the correlation study for the VAPEPS model. The second part of this paper will present the results from a study that used a commercial SEA software package[2] to study the effects of in-plane modes and to evaluate
Acoustic analysis and mood classification of pain-relieving music.

Science.gov (United States)

Knox, Don; Beveridge, Scott; Mitchell, Laura A; MacDonald, Raymond A R

2011-09-01

Listening to preferred music (that which is chosen by the participant) has been shown to be effective in mitigating the effects of pain when compared to silence and a variety of distraction techniques. The wide range of genre, tempo, and structure in music chosen by participants in studies utilizing experimentally induced pain has led to the assertion that structure does not play a significant role, rather listening to preferred music renders the music "functionally equivalent" as regards its effect upon pain perception. This study addresses this assumption and performs detailed analysis of a selection of music chosen from three pain studies. Music analysis showed significant correlation between timbral and tonal aspects of music and measurements of pain tolerance and perceived pain intensity. Mood classification was performed using a hierarchical Gaussian Mixture Model, which indicated the majority of the chosen music expressed contentment. The results suggest that in addition to personal preference, associations with music and the listening context, emotion expressed by music, as defined by its acoustical content, is important to enhancing emotional engagement with music and therefore enhances the level of pain reduction and tolerance. © 2011 Acoustical Society of America
Intelligent acoustic data fusion technique for information security analysis

Science.gov (United States)

Jiang, Ying; Tang, Yize; Lu, Wenda; Wang, Zhongfeng; Wang, Zepeng; Zhang, Luming

2017-08-01

Tone is an essential component of word formation in all tonal languages, and it plays an important role in the transmission of information in speech communication. Therefore, tones characteristics study can be applied into security analysis of acoustic signal by the means of language identification, etc. In speech processing, fundamental frequency (F0) is often viewed as representing tones by researchers of speech synthesis. However, regular F0 values may lead to low naturalness in synthesized speech. Moreover, F0 and tone are not equivalent linguistically; F0 is just a representation of a tone. Therefore, the Electroglottography (EGG) signal is collected for deeper tones characteristics study. In this paper, focusing on the Northern Kam language, which has nine tonal contours and five level tone types, we first collected EGG and speech signals from six natural male speakers of the Northern Kam language, and then achieved the clustering distributions of the tone curves. After summarizing the main characteristics of tones of Northern Kam, we analyzed the relationship between EGG and speech signal parameters, and laid the foundation for further security analysis of acoustic signal.
Signal analysis of acoustic and flow-induced vibrations of BWR main steam line

Energy Technology Data Exchange (ETDEWEB)

Espinosa-Paredes, G., E-mail: gepe@xanum.uam.mx [División de Ciencias Básicas e Ingeniería, Universidad Autónoma Metropolitana-Iztapalapa, México, D.F. 09340 (Mexico); Prieto-Guerrero, A. [División de Ciencias Básicas e Ingeniería, Universidad Autónoma Metropolitana-Iztapalapa, México, D.F. 09340 (Mexico); Núñez-Carrera, A. [Comisión Nacional de Seguridad Nuclear y Salvaguardias, Doctor Barragán 779, Col. Narvarte, México, D.F. 03020 (Mexico); Vázquez-Rodríguez, A. [División de Ciencias Básicas e Ingeniería, Universidad Autónoma Metropolitana-Iztapalapa, México, D.F. 09340 (Mexico); Centeno-Pérez, J. [Instituto Politécnico Nacional, Escuela Superior de Física y Matemáticas Unidad Profesional “Adolfo López Mateos”, Av. IPN, s/n, México, D.F. 07738 (Mexico); Espinosa-Martínez, E.-G. [Departamento de Sistemas Energéticos, Universidad Nacional Autónoma de México, México, D.F. 04510 (Mexico); and others

2016-05-15

Highlights: • Acoustic and flow-induced vibrations of BWR are analyzed. • BWR performance after extended power uprate is considered. • Effect of acoustic side branches (ASB) is analyzed. • The ASB represents a reduction in the acoustic loads to the steam dryer. • Methodology developed for simultaneous analyzing the signals in the MSL. - Abstract: The aim of this work is the signal analysis of acoustic waves due to phenomenon known as singing in Safety Relief Valves (SRV) of the main steam lines (MSL) in a typical BWR5. The acoustic resonance in SRV standpipes and fluctuating pressure is propagated from SRV to the dryer through the MSL. The signals are analyzed with a novel method based on the Multivariate Empirical Mode Decomposition (M-EMD). The M-EMD algorithm has the potential to find common oscillatory modes (IMF) within multivariate data. Based on this fact, we implement the M-EMD technique to find the oscillatory mode in BWR considering the measurements obtained collected by the strain gauges located around the MSL. These IMF, analyzed simultaneously in time, allow obtaining an estimation of the effects of the multiple-SRV in the MSL. Two scenarios are analyzed: the first is the signal obtained before the installation of the acoustic dampers (ASB), and the second, the signal obtained after installation. The results show the effectiveness of the ASB to damp the strong resonances when the steam flow increases, which represents an important reduction in the acoustic loads to the steam dryer.
Signal analysis of acoustic and flow-induced vibrations of BWR main steam line

International Nuclear Information System (INIS)

Espinosa-Paredes, G.; Prieto-Guerrero, A.; Núñez-Carrera, A.; Vázquez-Rodríguez, A.; Centeno-Pérez, J.; Espinosa-Martínez, E.-G.

2016-01-01

Highlights: • Acoustic and flow-induced vibrations of BWR are analyzed. • BWR performance after extended power uprate is considered. • Effect of acoustic side branches (ASB) is analyzed. • The ASB represents a reduction in the acoustic loads to the steam dryer. • Methodology developed for simultaneous analyzing the signals in the MSL. - Abstract: The aim of this work is the signal analysis of acoustic waves due to phenomenon known as singing in Safety Relief Valves (SRV) of the main steam lines (MSL) in a typical BWR5. The acoustic resonance in SRV standpipes and fluctuating pressure is propagated from SRV to the dryer through the MSL. The signals are analyzed with a novel method based on the Multivariate Empirical Mode Decomposition (M-EMD). The M-EMD algorithm has the potential to find common oscillatory modes (IMF) within multivariate data. Based on this fact, we implement the M-EMD technique to find the oscillatory mode in BWR considering the measurements obtained collected by the strain gauges located around the MSL. These IMF, analyzed simultaneously in time, allow obtaining an estimation of the effects of the multiple-SRV in the MSL. Two scenarios are analyzed: the first is the signal obtained before the installation of the acoustic dampers (ASB), and the second, the signal obtained after installation. The results show the effectiveness of the ASB to damp the strong resonances when the steam flow increases, which represents an important reduction in the acoustic loads to the steam dryer.
Dysphonic Voice Pattern Analysis of Patients in Parkinson’s Disease Using Minimum Interclass Probability Risk Feature Selection and Bagging Ensemble Learning Methods

Directory of Open Access Journals (Sweden)

Yunfeng Wu

2017-01-01

Full Text Available Analysis of quantified voice patterns is useful in the detection and assessment of dysphonia and related phonation disorders. In this paper, we first study the linear correlations between 22 voice parameters of fundamental frequency variability, amplitude variations, and nonlinear measures. The highly correlated vocal parameters are combined by using the linear discriminant analysis method. Based on the probability density functions estimated by the Parzen-window technique, we propose an interclass probability risk (ICPR method to select the vocal parameters with small ICPR values as dominant features and compare with the modified Kullback-Leibler divergence (MKLD feature selection approach. The experimental results show that the generalized logistic regression analysis (GLRA, support vector machine (SVM, and Bagging ensemble algorithm input with the ICPR features can provide better classification results than the same classifiers with the MKLD selected features. The SVM is much better at distinguishing normal vocal patterns with a specificity of 0.8542. Among the three classification methods, the Bagging ensemble algorithm with ICPR features can identify 90.77% vocal patterns, with the highest sensitivity of 0.9796 and largest area value of 0.9558 under the receiver operating characteristic curve. The classification results demonstrate the effectiveness of our feature selection and pattern analysis methods for dysphonic voice detection and measurement.

Contribution of in situ acoustic emission analysis coupled with thermogravimetry to study zirconium alloy oxidation

International Nuclear Information System (INIS)

Al Haj, O.; Peres, V.; Serris, E.; Cournil, M.; Grosjean, F.; Kittel, J.; Ropital, F.

2015-01-01

Zirconium alloy (zircaloy-4) corrosion behavior under oxidizing atmosphere at high temperature was studied using thermogravimetric experiment associated with acoustic emission analysis. Under a mixture of oxygen and air in helium, an acceleration of the corrosion is observed due to the detrimental effect of nitrogen which produces zirconium nitride. The kinetic rate increases significantly after a kinetic transition (breakaway). This acceleration is accompanied by an acoustic emission (AE) activity. Most of the acoustic emission bursts were recorded after the kinetic transition or during the cooling of the sample. Acoustic emission signals analysis allows us to distinguish different populations of cracks in the ZrO 2 layer. These cracks have also been observed by SEM on post mortem cross section of oxidized samples and by in-situ microscopy observations on the top surface of the sample during oxidation. The numerous small convoluted thin cracks observed deeper in the zirconia scale are not detected by the AE technique. From these studies we can conclude that mechanisms as irreversible mechanisms, as cracks initiation and propagation, generate AE signals
Theoretical analysis of leaky surface acoustic waves of point-focused acoustic lens and some experiments

International Nuclear Information System (INIS)

Ishikawa, Isao; Suzuki, Yoshiaki; Ogura, Yukio; Katakura, Kageyoshi

1997-01-01

When a point-focused acoustic lens in the scanning acoustic microscope (SAM) is faced to test specimen and defocused to some extent, two effective echoes can be obtained. One is the echo of longitudinal wave, which is normally incident upon the specimen of an on-axis beam in the central region of the lens and is reflected normal to the lens surface, hence detected by the transducer. The other is of leaky surface acoustic waves(LSAW), which are mode converted front a narrow beam of off-axis longitudinal wave, then propagate across the surface of the specimen and reradiate at angles normal to the lens surface, thus detected by the transducer. These two echoes are either interfered or separated with each other depending ell the defocused distance. It turned out theoretically that the LSAW have a narrow focal spot in the central region of the point-focused acoustic lens, whose size is approximately 40% of the LSAW wavelength. On top of that, a wavelength of LSAW is about 50% short as that of longitudinal wave. So, It is expected that high resolution images can be obtained provided LSAW are used in the scanning acoustic microscope.
Using the Voice to Design Ceramics

DEFF Research Database (Denmark)

Hansen, Flemming Tvede; Jensen, Kristoffer

2011-01-01

Digital technology makes new possibilities in ceramic craft. This project is about how experiential knowledge that the craftsmen gains in a direct physical and tactile interaction with a responding material can be transformed and utilized in the use of digital technologies. The project presents...... to make ceramic results. The system demonstrates the close connection between digital technology and craft practice....... SoundShaping, a system to create ceramics from the human voice. Based on a generic audio feature extraction system, and the principal component analysis to ensure that the pertinent information in the voice is used, a 3D shape is created using simple geometric rules. This shape is output to a 3D printer...
Voice analysis as an objective state marker in bipolar disorder

DEFF Research Database (Denmark)

Faurholt-Jepsen, M.; Busk, Jonas; Frost, M.

2016-01-01

features with automatically generated objective smartphone data on behavioral activities (for example, number of text messages and phone calls per day) and electronic self-monitored data (mood) on illness activity would increase the accuracy as a marker of affective states. Using smartphones, voice...... features, automatically generated objective smartphone data on behavioral activities and electronic self-monitored data were collected from 28 outpatients with bipolar disorder in naturalistic settings on a daily basis during a period of 12 weeks. Depressive and manic symptoms were assessed using...... to be more accurate, sensitive and specific in the classification of manic or mixed states with an area under the curve (AUC)=0.89 compared with an AUC=0.78 for the classification of depressive states. Combining voice features with automatically generated objective smartphone data on behavioral activities...
Accuracy of Dynamic and Acoustic Analysis of Lightweight Panel Structures

DEFF Research Database (Denmark)

Kirkegaard, Poul Henning; Dickow, Kristoffer Ahrens; Andersen, Lars Vabbersgaard

2012-01-01

in such buildings is important. In the lowfrequency range, prediction of sound and vibration in building structures may be achieved by finite-element analysis (FEA). The aim of this paper is to compare the two commercial codes ABAQUS and ANSYS for FEA of an acoustic-structural coupling in a timber lightweight panel...
Similar representations of emotions across faces and voices.

Science.gov (United States)

Kuhn, Lisa Katharina; Wydell, Taeko; Lavan, Nadine; McGettigan, Carolyn; Garrido, Lúcia

2017-09-01

of emotions within each modality. We then compared the representations across modalities by computing the correlations of the representation matrices across faces and voices. We found highly correlated matrices across modalities, which suggest similar representations of emotions across faces and voices. We also showed that these results could not be explained by commonalities between low-level visual and acoustic properties of the stimuli. We thus propose that there are similar or shared coding mechanisms for emotions which may act independently of modality, despite their distinct perceptual inputs. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Acute effects of radioiodine therapy on the voice and larynx of basedow-Graves patients

International Nuclear Information System (INIS)

Isolan-Cury, Roberta Werlang; Cury, Adriano Namo; Monte, Osmar; Silva, Marta Assumpcao de Andrada e; Duprat, Andre; Marone, Marilia; Almeida, Renata de; Iglesias, Alexandre

2008-01-01

Graves's disease is the most common cause of hyperthyroidism. There are three current therapeutic options: anti-thyroid medication, surgery, and radioactive iodine (I 131). There are few data in the literature regarding the effects of radioiodine therapy on the larynx and voice. The aim of this study was: to assess the effect of radioiodine therapy on the voice of Basedow-Graves patients. Material and method: A prospective study was done. Following the diagnosis of Grave's disease, patients underwent investigation of their voice, measurement of maximum phonatory time (/a/) and the s/z ratio, fundamental frequency analysis (Praat software), laryngoscopy and (perceptive-auditory) analysis in three different conditions: pre-treatment, 4 days, and 20 days post-radioiodine therapy. Conditions are based on the inflammatory pattern of thyroid tissue (Jones et al. 1999). Results: No statistically significant differences were found in voice characteristics in these three conditions. Conclusion: Radioiodine therapy does not affect voice quality. (author)
Acute effects of radioiodine therapy on the voice and larynx of basedow-Graves patients

Energy Technology Data Exchange (ETDEWEB)

Isolan-Cury, Roberta Werlang; Cury, Adriano Namo [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP); Monte, Osmar [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Physiology Department; Silva, Marta Assumpcao de Andrada e [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Speech Therapy School; Duprat, Andre [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Otorhinolaryngology Department; Marone, Marilia [Nuclimagem - Irmanity of the Sao Paulo Santa Casa de Misericordia, SP (Brazil). Nuclear Medicine Unit; Almeida, Renata de; Iglesias, Alexandre [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Otorhinolaryngology Department. Endocrinology and Metabology Unit

2008-07-01

Graves's disease is the most common cause of hyperthyroidism. There are three current therapeutic options: anti-thyroid medication, surgery, and radioactive iodine (I 131). There are few data in the literature regarding the effects of radioiodine therapy on the larynx and voice. The aim of this study was: to assess the effect of radioiodine therapy on the voice of Basedow-Graves patients. Material and method: A prospective study was done. Following the diagnosis of Grave's disease, patients underwent investigation of their voice, measurement of maximum phonatory time (/a/) and the s/z ratio, fundamental frequency analysis (Praat software), laryngoscopy and (perceptive-auditory) analysis in three different conditions: pre-treatment, 4 days, and 20 days post-radioiodine therapy. Conditions are based on the inflammatory pattern of thyroid tissue (Jones et al. 1999). Results: No statistically significant differences were found in voice characteristics in these three conditions. Conclusion: Radioiodine therapy does not affect voice quality. (author)
Glottal inverse filtering analysis of human voice production — A ...

Indian Academy of Sciences (India)

A (grossly) simplified manner to study the functioning of the human speech production ...... selective auditory impairment in autism: can perceive but do not attend, Proc. Natl. Acad. .... Fritzell B 1996 Voice disorders and occupations, Logoped.
Voice Biometrics for Information Assurance Applications

National Research Council Canada - National Science Library

Kang, George

2002-01-01

.... The ultimate goal of voice biometrics is to enable the use of voice as a password. Voice biometrics are "man-in-the-loop" systems in which system performance is significantly dependent on human performance...
An experimental analysis of fracture mechanisms by acoustic ...

African Journals Online (AJOL)

rupture under monotonic loading in tensile test of a carbon ... respectively the longitudinal, transversal and ..... (1) location of acoustic output source, (2) Sensors and acoustic source position for 4 channel position,. O(x,y) ..... Due to multiple.
Health monitoring of Ceramic Matrix Composites from waveform-based analysis of Acoustic Emission

Directory of Open Access Journals (Sweden)

Maillet Emmanuel

2015-01-01

Full Text Available Ceramic Matrix Composites (CMCs are anticipated for use in the hot section of aircraft engines. Their implementation requires the understanding of the various damage modes that are involved and their relation to life expectancy. Acoustic Emission (AE has been shown to be an efficient technique for monitoring damage evolution in CMCs. However, only a waveform-based analysis of AE can offer the possibility to validate and precisely examine the recorded AE data with a view to damage localization and identification. The present work fully integrates wave initiation, propagation and acquisition in the analysis of Acoustic Emission waveforms recorded at various sensors, therefore providing more reliable information to assess the relation between Acoustic Emission and damage modes. The procedure allows selecting AE events originating from damage, accurate determination of their location as well as the characterization of effects of propagation on the recorded waveforms. This approach was developed using AE data recorded during tensile tests on carbon/carbon composites. It was then applied to melt-infiltrated SiC/SiC composites.
Prospective, longitudinal electroglottographic study of voice recovery following accelerated hypofractionated radiotherapy for T1/T2 larynx cancer

Energy Technology Data Exchange (ETDEWEB)

Kazi, Rehan [Head and Neck Unit, Royal Marsden Hospital, London (United Kingdom); Institute of Cancer Research, Cancer Research UK Centre for Cell and Molecular Biology, London (United Kingdom); Venkitaraman, Ramachandran; Johnson, Catherine; Prasad, Vyas; Clarke, Peter; Newbold, Kate; Rhys-Evans, Peter; Nutting, Christopher [Head and Neck Unit, Royal Marsden Hospital, London (United Kingdom); Harrington, Kevin [Head and Neck Unit, Royal Marsden Hospital, London (United Kingdom); Institute of Cancer Research, Cancer Research UK Centre for Cell and Molecular Biology, London (United Kingdom)], E-mail: kevinh@icr.ac.uk

2008-05-15

Background and purpose: To measure voice outcomes following accelerated hypofractionated radiotherapy for larynx cancer. Materials and methods: Twenty-five patients with T1/T2 glottic cancer underwent serial electroglottographic and acoustic analysis (sustained vowel/i/ and connected speech) before radiotherapy and 1, 6 and 12 months post-treatment. Twenty-five normal subjects served as a reference control population. Results: Pre-treatment measures were significantly worse for larynx cancer patients. Median jitter (0.23% vs 0.97%, p = 0.001) and shimmer (0.62 dB vs 0.98 dB, p = 0.05) and differences in data ranges reflected greater frequency and amplitude perturbation in the larynx cancer patients. Pre-treatment Mean Phonation Time (MPT) was significantly reduced (21 s vs 14.8 s, p = 0.002) in larynx cancer patients. There was a trend towards improvement of jitter, shimmer and normalized noise energy at 12 months post-treatment. MPT improved but remained significantly worse than for normal subjects (21 s vs 16.4 s, p = 0.013). Average fundamental frequency resembled normal subjects, including improvement of the measured range (91.4-244.6 Hz in controls vs 100-201 Hz in post-treatment larynx cancer patients). Conclusions: This non-invasive technique effectively measures post-treatment vocal function in larynx cancer patients. This study demonstrated improvement of many key parameters that influence voice function over 12 months after radiotherapy.
Prospective, longitudinal electroglottographic study of voice recovery following accelerated hypofractionated radiotherapy for T1/T2 larynx cancer

International Nuclear Information System (INIS)

Kazi, Rehan; Venkitaraman, Ramachandran; Johnson, Catherine; Prasad, Vyas; Clarke, Peter; Newbold, Kate; Rhys-Evans, Peter; Nutting, Christopher; Harrington, Kevin

2008-01-01

Background and purpose: To measure voice outcomes following accelerated hypofractionated radiotherapy for larynx cancer. Materials and methods: Twenty-five patients with T1/T2 glottic cancer underwent serial electroglottographic and acoustic analysis (sustained vowel/i/ and connected speech) before radiotherapy and 1, 6 and 12 months post-treatment. Twenty-five normal subjects served as a reference control population. Results: Pre-treatment measures were significantly worse for larynx cancer patients. Median jitter (0.23% vs 0.97%, p = 0.001) and shimmer (0.62 dB vs 0.98 dB, p = 0.05) and differences in data ranges reflected greater frequency and amplitude perturbation in the larynx cancer patients. Pre-treatment Mean Phonation Time (MPT) was significantly reduced (21 s vs 14.8 s, p = 0.002) in larynx cancer patients. There was a trend towards improvement of jitter, shimmer and normalized noise energy at 12 months post-treatment. MPT improved but remained significantly worse than for normal subjects (21 s vs 16.4 s, p = 0.013). Average fundamental frequency resembled normal subjects, including improvement of the measured range (91.4-244.6 Hz in controls vs 100-201 Hz in post-treatment larynx cancer patients). Conclusions: This non-invasive technique effectively measures post-treatment vocal function in larynx cancer patients. This study demonstrated improvement of many key parameters that influence voice function over 12 months after radiotherapy
Applying Acoustical and Musicological Analysis to Detect Brain Responses to Realistic Music: A Case Study

Directory of Open Access Journals (Sweden)

Niels Trusbak Haumann

2018-05-01

Full Text Available Music information retrieval (MIR methods offer interesting possibilities for automatically identifying time points in music recordings that relate to specific brain responses. However, how the acoustical features and the novelty of the music structure affect the brain response is not yet clear. In the present study, we tested a new method for automatically identifying time points of brain responses based on MIR analysis. We utilized an existing database including brain recordings of 48 healthy listeners measured with electroencephalography (EEG and magnetoencephalography (MEG. While we succeeded in capturing brain responses related to acoustical changes in the modern tango piece Adios Nonino, we obtained less reliable brain responses with a metal rock piece and a modern symphony orchestra musical composition. However, brain responses might also relate to the novelty of the music structure. Hence, we added a manual musicological analysis of novelty in the musical structure to the computational acoustic analysis, obtaining strong brain responses even to the rock and modern pieces. Although no standardized method yet exists, these preliminary results suggest that analysis of novelty in music is an important aid to MIR analysis for investigating brain responses to realistic music.
The relation of vocal fold lesions and voice quality to voice handicap and psychosomatic well-being

NARCIS (Netherlands)

Smits, R.; Marres, H.A.; de Jong, F.

2012-01-01

BACKGROUND: Voice disorders have a multifactorial genesis and may be present in various ways. They can cause a significant communication handicap and impaired quality of life. OBJECTIVE: To assess the effect of vocal fold lesions and voice quality on voice handicap and psychosomatic well-being.
We need to talk about Temer: the strangeness in the voice

Directory of Open Access Journals (Sweden)

Luciana Iost Vinhas

2017-11-01

Full Text Available The Brazilian political moment in 2016 presents the opacity of the political discourse. The materialization of this discourse is produced in different forms of material existence. This study has as its objective to analyze Michel Temer’s voice, who was the Vice-President in Dilma Roussef’s Government. The analysis will focus on his first speech as Acting President. The analytical path of the study starts through the identification of the strangeness (ERNST, 2009 in the vocal materiality, which is understood as a discursive materiality with a different status: Michel’s voice produces an effect of failure in the ritual. We analyze, then, the sense effects produced by this sapped, choked voice, trying to relate it to the main interest in the French Discourse Analysis: the connection between ideology and unconsciousness.
EXPERIMENTAL STUDY OF FIRMWARE FOR INPUT AND EXTRACTION OF USER’S VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

Directory of Open Access Journals (Sweden)

O. N. Faizulaieva

2014-09-01

Full Text Available Scientific task for improving the signal-to-noise ratio for user’s voice signal in computer systems and networks during the process of user’s voice authentication is considered. The object of study is the process of input and extraction of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of the voice signal on the background of external interference signals are investigated. Ways for quality improving of the user’s voice signal in systems of voice authentication are investigated experimentally. Firmware means for experimental unit of input and extraction of the user’s voice signal against external interference influence are considered. As modern computer means, including mobile, have two-channel audio card, two microphones are used in the voice signal input. The distance between sonic-wave sensors is 20 mm and it provides forming one direction pattern lobe of microphone array in a desired area of voice signal registration (from 100 Hz to 8 kHz. According to the results of experimental studies, the usage of directional properties of the proposed microphone array and space-time processing of the recorded signals with implementation of constant and adaptive weighting factors has made it possible to reduce considerably the influence of interference signals. The results of firmware experimental studies for input and extraction of the user’s voice signal against external interference influence are shown. The proposed solutions will give the possibility to improve the value of the signal/noise ratio of the useful signals recorded up to 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker discrimination.
Dor muscular em cabeça e pescoço e medidas vocais acústicas de fonte glótica Head and neck muscles pain and glottic source acoustical vocal measures

Directory of Open Access Journals (Sweden)

Luane de Moraes Boton

2012-02-01

56 year old with signs and symptoms of Temporomandibular Joint Dysfunction (TMD took part of this study. We applied an Anamnesis questionnaire, a specific clinic exam to check pain presence in head, face, mouth, neck muscles and in the ATM, otorhinolaryngology, stomatognatical and pure tone audiometric evaluations, the voice recorder and the acoustic voice analysis by the Multi Dimensional Voice Program Advanced software from KayPentax. The results were analyzed thought the qui-square test with statistical significance of (p<0.05. RESULTS: there were statistical significance between the pain absence in the superficial masseter and the alteration in the acoustic vocal parameter, such as Voice Turbulence Index (VTI; pain absence in the temporomandibular posterior aspect joint and the alteration of pitch perturbation quotient (PPQ and the fundamental frequency variation (Vfo; pain presence in the medial pterygoid area and normality in the degree of voice breaks (DVB. CONCLUSION: we found that there was no statistical significance between the alteration in voice parameters and the presence of pain in the evaluated muscles, but some parameters with alteration can have a relation with pain absence; this suggests that other aspects of temporomandibular joint disorder can interfere in the acoustical voice parameters.
Acoustic cloaking and transformation acoustics

International Nuclear Information System (INIS)

Chen Huanyang; Chan, C T

2010-01-01

In this review, we give a brief introduction to the application of the new technique of transformation acoustics, which draws on a correspondence between coordinate transformation and material properties. The technique is formulated for both acoustic waves and linear liquid surface waves. Some interesting conceptual devices can be designed for manipulating acoustic waves. For example, we can design acoustic cloaks that make an object invisible to acoustic waves, and the cloak can either encompass or lie outside the object to be concealed. Transformation acoustics, as an analog of transformation optics, can go beyond invisibility cloaking. As an illustration for manipulating linear liquid surface waves, we show that a liquid wave rotator can be designed and fabricated to rotate the wave front. The acoustic transformation media require acoustic materials which are anisotropic and inhomogeneous. Such materials are difficult to find in nature. However, composite materials with embedded sub-wavelength resonators can in principle be made and such 'acoustic metamaterials' can exhibit nearly arbitrary values of effective density and modulus tensors to satisfy the demanding material requirements in transformation acoustics. We introduce resonant sonic materials and Helmholtz resonators as examples of acoustic metamaterials that exhibit resonant behaviour in effective density and effective modulus. (topical review)

Some links on this page may take you to non-federal websites. Their policies may differ from this site.