WorldWideScience

Sample records for voice fundamental frequency

  1. Fundamental frequency and voice perturbation measures in smokers and non-smokers: An acoustic and perceptual study

    Science.gov (United States)

    Freeman, Allison

    This research examined the fundamental frequency and perturbation (jitter % and shimmer %) measures in young adult (20-30 year-old) and middle-aged adult (40-55 year-old) smokers and non-smokers; there were 36 smokers and 36 non-smokers. Acoustic analysis was carried out utilizing one task: production of sustained /a/. These voice samples were analyzed utilizing Multi-Dimensional Voice Program (MDVP) software, which provided values for fundamental frequency, jitter %, and shimmer %.These values were analyzed for trends regarding smoking status, age, and gender. Statistical significance was found regarding the fundamental frequency, jitter %, and shimmer % for smokers as compared to non-smokers; smokers were found to have significantly lower fundamental frequency values, and significantly higher jitter % and shimmer % values. Statistical significance was not found regarding fundamental frequency, jitter %, and shimmer % for age group comparisons. With regard to gender, statistical significance was found regarding fundamental frequency; females were found to have statistically higher fundamental frequencies as compared to males. However, the relationships between gender and jitter % and shimmer % lacked statistical significance. These results indicate that smoking negatively affects voice quality. This study also examined the ability of untrained listeners to identify smokers and non-smokers based on their voices. Results of this voice perception task suggest that listeners are not accurately able to identify smokers and non-smokers, as statistical significance was not reached. However, despite a lack of significance, trends in data suggest that listeners are able to utilize voice quality to identify smokers and non-smokers.

  2. [Fundamental frequency analysis - a contribution to the objective examination of the speaking and singing voice (author's transl)].

    Science.gov (United States)

    Schultz-Coulon, H J

    1975-07-01

    The applicability of a newly developed fundamental frequency analyzer to diagnosis in phoniatrics is reviewed. During routine voice examination, the analyzer allows a quick and accurate measurement of fundamental frequency and sound level of the speaking voice, and of vocal range and maximum phonation time. By computing fundamental frequency histograms, the median fundamental frequency and the total pitch range can be better determined and compared. Objective studies of certain technical faculties of the singing voice, which usually are estimated subjectively by the speech therapist, may now be done by means of this analyzer. Several examples demonstrate the differences between correct and incorrect phonation. These studies compare the pitch perturbations during the crescendo and decrescendo of a swell-tone, and show typical traces of staccato, thrill and yodel. Conclusions of the study indicate that fundamental frequency analysis is a valuable supplemental method for objective voice examination.

  3. Influences of Fundamental Frequency, Formant Frequencies, Aperiodicity, and Spectrum Level on the Perception of Voice Gender

    Science.gov (United States)

    Skuk, Verena G.; Schweinberger, Stefan R.

    2014-01-01

    Purpose: To determine the relative importance of acoustic parameters (fundamental frequency [F0], formant frequencies [FFs], aperiodicity, and spectrum level [SL]) on voice gender perception, the authors used a novel parameter-morphing approach that, unlike spectral envelope shifting, allows the application of nonuniform scale factors to transform…

  4. Instantaneous Fundamental Frequency Estimation with Optimal Segmentation for Nonstationary Voiced Speech

    DEFF Research Database (Denmark)

    Nørholm, Sidsel Marie; Jensen, Jesper Rindom; Christensen, Mads Græsbøll

    2016-01-01

    In speech processing, the speech is often considered stationary within segments of 20–30 ms even though it is well known not to be true. In this paper, we take the non-stationarity of voiced speech into account by using a linear chirp model to describe the speech signal. We propose a maximum...... likelihood estimator of the fundamental frequency and chirp rate of this model, and show that it reaches the Cramer-Rao bound. Since the speech varies over time, a fixed segment length is not optimal, and we propose to make a segmentation of the signal based on the maximum a posteriori (MAP) criterion. Using...... of the chirp model than the harmonic model to the speech signal. The methods are based on an assumption of white Gaussian noise, and, therefore, two prewhitening filters are also proposed....

  5. Variations in voice level and fundamental frequency with changing background noise level and talker-to-listener distance while wearing hearing protectors: A pilot study

    DEFF Research Database (Denmark)

    Bouserhal, Rachel E.; MacDonald, Ewen; Falk, Tiago H.

    2016-01-01

    in voice level and fundamental frequency in noise and with varying talker-to-listener distance. Study sample: Twelve participants with a mean age of 28 participated in this study. Results: Compared to existing data, results show a trend similar to the open ear condition with the exception of the occluded...

  6. A wearable multichannel tactile display of voice fundamental frequency.

    Science.gov (United States)

    Yeung, E; Boothroyd, A; Redmond, C

    1988-12-01

    This paper describes a wearable sensory aid that provides the deaf with tactually encoded information about intonation. Fundamental frequency is represented as both place and rate of vibration in a linear array of solenoids. Pitch extraction is accomplished through low-pass filtering and peak detection. A microcomputer is used to measure pitch period, which in turn determines which of the solenoids is actuated. By comparing consecutive periods, the system discriminates against random, noise-related inputs. The device is switchable between 1-, 8-, and 16-channel operation. The electronics package is contained in a case that may be worn on a belt. The solenoid array is worn on the forearm. The system is powered by five, rechargeable lithium cells and runs for at least 6 hours between charges. Proposed developments include the incorporation of digital pitch extraction methods and the option to use the spatial output dimension to encode speech parameters other than fundamental frequency.

  7. Predicting Achievable Fundamental Frequency Ranges in Vocalization Across Species.

    Directory of Open Access Journals (Sweden)

    Ingo Titze

    2016-06-01

    Full Text Available Vocal folds are used as sound sources in various species, but it is unknown how vocal fold morphologies are optimized for different acoustic objectives. Here we identify two main variables affecting range of vocal fold vibration frequency, namely vocal fold elongation and tissue fiber stress. A simple vibrating string model is used to predict fundamental frequency ranges across species of different vocal fold sizes. While average fundamental frequency is predominantly determined by vocal fold length (larynx size, range of fundamental frequency is facilitated by (1 laryngeal muscles that control elongation and by (2 nonlinearity in tissue fiber tension. One adaptation that would increase fundamental frequency range is greater freedom in joint rotation or gliding of two cartilages (thyroid and cricoid, so that vocal fold length change is maximized. Alternatively, tissue layers can develop to bear a disproportionate fiber tension (i.e., a ligament with high density collagen fibers, increasing the fundamental frequency range and thereby vocal versatility. The range of fundamental frequency across species is thus not simply one-dimensional, but can be conceptualized as the dependent variable in a multi-dimensional morphospace. In humans, this could allow for variations that could be clinically important for voice therapy and vocal fold repair. Alternative solutions could also have importance in vocal training for singing and other highly-skilled vocalizations.

  8. The Siren song of vocal fundamental frequency for romantic relationships

    Directory of Open Access Journals (Sweden)

    Sarah eWeusthoff

    2013-07-01

    Full Text Available A multitude of factors contribute to why and how romantic relationships are formed as well as whether they ultimately succeed or fail. Drawing on evolutionary models of attraction and speech production as well as integrative models of relationship functioning, this review argues that paralinguistic cues (more specifically the fundamental frequency of the voice that are initially a strong source of attraction also increase couples’ risk for relationship failure. Conceptual similarities and differences between the multiple operationalizations and interpretations of vocal fundamental frequency are discussed and guidelines are presented for understanding both convergent and non-convergent findings. Implications for clinical practice and future research are discussed.

  9. The siren song of vocal fundamental frequency for romantic relationships.

    Science.gov (United States)

    Weusthoff, Sarah; Baucom, Brian R; Hahlweg, Kurt

    2013-01-01

    A multitude of factors contribute to why and how romantic relationships are formed as well as whether they ultimately succeed or fail. Drawing on evolutionary models of attraction and speech production as well as integrative models of relationship functioning, this review argues that paralinguistic cues (more specifically the fundamental frequency of the voice) that are initially a strong source of attraction also increase couples' risk for relationship failure. Conceptual similarities and differences between the multiple operationalizations and interpretations of vocal fundamental frequency are discussed and guidelines are presented for understanding both convergent and non-convergent findings. Implications for clinical practice and future research are discussed.

  10. Fundamental frequency characteristics of Jordanian Arabic speakers.

    Science.gov (United States)

    Natour, Yaser S; Wingate, Judith M

    2009-09-01

    This study is the first in a series of investigations designed to test the acoustic characteristics of the normal Arabic voice. The subjects were three hundred normal Jordanian Arabic speakers (100 adult males, 100 adult females, and 100 children). The subjects produced a sustained phonation of the vowel /a:/ and stated their complete names (i.e. first, second, third and surname) using a carrier phrase. The samples were analyzed using the Multi Dimensional Voice Program (MDVP). Fundamental frequency (F0) from the /a:/ and speaking fundamental frequency (SF0) from the sentence were analyzed. Results revealed a significant difference of both F0 and SF0 values among adult Jordanian Arabic-speaking males (F0=131.34Hz +/- 18.65, SF0=137.45 +/- 18.93), females (F0=231.13Hz +/- 20.86, SF0=230.84 +/- 16.50) and children (F0=270.93Hz +/- 20.01, SF0=278.04 +/- 32.07). Comparison with other ethnicities indicated that F0 values of adult Jordanian Arabic-speaking males and females are generally consistent with adult Caucasian and African-American values. However, for Jordanian Arabic-speaking children, a higher trend in F0 values was present than their Western counterparts. SF0 values for adult Jordanian Arabic-speaking males are generally consistent with the adult Caucasian male SF0 values. However, SF0 values of adult Jordanian-speaking females and children were relatively higher than the reported Western values. It is recommended that speech-language pathologists in Arabic-speaking countries, Jordan in specific, utilize the new data provided (F0 and SF0) when evaluating and/or treating Arabic-speaking patients. Due to its cross-linguistic variability, SF0 emerged as a preferred measurement when conducting cross-cultural comparisons of voice features.

  11. Objective voice parameters in Colombian school workers with healthy voices

    NARCIS (Netherlands)

    L.C. Cantor Cutiva (Lady Catherine); A. Burdorf (Alex)

    2015-01-01

    textabstractObjectives: To characterize the objective voice parameters among school workers, and to identify associated factors of three objective voice parameters, namely fundamental frequency, sound pressure level and maximum phonation time. Materials and methods: We conducted a cross-sectional

  12. Objective Voice Parameters in Colombian School Workers with Healthy Voices

    Directory of Open Access Journals (Sweden)

    Lady Catherine Cantor Cutiva

    2015-09-01

    Full Text Available Objectives: To characterize the objective voice parameters among school workers, and to identi­fy associated factors of three objective voice parameters, namely fundamental frequency, sound pressure level and maximum phonation time. Materials and methods: We conducted a cross-sectional study among 116 Colombian teachers and 20 Colombian non-teachers. After signing the informed consent form, participants filled out a questionnaire. Then, a voice sample was recorded and evaluated perceptually by a speech therapist and by objective voice analysis with praat software. Short-term environmental measurements of sound level, temperature, humi­dity, and reverberation time were conducted during visits at the workplaces, such as classrooms and offices. Linear regression analysis was used to determine associations between individual and work-related factors and objective voice parameters. Results: Compared with men, women had higher fundamental frequency (201 Hz for teachers and 209 for non-teachers vs. 120 Hz for teachers and 127 for non-teachers and sound pressure level (82 dB vs. 80 dB, and shorter maximum phonation time (around 14 seconds vs. around 16 seconds. Female teachers younger than 50 years of age evidenced a significant tendency to speak with lower fundamental frequen­cy and shorter mpt compared with female teachers older than 50 years of age. Female teachers had significantly higher fundamental frequency (66 Hz, higher sound pressure level (2 dB and short phonation time (2 seconds than male teachers. Conclusion: Female teachers younger than 50 years of age had significantly lower F0 and shorter mpt compared with those older than 50 years of age. The multivariate analysis showed that gender was a much more important determinant of variations in F0, spl and mpt than age and teaching occupation. Objectively measured temperature also contributed to the changes on spl among school workers.

  13. Voice similarity in identical twins.

    Science.gov (United States)

    Van Gysel, W D; Vercammen, J; Debruyne, F

    2001-01-01

    If people are asked to discriminate visually the two individuals of a monozygotic twin (MT), they mostly get into trouble. Does this problem also exist when listening to twin voices? Twenty female and 10 male MT voices were randomly assembled with one "strange" voice to get voice trios. The listeners (10 female students in Speech and Language Pathology) were asked to label the twins (voices 1-2, 1-3 or 2-3) in two conditions: two standard sentences read aloud and a 2.5-second midsection of a sustained /a/. The proportion correctly labelled twins was for female voices 82% and 63% and for male voices 74% and 52% for the sentences and the sustained /a/ respectively, both being significantly greater than chance (33%). The acoustic analysis revealed a high intra-twin correlation for the speaking fundamental frequency (SFF) of the sentences and the fundamental frequency (F0) of the sustained /a/. So the voice pitch could have been a useful characteristic in the perceptual identification of the twins. We conclude that there is a greater perceptual resemblance between the voices of identical twins than between voices without genetic relationship. The identification however is not perfect. The voice pitch possibly contributes to the correct twin identifications.

  14. Correlation analysis of the physiological factors controlling fundamental voice frequency.

    Science.gov (United States)

    Atkinson, J E

    1978-01-01

    A technique has been developed to obtain a quantitative measure of correlation between electromyographic (EMG) activity of various laryngeal muscles, subglottal air pressure, and the fundamental frequency of vibration of the vocal folds (Fo). Data were collected and analyzed on one subject, a native speaker of American English. The results show that an analysis of this type can provide a useful measure of correlation between the physiological and acoustical events in speech and, furthermore, can yield detailed insights into the organization and nature of the speech production process. In particular, based on these results, a model is suggested of Fo control involving laryngeal state functions that seems to agree with present knowledge of laryngeal control and experimental evidence.

  15. Spatiotemporal frequency characteristics of cerebral oscillations during the perception of fundamental frequency contour changes in one-syllable intonation.

    Science.gov (United States)

    Ueno, Sanae; Okumura, Eiichi; Remijn, Gerard B; Yoshimura, Yuko; Kikuchi, Mitsuru; Shitamichi, Kiyomi; Nagao, Kikuko; Mochiduki, Masayuki; Haruta, Yasuhiro; Hayashi, Norio; Munesue, Toshio; Tsubokawa, Tsunehisa; Oi, Manabu; Nakatani, Hideo; Higashida, Haruhiro; Minabe, Yoshio

    2012-05-02

    Accurate perception of fundamental frequency (F0) contour changes in the human voice is important for understanding a speaker's intonation, and consequently also his/her attitude. In this study, we investigated the neural processes involved in the perception of F0 contour changes in the Japanese one-syllable interjection "ne" in 21 native-Japanese listeners. A passive oddball paradigm was applied in which "ne" with a high falling F0 contour, used when urging a reaction from the listener, was randomly presented as a rare deviant among a frequent "ne" syllable with a flat F0 contour (i.e., meaningless intonation). We applied an adaptive spatial filtering method to the neuromagnetic time course recorded by whole-head magnetoencephalography (MEG) and estimated the spatiotemporal frequency dynamics of event-related cerebral oscillatory changes in the oddball paradigm. Our results demonstrated a significant elevation of beta band event-related desynchronization (ERD) in the right temporal and frontal areas, in time windows from 100 to 300 and from 300 to 500 ms after the onset of deviant stimuli (high falling F0 contour). This is the first study to reveal detailed spatiotemporal frequency characteristics of cerebral oscillations during the perception of intonational (not lexical) F0 contour changes in the human voice. The results further confirmed that the right hemisphere is associated with perception of intonational F0 contour information in the human voice, especially in early time windows. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  16. Numerical analysis of effects of transglottal pressure change on fundamental frequency of phonation.

    Science.gov (United States)

    Deguchi, Shinji; Matsuzaki, Yuji; Ikeda, Tadashige

    2007-02-01

    In humans, a decrease in transglottal pressure (Pt) causes an increase in the fundamental frequency of phonation (F0) only at a specific voice pitch within the modal register, the mechanism of which remains unclear. In the present study, numerical analyses were performed to investigate the mechanism of the voice pitch-dependent positive change of F0 due to Pt decrease. The airflow and the airway, including the vocal folds, were modeled in terms of mechanics of fluid and structure. Simulations of phonation using the numerical model indicated that Pt affects both the average position and the average amplitude magnitude of vocal fold self-excited oscillation in a non-monotonous manner. This effect results in voice pitch-dependent responses of F0 to Pt decreases, including the positive response of F0 as actually observed in humans. The findings of the present study highlight the importance of considering self-excited oscillation of the vocal folds in elucidation of the phonation mechanism.

  17. Fundamental Frequency and Direction-of-Arrival Estimation for Multichannel Speech Enhancement

    DEFF Research Database (Denmark)

    Karimian-Azari, Sam

    Audio systems receive the speech signals of interest usually in the presence of noise. The noise has profound impacts on the quality and intelligibility of the speech signals, and it is therefore clear that the noisy signals must be cleaned up before being played back, stored, or analyzed. We can...... estimate the speech signal of interest from the noisy signals using a priori knowledge about it. A human speech signal is broadband and consists of both voiced and unvoiced parts. The voiced part is quasi-periodic with a time-varying fundamental frequency (or pitch as it is commonly referred to). We...... their time differences which eventually may further reduce the effects of noise. This thesis introduces a number of principles and methods to estimate periodic signals in noisy environments with application to multichannel speech enhancement. We propose model-based signal enhancement concerning the model...

  18. YIN, a fundamental frequency estimator for speech and music

    Science.gov (United States)

    de Cheveigné, Alain; Kawahara, Hideki

    2002-04-01

    An algorithm is presented for the estimation of the fundamental frequency (F0) of speech or musical sounds. It is based on the well-known autocorrelation method with a number of modifications that combine to prevent errors. The algorithm has several desirable features. Error rates are about three times lower than the best competing methods, as evaluated over a database of speech recorded together with a laryngograph signal. There is no upper limit on the frequency search range, so the algorithm is suited for high-pitched voices and music. The algorithm is relatively simple and may be implemented efficiently and with low latency, and it involves few parameters that must be tuned. It is based on a signal model (periodic signal) that may be extended in several ways to handle various forms of aperiodicity that occur in particular applications. Finally, interesting parallels may be drawn with models of auditory processing.

  19. Prior and posterior probabilistic models of uncertainties in a model for producing voice

    International Nuclear Information System (INIS)

    Cataldo, Edson; Sampaio, Rubens; Soize, Christian

    2010-01-01

    The aim of this paper is to use Bayesian statistics to update a probability density function related to the tension parameter, which is one of the main parameters responsible for the changing of the fundamental frequency of a voice signal, generated by a mechanical/mathematical model for producing voiced sounds. We follow a parametric approach for stochastic modeling, which requires the adoption of random variables to represent the uncertain parameters present in the cited model. For each random variable, a probability density function is constructed using the Maximum Entropy Principle and the Monte Carlo method is used to generate voice signals as the output of the model. Then, a probability density function of the voice fundamental frequency is constructed. The random variables are fit to experimental data so that the probability density function of the fundamental frequency obtained by the model can be as near as possible of a probability density function obtained from experimental data. New values are obtained experimentally for the fundamental frequency and they are used to update the probability density function of the tension parameter, via Bayes's Theorem.

  20. Playful Interaction with Voice Sensing Modular Robots

    DEFF Research Database (Denmark)

    Heesche, Bjarke; MacDonald, Ewen; Fogh, Rune

    2013-01-01

    This paper describes a voice sensor, suitable for modular robotic systems, which estimates the energy and fundamental frequency, F0, of the user’s voice. Through a number of example applications and tests with children, we observe how the voice sensor facilitates playful interaction between child...... children and two different robot configurations. In future work, we will investigate if such a system can motivate children to improve voice control and explore how to extend the sensor to detect emotions in the user’s voice....

  1. [Assessment of voice acoustic parameters in female teachers with diagnosed occupational voice disorders].

    Science.gov (United States)

    Niebudek-Bogusz, Ewa; Fiszer, Marta; Sliwińska-Kowalska, Mariola

    2005-01-01

    Laryngovideostroboscopy is the method most frequently used in the assessment of voice disorders. However, the employment of quantitative methods, such as voice acoustic analysis, is essential for evaluating the effectiveness of prophylactic and therapeutic activities as well as for objective medical certification of larynx pathologies. The aim of this study was to examine voice acoustic parameters in female teachers with occupational voice diseases. Acoustic analysis (IRIS software) was performed in 66 female teachers, including 35 teachers with occupational voice diseases and 31 with functional dysphonia. The teachers with occupational voice diseases presented the lower average fundamental frequency (193 Hz) compared to the group with functional dysphonia (209 Hz) and to the normative value (236 Hz), whereas other acoustic parameters did not differ significantly in both groups. Voice acoustic analysis, when applied separately from vocal loading, cannot be used as a testing method to verify the diagnosis of occupational voice disorders.

  2. Connections between voice ergonomic risk factors in classrooms and teachers' voice production.

    Science.gov (United States)

    Rantala, Leena M; Hakala, Suvi; Holmqvist, Sofia; Sala, Eeva

    2012-01-01

    The aim of the study was to investigate if voice ergonomic risk factors in classrooms correlated with acoustic parameters of teachers' voice production. The voice ergonomic risk factors in the fields of working culture, working postures and indoor air quality were assessed in 40 classrooms using the Voice Ergonomic Assessment in Work Environment - Handbook and Checklist. Teachers (32 females, 8 males) from the above-mentioned classrooms recorded text readings before and after a working day. Fundamental frequency, sound pressure level (SPL) and the slope of the spectrum (alpha ratio) were analyzed. The higher the number of the risk factors in the classrooms, the higher SPL the teachers used and the more strained the males' voices (increased alpha ratio) were. The SPL was already higher before the working day in the teachers with higher risk than in those with lower risk. In the working environment with many voice ergonomic risk factors, speakers increase voice loudness and use more strained voice quality (males). A practical implication of the results is that voice ergonomic assessments are needed in schools. Copyright © 2013 S. Karger AG, Basel.

  3. Gender classification in children based on speech characteristics: using fundamental and formant frequencies of Malay vowels.

    Science.gov (United States)

    Zourmand, Alireza; Ting, Hua-Nong; Mirhassani, Seyed Mostafa

    2013-03-01

    Speech is one of the prevalent communication mediums for humans. Identifying the gender of a child speaker based on his/her speech is crucial in telecommunication and speech therapy. This article investigates the use of fundamental and formant frequencies from sustained vowel phonation to distinguish the gender of Malay children aged between 7 and 12 years. The Euclidean minimum distance and multilayer perceptron were used to classify the gender of 360 Malay children based on different combinations of fundamental and formant frequencies (F0, F1, F2, and F3). The Euclidean minimum distance with normalized frequency data achieved a classification accuracy of 79.44%, which was higher than that of the nonnormalized frequency data. Age-dependent modeling was used to improve the accuracy of gender classification. The Euclidean distance method obtained 84.17% based on the optimal classification accuracy for all age groups. The accuracy was further increased to 99.81% using multilayer perceptron based on mel-frequency cepstral coefficients. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  4. Estudo da freqüência fundamental da voz de idosas portadoras de diferentes graus de perda auditiva Study of the fundamental frequency in elderly women with hearing loss

    Directory of Open Access Journals (Sweden)

    Giovana dos Santos Baraldi

    2007-06-01

    Full Text Available A deficiência auditiva, dentre outros, é um dos distúrbios mais referidos pela população idosa. Sabe-se que o sistema de feedback auditivo é primordial para o monitoramento dos parâmetros vocais, como a freqüência fundamenta. OBJETIVO: Correlacionar a audição e os valores de F0 (freqüência fundamental da voz de idosas portadores de diferentes graus de sensibilidade auditiva. FORMA DO ESTUDO: Transversal descritivo. MATERIAL E MÉTODOS: Amostra de 30 idosas, idades média de 76,23, portadoras de audição normal ou perda auditiva neurossensorial descendente simétrica. Foram submetidas a anamnese, avaliação auditiva (audiometria tonal limiar, IPRF e imitanciometria e avaliação vocal. Os resultados de ambas as avaliações foram correlacionados. RESULTADOS: A F0 da produção vocal de idosas com perda leve (144,44 foi significantemente menor que para perda moderada (160,3, moderadamente severa (188,23 e severa (201,27, tanto utilizando a classificação de grau da perda auditiva para freqüências baixas como altas. CONCLUSÃO: Quanto mais elevado o grau da perda auditiva, maior o valor de freqüência fundamental encontrado.Increased life expectancy raises demands for special attention for the elderly population; speech, language and hearing science deals with their communication disorders. Hearing loss is a common disorder affecting this age group. It is known that the auditory feedback system is essential to human vocalizing, as it organizes voice production. AIM: To assess and correlate the hearing system and the Fundamental Frequency (F0 of women who have variable degrees of sensorineural hearing loss. MATERIAL AND METHOD: a cross-sectional descriptive study. 30 women with a mean age of 75.95 (SD = 7,41 were included. Inclusion criteria were: symmetric sensorineural hearing loss, a high-frequency sloping configuration, and a type A tympanogram. Subjects underwent Pure Tone Audiometry, a Word Recognition Test, Tympanometry

  5. Emotional state and its impact on voice authentication accuracy

    Science.gov (United States)

    Voznak, Miroslav; Partila, Pavol; Penhaker, Marek; Peterek, Tomas; Tomala, Karel; Rezac, Filip; Safarik, Jakub

    2013-05-01

    The paper deals with the increasing accuracy of voice authentication methods. The developed algorithm first extracts segmental parameters, such as Zero Crossing Rate, the Fundamental Frequency and Mel-frequency cepstral coefficients from voice. Based on these parameters, the neural network classifier detects the speaker's emotional state. These parameters shape the distribution of neurons in Kohonen maps, forming clusters of neurons on the map characterizing a particular emotional state. Using regression analysis, we can calculate the function of the parameters of individual emotional states. This relationship increases voice authentication accuracy and prevents unjust rejection.

  6. Perceiving a stranger's voice as being one's own: a 'rubber voice' illusion?

    Directory of Open Access Journals (Sweden)

    Zane Z Zheng

    2011-04-01

    Full Text Available We describe an illusion in which a stranger's voice, when presented as the auditory concomitant of a participant's own speech, is perceived as a modified version of their own voice. When the congruence between utterance and feedback breaks down, the illusion is also broken. Compared to a baseline condition in which participants heard their own voice as feedback, hearing a stranger's voice induced robust changes in the fundamental frequency (F0 of their production. Moreover, the shift in F0 appears to be feedback dependent, since shift patterns depended reliably on the relationship between the participant's own F0 and the stranger-voice F0. The shift in F0 was evident both when the illusion was present and after it was broken, suggesting that auditory feedback from production may be used separately for self-recognition and for vocal motor control. Our findings indicate that self-recognition of voices, like other body attributes, is malleable and context dependent.

  7. Perceptual-Auditory and Acoustical Analysis of the Voices of Transgender Women.

    Science.gov (United States)

    Schwarz, Karine; Fontanari, Anna Martha Vaitses; Costa, Angelo Brandelli; Soll, Bianca Machado Borba; da Silva, Dhiordan Cardoso; de Sá Villas-Bôas, Anna Paula; Cielo, Carla Aparecida; Bastilha, Gabriele Rodrigues; Ribeiro, Vanessa Veis; Dorfman, Maria Elza Kazumi Yamaguti; Lobato, Maria Inês Rodrigues

    2017-09-28

    Voice is an important gender marker in the transition process as a transgender individual accepts a new gender identity. The objectives of this study were to describe and relate aspects of a perceptual-auditory analysis and the fundamental frequency (F0) of male-to-female (MtF) transsexual individuals. A case-control study was carried out with individuals aged 19-52 years who attended the Gender Identity Program of the Hospital de Clínicas of Porto Alegre. Vocal recordings from the MtF transgender and cisgender individuals (vowel /a:/ and six phrases of Consensus Auditory Perceptual Evaluation Voice [CAPE-V]) were edited and randomly coded before storage in a Dropbox folder. The voices (vowel /a:/) were analyzed by consensus on the same day by two judge speech therapists who had more than 10 years of experience in the voice area using the GRBASI perceptual-auditory vocal evaluation scale. Acoustic analysis of the voices was performed using the advanced Multi-Dimensional Voice Program software. The resonance focus and the degrees of masculinity and femininity for each voice recording were determined by listening to the CAPE-V phrases, for the same judges. There were significant differences between the groups regarding a greater frequency of subjects with F0 between 80 and 150 Hz (P = 0.003), and a greater frequency of hypernasal resonant focus (P < 0.001) in the MtF cases and greater frequency of subjects with absence of roughness (P = 0.031) in the control group. The MtF group of individuals showed altered vertical resonant focus, more masculine voices, and lower fundamental frequencies. The control group showed a significant absence of roughness. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  8. Fast fundamental frequency estimation

    DEFF Research Database (Denmark)

    Nielsen, Jesper Kjær; Jensen, Tobias Lindstrøm; Jensen, Jesper Rindom

    2017-01-01

    Modelling signals as being periodic is common in many applications. Such periodic signals can be represented by a weighted sum of sinusoids with frequencies being an integer multiple of the fundamental frequency. Due to its widespread use, numerous methods have been proposed to estimate the funda...

  9. Mechanics of human voice production and control.

    Science.gov (United States)

    Zhang, Zhaoyan

    2016-10-01

    As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed.

  10. Voice following radiotherapy

    International Nuclear Information System (INIS)

    Stoicheff, M.L.

    1975-01-01

    This study was undertaken to provide information on the voice of patients following radiotherapy for glottic cancer. Part I presents findings from questionnaires returned by 227 of 235 patients successfully irradiated for glottic cancer from 1960 through 1971. Part II presents preliminary findings on the speaking fundamental frequencies of 22 irradiated patients. Normal to near-normal voice was reported by 83 percent of the 227 patients; however, 80 percent did indicate persisting vocal difficulties such as fatiguing of voice with much usage, inability to sing, reduced loudness, hoarse voice quality and inability to shout. Amount of talking during treatments appeared to affect length of time for voice to recover following treatments in those cases where it took from nine to 26 weeks; also, with increasing years since treatment, patients rated their voices more favorably. Smoking habits following treatments improved significantly with only 27 percent smoking heavily as compared with 65 percent prior to radiation therapy. No correlation was found between smoking (during or after treatments) and vocal ratings or between smoking and length of time for voice to recover. There was no relationship found between reported vocal ratings and stage of the disease

  11. Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features

    Directory of Open Access Journals (Sweden)

    Ömer Eskidere

    2015-01-01

    Full Text Available The Mel Frequency Cepstral Coefficients (MFCCs are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later.

  12. Mobile Communication Devices, Ambient Noise, and Acoustic Voice Measures.

    Science.gov (United States)

    Maryn, Youri; Ysenbaert, Femke; Zarowski, Andrzej; Vanspauwen, Robby

    2017-03-01

    The ability to move with mobile communication devices (MCDs; ie, smartphones and tablet computers) may induce differences in microphone-to-mouth positioning and use in noise-packed environments, and thus influence reliability of acoustic voice measurements. This study investigated differences in various acoustic voice measures between six recording equipments in backgrounds with low and increasing noise levels. One chain of continuous speech and sustained vowel from 50 subjects with voice disorders (all separated by silence intervals) was radiated and re-recorded in an anechoic chamber with five MCDs and one high-quality recording system. These recordings were acquired in one condition without ambient noise and in four conditions with increased ambient noise. A total of 10 acoustic voice markers were obtained in the program Praat. Differences between MCDs and noise condition were assessed with Friedman repeated-measures test and posthoc Wilcoxon signed-rank tests, both for related samples, after Bonferroni correction. (1) Except median fundamental frequency and seven nonsignificant differences, MCD samples have significantly higher acoustic markers than clinical reference samples in minimal environmental noise. (2) Except median fundamental frequency, jitter local, and jitter rap, all acoustic measures on samples recorded with the reference system experienced significant influence from room noise levels. Fundamental frequency is resistant to recording system, environmental noise, and their combination. All other measures, however, were impacted by both recording system and noise condition, and especially by their combination, often already in the reference/baseline condition without added ambient noise. Caution is therefore warranted regarding implementation of MCDs as clinical recording tools, particularly when applied for treatment outcomes assessments. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  13. Voice Quality and Gender Stereotypes: A Study of Lebanese Women With Reinke's Edema.

    Science.gov (United States)

    Matar, Nayla; Portes, Cristel; Lancia, Leonardo; Legou, Thierry; Baider, Fabienne

    2016-12-01

    Women with Reinke's edema (RW) report being mistaken for men during telephone conversations. For this reason, their masculine-sounding voices are interesting for the study of gender stereotypes. The study's objective is to verify their complaint and to understand the cues used in gender identification. Using a self-evaluation study, we verified RW's perception of their own voices. We compared the acoustic parameters of vowels produced by 10 RW to those produced by 10 men and 10 women with healthy voices (hereafter referred to as NW) in Lebanese Arabic. We conducted a perception study for the evaluation of RW, healthy men's, and NW voices by naïve listeners. RW self-evaluated their voices as masculine and their gender identities as feminine. The acoustic parameters that distinguish RW from NW voices concern fundamental frequency, spectral slope, harmonicity of the voicing signal, and complexity of the spectral envelope. Naïve listeners very often rate RW as surely masculine. Listeners may rate RW's gender incorrectly. These incorrect gender ratings are correlated with acoustic measures of fundamental frequency and voice quality. Further investigations will reveal the contribution of each of these parameters to gender perception and guide the treatment plan of patients complaining of a gender ambiguous voice.

  14. Diagnostic value of voice acoustic analysis in assessment of occupational voice pathologies in teachers.

    Science.gov (United States)

    Niebudek-Bogusz, Ewa; Fiszer, Marta; Kotylo, Piotr; Sliwinska-Kowalska, Mariola

    2006-01-01

    It has been shown that teachers are at risk of developing occupational dysphonia, which accounts for over 25% of all occupational diseases diagnosed in Poland. The most frequently used method of diagnosing voice diseases is videostroboscopy. However, to facilitate objective evaluation of voice efficiency as well as medical certification of occupational voice disorders, it is crucial to implement quantitative methods of voice assessment, particularly voice acoustic analysis. The aim of the study was to assess the results of acoustic analysis in 66 female teachers (aged 40-64 years), including 35 subjects with occupational voice pathologies (e.g., vocal nodules) and 31 subjects with functional dysphonia. The acoustic analysis was performed using the IRIS software, before and after a 30-minute vocal loading test. All participants were subjected also to laryngological and videostroboscopic examinations. After the vocal effort, the acoustic parameters displayed statistically significant abnormalities, mostly lowered fundamental frequency (Fo) and incorrect values of shimmer and noise to harmonic ratio. To conclude, quantitative voice acoustic analysis using the IRIS software seems to be an effective complement to voice examinations, which is particularly helpful in diagnosing occupational dysphonia.

  15. Default Bayesian Estimation of the Fundamental Frequency

    DEFF Research Database (Denmark)

    Nielsen, Jesper Kjær; Christensen, Mads Græsbøll; Jensen, Søren Holdt

    2013-01-01

    Joint fundamental frequency and model order esti- mation is an important problem in several applications. In this paper, a default estimation algorithm based on a minimum of prior information is presented. The algorithm is developed in a Bayesian framework, and it can be applied to both real....... Moreover, several approximations of the posterior distributions on the fundamental frequency and the model order are derived, and one of the state-of-the-art joint fundamental frequency and model order estimators is demonstrated to be a special case of one of these approximations. The performance...

  16. A Kalman-based Fundamental Frequency Estimation Algorithm

    DEFF Research Database (Denmark)

    Shi, Liming; Nielsen, Jesper Kjær; Jensen, Jesper Rindom

    2017-01-01

    Fundamental frequency estimation is an important task in speech and audio analysis. Harmonic model-based methods typically have superior estimation accuracy. However, such methods usually as- sume that the fundamental frequency and amplitudes are station- ary over a short time frame. In this pape...

  17. 33 CFR 86.03 - Limits of fundamental frequencies.

    Science.gov (United States)

    2010-07-01

    ... of fundamental frequencies. To ensure a wide variety of whistle characteristics, the fundamental... 33 Navigation and Navigable Waters 1 2010-07-01 2010-07-01 false Limits of fundamental frequencies. 86.03 Section 86.03 Navigation and Navigable Waters COAST GUARD, DEPARTMENT OF HOMELAND SECURITY...

  18. Variations in voice level and fundamental frequency with changing background noise level and talker-to-listener distance while wearing hearing protectors: A pilot study.

    Science.gov (United States)

    Bouserhal, Rachel E; Macdonald, Ewen N; Falk, Tiago H; Voix, Jérémie

    2016-01-01

    Speech production in noise with varying talker-to-listener distance has been well studied for the open ear condition. However, occluding the ear canal can affect the auditory feedback and cause deviations from the models presented for the open-ear condition. Communication is a main concern for people wearing hearing protection devices (HPD). Although practical, radio communication is cumbersome, as it does not distinguish designated receivers. A smarter radio communication protocol must be developed to alleviate this problem. Thus, it is necessary to model speech production in noise while wearing HPDs. Such a model opens the door to radio communication systems that distinguish receivers and offer more efficient communication between persons wearing HPDs. This paper presents the results of a pilot study aimed to investigate the effects of occluding the ear on changes in voice level and fundamental frequency in noise and with varying talker-to-listener distance. Twelve participants with a mean age of 28 participated in this study. Compared to existing data, results show a trend similar to the open ear condition with the exception of the occluded quiet condition. This implies that a model can be developed to better understand speech production for the occluded ear.

  19. Acoustic cues for the recognition of self-voice and other-voice

    Directory of Open Access Journals (Sweden)

    Mingdi eXu

    2013-10-01

    Full Text Available Self-recognition, being indispensable for successful social communication, has become a major focus in current social neuroscience. The physical aspects of the self are most typically manifested in the face and voice. Compared with the wealth of studies on self-face recognition, self-voice recognition (SVR has not gained much attention. Converging evidence has suggested that the fundamental frequency (F0 and formant structures serve as the key acoustic cues for other-voice recognition (OVR. However, little is known about which, and how, acoustic cues are utilized for SVR as opposed to OVR. To address this question, we independently manipulated the F0 and formant information of recorded voices and investigated their contributions to SVR and OVR. Japanese participants were presented with recorded vocal stimuli and were asked to identify the speaker—either themselves or one of their peers. Six groups of 5 peers of the same sex participated in the study. Under conditions where the formant information was fully preserved and where only the frequencies lower than the third formant (F3 were retained, accuracies of SVR deteriorated significantly with the modulation of the F0, and the results were comparable for OVR. By contrast, under a condition where only the frequencies higher than F3 were retained, the accuracy of SVR was significantly higher than that of OVR throughout the range of F0 modulations, and the F0 scarcely affected the accuracies of SVR and OVR. Our results indicate that while both F0 and formant information are involved in SVR, as well as in OVR, the advantage of SVR is manifested only when major formant information for speech intelligibility is absent. These findings imply the robustness of self-voice representation, possibly by virtue of auditory familiarity and other factors such as its association with motor/articulatory representation.

  20. Acoustic analysis after radiotherapy in T1 vocal cord carcinoma: a new approach to the analysis of voice quality

    International Nuclear Information System (INIS)

    Rovirosa, Angeles; Martinez-Celdran, Eugenio; Ortega, Alicia; Ascaso, Carlos; Abellana, Rosa; Velasco, Mercedes; Bonet, Montserrat; Herrera, Carmen; Casas, Francesc; Francisco, Rosa Maria; Arenas, Meritxell; Hernandez, Victor; Sanchez-Reyes, Alberto; Leon, Concha; Traserra, Jordi; Biete, Albert

    2000-01-01

    Purpose: The study of acoustic voice parameters (fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio) in extended vowel production, oral reading of a standard paragraph, spontaneous speech and a song in irradiated patients for Tis-T1 vocal cord carcinoma. Methods and Materials: Eighteen male patients irradiated for Tis-T1 vocal cord carcinoma and a control group of 31 nonirradiated subjects of the same age were included in a study of acoustic voice analysis. The control group had been rigorously selected for voice quality and the irradiated group had previous history of smoking in two-thirds of the cases and a vocal cord biopsy. Radiotherapy patients were treated with a 6MV Linac receiving a total dose of 66 Gy, 2 Gy/day, with median treatment areas of 28 cm 2 . Acoustic voice analysis was performed 1 year after radiotherapy, the voice of patients in extended vowel production, oral reading of a standard paragraph, spontaneous speech, and in a song was tape registered and analyzed by a Kay Elemetric's Computerized Speech Lab (model CSL no. 4300). Fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio were obtained in each case. Mann Whitney analysis was used for statistical tests. Results: The irradiated group presented higher values of fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio. Mann-Whitney analysis showed significant differences for fundamental frequency and jitter in vowel production, oral reading, spontaneous speech, and song. Shimmer only showed differences in vowel production and harmonics-to-noise ratio in oral reading and song. Conclusions: In our study only fundamental frequency and jitter showed significant increased values to the control group in all the acoustic situations. Sustained vowel production showed the worst values of the acoustic parameters in comparison with the other acoustic situations. This study seems to suggest that more work should be done in this field

  1. Acoustic analysis of voice in children with cleft palate and velopharyngeal insufficiency.

    Science.gov (United States)

    Villafuerte-Gonzalez, Rocio; Valadez-Jimenez, Victor M; Hernandez-Lopez, Xochiquetzal; Ysunza, Pablo Antonio

    2015-07-01

    Acoustic analysis of voice can provide instrumental data concerning vocal abnormalities. These findings can be used for monitoring clinical course in cases of voice disorders. Cleft palate severely affects the structure of the vocal tract. Hence, voice quality can also be also affected. To study whether the main acoustic parameters of voice, including fundamental frequency, shimmer and jitter are significantly different in patients with a repaired cleft palate, as compared with normal children without speech, language and voice disorders. Fourteen patients with repaired unilateral cleft lip and palate and persistent or residual velopharyngeal insufficiency (VPI) were studied. A control group was assembled with healthy volunteer subjects matched by age and gender. Hypernasality and nasal emission were perceptually assessed in patients with VPI. Size of the gap as assessed by videonasopharyngoscopy was classified in patients with VPI. Acoustic analysis of voice including Fundamental frequency (F0), shimmer and jitter were compared between patients with VPI and control subjects. F0 was significantly higher in male patients as compared with male controls. Shimmer was significantly higher in patients with VPI regardless of gender. Moreover, patients with moderate VPI showed a significantly higher shimmer perturbation, regardless of gender. Although future research regarding voice disorders in patients with VPI is needed, at the present time it seems reasonable to include strategies for voice therapy in the speech and language pathology intervention plan for patients with VPI. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  2. Reproducibility of Automated Voice Range Profiles, a Systematic Literature Review

    DEFF Research Database (Denmark)

    Printz, Trine; Rosenberg, Tine; Godballe, Christian

    2018-01-01

    literature on test-retest accuracy of the automated voice range profile assessment. Study design: Systematic review. Data sources: PubMed, Scopus, Cochrane Library, ComDisDome, Embase, and CINAHL (EBSCO). Methods: We conducted a systematic literature search of six databases from 1983 to 2016. The following......Objective: Reliable voice range profiles are of great importance when measuring effects and side effects from surgery affecting voice capacity. Automated recording systems are increasingly used, but the reproducibility of results is uncertain. Our objective was to identify and review the existing...... keywords were used: phonetogram, voice range profile, and acoustic voice analysis. Inclusion criteria were automated recording procedure, healthy voices, and no intervention between test and retest. Test-retest values concerning fundamental frequency and voice intensity were reviewed. Results: Of 483...

  3. The singer's voice range profile: female professional opera soloists.

    Science.gov (United States)

    Lamarche, Anick; Ternström, Sten; Pabon, Peter

    2010-07-01

    This work concerns the collection of 30 voice range profiles (VRPs) of female operatic voice. We address the questions: Is there a need for a singer's protocol in VRP acquisition? Are physiological measurements sufficient or should the measurement of performance capabilities also be included? Can we address the female singing voice in general or is there a case for categorizing voices when studying phonetographic data? Subjects performed a series of structured tasks involving both standard speech voice protocols and additional singing tasks. Singers also completed an extensive questionnaire. Physiological VRPs differ from performance VRPs. Two new VRP metrics, the voice area above a defined level threshold and the dynamic range independent from the fundamental frequency (F(0)), were found to be useful in the analysis of singer VRPs. Task design had no effect on performance VRP outcomes. Voice category differences were mainly attributable to phonation frequency-based information. Results support the clinical importance of addressing the vocal instrument as it is used in performance. Equally important is the elaboration of a protocol suitable for the singing voice. The given context and instructions can be more important than task design for performance VRPs. Yet, for physiological VRP recordings, task design remains critical. Both types of VRPs are suggested for a singer's voice evaluation. Copyright (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  4. Voice Use Among Music Theory Teachers: A Voice Dosimetry and Self-Assessment Study.

    Science.gov (United States)

    Schiller, Isabel S; Morsomme, Dominique; Remacle, Angélique

    2017-07-25

    This study aimed (1) to investigate music theory teachers' professional and extra-professional vocal loading and background noise exposure, (2) to determine the correlation between vocal loading and background noise, and (3) to determine the correlation between vocal loading and self-evaluation data. Using voice dosimetry, 13 music theory teachers were monitored for one workweek. The parameters analyzed were voice sound pressure level (SPL), fundamental frequency (F0), phonation time, vocal loading index (VLI), and noise SPL. Spearman correlation was used to correlate vocal loading parameters (voice SPL, F0, and phonation time) and noise SPL. Each day, the subjects self-assessed their voice using visual analog scales. VLI and self-evaluation data were correlated using Spearman correlation. Vocal loading parameters and noise SPL were significantly higher in the professional than in the extra-professional environment. Voice SPL, phonation time, and female subjects' F0 correlated positively with noise SPL. VLI correlated with self-assessed voice quality, vocal fatigue, and amount of singing and speaking voice produced. Teaching music theory is a profession with high vocal demands. More background noise is associated with increased vocal loading and may indirectly increase the risk for voice disorders. Correlations between VLI and self-assessments suggest that these teachers are well aware of their vocal demands and feel their effect on voice quality and vocal fatigue. Visual analog scales seem to represent a useful tool for subjective vocal loading assessment and associated symptoms in these professional voice users. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  5. Voice parameters and videonasolaryngoscopy in children with vocal nodules: a longitudinal study, before and after voice therapy.

    Science.gov (United States)

    Valadez, Victor; Ysunza, Antonio; Ocharan-Hernandez, Esther; Garrido-Bustamante, Norma; Sanchez-Valerio, Araceli; Pamplona, Ma C

    2012-09-01

    Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. Before the onset of voice therapy, there was a significant difference (ptherapy period, a significant improvement (pvocal nodules were no longer discernible on the vocal folds in any of the cases. SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  6. The effect of oxandrolone on voice frequency in growth hormone-treated girls with Turner syndrome

    NARCIS (Netherlands)

    Menke, L.A.; Sas, T.C.J.; Koningsbrugge, S.H. van; Ridder, M.A. de; Zandwijken, G.R.; Boersma, B.; Dejonckere, P.H.; Muinck Keizer-Schrama, S.M.P.F. de; Otten, B.J.; Wit, J.M.

    2011-01-01

    OBJECTIVES/HYPOTHESIS: Oxandrolone (Ox) increases height gain but may also cause voice deepening in growth hormone (GH)-treated girls with Turner syndrome (TS). We assessed the effect of Ox on objective and subjective speaking voice frequency in GH-treated girls with TS. STUDY DESIGN: A multicenter,

  7. Work-related voice disorder

    Directory of Open Access Journals (Sweden)

    Paulo Eduardo Przysiezny

    2015-04-01

    Full Text Available INTRODUCTION: Dysphonia is the main symptom of the disorders of oral communication. However, voice disorders also present with other symptoms such as difficulty in maintaining the voice (asthenia, vocal fatigue, variation in habitual vocal fundamental frequency, hoarseness, lack of vocal volume and projection, loss of vocal efficiency, and weakness when speaking. There are several proposals for the etiologic classification of dysphonia: functional, organofunctional, organic, and work-related voice disorder (WRVD.OBJECTIVE: To conduct a literature review on WRVD and on the current Brazilian labor legislation.METHODS: This was a review article with bibliographical research conducted on the PubMed and Bireme databases, using the terms "work-related voice disorder", "occupational dysphonia", "dysphonia and labor legislation", and a review of labor and social security relevant laws.CONCLUSION: WRVD is a situation that frequently is listed as a reason for work absenteeism, functional rehabilitation, or for prolonged absence from work. Currently, forensic physicians have no comparative parameters to help with the analysis of vocal disorders. In certain situations WRVD may cause, work disability. This disorder may be labor-related, or be an adjuvant factor to work-related diseases.

  8. Accurate Estimation of Low Fundamental Frequencies from Real-Valued Measurements

    DEFF Research Database (Denmark)

    Christensen, Mads Græsbøll

    2013-01-01

    In this paper, the difficult problem of estimating low fundamental frequencies from real-valued measurements is addressed. The methods commonly employed do not take the phenomena encountered in this scenario into account and thus fail to deliver accurate estimates. The reason for this is that the......In this paper, the difficult problem of estimating low fundamental frequencies from real-valued measurements is addressed. The methods commonly employed do not take the phenomena encountered in this scenario into account and thus fail to deliver accurate estimates. The reason...... for this is that they employ asymptotic approximations that are violated when the harmonics are not well-separated in frequency, something that happens when the observed signal is real-valued and the fundamental frequency is low. To mitigate this, we analyze the problem and present some exact fundamental frequency estimators...

  9. Acoustic markers to differentiate gender in prepubescent children's speaking and singing voice.

    Science.gov (United States)

    Guzman, Marco; Muñoz, Daniel; Vivero, Martin; Marín, Natalia; Ramírez, Mirta; Rivera, María Trinidad; Vidal, Carla; Gerhard, Julia; González, Catalina

    2014-10-01

    Investigation sought to determine whether there is any acoustic variable to objectively differentiate gender in children with normal voices. A total of 30 children, 15 boys and 15 girls, with perceptually normal voices were examined. They were between 7 and 10 years old (mean: 8.1, SD: 0.7 years). Subjects were required to perform the following phonatory tasks: (1) to phonate sustained vowels [a:], [i:], [u:], (2) to read a phonetically balanced text, and (3) to sing a song. Acoustic analysis included long-term average spectrum (LTAS), fundamental frequency (F0), speaking fundamental frequency (SFF), equivalent continuous sound level (Leq), linear predictive code (LPC) to obtain formant frequencies, perturbation measures, harmonic to noise ratio (HNR), and Cepstral peak prominence (CPP). Auditory perceptual analysis was performed by four blinded judges to determine gender. No significant gender-related differences were found for most acoustic variables. Perceptual assessment showed good intra and inter rater reliability for gender. Cepstrum for [a:], alpha ratio in text, shimmer for [i:], F3 in [a:], and F3 in [i:], were the parameters that composed the multivariate logistic regression model to best differentiate male and female children's voices. Since perceptual assessment reliably detected gender, it is likely that other acoustic markers (not evaluated in the present study) are able to make clearer gender differences. For example, gender-specific patterns of intonation may be a more accurate feature for differentiating gender in children's voices. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  10. Voice and Handgrip Strength Predict Reproductive Success in a Group of Indigenous African Females

    Science.gov (United States)

    Sorokowska, Agnieszka; Sorokowski, Piotr; Mberira, Mara; Bartels, Astrid; Gallup, Gordon G.

    2012-01-01

    Evolutionary accounts of human traits are often based on proxies for genetic fitness (e.g., number of sex partners, facial attractiveness). Instead of using proxies, actual differences in reproductive success is a more direct measure of Darwinian fitness. Certain voice acoustics such as fundamental frequency and measures of health such as handgrip strength correlate with proxies of fitness, yet there are few studies showing the relation of these traits to reproduction. Here, we explore whether the fundamental frequency of the voice and handgrip strength account for differences in actual reproduction among a population of natural fertility humans. Our results show that both fundamental frequency and handgrip strength predict several measures of reproductive success among a group of indigenous Namibian females, particularly amongst the elderly, with weight also predicting reproductive outcomes among males. These findings demonstrate that both hormonally regulated and phenotypic quality markers can be used as measures of Darwinian fitness among humans living under conditions that resemble the evolutionary environment of Homo sapiens. We also argue that these findings provide support for the Grandmother Hypothesis. PMID:22870251

  11. Colour and texture associations in voice-induced synaesthesia

    Directory of Open Access Journals (Sweden)

    Anja eMoos

    2013-09-01

    Full Text Available Voice-induced synaesthesia, a form of synaesthesia in which synaesthetic perceptions are induced by the sounds of people’s voices, appears to be relatively rare and has not been systematically studied. In this study we investigated the synaesthetic colour and visual texture perceptions experienced in response to different types of voice quality (e.g. nasal, whisper, falsetto. Experiences of three different groups – self-reported voice synaesthetes, phoneticians and controls – were compared using both qualitative and quantitative analysis in a study conducted online. Whilst, in the qualitative analysis, synaesthetes used more colour and texture terms to describe voices than either phoneticians or controls, only weak differences, and many similarities, between groups were found in the quantitative analysis. Notable consistent results between groups were the matching of higher speech fundamental frequencies with lighter and redder colours, the matching of whispery voices with smoke-like textures and the matching of harsh and creaky voices with textures resembling dry cracked soil. These data are discussed in the light of current thinking about definitions and categorizations of synaesthesia, especially in cases where individuals apparently have a range of different synaesthetic inducers.

  12. Color and texture associations in voice-induced synesthesia

    Science.gov (United States)

    Moos, Anja; Simmons, David; Simner, Julia; Smith, Rachel

    2013-01-01

    Voice-induced synesthesia, a form of synesthesia in which synesthetic perceptions are induced by the sounds of people's voices, appears to be relatively rare and has not been systematically studied. In this study we investigated the synesthetic color and visual texture perceptions experienced in response to different types of “voice quality” (e.g., nasal, whisper, falsetto). Experiences of three different groups—self-reported voice synesthetes, phoneticians, and controls—were compared using both qualitative and quantitative analysis in a study conducted online. Whilst, in the qualitative analysis, synesthetes used more color and texture terms to describe voices than either phoneticians or controls, only weak differences, and many similarities, between groups were found in the quantitative analysis. Notable consistent results between groups were the matching of higher speech fundamental frequencies with lighter and redder colors, the matching of “whispery” voices with smoke-like textures, and the matching of “harsh” and “creaky” voices with textures resembling dry cracked soil. These data are discussed in the light of current thinking about definitions and categorizations of synesthesia, especially in cases where individuals apparently have a range of different synesthetic inducers. PMID:24032023

  13. Advanced Time-Frequency Representation in Voice Signal Analysis

    Directory of Open Access Journals (Sweden)

    Dariusz Mika

    2018-03-01

    Full Text Available The most commonly used time-frequency representation of the analysis in voice signal is spectrogram. This representation belongs in general to Cohen's class, the class of time-frequency energy distributions. From the standpoint of properties of the resolution spectrogram representation is not optimal. In Cohen class representations are known which have a better resolution properties. All of them are created by smoothing the Wigner-Ville'a (WVD distribution characterized by the best resolution, however, the biggest harmful interference. Used smoothing functions decide about a compromise between the properties of resolution and eliminating harmful interference term. Another class of time-frequency energy distributions is the affine class of distributions. From the point of view of readability of analysis the best properties are known so called Redistribution of energy caused by the use of a general methodology referred to as reassignment to any time-frequency representation. Reassigned distributions efficiently combine a reduction of the interference terms provided by a well adapted smoothing kernel and an increased concentration of the signal components.

  14. Frequência fundamental de crianças da cidade de Niterói Fundamental frequency for children in the municipality of Niterói

    Directory of Open Access Journals (Sweden)

    Tereza Cristina Andrade Schott

    2009-06-01

    the boys, with an overall mean value of 238.44 Hz. Due to the small difference; we obtained 237.57 Hz for the girls and 233.31 Hz for the boys. CONCLUSION: the findings enabled the comparison with previously carried out research and contributed providing the literature with new data for the standardization of the fundamental frequency of Brazilian Children's Voices, opening a new channel for further research.

  15. Fundamental Frequency and Model Order Estimation Using Spatial Filtering

    DEFF Research Database (Denmark)

    Karimian-Azari, Sam; Jensen, Jesper Rindom; Christensen, Mads Græsbøll

    2014-01-01

    extend this procedure to account for inharmonicity using unconstrained model order estimation. The simulations show that beamforming improves the performance of the joint estimates of fundamental frequency and the number of harmonics in low signal to interference (SIR) levels, and an experiment......In signal processing applications of harmonic-structured signals, estimates of the fundamental frequency and number of harmonics are often necessary. In real scenarios, a desired signal is contaminated by different levels of noise and interferers, which complicate the estimation of the signal...... parameters. In this paper, we present an estimation procedure for harmonic-structured signals in situations with strong interference using spatial filtering, or beamforming. We jointly estimate the fundamental frequency and the constrained model order through the output of the beamformers. Besides that, we...

  16. A new method for selectively enhancing hemisphere processing: voice frequency amplification influences the strength of attribute framing.

    Science.gov (United States)

    McCormick, Michael; Seta, John J

    2012-01-01

    An attribute framing effect occurs when positive or negative associations produced by positive or negative frames are mapped onto evaluations resulting in a more favourable evaluation for the positively framed attribute. We used a new voice frequency manipulation to differentially enhance right versus left hemisphere processing. In doing so we found a strong attribute framing effect when a speaker with a low-frequency voice enhanced the contextual processing style of the right hemisphere. However, a framing effect was not obtained when a speaker with a high-frequency voice enhanced the inferential/analytical processing style of the left hemisphere. At the theoretical level our results provide evidence that the contextual processing style of the right hemisphere is especially susceptible to associative implications, such as those found in attribute framing manipulations. At the applied level we provide a simple method for altering the effectiveness of persuasion messages.

  17. Wendler glottoplasty and voice-therapy in male-to-female transsexuals: results in pre and post-surgery assessment.

    Science.gov (United States)

    Casado, Juan C; O'Connor, Carlos; Angulo, María S; Adrián, José A

    2016-01-01

    With the development of new ENT techniques, many male transsexuals who wish to become women usually request a surgical procedure to raise the fundamental frequency of the voice (feminization). The ENT specialist and the voice-therapist have to use an interdisciplinary approach to this growing social demand. The aim of this study was to show the results in a group of transsexual patients after Wendler's anterior synechiae, with additional voice-therapy treatment. Ten male transexulas who wish to become women patients who had Wendler glottoplasty and voice-therapy were assessed. The surgical procedure consisted of a de-epithelialization of the anterior third of both vocal folds; this area was sutured and the surface of both vocal folds was vaporised with laser diode. Pre- and postsurgery voice assessment consisted of measuring fundamental frequency (Fo) and maximum phonation time, administering the transgender self-assessment questionnaire (TSEQ) and obtaining perceptual voice assessment by inter-rater agreement. All the male transsexuals who wish to become women patients significantly increased their Fo (106 Hz on average) after the treatment. Furthermore, significant improvements were shown in self-reported satisfaction and in the degree of voice feminization. No improvements in the maximum phonation time were observed. Wendler glottoplasty is a surgical procedure to contribute to feminising the voice, with good medium-term results and without noteworthy medical complications. The increase in vocal tone was observed using several pre- and post-surgery control measures and voice therapy. Copyright © 2014 Elsevier España, S.L.U. and Sociedad Española de Otorrinolaringología y Patología Cérvico-Facial. All rights reserved.

  18. [Voice assessment and demographic data of applicants for a school of speech therapists].

    Science.gov (United States)

    Reiter, R; Brosch, S

    2008-05-01

    Demographic data, subjective und objective voice analysis as well as self-assessment of voice quality from applicants for a school of speech therapists were investigated. Demographic data from 116 applicants were collected and their voice quality assessed by three independent judges. An objective evaluation was done by maximum phonation time, average fundamental frequency, dynamic range and percent of jitter and shimmer by means of Goettinger Hoarseness diagram. Self-assessment of voice quality was done by "voice handicap index questionnaire". The twenty successful applicants had a physiological voice in 95 %, they were all musical and had university entrance qualifications. Subjective voice assessment showed in 16 % of the applicants a hoarse voice. In this subgroup an unphysiological vocal use was observed in 72 % and a reduced articulation in 45 %. The objective voice parameters did not show a significant difference between the 3 groups. Self-assessment of the voice was inconspicuous in all applicants. Applicants with general qualification for university entrance, musicality and a physiological voice were more likely to be successful. There were main differences between self assessment of voice and quantitative analysis or subjective assessment by three independent judges.

  19. Transgender Voice and Communication Treatment: A Retrospective Chart Review of 25 Cases

    Science.gov (United States)

    Hancock, Adrienne B.; Garabedian, Laura M.

    2013-01-01

    Background: People transitioning from male to female (MTF) gender seek speech-language pathology services when they feel their voice is betraying their genuine self or perhaps is the last obstacle to representing their authentic gender. Speaking fundamental frequency (pitch) and resonance are most often targets in treatment because the combination…

  20. Detecting vocal fatigue in student singers using acoustic measures of mean fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio

    Science.gov (United States)

    Sisakun, Siphan

    2000-12-01

    The purpose of this study is to explore the ability of four acoustic parameters, mean fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio, to detect vocal fatigue in student singers. The participants are 15 voice students, who perform two distinct tasks, data collection task and vocal fatiguing task. The data collection task includes the sustained vowel /a/, reading a standard passage, and self-rate on a vocal fatigue form. The vocal fatiguing task is the vocal practice of musical scores for a total of 45 minutes. The four acoustic parameters are extracted using the software EZVoicePlus. The data analyses are performed to answer eight research questions. The first four questions relate to correlations of the self-rating scale and each of the four parameters. The next four research questions relate to differences in the parameters over time using one-factor repeated measures analysis of variance (ANOVA). The result yields a proposed acoustic profile of vocal fatigue in student singers. This profile is characterized by increased fundamental frequency; slightly decreased jitter; slightly decreased shimmer; and slightly increased harmonics-to-noise ratio. The proposed profile requires further investigation.

  1. Emotional Prosody Measurement (EPM): A voice-based evaluation method for psychological therapy effectiveness

    NARCIS (Netherlands)

    van den Broek, Egon; Bos, Lodewijk; Laxminarayan, Swamy; Marsh, Andy

    2004-01-01

    The voice embodies three sources of information: speech, the identity, and the emotional state of the speaker (i.e., emotional prosody). The latter feature is resembled by the variability of the F0 (also named fundamental frequency of pitch) (SD F0). To extract this feature, Emotional Prosody

  2. Acoustic and aerodynamic measures of the voice during pregnancy.

    Science.gov (United States)

    Hancock, Adrienne B; Gross, Heather E

    2015-01-01

    Known influences of sex hormones on the voice would suggest pregnancy hormones could have an effect, yet studies using acoustic measures have not indicated changes. Additionally, no examination of the voice before the third trimester has been reported. Effect of pregnancy on the voice is relatively unexplored yet could be quite relevant to female speakers and singers. It is possible that spectral and aerodynamic measures would be more sensitive to tissue-level changes caused by pregnancy hormones. In this first longitudinal study of a 32-year-old woman's pregnancy, weekly voice samples were analyzed for acoustic (fundamental frequency, perturbation ratios of shimmer and jitter, Harmonic-to-Noise Ratio, spectral measures, and maximum phonation time) and aerodynamic (average airflow, peak flow, AC/DC ratio, open quotient, and speed quotient) parameters. All measures appeared generally stable during weeks 11-39 of pregnancy compared with 21 weeks postpartum. Slight decrease in minimum airflow and open speed quotient may reflect suspected vocal fold tissue changes. It is recommended that future studies monitor and test correlations among hormone levels, visual analyses of vocal fold mucosa, aerodynamic function, and glottal efficiency. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  3. Immediate acoustic effects of straw phonation exercises in subjects with dysphonic voices.

    Science.gov (United States)

    Guzman, Marco; Higueras, Diego; Fincheira, Catherine; Muñoz, Daniel; Guajardo, Carlos; Dowdall, Jayme

    2013-04-01

    Abstract This study sought to measure any acoustic changes in the speaking voice immediately after phonation exercises involving plastic straws versus phonation exercises with the open vowel /a/. Forty-one primary school teachers with slightly dysphonic voices were asked to participate in four phonatory tasks. Phonetically balanced text at habitual intensity level and speaking fundamental frequency was recorded. Acoustical analysis with long-term average spectrum was performed. Significant changes after therapy for the experimental group include the alpha ratio, L1-L0 ratio and ratio between 1-5 kHz and 5-8 kHz. The results indicate that the use of phonatory tasks with straw exercises can have immediate therapeutic acoustic effects in dysphonic voices. Long-term effects were not assessed in this study.

  4. Correlations between Sportsmen’s Morpho-Functional Measurements and Voice Acoustic Variables

    Directory of Open Access Journals (Sweden)

    Rexhepi Agron M.

    2016-12-01

    Full Text Available Purpose. Since human voice characteristics are specific to each individual, numerous anthropological studies have been oriented to find significant relationships between voice and morpho-functional features. The goal of this study was to identify the correlation between seven morpho-functional variables and six voice acoustic parameters in sportsmen. Methods. Following the protocols of the International Biological Program, seven morpho-functional variables and six voice acoustic parameters have been measured in 88 male professional athletes from Kosovo, aged 17-35 years, during the period of April-October 2013. The statistical analysis was accomplished through the SPSS program, version 20. The obtained data were analysed through descriptive parameters and with Spearman’s method of correlation analysis. Results. Spearman’s method of correlation showed significant negative correlations (R = -0.215 to -0.613; p = 0.05 between three voice acoustic variables of the fundamental frequency of the voice sample (Mean, Minimum, and Maximum Pitch and six morpho-functional measures (Body Height, Body Weight, Margaria-Kalamen Power Test, Sargent Jump Test, Pull-up Test, and VO2max.abs. Conclusions. The significant correlations imply that the people with higher stature have longer vocal cords and a lower voice. These results encourage investigations on predicting sportsmen’s functional abilities on the basis of their voice acoustic parameters.

  5. Evaluation of the effectiveness of a voice training program for teachers.

    Science.gov (United States)

    Pizolato, Raquel Aparecida; Beltrati Cornacchioni Rehder, Maria Inês; dos Santos Dias, Carlos Tadeu; de Castro Meneghim, Marcelo; Bovi Ambrosano, Glaúcia Maria; Mialhe, Fábio Luiz; Pereira, Antonio Carlos

    2013-09-01

    To investigate the effects of a voice education program to teachers on vocal function exercise and voice hygiene and compare a pre- and post-vocal exercise for the teacher's voice quality. A random sample of 102 subjects was divided into two groups: experimental group (29 women and seven men) with vocal hygiene and training exercises and control group (52 women and 14 men) with vocal hygiene. Two sessions were held about voice hygiene for the control group and five sessions for the experimental group, one being with reference to the vocal hygiene habit and four vocal exercise sessions. Acoustic analysis of the vowel [i] was made pre- and post-vocal exercise and for the situations of initial and final evaluation of the educational program. Student t test (paired) and Proc MIXED (repeated measures) were used for analyses with level of significance (α = 0.05). The training exercises, posture and relaxation cervical, decreased the mean of fundamental frequency (f(0)) for men (P = 0.04), and for the phonation, intensity, and frequency exercises, there was a significant increase for f(0) in woman (P = 0.02) and glottal to noise excitation ratio (P = 0.04). There was no statistically significant difference intergroup evaluations after 3 months. The control group presented increased mean voice intensity in the final evaluation (P = 0.01). Voice training exercises showed a positive and immediate impact on the teacher's quality of voice, but it was not sustained longitudinally, suggesting that actions for this purpose should be continued at schools. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  6. Relationship Between Voice and Motor Disabilities of Parkinson's Disease.

    Science.gov (United States)

    Majdinasab, Fatemeh; Karkheiran, Siamak; Soltani, Majid; Moradi, Negin; Shahidi, Gholamali

    2016-11-01

    To evaluate voice of Iranian patients with Parkinson's disease (PD) and find any relationship between motor disabilities and acoustic voice parameters as speech motor components. We evaluated 27 Farsi-speaking PD patients and 21 age- and sex-matched healthy persons as control. Motor performance was assessed by the Unified Parkinson's Disease Rating Scale part III and Hoehn and Yahr rating scale in the "on" state. Acoustic voice evaluation, including fundamental frequency (f0), standard deviation of f0, minimum of f0, maximum of f0, shimmer, jitter, and harmonic to noise ratio, was done using the Praat software via /a/ prolongation. No difference was seen between the voice of the patients and the voice of the controls. f0 and its variation had a significant correlation with the duration of the disease, but did not have any relationships with the Unified Parkinson's Disease Rating Scale part III. Only limited relationship was observed between voice and motor disabilities. Tremor is an important main feature of PD that affects motor and phonation systems. Females had an older age at onset, more prolonged disease, and more severe motor disabilities (not statistically significant), but phonation disorders were more frequent in males and showed more relationship with severity of motor disabilities. Voice is affected by PD earlier than many other motor components and is more sensitive to disease progression. Tremor is the most effective part of PD that impacts voice. PD has more effect on voice of male versus female patients. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  7. Orthogonal frequency division multiple access fundamentals and applications

    CERN Document Server

    Jiang, Tao; Zhang, Yan

    2010-01-01

    Supported by the expert-level advice of pioneering researchers, Orthogonal Frequency Division Multiple Access Fundamentals and Applications provides a comprehensive and accessible introduction to the foundations and applications of one of the most promising access technologies for current and future wireless networks. It includes authoritative coverage of the history, fundamental principles, key techniques, and critical design issues of OFDM systems. Covering various techniques of effective resource management for OFDM/OFDMA-based wireless communication systems, this cutting-edge reference:Add

  8. Finding your mate at a cocktail party: frequency separation promotes auditory stream segregation of concurrent voices in multi-species frog choruses.

    Directory of Open Access Journals (Sweden)

    Vivek Nityananda

    Full Text Available Vocal communication in crowded social environments is a difficult problem for both humans and nonhuman animals. Yet many important social behaviors require listeners to detect, recognize, and discriminate among signals in a complex acoustic milieu comprising the overlapping signals of multiple individuals, often of multiple species. Humans exploit a relatively small number of acoustic cues to segregate overlapping voices (as well as other mixtures of concurrent sounds, like polyphonic music. By comparison, we know little about how nonhuman animals are adapted to solve similar communication problems. One important cue enabling source segregation in human speech communication is that of frequency separation between concurrent voices: differences in frequency promote perceptual segregation of overlapping voices into separate "auditory streams" that can be followed through time. In this study, we show that frequency separation (ΔF also enables frogs to segregate concurrent vocalizations, such as those routinely encountered in mixed-species breeding choruses. We presented female gray treefrogs (Hyla chrysoscelis with a pulsed target signal (simulating an attractive conspecific call in the presence of a continuous stream of distractor pulses (simulating an overlapping, unattractive heterospecific call. When the ΔF between target and distractor was small (e.g., ≤3 semitones, females exhibited low levels of responsiveness, indicating a failure to recognize the target as an attractive signal when the distractor had a similar frequency. Subjects became increasingly more responsive to the target, as indicated by shorter latencies for phonotaxis, as the ΔF between target and distractor increased (e.g., ΔF = 6-12 semitones. These results support the conclusion that gray treefrogs, like humans, can exploit frequency separation as a perceptual cue to segregate concurrent voices in noisy social environments. The ability of these frogs to segregate

  9. Speech task effects on acoustic measure of fundamental frequency in Cantonese-speaking children.

    Science.gov (United States)

    Ma, Estella P-M; Lam, Nina L-N

    2015-12-01

    Speaking fundamental frequency (F0) is a voice measure frequently used to document changes in vocal performance over time. Knowing the intra-subject variability of speaking F0 has implications on its clinical usefulness. The present study examined the speaking F0 elicited from three speech tasks in Cantonese-speaking children. The study also compared the variability of speaking F0 elicited from different speech tasks. Fifty-six vocally healthy Cantonese-speaking children (31 boys and 25 girls) aged between 7.0 and 10.11 years participated. For each child, speaking F0 was elicited using speech tasks at three linguistic levels (sustained vowel /a/ prolongation, reading aloud a sentence and passage). Two types of variability, within-session (trial-to-trial) and across-session (test-retest) variability, were compared across speech tasks. Significant differences in mean speaking F0 values were found between speech tasks. Mean speaking F0 value elicited from sustained vowel phonations was significantly higher than those elicited from the connected speech tasks. The variability of speaking F0 was higher in sustained vowel prolongation than that in connected speech. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  10. Voice Quality after Treatment for T1a Glottic Carcinoma - Radiotherapy Versus Laser Cordectomy

    International Nuclear Information System (INIS)

    Krengli, Marco; Policarpo, Mario; Manfredda, Irene; Aluffi, Paolo; Gambaro, Giuseppina; Panella, Massimiliano; Pia, Francesco

    2004-01-01

    The purpose of this study was to assess the anatomic and functional outcomes and compare the voice quality in patients affected by T1a glottic carcinoma treated with curative intent with radiotherapy or laser cordectomy. Fifty-seven cases were analysed: 27 after curative radiotherapy and 30 after laser cordectomy. All patients were studied with videolaryngostroboscopy, voice analysis by narrow spectrogram, and vocal parameters (Jitter, Shimmer, noise/harmonic ratio, and diplophonia). Videolaryngostroboscopy showed severe glottic inadequacy in 25% of cases treated with radiation and insufficient compensation 'ventricular band' or 'with arytenoid hyperadduction' in 65% of cases after surgery. Severe dysphonia on the electro-acoustic analysis of voice was observed in 25% of cases after radiation and 70% after laser (p<0.001). Fundamental frequency and vocal parameters showed more favourable results in the radiation group (p<0.001). Voice assessment showed better results after radiotherapy compared with laser cordectomy. Voice outcome should be carefully considered in the treatment decision for T1 glottic carcinoma

  11. Acute effects of radioiodine therapy on the voice and larynx of basedow-Graves patients

    International Nuclear Information System (INIS)

    Isolan-Cury, Roberta Werlang; Cury, Adriano Namo; Monte, Osmar; Silva, Marta Assumpcao de Andrada e; Duprat, Andre; Marone, Marilia; Almeida, Renata de; Iglesias, Alexandre

    2008-01-01

    Graves's disease is the most common cause of hyperthyroidism. There are three current therapeutic options: anti-thyroid medication, surgery, and radioactive iodine (I 131). There are few data in the literature regarding the effects of radioiodine therapy on the larynx and voice. The aim of this study was: to assess the effect of radioiodine therapy on the voice of Basedow-Graves patients. Material and method: A prospective study was done. Following the diagnosis of Grave's disease, patients underwent investigation of their voice, measurement of maximum phonatory time (/a/) and the s/z ratio, fundamental frequency analysis (Praat software), laryngoscopy and (perceptive-auditory) analysis in three different conditions: pre-treatment, 4 days, and 20 days post-radioiodine therapy. Conditions are based on the inflammatory pattern of thyroid tissue (Jones et al. 1999). Results: No statistically significant differences were found in voice characteristics in these three conditions. Conclusion: Radioiodine therapy does not affect voice quality. (author)

  12. Acute effects of radioiodine therapy on the voice and larynx of basedow-Graves patients

    Energy Technology Data Exchange (ETDEWEB)

    Isolan-Cury, Roberta Werlang; Cury, Adriano Namo [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP); Monte, Osmar [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Physiology Department; Silva, Marta Assumpcao de Andrada e [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Speech Therapy School; Duprat, Andre [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Otorhinolaryngology Department; Marone, Marilia [Nuclimagem - Irmanity of the Sao Paulo Santa Casa de Misericordia, SP (Brazil). Nuclear Medicine Unit; Almeida, Renata de; Iglesias, Alexandre [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Otorhinolaryngology Department. Endocrinology and Metabology Unit

    2008-07-01

    Graves's disease is the most common cause of hyperthyroidism. There are three current therapeutic options: anti-thyroid medication, surgery, and radioactive iodine (I 131). There are few data in the literature regarding the effects of radioiodine therapy on the larynx and voice. The aim of this study was: to assess the effect of radioiodine therapy on the voice of Basedow-Graves patients. Material and method: A prospective study was done. Following the diagnosis of Grave's disease, patients underwent investigation of their voice, measurement of maximum phonatory time (/a/) and the s/z ratio, fundamental frequency analysis (Praat software), laryngoscopy and (perceptive-auditory) analysis in three different conditions: pre-treatment, 4 days, and 20 days post-radioiodine therapy. Conditions are based on the inflammatory pattern of thyroid tissue (Jones et al. 1999). Results: No statistically significant differences were found in voice characteristics in these three conditions. Conclusion: Radioiodine therapy does not affect voice quality. (author)

  13. Fundamental Frequency Tuning and Its Influence on LHC 200MHz ACN Cavity

    CERN Document Server

    Linnecar, Trevor Paul R; Tückmantel, Joachim; CERN. Geneva. SPS and LHC Division

    2001-01-01

    To study the influence of the tuner on the fundamental mode frequency, the Q factor as well as the shunt impedance of the LHC 200MHz ACN cavities, 3D simulations have been done in the frequency domain using MAFIA. Curves giving the variation of RF frequency and other RF parameters with tuner position relative to the inner surface of the cavity have been obtained for the fundamental mode. This paper details the simulation results.

  14. [Acoustic and aerodynamic characteristics of the oesophageal voice].

    Science.gov (United States)

    Vázquez de la Iglesia, F; Fernández González, S

    2005-12-01

    The aim of the study is to determine the physiology and pathophisiology of esophageal voice according to objective aerodynamic and acoustic parameters (quantitative and qualitative parameters). Our subjects were comprised of 33 laryngectomized patients (all male) that underwent aerodynamic, acoustic and perceptual protocol. There is a statistical association between acoustic and aerodynamic qualitative parameters (phonation flow chart type, sound spectrum, perceptual analysis) among quantitative parameters (neoglotic pressure, phonation flow, phonation time, fundamental frequency, maximum intensity sound level, speech rate). Nevertheles, not always such observations bring practical resources to clinical practice. We consider that the facts studied may enable us to add, pragmatically, new resources to the more effective vocal rehabilitation to these patients. The physiology of esophageal voice is well understood by the method we have applied, also seeking for rehabilitation, improving oral communication skills in the laryngectomee population.

  15. Musculoskeletal Pain and Occupational Variables in Teachers With Voice Disorders and in Those With Healthy Voices-A Pilot Study.

    Science.gov (United States)

    da Silva Vitor, Jhonatan; Siqueira, Larissa Thaís Donalonso; Ribeiro, Vanessa Veis; Ramos, Janine Santos; Brasolotto, Alcione Ghedini; Silverio, Kelly Cristina Alves

    2017-07-01

    This study aimed to compare musculoskeletal pain perception in teachers with voice disorders and in those with healthy voices, and to investigate the relationship between musculoskeletal pain and occupational variables (ie, work journey per week and working period). Forty-three classroom teachers were divided into two groups: dysphonic group (DG), 32 classroom teachers with voice complaints and voice disorders; and non-DG, 11 classroom teachers without voice complaints and who are vocally healthy. The musculoskeletal pain investigation survey was used to investigate the frequency and intensity of the pain. Occupational variables, such as work journey per week and working period, were investigated by the Voice Production Condition-Teacher questionnaire. The statistical tests used were the Spearman correlation (P ≤ 0.05) and the Mann-Whitney U test (P ≤ 0.05). There was no difference between the frequency and the intensity of musculoskeletal pain regarding dysphonia. Work journey per week was positively related to the frequency and the intensity of laryngeal pain in the DG. The working period had a negative relationship to the frequency and the intensity of musculoskeletal pain in the submandibular region in the DG. Classroom teachers with voice disorders and those with healthy voices do not have differences regarding the frequency and the intensity of musculoskeletal pain. Besides dysphonia the pain is an important symptom to be considered in classroom teachers. The occupational variables contributed to the presence of musculoskeletal pain in the region near the larynx, which appears to be directly proportional to work journey per week and inversely proportional to the working period. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  16. Fast and Statistically Efficient Fundamental Frequency Estimation

    DEFF Research Database (Denmark)

    Nielsen, Jesper Kjær; Jensen, Tobias Lindstrøm; Jensen, Jesper Rindom

    2016-01-01

    Fundamental frequency estimation is a very important task in many applications involving periodic signals. For computational reasons, fast autocorrelation-based estimation methods are often used despite parametric estimation methods having superior estimation accuracy. However, these parametric...... a recursive solver. Via benchmarks, we demonstrate that the computation time is reduced by approximately two orders of magnitude. The proposed fast algorithm is available for download online....

  17. Mobile voice health monitoring using a wearable accelerometer sensor and a smartphone platform.

    Science.gov (United States)

    Mehta, Daryush D; Zañartu, Matías; Feng, Shengran W; Cheyne, Harold A; Hillman, Robert E

    2012-11-01

    Many common voice disorders are chronic or recurring conditions that are likely to result from faulty and/or abusive patterns of vocal behavior, referred to generically as vocal hyperfunction. An ongoing goal in clinical voice assessment is the development and use of noninvasively derived measures to quantify and track the daily status of vocal hyperfunction so that the diagnosis and treatment of such behaviorally based voice disorders can be improved. This paper reports on the development of a new, versatile, and cost-effective clinical tool for mobile voice monitoring that acquires the high-bandwidth signal from an accelerometer sensor placed on the neck skin above the collarbone. Using a smartphone as the data acquisition platform, the prototype device provides a user-friendly interface for voice use monitoring, daily sensor calibration, and periodic alert capabilities. Pilot data are reported from three vocally normal speakers and three subjects with voice disorders to demonstrate the potential of the device to yield standard measures of fundamental frequency and sound pressure level and model-based glottal airflow properties. The smartphone-based platform enables future clinical studies for the identification of the best set of measures for differentiating between normal and hyperfunctional patterns of voice use.

  18. Do women's voices provide cues of the likelihood of ovulation? The importance of sampling regime.

    Directory of Open Access Journals (Sweden)

    Julia Fischer

    Full Text Available The human voice provides a rich source of information about individual attributes such as body size, developmental stability and emotional state. Moreover, there is evidence that female voice characteristics change across the menstrual cycle. A previous study reported that women speak with higher fundamental frequency (F0 in the high-fertility compared to the low-fertility phase. To gain further insights into the mechanisms underlying this variation in perceived attractiveness and the relationship between vocal quality and the timing of ovulation, we combined hormone measurements and acoustic analyses, to characterize voice changes on a day-to-day basis throughout the menstrual cycle. Voice characteristics were measured from free speech as well as sustained vowels. In addition, we asked men to rate vocal attractiveness from selected samples. The free speech samples revealed marginally significant variation in F0 with an increase prior to and a distinct drop during ovulation. Overall variation throughout the cycle, however, precluded unequivocal identification of the period with the highest conception risk. The analysis of vowel samples revealed a significant increase in degree of unvoiceness and noise-to-harmonic ratio during menstruation, possibly related to an increase in tissue water content. Neither estrogen nor progestogen levels predicted the observed changes in acoustic characteristics. The perceptual experiments revealed a preference by males for voice samples recorded during the pre-ovulatory period compared to other periods in the cycle. While overall we confirm earlier findings in that women speak with a higher and more variable fundamental frequency just prior to ovulation, the present study highlights the importance of taking the full range of variation into account before drawing conclusions about the value of these cues for the detection of ovulation.

  19. Voice Quality after Treatment for T1a Glottic Carcinoma - Radiotherapy Versus Laser Cordectomy

    Energy Technology Data Exchange (ETDEWEB)

    Krengli, Marco; Policarpo, Mario; Manfredda, Irene; Aluffi, Paolo; Gambaro, Giuseppina; Panella, Massimiliano; Pia, Francesco [Univ. of Piemonte Orientale ' Amedeo Avogadro' , Novara (Italy). Div. of Radiotherapy

    2004-04-01

    The purpose of this study was to assess the anatomic and functional outcomes and compare the voice quality in patients affected by T1a glottic carcinoma treated with curative intent with radiotherapy or laser cordectomy. Fifty-seven cases were analysed: 27 after curative radiotherapy and 30 after laser cordectomy. All patients were studied with videolaryngostroboscopy, voice analysis by narrow spectrogram, and vocal parameters (Jitter, Shimmer, noise/harmonic ratio, and diplophonia). Videolaryngostroboscopy showed severe glottic inadequacy in 25% of cases treated with radiation and insufficient compensation 'ventricular band' or 'with arytenoid hyperadduction' in 65% of cases after surgery. Severe dysphonia on the electro-acoustic analysis of voice was observed in 25% of cases after radiation and 70% after laser (p<0.001). Fundamental frequency and vocal parameters showed more favourable results in the radiation group (p<0.001). Voice assessment showed better results after radiotherapy compared with laser cordectomy. Voice outcome should be carefully considered in the treatment decision for T1 glottic carcinoma.

  20. [Environmental factors and vocal habits regarding pre-school teachers and functionaries suffering voice disorders].

    Science.gov (United States)

    Barrreto-Munévar, Deisy P; Cháux-Ramos, Oriana M; Estrada-Rangel, Mónica A; Sánchez-Morales, Jenifer; Moreno-Angarita, Marisol; Camargo-Mendoza, Maryluz

    2011-06-01

    Determining the relationship between vocal habits and environmental/ occupational conditions with the presence of vocal disturbance (dysphonia) in teachers and functionaries working at community-based, initial childhood education centres (kindergartens). This was a descriptive study which adopted across-sectional approach using 198 participants which was developed in three phases. Phase 1: consisted of identifying participants having the highest risk of presenting vocal disturbance. Phase 2consisted of observation-analysis concerning the voice use and vocal habits of participants who had been identified in phase 1. Phase 3consisted of perceptual and computational assessment of participants' voices using Wilson's vocal profile and the multidimensional voice program. Individuals having pitch breaks, throat clearing, increased voice intensity, and gastro-oesophageal reflux were found to present below standard fundamental frequency (FF). Subjects having altered breathing and increased voice intensity were identified as having above standard shimmer and jitter acoustic values. A high rate of inability to work was found due to vocal disturbance. It is thus suggested that there is a correlation between vocal habits and vocal disorders presented by preschool teachers in kindergarten settings.

  1. Voice amplification as a means of reducing vocal load for elementary music teachers.

    Science.gov (United States)

    Morrow, Sharon L; Connor, Nadine P

    2011-07-01

    Music teachers are over four times more likely than classroom teachers to develop voice disorders and greater than eight times more likely to have voice-related problems than the general public. Research has shown that individual voice-use parameters of phonation time, fundamental frequency and vocal intensity, as well as vocal load as calculated by cycle dose and distance dose are significantly higher for music teachers than their classroom teacher counterparts. Finding effective and inexpensive prophylactic measures to decrease vocal load for music teachers is an important aspect for voice preservation for this group of professional voice users. The purpose of this study was to determine the effects of voice amplification on vocal intensity and vocal load in the workplace as measured using a KayPENTAX Ambulatory Phonation Monitor (APM) (KayPENTAX, Lincoln Park, NJ). Seven music teachers were monitored for 1 workweek using an APM to determine average vocal intensity (dB sound pressure level [SPL]) and vocal load as calculated by cycle dose and distance dose. Participants were monitored a second week while using a voice amplification unit (Asyst ChatterVox; Asyst Communications Company, Inc., Indian Creek, IL). Significant decreases in mean vocal intensity of 7.00-dB SPL (Pmusic teachers in the classroom. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  2. Fundamental Frequency Estimation using Polynomial Rooting of a Subspace-Based Method

    DEFF Research Database (Denmark)

    Jensen, Jesper Rindom; Christensen, Mads Græsbøll; Jensen, Søren Holdt

    2010-01-01

    improvements compared to HMUSIC. First, by using the proposed method we can obtain an estimate of the fundamental frequency without doing a grid search like in HMUSIC. This is due to that the fundamental frequency is estimated as the argument of the root lying closest to the unit circle. Second, we obtain...... a higher spectral resolution compared to HMUSIC which is a property of polynomial rooting methods. Our simulation results show that the proposed method is applicable to real-life signals, and that we in most cases obtain a higher spectral resolution than HMUSIC....

  3. The electronic cry: Voice and gender in electroacoustic music

    NARCIS (Netherlands)

    Bosma, H.M.

    2013-01-01

    The voice provides an entrance to discuss gender and related fundamental issues in electroacoustic music that are relevant as well in other musical genres and outside of music per se: the role of the female voice; the use of language versus non-verbal vocal sounds; the relation of voice, embodiment

  4. Observations of the relationship between noise exposure and preschool teacher voice usage in day-care center environments.

    Science.gov (United States)

    Lindstrom, Fredric; Waye, Kerstin Persson; Södersten, Maria; McAllister, Anita; Ternström, Sten

    2011-03-01

    Although the relationship between noise exposure and vocal behavior (the Lombard effect) is well established, actual vocal behavior in the workplace is still relatively unexamined. The first purpose of this study was to investigate correlations between noise level and both voice level and voice average fundamental frequency (F₀) for a population of preschool teachers in their normal workplace. The second purpose was to study the vocal behavior of each teacher to investigate whether individual vocal behaviors or certain patterns could be identified. Voice and noise data were obtained for female preschool teachers (n=13) in their workplace, using wearable measurement equipment. Correlations between noise level and voice level, and between voice level and F₀, were calculated for each participant and ranged from 0.07 to 0.87 for voice level and from 0.11 to 0.78 for F₀. The large spread of the correlation coefficients indicates that the teachers react individually to the noise exposure. For example, some teachers increase their voice-to-noise level ratio when the noise is reduced, whereas others do not. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  5. METHODS FOR QUALITY ENHANCEMENT OF USER VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    O. N. Faizulaieva

    2014-03-01

    Full Text Available The reasonability for the usage of computer systems user voice in the authentication process is proved. The scientific task for improving the signal/noise ratio of the user voice signal in the authentication system is considered. The object of study is the process of input and output of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of voice signal against external interference signals are researched. Methods for quality enhancement of user voice signal in voice authentication systems are suggested. As modern computer facilities, including mobile ones, have two-channel audio card, the usage of two microphones is proposed in the voice signal input system of authentication system. Meanwhile, the task of forming a lobe of microphone array in a desired area of voice signal registration (100 Hz to 8 kHz is solved. The usage of directional properties of the proposed microphone array gives the possibility to have the influence of external interference signals two or three times less in the frequency range from 4 to 8 kHz. The possibilities for implementation of space-time processing of the recorded signals using constant and adaptive weighting factors are investigated. The simulation results of the proposed system for input and extraction of signals during digital processing of narrowband signals are presented. The proposed solutions make it possible to improve the value of the signal/noise ratio of the useful signals recorded up to 10, ..., 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker’s discrimination.

  6. Comparison of voice-use profiles between elementary classroom and music teachers.

    Science.gov (United States)

    Morrow, Sharon L; Connor, Nadine P

    2011-05-01

    Among teachers, music teachers are roughly four times more likely than classroom teachers to develop voice-related problems. Although it has been established that music teachers use their voices at high intensities and durations in the course of their workday, voice-use profiles concerning the amount and intensity of vocal use and vocal load have neither been quantified nor has vocal load for music teachers been compared with classroom teachers using these same voice-use parameters. In this study, total phonation time, fundamental frequency (F₀), and vocal intensity (dB SPL [sound pressure level]) were measured or estimated directly using a KayPENTAX Ambulatory Phonation Monitor (KayPENTAX, Lincoln Park, NJ). Vocal load was calculated as cycle and distance dose, as defined by Švec et al (2003), which integrates total phonation time, F₀, and vocal intensity. Twelve participants (n = 7 elementary music teachers and n = 5 elementary classroom teachers) were monitored during five full teaching days of one workweek to determine average vocal load for these two groups of teachers. Statistically significant differences in all measures were found between the two groups (P vocal loads for music teachers are substantially higher than those experienced by classroom teachers (P vocal load may have immediate clinical and educational benefits in vocal health in music teachers. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  7. Rotational structure of the five lowest frequency fundamental vibrational states of dimethylsulfoxide

    Science.gov (United States)

    Cuisset, Arnaud; Drumel, Marie-Aline Martin; Hindle, Francis; Mouret, Gaël; Sadovskií, Dmitrií A.

    2013-10-01

    We report on the successful extended analysis of the high-frequency (200-700 GHz) part of the gas phase (sub)mm-wave spectra of dimethylsulfoxide (DMSO). The spectrum was recorded at 100 kHz resolution using a solid state subTHz spectrometer. The five lowest energy fundamental vibrational states of DMSO with frequencies below 400 cm-1 were observed as sidebands along with the main 0←0 band. Neglecting the internal rotation of methyls, our rotational Hamiltonian reproduced the spectrum to the subMHz accuracy. We have found that the asymmetric bending state ν23 is the only low frequency fundamental vibrational state with the "anomalous" rotational structure uncovered in Cuisset et al. [1]. dmsomw 2013-09-04 15:03

  8. Bayesian analysis of rotating machines - A statistical approach to estimate and track the fundamental frequency

    DEFF Research Database (Denmark)

    Pedersen, Thorkild Find

    2003-01-01

    frequency and the related frequencies as orders of the fundamental frequency. When analyzing rotating or reciprocating machines it is important to know the running speed. Usually this requires direct access to the rotating parts in order to mount a dedicated tachometer probe. In this thesis different......Rotating and reciprocating mechanical machines emit acoustic noise and vibrations when they operate. Typically, the noise and vibrations are concentrated in narrow frequency bands related to the running speed of the machine. The frequency of the running speed is referred to as the fundamental...

  9. Pitch (F0) and formant profiles of human vowels and vowel-like baboon grunts: The role of vocalizer body size and voice-acoustic allometry

    Science.gov (United States)

    Rendall, Drew; Kollias, Sophie; Ney, Christina; Lloyd, Peter

    2005-02-01

    Key voice features-fundamental frequency (F0) and formant frequencies-can vary extensively between individuals. Much of the variation can be traced to differences in the size of the larynx and vocal-tract cavities, but whether these differences in turn simply reflect differences in speaker body size (i.e., neutral vocal allometry) remains unclear. Quantitative analyses were therefore undertaken to test the relationship between speaker body size and voice F0 and formant frequencies for human vowels. To test the taxonomic generality of the relationships, the same analyses were conducted on the vowel-like grunts of baboons, whose phylogenetic proximity to humans and similar vocal production biology and voice acoustic patterns recommend them for such comparative research. For adults of both species, males were larger than females and had lower mean voice F0 and formant frequencies. However, beyond this, F0 variation did not track body-size variation between the sexes in either species, nor within sexes in humans. In humans, formant variation correlated significantly with speaker height but only in males and not in females. Implications for general vocal allometry are discussed as are implications for speech origins theories, and challenges to them, related to laryngeal position and vocal tract length. .

  10. Effects of melody and technique on acoustical and musical features of western operatic singing voices.

    Science.gov (United States)

    Larrouy-Maestri, Pauline; Magis, David; Morsomme, Dominique

    2014-05-01

    The operatic singing technique is frequently used in classical music. Several acoustical parameters of this specific technique have been studied but how these parameters combine remains unclear. This study aims to further characterize the Western operatic singing technique by observing the effects of melody and technique on acoustical and musical parameters of the singing voice. Fifty professional singers performed two contrasting melodies (popular song and romantic melody) with two vocal techniques (with and without operatic singing technique). The common quality parameters (energy distribution, vibrato rate, and extent), perturbation parameters (standard deviation of the fundamental frequency, signal-to-noise ratio, jitter, and shimmer), and musical features (fundamental frequency of the starting note, average tempo, and sound pressure level) of the 200 sung performances were analyzed. The results regarding the effect of melody and technique on the acoustical and musical parameters show that the choice of melody had a limited impact on the parameters observed, whereas a particular vocal profile appeared depending on the vocal technique used. This study confirms that vocal technique affects most of the parameters examined. In addition, the observation of quality, perturbation, and musical parameters contributes to a better understanding of the Western operatic singing technique. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  11. Fundamental Frequency Extraction Method using Central Clipping and its Importance for the Classification of Emotional State

    Directory of Open Access Journals (Sweden)

    Pavol Partila

    2012-01-01

    Full Text Available The paper deals with a classification of emotional state. We implemented a method for extracting the fundamental speech signal frequency by means of a central clipping and examined a correlation between emotional state and fundamental speech frequency. For this purpose, we applied an approach of exploratory data analysis. The ANOVA (Analysis of variance test confirmed that a modification in the speaker's emotional state changes the fundamental frequency of human vocal tract. The main contribution of the paper lies in investigation, of central clipping method by the ANOVA.

  12. Voice and Speech Quality Perception Assessment and Evaluation

    CERN Document Server

    Jekosch, Ute

    2005-01-01

    Foundations of Voice and Speech Quality Perception starts out with the fundamental question of: "How do listeners perceive voice and speech quality and how can these processes be modeled?" Any quantitative answers require measurements. This is natural for physical quantities but harder to imagine for perceptual measurands. This book approaches the problem by actually identifying major perceptual dimensions of voice and speech quality perception, defining units wherever possible and offering paradigms to position these dimensions into a structural skeleton of perceptual speech and voice quality. The emphasis is placed on voice and speech quality assessment of systems in artificial scenarios. Many scientific fields are involved. This book bridges the gap between two quite diverse fields, engineering and humanities, and establishes the new research area of Voice and Speech Quality Perception.

  13. Clinical voice analysis of Carnatic singers.

    Science.gov (United States)

    Arunachalam, Ravikumar; Boominathan, Prakash; Mahalingam, Shenbagavalli

    2014-01-01

    Carnatic singing is a classical South Indian style of music that involves rigorous training to produce an "open throated" loud, predominantly low-pitched singing, embedded with vocal nuances in higher pitches. Voice problems in singers are not uncommon. The objective was to report the nature of voice problems and apply a routine protocol to assess the voice. Forty-five trained performing singers (females: 36 and males: 9) who reported to a tertiary care hospital with voice problems underwent voice assessment. The study analyzed their problems and the clinical findings. Voice change, difficulty in singing higher pitches, and voice fatigue were major complaints. Most of the singers suffered laryngopharyngeal reflux that coexisted with muscle tension dysphonia and chronic laryngitis. Speaking voices were rated predominantly as "moderate deviation" on GRBAS (Grade, Rough, Breathy, Asthenia, and Strain). Maximum phonation time ranged from 4 to 29 seconds (females: 10.2, standard deviation [SD]: 5.28 and males: 15.7, SD: 5.79). Singing frequency range was reduced (females: 21.3 Semitones and males: 23.99 Semitones). Dysphonia severity index (DSI) scores ranged from -3.5 to 4.91 (females: 0.075 and males: 0.64). Singing frequency range and DSI did not show significant difference between sex and across clinical diagnosis. Self-perception using voice disorder outcome profile revealed overall severity score of 5.1 (SD: 2.7). Findings are discussed from a clinical intervention perspective. Study highlighted the nature of voice problems (hyperfunctional) and required modifications in assessment protocol for Carnatic singers. Need for regular assessments and vocal hygiene education to maintain good vocal health are emphasized as outcomes. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  14. Voice Habits and Behaviors: Voice Care Among Flamenco Singers.

    Science.gov (United States)

    Garzón García, Marina; Muñoz López, Juana; Y Mendoza Lara, Elvira

    2017-03-01

    The purpose of this study is to analyze the vocal behavior of flamenco singers, as compared with classical music singers, to establish a differential vocal profile of voice habits and behaviors in flamenco music. Bibliographic review was conducted, and the Singer's Vocal Habits Questionnaire, an experimental tool designed by the authors to gather data regarding hygiene behavior, drinking and smoking habits, type of practice, voice care, and symptomatology perceived in both the singing and the speaking voice, was administered. We interviewed 94 singers, divided into two groups: the flamenco experimental group (FEG, n = 48) and the classical control group (CCG, n = 46). Frequency analysis, a Likert scale, and discriminant and exploratory factor analysis were used to obtain a differential profile for each group. The FEG scored higher than the CCG in speaking voice symptomatology. The FEG scored significantly higher than the CCG in use of "inadequate vocal technique" when singing. Regarding voice habits, the FEG scored higher in "lack of practice and warm-up" and "environmental habits." A total of 92.6% of the subjects classified themselves correctly in each group. The Singer's Vocal Habits Questionnaire has proven effective in differentiating flamenco and classical singers. Flamenco singers are exposed to numerous vocal risk factors that make them more prone to vocal fatigue, mucosa dehydration, phonotrauma, and muscle stiffness than classical singers. Further research is needed in voice training in flamenco music, as a means to strengthen the voice and enable it to meet the requirements of this musical genre. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  15. The Effect of Communications Medium on the Fundamental Frequency of Speech.

    Science.gov (United States)

    Noll, A. Michael

    1978-01-01

    Describes the results of preliminary experiments to investigate the effects of communications medium (face-to-face and two-way closed circuit television) on the fundamental frequency of speakers in a dyadic communications situation. (JMF)

  16. The Belt voice: Acoustical measurements and esthetic correlates

    Science.gov (United States)

    Bounous, Barry Urban

    This dissertation explores the esthetic attributes of the Belt voice through spectral acoustical analysis. The process of understanding the nature and safe practice of Belt is just beginning, whereas the understanding of classical singing is well established. The unique nature of the Belt sound provides difficulties for voice teachers attempting to evaluate the quality and appropriateness of a particular sound or performance. This study attempts to provide answers to the question "does Belt conform to a set of measurable esthetic standards?" In answering this question, this paper expands on a previous study of the esthetic attributes of the classical baritone voice (see "Vocal Beauty", NATS Journal 51,1) which also drew some tentative conclusions about the Belt voice but which had an inadequate sample pool of subjects from which to draw. Further, this study demonstrates that it is possible to scientifically investigate the realm of musical esthetics in the singing voice. It is possible to go beyond the "a trained voice compared to an untrained voice" paradigm when evaluating quantitative vocal parameters and actually investigate what truly beautiful voices do. There are functions of sound energy (measured in dB) transference which may affect the nervous system in predictable ways and which can be measured and associated with esthetics. This study does not show consistency in measurements for absolute beauty (taste) even among belt teachers and researchers but does show some markers with varying degrees of importance which may point to a difference between our cognitive learned response to singing and our emotional, more visceral response to sounds. The markers which are significant in determining vocal beauty are: (1) Vibrancy-Characteristics of vibrato including speed, width, and consistency (low variability). (2) Spectral makeup-Ratio of partial strength above the fundamental to the fundamental. (3) Activity of the voice-The quantity of energy being produced. (4

  17. Joint fundamental frequency and order estimation using optimal filtering

    Directory of Open Access Journals (Sweden)

    Jakobsson Andreas

    2011-01-01

    Full Text Available Abstract In this paper, the problem of jointly estimating the number of harmonics and the fundamental frequency of periodic signals is considered. We show how this problem can be solved using a number of methods that either are or can be interpreted as filtering methods in combination with a statistical model selection criterion. The methods in question are the classical comb filtering method, a maximum likelihood method, and some filtering methods based on optimal filtering that have recently been proposed, while the model selection criterion is derived herein from the maximum a posteriori principle. The asymptotic properties of the optimal filtering methods are analyzed and an order-recursive efficient implementation is derived. Finally, the estimators have been compared in computer simulations that show that the optimal filtering methods perform well under various conditions. It has previously been demonstrated that the optimal filtering methods perform extremely well with respect to fundamental frequency estimation under adverse conditions, and this fact, combined with the new results on model order estimation and efficient implementation, suggests that these methods form an appealing alternative to classical methods for analyzing multi-pitch signals.

  18. The interaction of tone with voicing and foot structure: evidence from Kera phonetics and phonology

    Science.gov (United States)

    Pearce, Mary Dorothy

    This thesis uses acoustic measurements as a basis for the phonological analysis of the interaction of tone with voicing and foot structure in Kera (a Chadic language). In both tone spreading and vowel harmony, the iambic foot acts as a domain for spreading. Further evidence for the foot comes from measurements of duration, intensity and vowel quality. Kera is unusual in combining a tone system with a partially independent metrical system based on iambs. In words containing more than one foot, the foot is the tone bearing unit (TBU), but in shorter words, the TBU is the syllable. In perception and production experiments, results show that Kera speakers, unlike English and French, use the fundamental frequency as the principle cue to 'Voicing" contrast. Voice onset time (VOT) has only a minor role. Historically, tones probably developed from voicing through a process of tonogenesis, but synchronically, the feature voice is no longer contrastive and VOT is used in an enhancing role. Some linguists have claimed that Kera is a key example for their controversial theory of long-distance voicing spread. But as voice is not part of Kera phonology, this thesis gives counter-evidence to the voice spreading claim. An important finding from the experiments is that the phonological grammars are different between village women, men moving to town and town men. These differences are attributed to French contact. The interaction between Kera tone and voicing and contact with French have produced changes from a 2-way voicing contrast, through a 3-way tonal contrast, to a 2-way voicing contrast plus another contrast with short VOT. These diachronic and synchronic tone/voicing facts are analysed using laryngeal features and Optimality Theory. This thesis provides a body of new data, detailed acoustic measurements, and an analysis incorporating current theoretical issues in phonology, which make it of interest to Africanists and theoreticians alike.

  19. Fundamental Frequency Estimation of the Speech Signal Compressed by MP3 Algorithm Using PCC Interpolation

    Directory of Open Access Journals (Sweden)

    MILIVOJEVIC, Z. N.

    2010-02-01

    Full Text Available In this paper the fundamental frequency estimation results of the MP3 modeled speech signal are analyzed. The estimation of the fundamental frequency was performed by the Picking-Peaks algorithm with the implemented Parametric Cubic Convolution (PCC interpolation. The efficiency of PCC was tested for Catmull-Rom, Greville and Greville two-parametric kernel. Depending on MSE, a window that gives optimal results was chosen.

  20. Hearing history influences voice gender perceptual performance in cochlear implant users.

    Science.gov (United States)

    Kovačić, Damir; Balaban, Evan

    2010-12-01

    The study was carried out to assess the role that five hearing history variables (chronological age, age at onset of deafness, age of first cochlear implant [CI] activation, duration of CI use, and duration of known deafness) play in the ability of CI users to identify speaker gender. Forty-one juvenile CI users participated in two voice gender identification tasks. In a fixed, single-interval task, subjects listened to a single speech item from one of 20 adult male or 20 adult female speakers and had to identify speaker gender. In an adaptive speech-based voice gender discrimination task with the fundamental frequency difference between the voices as the adaptive parameter, subjects listened to a pair of speech items presented in sequential order, one of which was always spoken by an adult female and the other by an adult male. Subjects had to identify the speech item spoken by the female voice. Correlation and regression analyses between perceptual scores in the two tasks and the hearing history variables were performed. Subjects fell into three performance groups: (1) those who could distinguish voice gender in both tasks, (2) those who could distinguish voice gender in the adaptive but not the fixed task, and (3) those who could not distinguish voice gender in either task. Gender identification performance for single voices in the fixed task was significantly and negatively related to the duration of deafness before cochlear implantation (shorter deafness yielded better performance), whereas performance in the adaptive task was weakly but significantly related to age at first activation of the CI device, with earlier activations yielding better scores. The existence of a group of subjects able to perform adaptive discrimination but unable to identify the gender of singly presented voices demonstrates the potential dissociability of the skills required for these two tasks, suggesting that duration of deafness and age of cochlear implantation could have

  1. The IE Middle Voice: A Study in Syntactic Strategy and Syntactic Change.

    Science.gov (United States)

    Barber, Elizabeth

    The active/passive system of English grew out of a Proto-Indo-European (PIE) system where the fundamental distinction was between active and middle voices. The middle voice included within its functions the relationship that now would be known as passive. The PIE voice system is preserved in ancient Greek and Sanskrit, and in the former, the…

  2. Applicability of the Arabic version of Vocal Tract Discomfort Scale (VTDS) with student singers as professional voice users.

    Science.gov (United States)

    Darawsheh, Wesam B; Natour, Yaser S; Sada, Eve G

    2018-07-01

    This pilot study aimed to evaluate the internal consistency, convergent construct validity and criterion validity of Arabic version of the Vocal Tract Discomfort Scale (VTDS), and to investigate the correlation between the scores of the VTDS, the VHI and the acoustic measures of fundamental frequency (F0), shimmer, jitter and signal-to-noise ratio (SNR). A cross-sectional study where 97 participants participated (47 males and 50 females) (mean age 20.5 ± 2.1 years) (31 student singers and 66 other non-professional voice user students). Participants were without self-perceived voice disorders who completed the VTDS-Arab scale and the Voice Handicap Index (VHI-Arab), and recorded a vocal sample of/a:/at a comfortable level. A positive internal consistency that signifies reliability was confirmed by Cronbach's α = .884 and 0.874 for the VTDS-Arab frequency and severity subscales, respectively. A moderate positive correlation was found between the VTDS-Arab (frequency, severity, total) and the VHI-Arab total where values of Pearson's correlation coefficient were r= 0.459, 0.430 and 0.451, respectively. Weak correlations were found between all of the acoustic measures and the scores of the VTDS-Arab and VHI-Arab (total and subscales). The area under curve for the VTDS was AUC= 0.824, 0.804 and 0.817 for the VTDS frequency, VTDS severity and VTDS total, respectively. The VTDS-Arab is a valid and reliable tool in measuring vocal tract sensations and predicting the perception of vocal handicap in student singers and can be used to predict the vocal load among professional voice users.

  3. Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques.

    Science.gov (United States)

    Fitch, W T

    1997-08-01

    Body weight, length, and vocal tract length were measured for 23 rhesus macaques (Macaca mulatta) of various sizes using radiographs and computer graphic techniques. linear predictive coding analysis of tape-recorded threat vocalizations were used to determine vocal tract resonance frequencies ("formants") for the same animals. A new acoustic variable is proposed, "formant dispersion," which should theoretically depend upon vocal tract length. Formant dispersion is the averaged difference between successive formant frequencies, and was found to be closely tied to both vocal tract length and body size. Despite the common claim that voice fundamental frequency (F0) provides an acoustic indication of body size, repeated investigations have failed to support such a relationship in many vertebrate species including humans. Formant dispersion, unlike voice pitch, is proposed to be a reliable predictor of body size in macaques, and probably many other species.

  4. Differences between self-assessment and external rating of voice with regard to sex characteristics, age, and attractiveness.

    Science.gov (United States)

    Sandmann, Katja; am Zehnhoff-Dinnesen, Antoinette; Schmidt, Claus-Michael; Rosslau, Ken; Lang-Roth, Ruth; Burgmer, Markus; Knief, Arne; Matulat, Peter; Vauth, Melanie; Deuster, Dirk

    2014-01-01

    This study investigates differences between the self-assessment and external rating of a person's voice with regard to sex characteristics, age, and attractiveness of the voice and mean fundamental frequency (F0). Cross-sectional study. A group of 47 participants with a balanced sex distribution was recruited and the following data were collected: videostroboscopy, voice range profile, F0, self-assessment questionnaire (attractiveness, masculinity or femininity of voice, and appearance), Voice Handicap Index, and questionnaires to determine levels of depression and quality of life. External rating was performed by four experts and four laymen. In both sexes, fair to moderate significant correlations between the self-assessment of masculinity (men)/femininity (women) of voice and masculinity/femininity of appearance could be found, but not between the self-assessment of attractiveness of voice and appearance. In men, a statistically significant correlation was found between external ratings and self-assessment of attractiveness and, with the exception of the female rating group, of masculinity. In women, self-assessment of femininity and attractiveness of voice did not correlate to a statistically significant extent with the evaluation of the external rater. Additionally, the statistical correlation between estimated and real ages was high. Although the objective parameters of age and gender identification could be rated with a high degree of accuracy, subjective parameters showed significant differences between self-assessment and external rating, in particular in rating women's voices. Taking these findings into account in treatments for modifying voice could impede successful interventions. As one consequence, we recommend summarizing target agreements in detail before the treatment. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  5. Welding characteristics of 27, 40 and 67 kHz ultrasonic plastic welding systems using fundamental- and higher-resonance frequencies.

    Science.gov (United States)

    Tsujino, Jiromaru; Hongoh, Misugi; Yoshikuni, Masafumi; Hashii, Hidekazu; Ueoka, Tetsugi

    2004-04-01

    The welding characteristics of 27, 40 and 67 kHz ultrasonic plastic welding systems that are driven at only the fundamental-resonance frequency vibration were compared, and also those of the welding systems that were driven at the fundamental and several higher resonance frequencies simultaneously were studied. At high frequency, welding characteristics can be improved due to the larger vibration loss of plastic materials. For welding of rather thin or small specimens, as the fundamental frequency of these welding systems is higher and the numbers of driven higher frequencies are driven simultaneously, larger welded area and weld strength were obtained.

  6. Perceptual and acoustic outcomes of voice therapy for male-to-female transgender individuals immediately after therapy and 15 months later.

    Science.gov (United States)

    Gelfer, Marylou Pausewang; Tice, Ruthanne M

    2013-05-01

    The present study examined how effectively listeners' perceptions of gender could be changed from male to female for male-to-female (MTF) transgender (TG) clients based on the voice signal alone, immediately after voice therapy and at long-term follow-up. Short- and long-term changes in masculinity and femininity ratings and acoustic measures of speaking fundamental frequency (SFF) and vowel formant frequencies were also investigated. Prospective treatment study. Five MTF TG clients, five control female speakers, and five control male speakers provided a variety of speech samples for later analysis. The TG clients then underwent 8 weeks of voice therapy. Voice samples were collected immediately at the termination of therapy and again 15 months later. Two groups of listeners were recruited to evaluate gender and provide masculinity and femininity ratings. Perceptual results revealed that TG subjects were perceived as female 1.9% of the time in the pretest, 50.8% of the time in the immediate posttest, and 33.1% of the time in the long-term posttest. The TG speakers were also perceived as significantly less masculine and more feminine in the immediate posttest and the long-term posttest compared with the pre-test. Some acoustic measures showed significant differences between the pretest and the immediate posttest and long-term posttest. It appeared that 8 weeks of voice therapy could result in vocal changes in MTF TG individuals that persist at least partially for up to 15 months. However, some TG subjects were more successful with voice feminization than others. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  7. Vocal parameters and voice-related quality of life in adult women with and without ovarian function.

    Science.gov (United States)

    Ferraz, Pablo Rodrigo Rocha; Bertoldo, Simão Veras; Costa, Luanne Gabrielle Morais; Serra, Emmeliny Cristini Nogueira; Silva, Eduardo Magalhães; Brito, Luciane Maria Oliveira; Chein, Maria Bethânia da Costa

    2013-05-01

    To identify the perceptual and acoustic parameters of voice in adult women with and without ovarian function and its impact on quality of life related to voice. Cross-sectional and analytical study with 106 women divided into, two groups: G1, with ovarian function (n=43) and G2, without physiological ovarian function (n=63). The women were instructed to sustain the vowel "a" and the sounds of /s/ and /z/ in habitual pitch and loudness. They were also asked to classify their voices and answer the voice-related quality of life (V-RQOL) questionnaire. The perceptual analysis of the vocal samples was performed by three speech-language pathologists using the GRBASI (G: grade; R: roughness; B: breathness; A: asthenia; S: strain; I: instability) scale. The acoustic analysis was carried out with the software VoxMetria 2.7h (CTS Informatica). The data were analyzed using descriptive statistics. In the perceptual analysis, both groups showed a mild deviation for the parameters roughness, strain, and instability, but only G2 showed a mild impact for the overall degree of dysphonia. The mean of fundamental frequency was significantly lower for the G2, with a difference of 17.41Hz between the two groups. There was no impact on V-RQOL in any of the V-RQOL domains for this group. With the menopause, there is a change in women's voices, impacting on some voice parameters. However, there is no direct impact on their quality of life related to voice. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  8. Contemporary Commercial Music Singing Students-Voice Quality and Vocal Function at the Beginning of Singing Training.

    Science.gov (United States)

    Sielska-Badurek, Ewelina M; Sobol, Maria; Olszowska, Katarzyna; Niemczyk, Kazimierz

    2017-10-03

    The purpose of this study was to assess the voice quality and the vocal tract function in popular singing students at the beginning of their singing training at the High School of Music. This is a retrospective cross-sectional study. The study consisted of 45 popular singing students (35 females and 10 males, mean age: 19.9 ± 2.8 years). They were assessed in the first 2 months of their 4-year singing training at the High School of Music, between 2013 and 2016. Voice quality and vocal tract function were evaluated using videolaryngostroboscopy, palpation of the vocal tract structures, the perceptual speaking and singing voice assessment, acoustic analysis, maximal phonation time, the Voice Handicap Index, and the Singing Voice Handicap Index (SVHI). Twenty-two percent of Contemporary Commercial Music singing students began their education in the High School, with vocal nodules. Palpation of the vocal tract structure showed in 50% correct motions and tension in speaking and in 39.3% in singing. Perceptual voice assessment showed in 80% proper speaking voice quality and in 82.4% proper singing voice quality. The mean vocal fundamental frequency while speaking in females was 214 Hz and in males was 116 Hz. Dysphonia Severity Index was at the level of 2, and maximum phonation time was 17.7 seconds. The Voice Handicap Index and the SVHI remained within the normal range: 7.5 and 19, respectively. Perceptual singing voice assessment correlated with the SVHI (P = 0.006). Twenty-two percent of the Contemporary Commercial Music singing students began their education in the High School, with organic vocal fold lesions. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  9. Measuring positive and negative affect in the voiced sounds of African elephants (Loxodonta africana).

    Science.gov (United States)

    Soltis, Joseph; Blowers, Tracy E; Savage, Anne

    2011-02-01

    As in other mammals, there is evidence that the African elephant voice reflects affect intensity, but it is less clear if positive and negative affective states are differentially reflected in the voice. An acoustic comparison was made between African elephant "rumble" vocalizations produced in negative social contexts (dominance interactions), neutral social contexts (minimal social activity), and positive social contexts (affiliative interactions) by four adult females housed at Disney's Animal Kingdom®. Rumbles produced in the negative social context exhibited higher and more variable fundamental frequencies (F(0)) and amplitudes, longer durations, increased voice roughness, and higher first formant locations (F1), compared to the neutral social context. Rumbles produced in the positive social context exhibited similar shifts in most variables (F(0 )variation, amplitude, amplitude variation, duration, and F1), but the magnitude of response was generally less than that observed in the negative context. Voice roughness and F(0) observed in the positive social context remained similar to that observed in the neutral context. These results are most consistent with the vocal expression of affect intensity, in which the negative social context elicited higher intensity levels than the positive context, but differential vocal expression of positive and negative affect cannot be ruled out.

  10. Double Fourier analysis for Emotion Identification in Voiced Speech

    International Nuclear Information System (INIS)

    Sierra-Sosa, D.; Bastidas, M.; Ortiz P, D.; Quintero, O.L.

    2016-01-01

    We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech. Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions. A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds. Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions. Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it. Finally features related with emotions in voiced speech are extracted and presented. (paper)

  11. Familiar Person Recognition: Is Autonoetic Consciousness More Likely to Accompany Face Recognition Than Voice Recognition?

    Science.gov (United States)

    Barsics, Catherine; Brédart, Serge

    2010-11-01

    Autonoetic consciousness is a fundamental property of human memory, enabling us to experience mental time travel, to recollect past events with a feeling of self-involvement, and to project ourselves in the future. Autonoetic consciousness is a characteristic of episodic memory. By contrast, awareness of the past associated with a mere feeling of familiarity or knowing relies on noetic consciousness, depending on semantic memory integrity. Present research was aimed at evaluating whether conscious recollection of episodic memories is more likely to occur following the recognition of a familiar face than following the recognition of a familiar voice. Recall of semantic information (biographical information) was also assessed. Previous studies that investigated the recall of biographical information following person recognition used faces and voices of famous people as stimuli. In this study, the participants were presented with personally familiar people's voices and faces, thus avoiding the presence of identity cues in the spoken extracts and allowing a stricter control of frequency exposure with both types of stimuli (voices and faces). In the present study, the rate of retrieved episodic memories, associated with autonoetic awareness, was significantly higher from familiar faces than familiar voices even though the level of overall recognition was similar for both these stimuli domains. The same pattern was observed regarding semantic information retrieval. These results and their implications for current Interactive Activation and Competition person recognition models are discussed.

  12. Influence of Traffic Vehicles Against Ground Fundamental Frequency Prediction using Ambient Vibration Technique

    Science.gov (United States)

    Kamarudin, A. F.; Noh, M. S. Md; Mokhatar, S. N.; Anuar, M. A. Mohd; Ibrahim, A.; Ibrahim, Z.; Daud, M. E.

    2018-04-01

    Ambient vibration (AV) technique is widely used nowadays for ground fundamental frequency prediction. This technique is easy, quick, non-destructive, less operator required and reliable result. The input motions of ambient vibration are originally collected from surrounding natural and artificial excitations. But, careful data acquisition controlled must be implemented to reduce the intrusion of short period noise that could imply the quality of frequency prediction of an investigated site. In this study, investigation on the primary noise intrusion under peak (morning, afternoon and evening) and off peak (early morning) traffic flows (only 8 meter from sensor to road shoulder) against the stability and quality of ground fundamental frequency prediction were carried out. None of specific standard is available for AV data acquisition and processing. Thus, some field and processing parameters recommended by previous studies and guideline were considered. Two units of 1 Hz tri-axial seismometer sensor were closely positioned in front of the main entrance Universiti Tun Hussein Onn Malaysia. 15 minutes of recording length were taken during peak and off peak periods of traffic flows. All passing vehicles were counted and grouped into four classes. Three components of ambient vibration time series recorded in the North-South: NS, East-West: EW and vertical: UD directions were automatically computed into Horizontal to Vertical Spectral Ratio (HVSR), by using open source software of GEOPSY for fundamental ground frequency, Fo determination. Single sharp peak pattern of HVSR curves have been obtained at peak frequencies between 1.33 to 1.38 Hz which classified under soft to dense soil classification. Even identical HVSR curves pattern with close frequencies prediction were obtained under both periods of AV measurement, however the total numbers of stable and quality windows selected for HVSR computation were significantly different but both have satisfied the requirement

  13. Voice acoustic patterns of patients diagnosed with vibroacoustic disease

    Directory of Open Access Journals (Sweden)

    Ana Mendes

    2006-07-01

    Full Text Available Background: Long-term low frequency noise exposure (LFN (≤ 500 Hz, including infrasound may lead to the development of vibroacoustic disease (VAD, a systemic pathology characterized by the abnormal growth of extra-cellular matrices. The respiratory system is a target for LFN. Fibrosis of the respiratory tract epithelia was observed in VAD patients through biopsy, and confirmed in animal models exposed to LFN. Voice acoustic analysis can detect vocal fold variations of mass, tension, muscular and neural activity. Frequency perturbation (jitter, amplitude perturbation (shimmer and harmonicto- noise ratio (HNR are used in the evaluation of the vocal function, and can be indicators of the presence and degree of severity of vocal pathology. Since the respiratory system is the energy source of the phonation process, this raises questions about the effects of VAD on voice production. The purpose of this study was to determine if voice acoustic parameters of VAD patients are different from normative data. Methods: Nine individuals (5 males and 4 females diagnosed with VAD were recorded performing spoken and sung tasks. The spoken tasks included sustaining vowels and fricatives. The sung tasks consisted of maximum phonational frequency range (MPFR. Voice acoustic parameters analysed were: fundamental frequency (F0, jitter, shimmer, HNR and temporal measures. Results: Compared with normative data, both males and females diagnosed with VAD exhibited increased F0, shimmer and HNR. Jitter, MPFR and one temporal measure were reduced. Conclusions: VAD individuals presented voice acoustic parameter differences in spectral, temporal and perturbation measures, which may be indicative of small morphological changes in the phonatory system. Resumo: Enquadramento: A exposição crónica ao ruído de baixa frequência (RBF (≤ 500 Hz, incluindo infra-sons pode conduzir ao desenvolvimento da doença vibroacústica (VAD

  14. Electroglottographic analysis of actresses and nonactresses' voices in different levels of intensity.

    Science.gov (United States)

    Master, Suely; Guzman, Marco; Carlos de Miranda, Helder; Lloyd, Adam

    2013-03-01

    Previous studies with long-term average spectrum (LTAS) showed the importance of the glottal source for understanding the projected voices of actresses. In this study, electroglottographic (EGG) analysis was used to investigate the contribution of the glottal source to the projected voice, comparing actresses and nonactresses' voices, in different levels of intensity. Thirty actresses and 30 nonactresses sustained vowels in habitual, moderate, and loud intensity levels. The EGG variables were contact quotient (CQ), closing quotient (QCQ), and opening quotient (QOQ). Other variables were sound pressure level (SPL) and fundamental frequency (F0). A KayPENTAX EGG was used. Variables were inputted in a general linear model. Actresses showed significantly higher values for SPL, in all levels, and both groups increased SPL significantly while changing from habitual to moderate and further to loud. There were no significant differences between groups for EGG quotients. There were significant differences between the levels only for F0 and CQ for both groups. SPL was significantly higher among actresses in all intensity levels, but in the EGG analysis, no differences were found. This apparently weak contribution of the glottal source in the supposedly projected voices of actresses, contrary to previous LTAS studies, might be because of a higher subglottal pressure or perhaps greater vocal tract contribution in SPL. Results from the present study suggest that trained subjects did not produce a significant higher SPL than untrained individuals by increasing the cost in terms of higher vocal fold collision and hence more impact stress. Future researches should explore the difference between trained and nontrained voices by aerodynamic measurements to evaluate the relationship between physiologic findings and the acoustic and EGG data. Moreover, further studies should consider both types of vocal tasks, sustained vowel and running speech, for both EGG and LTAS analysis

  15. Changes in speaking fundamental frequency characteristics with aging.

    Science.gov (United States)

    Nishio, Masaki; Niimi, Seiji

    2008-01-01

    Changes in speaking fundamental frequency (SFF) associated with aging were studied in a total of 374 healthy normal speakers (187 males and 187 females) from adolescent to older age groups. Participants were asked to read a sample passage aloud, and acoustic analysis was performed. The main results were as follows: (1) Males exhibited no significant trend for SFF changes in aging. However, a slight increase was observed in participants aged 70 years or older. (2) Females in their 30s and 40s showed obviously lower frequencies than those in their 20s. Across all age groups, including the 80s, SFF tended to decrease markedly in association with aging. (3) The degree of SFF change in association with aging was much larger in females than in males. In addition, reference intervals (mean +/- 1.96 SD) obtained for males and females in each age group are considered useful for clinical detection of abnormalities of SFF, as well as for detection of laryngeal diseases causing SFF abnormality. 2008 S. Karger AG, Basel.

  16. Discrimination of fundamental frequency of synthesized vowel sounds in a noise background

    NARCIS (Netherlands)

    Scheffers, M.T.M.

    1984-01-01

    An experiment was carried out, investigating the relationship between the just noticeable difference of fundamental frequency (jndf0) of three stationary synthesized vowel sounds in noise and the signal-to-noise ratio. To this end the S/N ratios were measured at which listeners could just

  17. Voice recognition through phonetic features with Punjabi utterances

    Science.gov (United States)

    Kaur, Jasdeep; Juglan, K. C.; Sharma, Vishal; Upadhyay, R. K.

    2017-07-01

    This paper deals with perception and disorders of speech in view of Punjabi language. Visualizing the importance of voice identification, various parameters of speaker identification has been studied. The speech material was recorded with a tape recorder in their normal and disguised mode of utterances. Out of the recorded speech materials, the utterances free from noise, etc were selected for their auditory and acoustic spectrographic analysis. The comparison of normal and disguised speech of seven subjects is reported. The fundamental frequency (F0) at similar places, Plosive duration at certain phoneme, Amplitude ratio (A1:A2) etc. were compared in normal and disguised speech. It was found that the formant frequency of normal and disguised speech remains almost similar only if it is compared at the position of same vowel quality and quantity. If the vowel is more closed or more open in the disguised utterance the formant frequency will be changed in comparison to normal utterance. The ratio of the amplitude (A1: A2) is found to be speaker dependent. It remains unchanged in the disguised utterance. However, this value may shift in disguised utterance if cross sectioning is not done at the same location.

  18. High-frequency energy in singing and speech

    Science.gov (United States)

    Monson, Brian Bruce

    While human speech and the human voice generate acoustical energy up to (and beyond) 20 kHz, the energy above approximately 5 kHz has been largely neglected. Evidence is accruing that this high-frequency energy contains perceptual information relevant to speech and voice, including percepts of quality, localization, and intelligibility. The present research was an initial step in the long-range goal of characterizing high-frequency energy in singing voice and speech, with particular regard for its perceptual role and its potential for modification during voice and speech production. In this study, a database of high-fidelity recordings of talkers was created and used for a broad acoustical analysis and general characterization of high-frequency energy, as well as specific characterization of phoneme category, voice and speech intensity level, and mode of production (speech versus singing) by high-frequency energy content. Directionality of radiation of high-frequency energy from the mouth was also examined. The recordings were used for perceptual experiments wherein listeners were asked to discriminate between speech and voice samples that differed only in high-frequency energy content. Listeners were also subjected to gender discrimination tasks, mode-of-production discrimination tasks, and transcription tasks with samples of speech and singing that contained only high-frequency content. The combination of these experiments has revealed that (1) human listeners are able to detect very subtle level changes in high-frequency energy, and (2) human listeners are able to extract significant perceptual information from high-frequency energy.

  19. Voice Range Profiles of Middle School and High School Choral Directors

    Science.gov (United States)

    Schwartz, Sandra M.

    2009-01-01

    Vocal demands of teaching are significant, and this challenge is compounded for choral directors who depend on the voice for communicating information or demonstrating music concepts. The purpose of this study is to examine the frequency and intensity of middle and high school choral directors' voices and to compare choral directors' voices with…

  20. Trends in Singing Voice Research: An Innovative Approach.

    Science.gov (United States)

    Pestana, Pedro Melo; Vaz-Freitas, Susana; Manso, Maria Conceição

    2018-01-11

    The objectives of this study were to trace and describe research patterns in singing voice, to compare the amount of published research over time, to identify journals that published most papers on "singing voice," and to establish the most frequent research topics. The study uses qualitative and quantitative approaches through descriptive statistics, text mining, and clustering. The authors conducted a search to identify scientific papers. The titles and abstracts were analyzed regarding word frequency and relations between them, through hierarchical cluster analysis and co-occurrence networks. The frequency of journals was calculated, as well as the amount of papers across time. Since 1949, 754 papers were published and an increase was noticed. Even though 162 journals were identified by the authors, the Journal of Voice holds the majority of papers, in every analyzed period. An evolution of studied topics is described. Up to 2010, the main theme was professional singers, especially classical and opera interpreters. Since then, voice quality and the effects of training gathered more attention. The growing interest in singing has been conspicuous since the first indexed paper. However, it has been slightly slowing down. Until 2010, great importance was given to the voice quality of singers and their occupational demands. Acoustic analysis was widely used to study the effects of training. Since 2010, the concern with functionality is increasing, rather than the organic voice structures. Musical perception studies have been a trend, as well as the use of electroglottography. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  1. Joint DOA and Fundamental Frequency Estimation Methods based on 2-D Filtering

    DEFF Research Database (Denmark)

    Jensen, Jesper Rindom; Christensen, Mads Græsbøll; Jensen, Søren Holdt

    2010-01-01

    of the fundamental frequency and the DOA of spatio-temporarily sampled periodic signals. The first and simplest method is based on the 2-D periodogram, whereas the second method is a generalization of the 2-D Capon method. In the experimental part, both qualitative and quantitative measurements show that the proposed...

  2. What makes a voice masculine: physiological and acoustical correlates of women's ratings of men's vocal masculinity.

    Science.gov (United States)

    Cartei, Valentina; Bond, Rod; Reby, David

    2014-09-01

    Men's voices contain acoustic cues to body size and hormonal status, which have been found to affect women's ratings of speaker size, masculinity and attractiveness. However, the extent to which these voice parameters mediate the relationship between speakers' fitness-related features and listener's judgments of their masculinity has not yet been investigated. We audio-recorded 37 adult heterosexual males performing a range of speech tasks and asked 20 adult heterosexual female listeners to rate speakers' masculinity on the basis of their voices only. We then used a two-level (speaker within listener) path analysis to examine the relationships between the physiological (testosterone, height), acoustic (fundamental frequency or F0, and resonances or ΔF) and perceptual dimensions (listeners' ratings) of speakers' masculinity. Overall, results revealed that male speakers who were taller and had higher salivary testosterone levels also had lower F0 and ΔF, and were in turn rated as more masculine. The relationship between testosterone and perceived masculinity was essentially mediated by F0, while that of height and perceived masculinity was partially mediated by both F0 and ΔF. These observations confirm that women listeners attend to sexually dimorphic voice cues to assess the masculinity of unseen male speakers. In turn, variation in these voice features correlate with speakers' variation in stature and hormonal status, highlighting the interdependence of these physiological, acoustic and perceptual dimensions. Copyright © 2014. Published by Elsevier Inc.

  3. The eye-voice span during reading aloud

    Directory of Open Access Journals (Sweden)

    Jochen eLaubrock

    2015-09-01

    Full Text Available Although eye movements during reading are modulated by cognitive processing demands, they also reflect visual sampling of the input, and possibly preparation of output for speech or the inner voice. By simultaneously recording eye movements and the voice during reading aloud, we obtained an output measure that constrains the length of time spent on cognitive processing. Here we investigate the dynamics of the eye-voice span (EVS, the distance between eye and voice. We show that the EVS is regulated immediately during fixation of a word by either increasing fixation duration or programming a regressive eye movement against the reading direction. EVS size at the beginning of a fixation was positively correlated with the likelihood of regressions and refixations. Regression probability was further increased if the EVS was still large at the end of a fixation: if adjustment of fixation duration did not sufficiently reduce the EVS during a fixation, then a regression rather than a refixation followed with high probability. We further show that the EVS can help understand cognitive influences on fixation duration during reading: in mixed model analyses, the EVS was a stronger predictor of fixation durations than either word frequency or word length. The EVS modulated the influence of several other predictors on single fixation durations. For example, word-N frequency effects were larger with a large EVS, especially when word N-1 frequency was low. Finally, a comparison of single fixation durations during oral and silent reading showed that reading is governed by similar principles in both reading modes, although EVS maintenance and articulatory processing also cause some differences. In summary, the eye-voice span is regulated by adjusting fixation duration and/or by programming a regressive eye movement when the eye-voice span gets too large. Overall, the EVS appears to be directly related to updating of the working memory buffer during reading.

  4. Perceptual adaptation of voice gender discrimination with spectrally shifted vowels.

    Science.gov (United States)

    Li, Tianhao; Fu, Qian-Jie

    2011-08-01

    To determine whether perceptual adaptation improves voice gender discrimination of spectrally shifted vowels and, if so, which acoustic cues contribute to the improvement. Voice gender discrimination was measured for 10 normal-hearing subjects, during 5 days of adaptation to spectrally shifted vowels, produced by processing the speech of 5 male and 5 female talkers with 16-channel sine-wave vocoders. The subjects were randomly divided into 2 groups; one subjected to 50-Hz, and the other to 200-Hz, temporal envelope cutoff frequencies. No preview or feedback was provided. There was significant adaptation in voice gender discrimination with the 200-Hz cutoff frequency, but significant improvement was observed only for 3 female talkers with F(0) > 180 Hz and 3 male talkers with F(0) gender discrimination under spectral shift conditions with perceptual adaptation, but spectral shift may limit the exclusive use of spectral information and/or the use of formant structure on voice gender discrimination. The results have implications for cochlear implant users and for understanding voice gender discrimination.

  5. Speech enhancement via Mel-scale Wiener filtering with a frequency-wise voice activity detector

    International Nuclear Information System (INIS)

    Kim, Han Jun; Kim, Hwa Soo; Cho, Young Man

    2007-01-01

    This paper presents a speech enhancement system that enables a comfortable communication inside an automobile. A couple of novel concepts are proposed in an effort to improve two major building blocks in the existing speech enhancement systems: a voice activity detector (VAD) and a noise filtering algorithm. The proposed VAD classifies a given data frame as speech or noise at each frequency, enabling the frequency-wise updates of noise statistics and thereby improving the effectiveness of the noise filtering algorithms by providing more up-to-date noise statistics. The celebrated Wiener filter is adopted in this paper as the accompanying noise filtering algorithm, which results in significant noise suppression. Yet, the musical noise present in most Wiener filter-based systems prompts the idea of applying the Wiener filter in the Mel-scale in which the human auditory system responds to the external stimulation. It turns out that the Mel-scale Wiener filter creates some masking effects and thereby reduces musical noise significantly, leading to smooth transition between data frames

  6. Effects of a three-week vocal exercise program using the Finnish Kuukka exercises on the speaking voice of Norwegian broadcast journalism students.

    Science.gov (United States)

    Bele, Irene; Laukkanen, Anne-Maria; Sipilä, Laura

    2010-12-01

    Nine broadcast journalism students attended 10 hours in Kuukka vocal exercises, which aims at producing a ringing vocal quality. Nine control subjects received no training. A text was read at habitual loudness before and after the course. Five speech specialists evaluated the text samples for perceptual voice quality and analyzed mean fundamental frequency (F0), equivalent sound level (Leq), and long-term average spectrum (LTAS). For the Training Group, voice quality improved and correlated negatively with firmness and timbre (less firm and darker qualities being considered more desirable), and F0 increased slightly. Leq increased significantly in both groups. The results show positive and perceivable differences after the course. However, the aimed ring was not reached, may be due to too short time.

  7. EXPERIMENTAL STUDY OF FIRMWARE FOR INPUT AND EXTRACTION OF USER’S VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    O. N. Faizulaieva

    2014-09-01

    Full Text Available Scientific task for improving the signal-to-noise ratio for user’s voice signal in computer systems and networks during the process of user’s voice authentication is considered. The object of study is the process of input and extraction of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of the voice signal on the background of external interference signals are investigated. Ways for quality improving of the user’s voice signal in systems of voice authentication are investigated experimentally. Firmware means for experimental unit of input and extraction of the user’s voice signal against external interference influence are considered. As modern computer means, including mobile, have two-channel audio card, two microphones are used in the voice signal input. The distance between sonic-wave sensors is 20 mm and it provides forming one direction pattern lobe of microphone array in a desired area of voice signal registration (from 100 Hz to 8 kHz. According to the results of experimental studies, the usage of directional properties of the proposed microphone array and space-time processing of the recorded signals with implementation of constant and adaptive weighting factors has made it possible to reduce considerably the influence of interference signals. The results of firmware experimental studies for input and extraction of the user’s voice signal against external interference influence are shown. The proposed solutions will give the possibility to improve the value of the signal/noise ratio of the useful signals recorded up to 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker discrimination.

  8. Effects of Parkinson's Disease on Fundamental Frequency Variability in Running Speech.

    Science.gov (United States)

    Bowen, Leah K; Hands, Gabrielle L; Pradhan, Sujata; Stepp, Cara E

    2013-09-01

    In Parkinson's Disease (PD), qualitative speech changes such as decreased variation in pitch and loudness are common, but quantitative vocal changes are not well documented. The variability of fundamental frequency (F0) in 32 individuals (23 male) with PD both ON and OFF levodopa medication was compared with 32 age-matched healthy controls (23 male). Participants read a single paragraph and estimates of fundamental frequency (F0) variability were determined for the entire reading passage as well as for the first and last sentences of the passage separately. F0 variability was significantly increased in controls relative to both PD groups and PD patients showed significantly higher F0 variability while ON medication relative to OFF. No significant effect of group was seen in the change in F0 variability from the beginning to the end of the reading passage. Female speakers were found to have higher F0 variability than males. F0 variability was both significantly reduced in PD relative to controls and significantly increased in patients with PD during use of dopaminergic medications. F0 variability changes over the course of reading a paragraph may not be indicative of PD but rather dependent on non-disease factors such as the linguistic characteristics of the text.

  9. Fast LCMV-based Methods for Fundamental Frequency Estimation

    DEFF Research Database (Denmark)

    Jensen, Jesper Rindom; Glentis, George-Othon; Christensen, Mads Græsbøll

    2013-01-01

    peaks and require matrix inversions for each point in the search grid. In this paper, we therefore consider fast implementations of LCMV-based fundamental frequency estimators, exploiting the estimators' inherently low displacement rank of the used Toeplitz-like data covariance matrices, using...... with several orders of magnitude, but, as we show, further computational savings can be obtained by the adoption of an approximative IAA-based data covariance matrix estimator, reminiscent of the recently proposed Quasi-Newton IAA technique. Furthermore, it is shown how the considered pitch estimators can...... as such either the classic time domain averaging covariance matrix estimator, or, if aiming for an increased spectral resolution, the covariance matrix resulting from the application of the recent iterative adaptive approach (IAA). The proposed exact implementations reduce the required computational complexity...

  10. An exploratory study of voice change associated with healthy speakers after transcutaneous electrical stimulation to laryngeal muscles.

    Science.gov (United States)

    Fowler, Linda P; Gorham-Rowan, Mary; Hapner, Edie R

    2011-01-01

    The purpose of this study was to determine if measurable changes in fundamental frequency (F(0)) and relative sound level (RSL) occurred in healthy speakers after transcutaneous electrical stimulation (TES) as applied via VitalStim (Chattanooga Group, Chattanooga, TN). A prospective, repeated-measures design. Ten healthy female and 10 healthy male speakers, 20-53 years of age, participated in the study. All participants were nonsmokers and reported negative history for voice disorders. Participants received 1 hour of TES while engaged in eating, drinking, and conversation to simulate a typical dysphagia therapy protocol. Voice recordings were obtained before and immediately after TES. The voice samples consisted of a sustained vowel task and reading of the Rainbow Passage. Measurements of F(0) and RSL were obtained using TF32 (Milenkovic, 2005, University of Wisconsin). The participants also reported any sensations 5 minutes and 24 hours after TES. Measurable changes in F(0) and RSL were found for both tasks but were variable in direction and magnitude. These changes were not statistically significant. Subjective comments ranged from reports of a vocal warm-up feeling to delayed onset muscle soreness. These findings demonstrate that application of TES produces measurable changes in F(0) and RSL. However, the direction and magnitude of these changes are highly variable. Further research is needed to determine factors that may affect the extent to which TES contributes to significant changes in voice. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  11. Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.

    Science.gov (United States)

    Fang, Shih-Hau; Tsao, Yu; Hsiao, Min-Jing; Chen, Ji-Ying; Lai, Ying-Hui; Lin, Feng-Chuan; Wang, Chi-Te

    2018-03-19

    Computerized detection of voice disorders has attracted considerable academic and clinical interest in the hope of providing an effective screening method for voice diseases before endoscopic confirmation. This study proposes a deep-learning-based approach to detect pathological voice and examines its performance and utility compared with other automatic classification algorithms. This study retrospectively collected 60 normal voice samples and 402 pathological voice samples of 8 common clinical voice disorders in a voice clinic of a tertiary teaching hospital. We extracted Mel frequency cepstral coefficients from 3-second samples of a sustained vowel. The performances of three machine learning algorithms, namely, deep neural network (DNN), support vector machine, and Gaussian mixture model, were evaluated based on a fivefold cross-validation. Collective cases from the voice disorder database of MEEI (Massachusetts Eye and Ear Infirmary) were used to verify the performance of the classification mechanisms. The experimental results demonstrated that DNN outperforms Gaussian mixture model and support vector machine. Its accuracy in detecting voice pathologies reached 94.26% and 90.52% in male and female subjects, based on three representative Mel frequency cepstral coefficient features. When applied to the MEEI database for validation, the DNN also achieved a higher accuracy (99.32%) than the other two classification algorithms. By stacking several layers of neurons with optimized weights, the proposed DNN algorithm can fully utilize the acoustic features and efficiently differentiate between normal and pathological voice samples. Based on this pilot study, future research may proceed to explore more application of DNN from laboratory and clinical perspectives. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  12. Connections between voice ergonomic risk factors and voice symptoms, voice handicap, and respiratory tract diseases.

    Science.gov (United States)

    Rantala, Leena M; Hakala, Suvi J; Holmqvist, Sofia; Sala, Eeva

    2012-11-01

    The aim of the study was to investigate the connections between voice ergonomic risk factors found in classrooms and voice-related problems in teachers. Voice ergonomic assessment was performed in 39 classrooms in 14 elementary schools by means of a Voice Ergonomic Assessment in Work Environment--Handbook and Checklist. The voice ergonomic risk factors assessed included working culture, noise, indoor air quality, working posture, stress, and access to a sound amplifier. Teachers from the above-mentioned classrooms reported their voice symptoms, respiratory tract diseases, and completed a Voice Handicap Index (VHI). The more voice ergonomic risk factors found in the classroom the higher were the teachers' total scores on voice symptoms and VHI. Stress was the factor that correlated most strongly with voice symptoms. Poor indoor air quality increased the occurrence of laryngitis. Voice ergonomics were poor in the classrooms studied and voice ergonomic risk factors affected the voice. It is important to convey information on voice ergonomics to education administrators and those responsible for school planning and taking care of school buildings. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  13. A study of VHI scores and acoustic features in street vendors as occupational voice users.

    Science.gov (United States)

    Natour, Yaser S; Darawsheh, Wesam B; Bashiti, Sara; Wari, Majd; Taha, Juhayna; Odeh, Thair

    to investigate acoustic features of phonation and perception of voice handicap in street vendors. Eighty-eight participants (44 street vendors, 44 controls) were recruited. The mean age of the group was 38.9±16.0 years (range: 20-78 years). Scores of the Arabic version of the Voice Handicap Index (VHI-Arab) were used for analysis. Acoustic measures of fundamental frequency (F 0 ), jitter, shimmer, and signal-to-noise ratio (SNR) were also analyzed. Analysis showed a significant difference between street vendors and controls in the total score of the VHI-Arab (p<0.001) as well as scores of all three VHI-Arab subsections: functional (p<0.001), physical (p<0.001), and emotional (p=0.025). Weak correlations were found among all of the VHI scores and acoustic measures (-0.219≤ r≤0.355), except for SNR where a moderate negative correlations were found (r=-0.555; -0.4) between the VHI (physical and total) scores and SNR values. Significant differences also were found in F 0 , jitter, and SNR among specific subgroups of street vendors when stratified by weekly hours worked (p<0.05), and in jitter (p=0.39) when stratified by educational level. Perception of voice handicap and a possible effect on vocal quality in street vendors were noted. The effect of factors, namely work hours and educational level, on voice quality should be further studied. Copyright © 2017. Published by Elsevier Inc.

  14. Protonated Nitrous Oxide, NNOH(+): Fundamental Vibrational Frequencies and Spectroscopic Constants from Quartic Force Fields

    Science.gov (United States)

    Huang, Xinchuan; Fortenberry, Ryan C.; Lee, Timothy J.

    2013-01-01

    The interstellar presence of protonated nitrous oxide has been suspected for some time. Using established high-accuracy quantum chemical techniques, spectroscopic constants and fundamental vibrational frequencies are provided for the lower energy O-protonated isomer of this cation and its deuterated isotopologue. The vibrationally-averaged B0 and C0 rotational constants are within 6 MHz of their experimental values and the D(subJ) quartic distortion constants agree with experiment to within 3%. The known gas phase O-H stretch of NNOH(+) is 3330.91 cm(exp-1), and the vibrational configuration interaction computed result is 3330.9 cm(exp-1). Other spectroscopic constants are also provided, as are the rest of the fundamental vibrational frequencies for NNOH(+) and its deuterated isotopologue. This high-accuracy data should serve to better inform future observational or experimental studies of the rovibrational bands of protonated nitrous oxide in the ISM and the laboratory.

  15. "Ring" in the solo child singing voice.

    Science.gov (United States)

    Howard, David M; Williams, Jenevora; Herbst, Christian T

    2014-03-01

    Listeners often describe the voices of solo child singers as being "pure" or "clear"; these terms would suggest that the voice is not only pleasant but also clearly audible. The audibility or clarity could be attributed to the presence of high-frequency partials in the sound: a "brightness" or "ring." This article aims to investigate spectrally the acoustic nature of this ring phenomenon in children's solo voices, and in particular, relating it to their "nonring" production. Additionally, this is set in the context of establishing to what extent, if any, the spectral characteristics of ring are shared with those of the singer's formant cluster associated with professional adult opera singers in the 2.5-3.5kHz region. A group of child solo singers, acknowledged as outstanding by a singing teacher who specializes in teaching professional child singers, were recorded in a major UK concert hall performing Come unto him, all ye that labour, from the aria He shall feed his flock from The Messiah by GF Handel. Their singing was accompanied by a recording of a piano played through in-ear headphones. Sound pressure recordings were made from well within the critical distance in the hall. The singers were observed to produce notes with and without ring, and these recordings were analyzed in the frequency domain to investigate their spectra. The results indicate that there is evidence to suggest that ring in child solo singers is carried in two areas of the output spectrum: first in the singer's formant cluster region, centered around 4kHz, which is more than 1000Hz higher than what is observed in adults; and second in the region around 7.5-11kHz where a significant strengthening of harmonic presence is observed. A perceptual test has been carried out demonstrating that 94% of 62 listeners label a synthesized version of the calculated overall average ring spectrum for all subjects as having ring when compared with a synthesized version of the calculated overall average nonring

  16. The effectiveness of voice therapy for teachers with dysphonia.

    Science.gov (United States)

    Niebudek-Bogusz, E; Sznurowska-Przygocka, B; Fiszer, M; Kotyło, P; Sinkiewicz, A; Modrzewska, M; Sliwinska-Kowalska, M

    2008-01-01

    An incorrect voice emission is a risk factor for developing occupational voice disorders. The study aimed at assessing the effectiveness of voice therapy in female teachers with dysphonia. The study comprised 133 subjects with voice disorders, taking part in a vocal training programme. A reference group for the present study included 53 teachers with dysphonia. Questionnaire surveys, phoniatric examination and videostroboscopic evaluation were conducted at initial and control examination. In the study group, an improvement after the vocal training was noted in most of the reported symptoms and also in some quantitative parameters of phoniatric examinations compared to the findings for the reference group. The number of patients who assessed their voice as normal increased significantly after the vocal training (2.3 vs. 46.6%). A significant increase in the mean maximum phonation time, from 13.3 to 16.6 s, was observed. The same applied to voice frequency range (increase from 171 to 226.8 Hz). The outcomes of vocal training, such as a subjective improvement of voice quality and an increase in the quantitative parameters (prolonged maximum phonation time, extended voice range) seem to be important parameters for monitoring the effectiveness of training in correct voice emission. 2008 S. Karger AG, Basel.

  17. Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations

    Science.gov (United States)

    Zhang, Yu; Jiang, Jack J.

    2008-09-01

    Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low-frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.

  18. Classification of voice disorder in children with cochlear implantation and hearing aid using multiple classifier fusion

    Directory of Open Access Journals (Sweden)

    Tayarani Hamid

    2011-01-01

    Full Text Available Abstract Background Speech production and speech phonetic features gradually improve in children by obtaining audio feedback after cochlear implantation or using hearing aids. The aim of this study was to develop and evaluate automated classification of voice disorder in children with cochlear implantation and hearing aids. Methods We considered 4 disorder categories in children's voice using the following definitions: Level_1: Children who produce spontaneous phonation and use words spontaneously and imitatively. Level_2: Children, who produce spontaneous phonation, use words spontaneously and make short sentences imitatively. Level_3: Children, who produce spontaneous phonations, use words and arbitrary sentences spontaneously. Level_4: Normal children without any hearing loss background. Thirty Persian children participated in the study, including six children in each level from one to three and 12 children in level four. Voice samples of five isolated Persian words "mashin", "mar", "moosh", "gav" and "mouz" were analyzed. Four levels of the voice quality were considered, the higher the level the less significant the speech disorder. "Frame-based" and "word-based" features were extracted from voice signals. The frame-based features include intensity, fundamental frequency, formants, nasality and approximate entropy and word-based features include phase space features and wavelet coefficients. For frame-based features, hidden Markov models were used as classifiers and for word-based features, neural network was used. Results After Classifiers fusion with three methods: Majority Voting Rule, Linear Combination and Stacked fusion, the best classification rates were obtained using frame-based and word-based features with MVR rule (level 1:100%, level 2: 93.75%, level 3: 100%, level 4: 94%. Conclusions Result of this study may help speech pathologists follow up voice disorder recovery in children with cochlear implantation or hearing aid who are

  19. Joint Spatio-Temporal Filtering Methods for DOA and Fundamental Frequency Estimation

    DEFF Research Database (Denmark)

    Jensen, Jesper Rindom; Christensen, Mads Græsbøll; Benesty, Jacob

    2015-01-01

    some attention in the community and is quite promising for several applications. The proposed methods are based on optimal, adaptive filters that leave the desired signal, having a certain DOA and fundamental frequency, undistorted and suppress everything else. The filtering methods simultaneously...... operate in space and time, whereby it is possible resolve cases that are otherwise problematic for pitch estimators or DOA estimators based on beamforming. Several special cases and improvements are considered, including a method for estimating the covariance matrix based on the recently proposed...

  20. Effects of Parkinson’s Disease on Fundamental Frequency Variability in Running Speech

    Science.gov (United States)

    Bowen, Leah K.; Hands, Gabrielle L.; Pradhan, Sujata; Stepp, Cara E.

    2013-01-01

    In Parkinson’s Disease (PD), qualitative speech changes such as decreased variation in pitch and loudness are common, but quantitative vocal changes are not well documented. The variability of fundamental frequency (F0) in 32 individuals (23 male) with PD both ON and OFF levodopa medication was compared with 32 age-matched healthy controls (23 male). Participants read a single paragraph and estimates of fundamental frequency (F0) variability were determined for the entire reading passage as well as for the first and last sentences of the passage separately. F0 variability was significantly increased in controls relative to both PD groups and PD patients showed significantly higher F0 variability while ON medication relative to OFF. No significant effect of group was seen in the change in F0 variability from the beginning to the end of the reading passage. Female speakers were found to have higher F0 variability than males. F0 variability was both significantly reduced in PD relative to controls and significantly increased in patients with PD during use of dopaminergic medications. F0 variability changes over the course of reading a paragraph may not be indicative of PD but rather dependent on non-disease factors such as the linguistic characteristics of the text. PMID:25838754

  1. Protective Strategies Against Dysphonia in Teachers: Preliminary Results Comparing Voice Amplification and 0.9% NaCl Nebulization.

    Science.gov (United States)

    Masson, Maria Lúcia Vaz; de Araújo, Tânia Maria

    2018-03-01

    This study aimed to compare the effects of two protective strategies, voice amplification (VA) and 0.9% NaCl nebulization (NEB), on teachers' voice in the work setting. An interventional evaluator-blind study was conducted, assigning 53 teachers from two public high schools to one of the two protective strategy groups (VA or NEB). Vocal function was assessed in a sound-treated booth before and after a 4-week period. Assessment included the severity of voice impairment (Consensus Auditory-Perceptual Evaluation of Voice [CAPE-V]), acoustic analysis of fundamental frequency (f0), sound pressure level (SPL), jitter, shimmer, glottal-to-noise excitation ratio (GNE), noise (VoxMetria), and the self-rated Screening Index for Voice Disorder (SIVD). Data were statistically analyzed using SPSS Statistics (version 22) with a significance level of P ≤ 0.05. Effect size was calculated using Cohen's d coefficient. There were no statistical differences between groups at baseline in terms of age, sex, time of teaching, teaching workload, and voice outcomes, except for SPL. During postintervention between groups, NEB displayed lower SIVD scores (VA = 3; NEB = 0; P = 0.018) and VA had lower acoustic irregularity (VA = 3.19; NEB = 3.69; P = 0.027), with moderate to large effect size. Postintervention within-groups decreased CAPE-V for VA (pretest = 31.97; posttest = 28.24; P = 0.021) and SIVD for NEB (pretest = 3; posttest = 0; P = 0.001). SPL decreased in both groups, NEB decreased in men only, and VA decreased in both men and women. NEB increased f0 for female participants (P ≤ 0.001). Both VA and NEB may help mitigate dysphonia in different pathways, being potential interventions for protecting teachers' voices in the work setting. An ongoing study with a control group will further support these preliminary results. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  2. Discriminating male and female voices: differentiating pitch and gender.

    Science.gov (United States)

    Latinus, Marianne; Taylor, Margot J

    2012-04-01

    Gender is salient, socially critical information obtained from faces and voices, yet the brain processes underlying gender discrimination have not been well studied. We investigated neural correlates of gender processing of voices in two ERP studies. In the first, ERP differences were seen between female and male voices starting at 87 ms, in both spatial-temporal and peak analyses, particularly the fronto-central N1 and P2. As pitch differences may drive gender differences, the second study used normal, high- and low-pitch voices. The results of these studies suggested that differences in pitch produced early effects (27-63 ms). Gender effects were seen on N1 (120 ms) with implicit pitch processing (study 1), but were not seen with manipulations of pitch (study 2), demonstrating that N1 was modulated by attention. P2 (between 170 and 230 ms) discriminated male from female voices, independent of pitch. Thus, these data show that there are two stages in voice gender processing; a very early pitch or frequency discrimination and a later more accurate determination of gender at the P2 latency.

  3. Acoustic Measures of Voice and Physiologic Measures of Autonomic Arousal during Speech as a Function of Cognitive Load.

    Science.gov (United States)

    MacPherson, Megan K; Abur, Defne; Stepp, Cara E

    2017-07-01

    This study aimed to determine the relationship among cognitive load condition and measures of autonomic arousal and voice production in healthy adults. A prospective study design was conducted. Sixteen healthy young adults (eight men, eight women) produced a sentence containing an embedded Stroop task in each of two cognitive load conditions: congruent and incongruent. In both conditions, participants said the font color of the color words instead of the word text. In the incongruent condition, font color differed from the word text, creating an increase in cognitive load relative to the congruent condition in which font color and word text matched. Three physiologic measures of autonomic arousal (pulse volume amplitude, pulse period, and skin conductance response amplitude) and four acoustic measures of voice (sound pressure level, fundamental frequency, cepstral peak prominence, and low-to-high spectral energy ratio) were analyzed for eight sentence productions in each cognitive load condition per participant. A logistic regression model was constructed to predict the cognitive load condition (congruent or incongruent) using subject as a categorical predictor and the three autonomic measures and four acoustic measures as continuous predictors. It revealed that skin conductance response amplitude, cepstral peak prominence, and low-to-high spectral energy ratio were significantly associated with cognitive load condition. During speech produced under increased cognitive load, healthy young adults show changes in physiologic markers of heightened autonomic arousal and acoustic measures of voice quality. Future work is necessary to examine these measures in older adults and individuals with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  4. Anti-voice adaptation suggests prototype-based coding of voice identity

    Directory of Open Access Journals (Sweden)

    Marianne eLatinus

    2011-07-01

    Full Text Available We used perceptual aftereffects induced by adaptation with anti-voice stimuli to investigate voice identity representations. Participants learned a set of voices then were tested on a voice identification task with vowel stimuli morphed between identities, after different conditions of adaptation. In Experiment 1, participants chose the identity opposite to the adapting anti-voice significantly more often than the other two identities (e.g., after being adapted to anti-A, they identified the average voice as A. In Experiment 2, participants showed a bias for identities opposite to the adaptor specifically for anti-voice, but not for non anti-voice adaptors. These results are strikingly similar to adaptation aftereffects observed for facial identity. They are compatible with a representation of individual voice identities in a multidimensional perceptual voice space referenced on a voice prototype.

  5. Emotional Prosody Measurement (EPM): a voice-based evaluation method for psychological therapy effectiveness.

    Science.gov (United States)

    van den Broek, Egon L

    2004-01-01

    The voice embodies three sources of information: speech, the identity, and the emotional state of the speaker (i.e., emotional prosody). The latter feature is resembled by the variability of the F0 (also named fundamental frequency of pitch) (SD F0). To extract this feature, Emotional Prosody Measurement (EPM) was developed, which consists of 1) speech recording, 2) removal of speckle noise, 3) a Fourier Transform to extract the F0-signal, and 4) the determination of SD F0. After a pilot study in which six participants mimicked emotions by their voice, the core experiment was conducted to see whether EPM is successful. Twenty-five patients suffering from a panic disorder with agoraphobia participated. Two methods (story-telling and reliving) were used to trigger anxiety and were compared with comparable but more relaxed conditions. This resulted in a unique database of speech samples that was used to compare the EPM with the Subjective Unit of Distress to validate it as measure for anxiety/stress. The experimental manipulation of anxiety proved to be successful and EPM proved to be a successful evaluation method for psychological therapy effectiveness.

  6. Type and severity of pain during phonation in professional voice users and nonvocal professionals.

    Science.gov (United States)

    Van Lierde, Kristiane M; Dijckmans, Joke; Scheffel, Lara; Behlau, Mara

    2012-09-01

    The purpose of this study was to determine the presence, frequency, and intensity of pain during speaking in professional voice users and nonvocal professionals and to determine if the presence of pain is significantly related with the profile of the professional voice user. Based on the available literature, significantly more pain symptoms in professional voice users can be hypothesized. Sample survey. To characterize the presence, type, and degree of pain symptoms during speaking, a questionnaire was used. Pain severity was measured by means of a numerical rating scale. Fifty-five (176/320) percent of the nonvocal professionals and 84% (698/832) of the professional voice users mentioned the presence of one or more pain symptoms during speaking. Throat pain was mentioned as the most common pain in both the professional and nonvocal professional voice users. The professional voice users showed significantly more throat, neck, shoulder, headache, ear, and back pain. Moreover, the intensity of throat pain was significantly increased in the professional voice users. This study showed evidence that several types of pain are present with significantly greater frequency in professional voice users. Vocal screening strategies, diagnostic, and treatment protocols should include the assessment of the type and severity of pain. Currently, the voice clinic is working on improving the diagnostic protocol with the objective of defining the combination of tests, which best diagnose voice problems and related complaints and which evaluate progress in vocal characteristics and pain after rehabilitation. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  7. It's not what you hear, it's the way you think about it: appraisals as determinants of affect and behaviour in voice hearers.

    Science.gov (United States)

    Peters, E R; Williams, S L; Cooke, M A; Kuipers, E

    2012-07-01

    Previous studies have suggested that beliefs about voices mediate the relationship between actual voice experience and behavioural and affective response. We investigated beliefs about voice power (omnipotence), voice intent (malevolence/benevolence) and emotional and behavioural response (resistance/engagement) using the Beliefs About Voices Questionnaire - Revised (BAVQ-R) in 46 voice hearers. Distress was assessed using a wide range of measures: voice-related distress, depression, anxiety, self-esteem and suicidal ideation. Voice topography was assessed using measures of voice severity, frequency and intensity. We predicted that beliefs about voices would show a stronger association with distress than voice topography. Omnipotence had the strongest associations with all measures of distress included in the study whereas malevolence was related to resistance, and benevolence to engagement. As predicted, voice severity, frequency and intensity were not related to distress once beliefs were accounted for. These results concur with previous findings that beliefs about voice power are key determinants of distress in voice hearers, and should be targeted specifically in psychological interventions.

  8. Assessments of Voice Use and Voice Quality among College/University Singing Students Ages 18–24 through Ambulatory Monitoring with a Full Accelerometer Signal

    Science.gov (United States)

    Schloneger, Matthew; Hunter, Eric

    2016-01-01

    The multiple social and performance demands placed on college/university singers could put their still developing voices at risk. Previous ambulatory monitoring studies have analyzed the duration, intensity, and frequency (in Hz) of voice use among such students. Nevertheless, no studies to date have incorporated the simultaneous acoustic voice quality measures into the acquisition of these measures to allow for direct comparison during the same voicing period. Such data could provide greater insight into how young singers use their voices, as well as identify potential correlations between vocal dose and acoustic changes in voice quality. The purpose of this study was to assess the voice use and estimated voice quality of college/university singing students (18–24 y/o, N = 19). Ambulatory monitoring was conducted over three full, consecutive weekdays measuring voice from an unprocessed accelerometer signal measured at the neck. From this signal were analyzed traditional vocal dose metrics such as phonation percentage, dose time, cycle dose, and distance dose. Additional acoustic measures included perceived pitch, pitch strength, LTAS slope, alpha ratio, dB SPL 1–3 kHz, and harmonic-to-noise ratio. Major findings from more than 800 hours of recording indicated that among these students (a) higher vocal doses correlated significantly with greater voice intensity, more vocal clarity and less perturbation; and (b) there were significant differences in some acoustic voice quality metrics between non-singing, solo singing and choral singing. PMID:26897545

  9. Correlation of the Dysphonia Severity Index (DSI), Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V), and Gender in Brazilians With and Without Voice Disorders.

    Science.gov (United States)

    Nemr, Katia; Simões-Zenari, Marcia; de Souza, Glaucia S; Hachiya, Adriana; Tsuji, Domingos H

    2016-11-01

    This study aims to analyze the Dysphonia Severity Index (DSI) in Brazilians with or without voice disorders and investigate DSI's correlation with gender and auditory-perceptual evaluation data obtained via the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) protocol. A total of 66 Brazilian adults from both genders participated in the study, including 24 patients with dysphonia confirmed on laryngeal examination (dysphonic group [DG]) and 42 volunteers without voice or hearing complaints and without auditory-perceptual voice disorders (nondysphonic group [NDG]). The vocal tasks included in CAPE-V and DSI were performed and recorded. Data were analyzed by means of the independent t test, the Mann-Whitney U test, and Pearson correlation at the 5% significance level. Differences were found in the mean DSI values between the DG and the NDG. Differences were also found in all DSI items between the groups, except for the highest frequency parameter. In the DG, a moderate negative correlation was detected between overall dysphonia severity (CAPE-V) and DSI value, and between breathiness and DSI value, and a weak negative correlation was detected between DSI value and roughness. In the NDG, the maximum phonation time was higher among males. In both groups, the highest frequency parameter was higher among females. The DSI discriminated among Brazilians with or without voice disorders. A correlation was found between some aspects of the DSI and the CAPE-V but not between DSI and gender. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  10. Comparison of acoustic voice characteristics in smoking and nonsmoking teachers

    Directory of Open Access Journals (Sweden)

    Šehović Ivana

    2012-01-01

    Full Text Available Voice of vocal professionals is exposed to great temptations, i.e. there is a high probability of voice alterations. Smoking, allergies and respiratory infections greatly affect the voice, which can change its acoustic characteristics. In smokers, the vocal cords mass increases, resulting in changes in vocal fold vibratory cycle. Pathological changes of vocal folds deform the acoustic signal and affect voice production. As vocal professionals, teachers are much more affected by voice disorders than average speakers. The aim of this study was to examine the differences in acoustic parameters of voice between smoking and nonsmoking teachers, in a sample of vocal professionals. The sample consisted of 60 female subjects, aged from 25 to 59. For voice analysis we used Computer lab, model 4300, 'Kay Elemetrics Corporation'. The statistical significance of differences in the values of acoustic parameters between smokers and nonsmokers was tested by ANOVA. Results showed that in the sample of female teachers, professional use of voice combined with the smoking habit can be linked to the changes in voice parameters. Comparing smokers and nonsmokers, average values of the parameters in short-term and long-term disturbances of frequency and amplitude proved to be significantly different.

  11. Modeling hemoglobin at optical frequency using the unconditionally stable fundamental ADI-FDTD method.

    Science.gov (United States)

    Heh, Ding Yu; Tan, Eng Leong

    2011-04-12

    This paper presents the modeling of hemoglobin at optical frequency (250 nm - 1000 nm) using the unconditionally stable fundamental alternating-direction-implicit finite-difference time-domain (FADI-FDTD) method. An accurate model based on complex conjugate pole-residue pairs is proposed to model the complex permittivity of hemoglobin at optical frequency. Two hemoglobin concentrations at 15 g/dL and 33 g/dL are considered. The model is then incorporated into the FADI-FDTD method for solving electromagnetic problems involving interaction of light with hemoglobin. The computation of transmission and reflection coefficients of a half space hemoglobin medium using the FADI-FDTD validates the accuracy of our model and method. The specific absorption rate (SAR) distribution of human capillary at optical frequency is also shown. While maintaining accuracy, the unconditionally stable FADI-FDTD method exhibits high efficiency in modeling hemoglobin.

  12. A Study of the Effect of Emotional State upon the Variation of the Fundamental Frequency of a Speaker

    Directory of Open Access Journals (Sweden)

    Marius Vasile GHIURCAU

    2010-01-01

    Full Text Available Telephone banking or brokering, building accesssystems or forensics are some of the areas in which speakerrecognition is continuously developing. Fundamental frequencyrepresents an important speech feature used in theseapplications. In this paper we present a study of the effect ofemotional state of a speaker upon the variation of thefundamental frequency of the speech signal. Human beings arequite frequently overwhelmed by various emotions and most ofthe time one can not really control these emotional states. Forthe purpose of our work we have used the Berlin emotionalspeech database which contains utterances of 10 speakers indifferent emotional situations: happy, angry, fearful, bored andneutral. The mean fundamental frequency and also the standarddeviation for every speaker in all the emotional states werecomputed. The results show a very strong influence of theemotional state upon frequency variation.

  13. Acute effects of inhaling Oud incense on voice of Saudi adults.

    Science.gov (United States)

    Mesallam, Tamer A; Farahat, Mohamed; Shoeib, Rasha; Alharethy, Sami; Alshahwan, Abdulaziz; Murry, Thomas; Almalkia, Khalid

    2015-01-01

    Like in most of the Arab countries, incense burning, including Oud, is widely used in Saudi Arabia. The widespread effects of the Oud incense on voice have not been examined. Thus, the aim of this study was to examine the short-term effects of Oud incense on laryngeal symptoms and voice acoustics in normal Saudi adults. A prospective study that has been carried out at King Abdulaziz University Hospital between July 2012 and Jan 2014. Study subjects were recruited on a volunteer basis. A total of 72 adults (44.4% males and 55.6 % females), were exposed to Oud incense smoke for 5 minutes while sitting 1 m away from an electrical sensor in a closed room. Symptom and acoustic voice analyses were performed pre-exposure and immediately post-exposure. A total of 27.8% of the subjects reported throat and voice symptoms after 5 minutes of exposure. Some frequency-related acoustic measures increased in male and female subjects after exposure to Oud incense. However, the difference between the pre- and post-exposure measures was not statistically significant. One third of the study subjects reported voice-related symptoms following exposure to Oud incense. Despite the absence of statistical significant difference, some frequency-based acoustic parameters increased following exposure to Oud incense smoke.

  14. Dominant distortion classification for pre-processing of vowels in remote biomedical voice analysis

    DEFF Research Database (Denmark)

    Poorjam, Amir Hossein; Jensen, Jesper Rindom; Little, Max A

    2017-01-01

    for pathological voice assessments and investigate the impact of four major types of distortion that are commonly present during recording or transmission in voice analysis, namely: background noise, reverberation, clipping and compression, on Mel-frequency cepstral coefficients (MFCCs) – the most widely...

  15. Análise acústica da voz captada na faringe próximo à fonte glótica através de microfone acoplado ao fibrolaringoscópio Acoustic analysis of voice captured in the pharynx above the glottic source through a microphone on a laryngo-fiberscope

    Directory of Open Access Journals (Sweden)

    Erica E. Fukuyama

    2001-01-01

    Kay Elemetrics’ Computerized Speech Lab 4300B Model. Samples of the sustained vowels /a/, /i/ and /u/ were picked up in three distinct ways. Firstly, by a common external microphone placed at 15 cm from the mouth. Secondly, a special microphone was placed on the pharynx 1.5 cm above the vocal folds. Lastly, the same special microphone was placed externally at 2 cm from the mouth. Twelve acoustic parameters regarding fundamental frequency, amplitude and noise of each and every vowel were compared statistically as to the way the voice was picked up. Results: Results show statistically significant differences between the voice picked up by the common external microphone and by the special one as regards to the fundamental frequency, frequency and amplitude variability and noise. Conclusion: The difference between the sound coming from the glottic source and the sound from the external voice shows alterations experienced by the voice during its passage through the vocal tract.

  16. One (rating) from many (observations): Factors affecting the individual assessment of voice behavior in groups.

    Science.gov (United States)

    Podsakoff, Nathan P; Maynes, Timothy D; Whiting, Steven W; Podsakoff, Philip M

    2015-07-01

    This article reports an investigation into how individuals form perceptions of overall voice behavior in group contexts. More specifically, the authors examine the effect of the proportion of group members exhibiting voice behavior in the group, the frequency of voice events in the group, and the measurement item referent (group vs. individual) on an individual's ratings of group voice behavior. In addition, the authors examine the effect that measurement item referent has on the magnitude of the relationship observed between an individual's ratings of group voice behavior and perceptions of group performance. Consistent with hypotheses, the results from 1 field study (N = 220) and 1 laboratory experiment (N = 366) indicate that: (a) When group referents were used, raters relied on the frequency of voice events (and not the proportion of group members exhibiting voice) to inform their ratings of voice behavior, whereas the opposite was true when individual-referent items were used, and (b) the magnitude of the relationship between observers' ratings of group voice behavior and their perceptions of group performance was higher when raters used group-referent, as opposed to an individual-referent, items. The authors discuss the implications of their findings for scholars interested in studying behavioral phenomena occurring in teams, groups, and work units in organizational behavior research. (c) 2015 APA, all rights reserved).

  17. The self or the voice? Relative contributions of self-esteem and voice appraisal in persistent auditory hallucinations.

    Science.gov (United States)

    Fannon, Dominic; Hayward, Peter; Thompson, Neil; Green, Nicola; Surguladze, Simon; Wykes, Til

    2009-07-01

    Persistent auditory hallucinations are common, disabling and difficult to treat. Cognitive behavioural therapy is recommended in their treatment though there is limited empirical evidence of the role of cognitive factors in the formation and persistence of voices. Low self-esteem is thought to play a causal and maintaining role in a range of clinical disorders, particularly depression, which is prevalent and disabling in schizophrenia. It was hypothesized that low self-esteem is prominent in, and contributes to, depression in voice hearers. Beliefs about persistent auditory hallucinations were investigated in 82 patients using the Beliefs About Voices Questionnaire--revised in a cross-sectional design. Self-esteem and depression were assessed using standardized measures. Depression and low self-esteem were prominent as were beliefs about the omnipotence and malevolence of auditory hallucinations. Beliefs about the uncontrollability and dominance of auditory hallucinations and low self-esteem were significantly correlated with depression. Low self-esteem did not mediate the effect of beliefs about auditory hallucinations--both acted independently to contribute to depression in this sample of patients with schizophrenia and persistent auditory hallucinations. Low self-esteem is of fundamental importance to the understanding of affective disturbance in voice hearers. Therapeutic interventions need to address both the appraisal of self and hallucinations in schizophrenia. Measures which ameliorate low self-esteem can be expected to improve depressed mood in this patient group. Further elucidation of the mechanisms involved can strengthen existing models of positive psychotic symptoms and provide targets for more effective treatments.

  18. Instantaneous and Frequency-Warped Signal Processing Techniques for Auditory Source Separation.

    Science.gov (United States)

    Wang, Avery Li-Chun

    This thesis summarizes several contributions to the areas of signal processing and auditory source separation. The philosophy of Frequency-Warped Signal Processing is introduced as a means for separating the AM and FM contributions to the bandwidth of a complex-valued, frequency-varying sinusoid p (n), transforming it into a signal with slowly-varying parameters. This transformation facilitates the removal of p (n) from an additive mixture while minimizing the amount of damage done to other signal components. The average winding rate of a complex-valued phasor is explored as an estimate of the instantaneous frequency. Theorems are provided showing the robustness of this measure. To implement frequency tracking, a Frequency-Locked Loop algorithm is introduced which uses the complex winding error to update its frequency estimate. The input signal is dynamically demodulated and filtered to extract the envelope. This envelope may then be remodulated to reconstruct the target partial, which may be subtracted from the original signal mixture to yield a new, quickly-adapting form of notch filtering. Enhancements to the basic tracker are made which, under certain conditions, attain the Cramer -Rao bound for the instantaneous frequency estimate. To improve tracking, the novel idea of Harmonic -Locked Loop tracking, using N harmonically constrained trackers, is introduced for tracking signals, such as voices and certain musical instruments. The estimated fundamental frequency is computed from a maximum-likelihood weighting of the N tracking estimates, making it highly robust. The result is that harmonic signals, such as voices, can be isolated from complex mixtures in the presence of other spectrally overlapping signals. Additionally, since phase information is preserved, the resynthesized harmonic signals may be removed from the original mixtures with relatively little damage to the residual signal. Finally, a new methodology is given for designing linear-phase FIR filters

  19. Two-component network model in voice identification technologies

    Directory of Open Access Journals (Sweden)

    Edita K. Kuular

    2018-03-01

    Full Text Available Among the most important parameters of biometric systems with voice modalities that determine their effectiveness, along with reliability and noise immunity, a speed of identification and verification of a person has been accentuated. This parameter is especially sensitive while processing large-scale voice databases in real time regime. Many research studies in this area are aimed at developing new and improving existing algorithms for presentation and processing voice records to ensure high performance of voice biometric systems. Here, it seems promising to apply a modern approach, which is based on complex network platform for solving complex massive problems with a large number of elements and taking into account their interrelationships. Thus, there are known some works which while solving problems of analysis and recognition of faces from photographs, transform images into complex networks for their subsequent processing by standard techniques. One of the first applications of complex networks to sound series (musical and speech analysis are description of frequency characteristics by constructing network models - converting the series into networks. On the network ontology platform a previously proposed technique of audio information representation aimed on its automatic analysis and speaker recognition has been developed. This implies converting information into the form of associative semantic (cognitive network structure with amplitude and frequency components both. Two speaker exemplars have been recorded and transformed into pertinent networks with consequent comparison of their topological metrics. The set of topological metrics for each of network models (amplitude and frequency one is a vector, and together  those combine a matrix, as a digital "network" voiceprint. The proposed network approach, with its sensitivity to personal conditions-physiological, psychological, emotional, might be useful not only for person identification

  20. A Novel Fast and Secure Approach for Voice Encryption Based on DNA Computing

    Science.gov (United States)

    Kakaei Kate, Hamidreza; Razmara, Jafar; Isazadeh, Ayaz

    2018-06-01

    Today, in the world of information communication, voice information has a particular importance. One way to preserve voice data from attacks is voice encryption. The encryption algorithms use various techniques such as hashing, chaotic, mixing, and many others. In this paper, an algorithm is proposed for voice encryption based on three different schemes to increase flexibility and strength of the algorithm. The proposed algorithm uses an innovative encoding scheme, the DNA encryption technique and a permutation function to provide a secure and fast solution for voice encryption. The algorithm is evaluated based on various measures including signal to noise ratio, peak signal to noise ratio, correlation coefficient, signal similarity and signal frequency content. The results demonstrate applicability of the proposed method in secure and fast encryption of voice files

  1. Effects of first formant onset frequency on [-voice] judgments result from auditory processes not specific to humans.

    Science.gov (United States)

    Kluender, K R; Lotto, A J

    1994-02-01

    When F1-onset frequency is lower, longer F1 cut-back (VOT) is required for human listeners to perceive synthesized stop consonants as voiceless. K. R. Kluender [J. Acoust. Soc. Am. 90, 83-96 (1991)] found comparable effects of F1-onset frequency on the "labeling" of stop consonants by Japanese quail (coturnix coturnix japonica) trained to distinguish stop consonants varying in F1 cut-back. In that study, CVs were synthesized with natural-like rising F1 transitions, and endpoint training stimuli differed in the onset frequency of F1 because a longer cut-back resulted in a higher F1 onset. In order to assess whether earlier results were due to auditory predispositions or due to animals having learned the natural covariance between F1 cut-back and F1-onset frequency, the present experiment was conducted with synthetic continua having either a relatively low (375 Hz) or high (750 Hz) constant-frequency F1. Six birds were trained to respond differentially to endpoint stimuli from three series of synthesized /CV/s varying in duration of F1 cut-back. Second and third formant transitions were appropriate for labial, alveolar, or velar stops. Despite the fact that there was no opportunity for animal subjects to use experienced covariation of F1-onset frequency and F1 cut-back, quail typically exhibited shorter labeling boundaries (more voiceless stops) for intermediate stimuli of the continua when F1 frequency was higher. Responses by human subjects listening to the same stimuli were also collected. Results lend support to the earlier conclusion that part or all of the effect of F1 onset frequency on perception of voicing may be adequately explained by general auditory processes.(ABSTRACT TRUNCATED AT 250 WORDS)

  2. [Comparison of cepstral coefficients to other voice evaluation parameters in patients with occupational dysphonia].

    Science.gov (United States)

    Niebudek-Bogusz, Ewa; Strumiłło, Paweł; Wiktorowicz, Justyna; Sliwińska-Kowalska, Mariola

    2013-01-01

    BACKGROUND Special consideration has recently been given to cepstral analysis with mel-frequency cepstral coefficients (MFCCs). The aim of this study was to assess the applicability of MFCCs in acoustic analysis for diagnosing occupational dysphonia in comparison to subjective and objective parameters of voice evaluation. The study comprised 2 groups, one of 55 female teachers (mean age: 45 years) with occupational dysphonia confirmed by videostroboscopy and 40 female controls with normal voice (mean age: 43 years). The acoustic samples involving sustained vowels "a" and four standardized sentences were analyzed by computed analysis of MFCCs. The results were compared to acoustic parameters of jitter and shimmer groups, noise to harmonic ratio, Yanagihara index evaluating the grade of hoarseness, the aerodynamic parameter: maximum phonation time and also subjective parameters: GRBAS perceptual scale and Voice Handicap Index (VHI). The compared results revealed differences between the study and control groups, significant for MFCC2, MFCC3, MFCC5, MFCC6, MFCC8, MFCC10, particularly for MFCC6 (p teachers correlated with all eight objective parameters, also showed the significant relation with perceptual voice feature A (asthenity) of subjective scale GRBAS, characteristic of weak tired voice. The cepstral analysis with mel frequency cepstral coefficients is a promising tool for evaluating occupational voice disorders, capable of reflecting the perceptual voice features better than other methods of acoustic analysis.

  3. The effects of voice and manual control mode on dual task performance

    Science.gov (United States)

    Wickens, C. D.; Zenyuh, J.; Culp, V.; Marshak, W.

    1986-01-01

    Two fundamental principles of human performance, compatibility and resource competition, are combined with two structural dichotomies in the human information processing system, manual versus voice output, and left versus right cerebral hemisphere, in order to predict the optimum combination of voice and manual control with either hand, for time-sharing performance of a dicrete and continuous task. Eight right handed male subjected performed a discrete first-order tracking task, time-shared with an auditorily presented Sternberg Memory Search Task. Each task could be controlled by voice, or by the left or right hand, in all possible combinations except for a dual voice mode. When performance was analyzed in terms of a dual-task decrement from single task control conditions, the following variables influenced time-sharing efficiency in diminishing order of magnitude, (1) the modality of control, (discrete manual control of tracking was superior to discrete voice control of tracking and the converse was true with the memory search task), (2) response competition, (performance was degraded when both tasks were responded manually), (3) hemispheric competition, (performance degraded whenever two tasks were controlled by the left hemisphere) (i.e., voice or right handed control). The results confirm the value of predictive models invoice control implementation.

  4. Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

    Directory of Open Access Journals (Sweden)

    Maureen Fontaine

    2017-07-01

    Full Text Available The ability to recognize an individual from their voice is a widespread ability with a long evolutionary history. Yet, the perceptual representation of familiar voices is ill-defined. In two experiments, we explored the neuropsychological processes involved in the perception of voice identity. We specifically explored the hypothesis that familiar voices (trained-to-familiar (Experiment 1, and famous voices (Experiment 2 are represented as a whole complex pattern, well approximated by the average of multiple utterances produced by a single speaker. In experiment 1, participants learned three voices over several sessions, and performed a three-alternative forced-choice identification task on original voice samples and several “speaker averages,” created by morphing across varying numbers of different vowels (e.g., [a] and [i] produced by the same speaker. In experiment 2, the same participants performed the same task on voice samples produced by familiar speakers. The two experiments showed that for famous voices, but not for trained-to-familiar voices, identification performance increased and response times decreased as a function of the number of utterances in the averages. This study sheds light on the perceptual representation of familiar voices, and demonstrates the power of average in recognizing familiar voices. The speaker average captures the unique characteristics of a speaker, and thus retains the information essential for recognition; it acts as a prototype of the speaker.

  5. Stigma and need for care in individuals who hear voices.

    Science.gov (United States)

    Vilhauer, Ruvanee P

    2017-02-01

    Voice hearing experiences, or auditory verbal hallucinations, occur in healthy individuals as well as in individuals who need clinical care, but news media depict voice hearing primarily as a symptom of mental illness, particularly schizophrenia. This article explores whether, and how, public perception of an exaggerated association between voice hearing and mental illness might influence individuals' need for clinical care. A narrative literature review was conducted, using relevant peer-reviewed research published in the English language. Stigma may prevent disclosure of voice hearing experiences. Non-disclosure can prevent access to sources of normalizing information and lead to isolation, loss of social support and distress. Internalization of stigma and concomitantly decreased self-esteem could potentially affect features of voices such as perceived voice power, controllability, negativity and frequency, as well as distress. Increased distress may result in a decrease in functioning and increased need for clinical care. The literature reviewed suggests that stigma has the potential to increase need for care through many interrelated pathways. However, the ability to draw definitive conclusions was constrained by the designs of the studies reviewed. Further research is needed to confirm the findings of this review.

  6. Age-related changes to spectral voice characteristics affect judgments of prosodic, segmental, and talker attributes for child and adult speech

    Science.gov (United States)

    Dilley, Laura C.; Wieland, Elizabeth A.; Gamache, Jessica L.; McAuley, J. Devin; Redford, Melissa A.

    2013-01-01

    Purpose As children mature, changes in voice spectral characteristics covary with changes in speech, language, and behavior. Spectral characteristics were manipulated to alter the perceived ages of talkers’ voices while leaving critical acoustic-prosodic correlates intact, to determine whether perceived age differences were associated with differences in judgments of prosodic, segmental, and talker attributes. Method Speech was modified by lowering formants and fundamental frequency, for 5-year-old children’s utterances, or raising them, for adult caregivers’ utterances. Next, participants differing in awareness of the manipulation (Exp. 1a) or amount of speech-language training (Exp. 1b) made judgments of prosodic, segmental, and talker attributes. Exp. 2 investigated the effects of spectral modification on intelligibility. Finally, in Exp. 3 trained analysts used formal prosody coding to assess prosodic characteristics of spectrally-modified and unmodified speech. Results Differences in perceived age were associated with differences in ratings of speech rate, fluency, intelligibility, likeability, anxiety, cognitive impairment, and speech-language disorder/delay; effects of training and awareness of the manipulation on ratings were limited. There were no significant effects of the manipulation on intelligibility or formally coded prosody judgments. Conclusions Age-related voice characteristics can greatly affect judgments of speech and talker characteristics, raising cautionary notes for developmental research and clinical work. PMID:23275414

  7. [Voice disorders in female teachers assessed by Voice Handicap Index].

    Science.gov (United States)

    Niebudek-Bogusz, Ewa; Kuzańska, Anna; Woźnicka, Ewelina; Sliwińska-Kowalska, Mariola

    2007-01-01

    The aim of this study was to assess the application of Voice Handicap Index (VHI) in the diagnosis of occupational voice disorders in female teachers. The subjective assessment of voice by VHI was performed in fifty subjects with dysphonia diagnosed in laryngovideostroboscopic examination. The control group comprised 30 women whose jobs did not involve vocal effort. The results of the total VHI score and each of its subscales: functional, emotional and physical was significantly worse in the study group than in controls (p teachers estimated their own voice problems as a moderate disability, while 12% of them reported severe voice disability. However, all non-teachers assessed their voice problems as slight, their results ranged at the lowest level of VHI score. This study confirmed that VHI as a tool for self-assessment of voice can be a significant contribution to the diagnosis of occupational dysphonia.

  8. Prospective, longitudinal electroglottographic study of voice recovery following accelerated hypofractionated radiotherapy for T1/T2 larynx cancer

    Energy Technology Data Exchange (ETDEWEB)

    Kazi, Rehan [Head and Neck Unit, Royal Marsden Hospital, London (United Kingdom); Institute of Cancer Research, Cancer Research UK Centre for Cell and Molecular Biology, London (United Kingdom); Venkitaraman, Ramachandran; Johnson, Catherine; Prasad, Vyas; Clarke, Peter; Newbold, Kate; Rhys-Evans, Peter; Nutting, Christopher [Head and Neck Unit, Royal Marsden Hospital, London (United Kingdom); Harrington, Kevin [Head and Neck Unit, Royal Marsden Hospital, London (United Kingdom); Institute of Cancer Research, Cancer Research UK Centre for Cell and Molecular Biology, London (United Kingdom)], E-mail: kevinh@icr.ac.uk

    2008-05-15

    Background and purpose: To measure voice outcomes following accelerated hypofractionated radiotherapy for larynx cancer. Materials and methods: Twenty-five patients with T1/T2 glottic cancer underwent serial electroglottographic and acoustic analysis (sustained vowel/i/ and connected speech) before radiotherapy and 1, 6 and 12 months post-treatment. Twenty-five normal subjects served as a reference control population. Results: Pre-treatment measures were significantly worse for larynx cancer patients. Median jitter (0.23% vs 0.97%, p = 0.001) and shimmer (0.62 dB vs 0.98 dB, p = 0.05) and differences in data ranges reflected greater frequency and amplitude perturbation in the larynx cancer patients. Pre-treatment Mean Phonation Time (MPT) was significantly reduced (21 s vs 14.8 s, p = 0.002) in larynx cancer patients. There was a trend towards improvement of jitter, shimmer and normalized noise energy at 12 months post-treatment. MPT improved but remained significantly worse than for normal subjects (21 s vs 16.4 s, p = 0.013). Average fundamental frequency resembled normal subjects, including improvement of the measured range (91.4-244.6 Hz in controls vs 100-201 Hz in post-treatment larynx cancer patients). Conclusions: This non-invasive technique effectively measures post-treatment vocal function in larynx cancer patients. This study demonstrated improvement of many key parameters that influence voice function over 12 months after radiotherapy.

  9. Prospective, longitudinal electroglottographic study of voice recovery following accelerated hypofractionated radiotherapy for T1/T2 larynx cancer

    International Nuclear Information System (INIS)

    Kazi, Rehan; Venkitaraman, Ramachandran; Johnson, Catherine; Prasad, Vyas; Clarke, Peter; Newbold, Kate; Rhys-Evans, Peter; Nutting, Christopher; Harrington, Kevin

    2008-01-01

    Background and purpose: To measure voice outcomes following accelerated hypofractionated radiotherapy for larynx cancer. Materials and methods: Twenty-five patients with T1/T2 glottic cancer underwent serial electroglottographic and acoustic analysis (sustained vowel/i/ and connected speech) before radiotherapy and 1, 6 and 12 months post-treatment. Twenty-five normal subjects served as a reference control population. Results: Pre-treatment measures were significantly worse for larynx cancer patients. Median jitter (0.23% vs 0.97%, p = 0.001) and shimmer (0.62 dB vs 0.98 dB, p = 0.05) and differences in data ranges reflected greater frequency and amplitude perturbation in the larynx cancer patients. Pre-treatment Mean Phonation Time (MPT) was significantly reduced (21 s vs 14.8 s, p = 0.002) in larynx cancer patients. There was a trend towards improvement of jitter, shimmer and normalized noise energy at 12 months post-treatment. MPT improved but remained significantly worse than for normal subjects (21 s vs 16.4 s, p = 0.013). Average fundamental frequency resembled normal subjects, including improvement of the measured range (91.4-244.6 Hz in controls vs 100-201 Hz in post-treatment larynx cancer patients). Conclusions: This non-invasive technique effectively measures post-treatment vocal function in larynx cancer patients. This study demonstrated improvement of many key parameters that influence voice function over 12 months after radiotherapy

  10. Singing voice outcomes following singing voice therapy.

    Science.gov (United States)

    Dastolfo-Hromack, Christina; Thomas, Tracey L; Rosen, Clark A; Gartner-Schmidt, Jackie

    2016-11-01

    The objectives of this study were to describe singing voice therapy (SVT), describe referred patient characteristics, and document the outcomes of SVT. Retrospective. Records of patients receiving SVT between June 2008 and June 2013 were reviewed (n = 51). All diagnoses were included. Demographic information, number of SVT sessions, and symptom severity were retrieved from the medical record. Symptom severity was measured via the 10-item Singing Voice Handicap Index (SVHI-10). Treatment outcome was analyzed by diagnosis, history of previous training, and SVHI-10. SVHI-10 scores decreased following SVT (mean change = 11, 40% decrease) (P singing lessons (n = 10) also completed an average of three SVT sessions. Primary muscle tension dysphonia (MTD1) and benign vocal fold lesion (lesion) were the most common diagnoses. Most patients (60%) had previous vocal training. SVHI-10 decrease was not significantly different between MTD and lesion. This is the first outcome-based study of SVT in a disordered population. Diagnosis of MTD or lesion did not influence treatment outcomes. Duration of SVT was short (approximately three sessions). Voice care providers are encouraged to partner with a singing voice therapist to provide optimal care for the singing voice. This study supports the use of SVT as a tool for the treatment of singing voice disorders. 4 Laryngoscope, 126:2546-2551, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.

  11. Simulation of a Smith-Purcell free-electron laser with sidewalls: Copious emission at the fundamental frequency

    International Nuclear Information System (INIS)

    Donohue, J. T.; Gardelle, J.

    2011-01-01

    The two-dimensional theory of the Smith-Purcell free-electron laser of Andrews and Brau [H. L. Andrews and C. A. Brau, Phys. Rev. ST Accel. Beams 7, 070701 (2004)] predicts that coherent Smith-Purcell radiation can occur only at harmonics of the frequency of the evanescent wave that is resonant with the beam. A particle-in-cell simulation shows that in a three-dimensional context, where the lamellar grating has sidewalls, coherent Smith-Purcell radiation can be copiously emitted at the fundamental frequency, for a well-defined range of beam energy.

  12. Artificially intelligent recognition of Arabic speaker using voice print-based local features

    Science.gov (United States)

    Mahmood, Awais; Alsulaiman, Mansour; Muhammad, Ghulam; Akram, Sheeraz

    2016-11-01

    Local features for any pattern recognition system are based on the information extracted locally. In this paper, a local feature extraction technique was developed. This feature was extracted in the time-frequency plain by taking the moving average on the diagonal directions of the time-frequency plane. This feature captured the time-frequency events producing a unique pattern for each speaker that can be viewed as a voice print of the speaker. Hence, we referred to this technique as voice print-based local feature. The proposed feature was compared to other features including mel-frequency cepstral coefficient (MFCC) for speaker recognition using two different databases. One of the databases used in the comparison is a subset of an LDC database that consisted of two short sentences uttered by 182 speakers. The proposed feature attained 98.35% recognition rate compared to 96.7% for MFCC using the LDC subset.

  13. Speak up-related climate and its association with healthcare workers' speaking up and withholding voice behaviours: a cross-sectional survey in Switzerland.

    Science.gov (United States)

    Schwappach, David; Richard, Aline

    2018-03-23

    To determine frequencies of healthcare workers (HCWs) speak up-related behaviours and the association of speak up-related safety climate with speaking up and withholding voice. Cross-sectional survey of doctors and nurses. Data were analysed using multilevel logistic regression models SETTING: 4 hospitals with a total of nine sites from the German, French and Italian speaking part of Switzerland. Survey data were collected from 979 nurses and doctors. Frequencies of perceived patient safety concerns, of withholding voice and of speaking up behaviour. Speak up-related climate measures included psychological safety, encouraging environment and resignation. Perceived patient safety concerns were frequent among doctors and nurses (between 62% and 80% reported at least one safety concern during the last 4 weeks depending on the single items). Withholding voice was reported by 19%-39% of HCWs. Speaking up was reported by more than half of HCWs (55%-76%). The frequency of perceived concerns during the last 4 weeks was positively associated with both speaking up (OR=2.7, pspeaking up frequency (OR=1.3, p=0.005) and lower withholding voice frequency (OR=0.82, p=0.006). Resignation was associated with withholding voice (OR=1.5, pspeak up-supportive safety climate for staff safety-related communication behaviours, specifically withholding voice. This study indicates that a poor climate, in particular high levels of resignation among HCWs, is linked to frequent 'silence' of HCWs but not inversely associated with frequent speaking up. Interventions addressing safety-related voicing behaviours should discriminate between withholding voice and speaking up. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  14. I like my voice better: self-enhancement bias in perceptions of voice attractiveness.

    Science.gov (United States)

    Hughes, Susan M; Harrison, Marissa A

    2013-01-01

    Previous research shows that the human voice can communicate a wealth of nonsemantic information; preferences for voices can predict health, fertility, and genetic quality of the speaker, and people often use voice attractiveness, in particular, to make these assessments of others. But it is not known what we think of the attractiveness of our own voices as others hear them. In this study eighty men and women rated the attractiveness of an array of voice recordings of different individuals and were not told that their own recorded voices were included in the presentation. Results showed that participants rated their own voices as sounding more attractive than others had rated their voices, and participants also rated their own voices as sounding more attractive than they had rated the voices of others. These findings suggest that people may engage in vocal implicit egotism, a form of self-enhancement.

  15. Hyperfine-resolved transition frequency list of fundamental vibration bands of H35Cl and H37Cl

    Science.gov (United States)

    Iwakuni, Kana; Sera, Hideyuki; Abe, Masashi; Sasada, Hiroyuki

    2014-12-01

    Sub-Doppler resolution spectroscopy of the fundamental vibration bands of H35Cl and H37Cl has been carried out from 87.1 to 89.9 THz. We have determined the absolute transition frequencies of the hyperfine-resolved R(0) to R(4) transitions with a typical uncertainty of 10 kHz. We have also yielded six molecular constants for each isotopomer in the vibrational excited state, which reproduce the determined frequencies with a standard deviation of about 10 kHz.

  16. Associations between the Transsexual Voice Questionnaire (TVQMtF ) and self-report of voice femininity and acoustic voice measures.

    Science.gov (United States)

    Dacakis, Georgia; Oates, Jennifer; Douglas, Jacinta

    2017-11-01

    The Transsexual Voice Questionnaire (TVQ MtF ) was designed to capture the voice-related perceptions of individuals whose gender identity as female is the opposite of their birth-assigned gender (MtF women). Evaluation of the psychometric properties of the TVQ MtF is ongoing. To investigate associations between TVQ MtF scores and (1) self-perceptions of voice femininity and (2) acoustic parameters of voice pitch and voice quality in order to evaluate further the validity of the TVQ MtF . A strong correlation between TVQ MtF scores and self-ratings of voice femininity was predicted, but no association between TVQ MtF scores and acoustic measures of voice pitch and quality was proposed. Participants were 148 MtF women (mean age 48.14 years) recruited from the La Trobe Communication Clinic and the clinics of three doctors specializing in transgender health. All participants completed the TVQ MtF and 34 of these participants also provided a voice sample for acoustic analysis. Pearson product-moment correlation analysis was conducted to examine the associations between TVQ MtF scores and (1) self-perceptions of voice femininity and (2) acoustic measures of F0, jitter (%), shimmer (dB) and harmonic-to-noise ratio (HNR). Strong negative correlations between the participants' perceptions of their voice femininity and the TVQ MtF scores demonstrated that for this group of MtF women a low self-rating of voice femininity was associated with more frequent negative voice-related experiences. This association was strongest with the vocal-functioning component of the TVQ MtF . These strong correlations and high levels of shared variance between the TVQ MtF and a measure of a related construct provides evidence for the convergent validity of the TVQ MtF . The absence of significant correlations between the TVQ MtF and the acoustic data is consistent with the equivocal findings of earlier research. This finding indicates that these two measures assess different aspects of the voice

  17. Dysphonic Voice Pattern Analysis of Patients in Parkinson’s Disease Using Minimum Interclass Probability Risk Feature Selection and Bagging Ensemble Learning Methods

    Directory of Open Access Journals (Sweden)

    Yunfeng Wu

    2017-01-01

    Full Text Available Analysis of quantified voice patterns is useful in the detection and assessment of dysphonia and related phonation disorders. In this paper, we first study the linear correlations between 22 voice parameters of fundamental frequency variability, amplitude variations, and nonlinear measures. The highly correlated vocal parameters are combined by using the linear discriminant analysis method. Based on the probability density functions estimated by the Parzen-window technique, we propose an interclass probability risk (ICPR method to select the vocal parameters with small ICPR values as dominant features and compare with the modified Kullback-Leibler divergence (MKLD feature selection approach. The experimental results show that the generalized logistic regression analysis (GLRA, support vector machine (SVM, and Bagging ensemble algorithm input with the ICPR features can provide better classification results than the same classifiers with the MKLD selected features. The SVM is much better at distinguishing normal vocal patterns with a specificity of 0.8542. Among the three classification methods, the Bagging ensemble algorithm with ICPR features can identify 90.77% vocal patterns, with the highest sensitivity of 0.9796 and largest area value of 0.9558 under the receiver operating characteristic curve. The classification results demonstrate the effectiveness of our feature selection and pattern analysis methods for dysphonic voice detection and measurement.

  18. Motorcycle Start-stop System based on Intelligent Biometric Voice Recognition

    Science.gov (United States)

    Winda, A.; E Byan, W. R.; Sofyan; Armansyah; Zariantin, D. L.; Josep, B. G.

    2017-03-01

    Current mechanical key in the motorcycle is prone to bulgary, being stolen or misplaced. Intelligent biometric voice recognition as means to replace this mechanism is proposed as an alternative. The proposed system will decide whether the voice is belong to the user or not and the word utter by the user is ‘On’ or ‘Off’. The decision voice will be sent to Arduino in order to start or stop the engine. The recorded voice is processed in order to get some features which later be used as input to the proposed system. The Mel-Frequency Ceptral Coefficient (MFCC) is adopted as a feature extraction technique. The extracted feature is the used as input to the SVM-based identifier. Experimental results confirm the effectiveness of the proposed intelligent voice recognition and word recognition system. It show that the proposed method produces a good training and testing accuracy, 99.31% and 99.43%, respectively. Moreover, the proposed system shows the performance of false rejection rate (FRR) and false acceptance rate (FAR) accuracy of 0.18% and 17.58%, respectively. In the intelligent word recognition shows that the training and testing accuracy are 100% and 96.3%, respectively.

  19. The future of Asian feminisms: confronting fundamentalisms, conflicts and neo-liberalism

    NARCIS (Netherlands)

    Katjasungkana, N.; Wieringa, S.E.

    2012-01-01

    This book on the future of Asian feminisms, confronting fundamentalisms, conflicts, and neo-liberalism is a critical contribution to the rising voices of Asian women’s studies scholars and activists. It is based on the ongoing research and advocacy work of the Kartini Asia Network, founded in 2003

  20. Comparison of cepstral coefficients to other voice evaluation parameters in patients with occupational dysphonia

    Directory of Open Access Journals (Sweden)

    Ewa Niebudek-Bogusz

    2013-12-01

    Full Text Available Background: Special consideration has recently been given to cepstral analysis with mel-frequency cepstral coefficients (MFCCs. The aim of this study was to assess the applicability of MFCCs in acoustic analysis for diagnosing occupational dysphonia in comparison to subjective and objective parameters of voice evaluation. Materials and Methods: The study comprised 2 groups, one of 55 female teachers (mean age: 45 years with occupational dysphonia confirmed by videostroboscopy and 40 female controls with normal voice (mean age: 43 years. The acoustic samples involving sustained vowels "a" and four standardized sentences were analyzed by computed analysis of MFCCs. The results were compared to acoustic parameters of jitter and shimmer groups, noise to harmonic ratio, Yanagihara index evaluating the grade of hoarseness, the aerodynamic parameter: maximum phonation time and also subjective parameters: GRBAS perceptual scale and Voice Handicap Index (VHI. Results: The compared results revealed differences between the study and control groups, significant for MFCC2, MFCC3, MFCC5, MFCC6, MFCC8, MFCC10, particularly for MFCC6 (p < 0.001 and MFCC8 (p < 0.009, which may suggest their clinical applicability. In the study group, MFCC4, MFCC8 and MFCC10 correlated significantly with the major objective parameters of voice assessment. Moreover, MFCC8 coefficient, which in the female teachers correlated with all eight objective parameters, also showed the significant relation with perceptual voice feature A (asthenity of subjective scale GRBAS, characteristic of weak tired voice. Conclusions: The cepstral analysis with mel frequency cepstral coefficients is a promising tool for evaluating occupational voice disorders, capable of reflecting the perceptual voice features better than other methods of acoustic analysis. Med Pr 2013;64(6:805–816

  1. Role of the Internal Superior Laryngeal Nerve in the Motor Responses of Vocal Cords and the Related Voice Acoustic Changes

    Science.gov (United States)

    Seifpanahi, Sadegh; Izadi, Farzad; Jamshidi, Ali-Ashraf; Torabinezhad, Farhad; Sarrafzadeh, Javad; Mohammadi, Siavash

    2016-01-01

    Background: Repeated efforts by researchers to impose voice changes by laryngeal surface electrical stimulation (SES) have come to no avail. This present pre-experimental study employed a novel method for SES application so as to evoke the motor potential of the internal superior laryngeal nerve (ISLN) and create voice changes. Methods: Thirty-two normal individuals (22 females and 10 males) participated in this study. The subjects were selected from the students of Iran University of Medical Sciences in 2014. Two monopolar active electrodes were placed on the thyrohyoid space at the location of the ISLN entrance to the larynx and 1 dispersive electrode was positioned on the back of the neck. A current with special programmed parameters was applied to stimulate the ISLN via the active electrodes and simultaneously the resultant acoustic changes were evaluated. All the means of the acoustic parameters during SES and rest periods were compared using the paired t-test. Results: The findings indicated significant changes (P=0.00) in most of the acoustic parameters during SES presentation compared to them at rest. The mean of fundamental frequency standard deviation (SD F0) at rest was 1.54 (SD=0.55) versus 4.15 (SD=3.00) for the SES period. The other investigated parameters comprised fundamental frequency (F0), minimum F0, jitter, shimmer, harmonic-to-noise ratio (HNR), mean intensity, and minimum intensity. Conclusion: These findings demonstrated significant changes in most of the important acoustic features, suggesting that the stimulation of the ISLN via SES could induce motor changes in the vocal folds. The clinical applicability of the method utilized in the current study in patients with vocal fold paralysis requires further research. PMID:27582586

  2. Role of the Internal Superior Laryngeal Nerve in the Motor Responses of Vocal Cords and the Related Voice Acoustic Changes

    Directory of Open Access Journals (Sweden)

    Sadegh Seifpanahi

    2016-09-01

    Full Text Available Background: Repeated efforts by researchers to impose voice changes by laryngeal surface electrical stimulation (SES have come to no avail. This present pre-experimental study employed a novel method for SES application so as to evoke the motor potential of the internal superior laryngeal nerve (ISLN and create voice changes. Methods: Thirty-two normal individuals (22 females and 10 males participated in this study. The subjects were selected from the students of Iran University of Medical Sciences in 2014. Two monopolar active electrodes were placed on the thyrohyoid space at the location of the ISLN entrance to the larynx and 1 dispersive electrode was positioned on the back of the neck. A current with special programmed parameters was applied to stimulate the ISLN via the active electrodes and simultaneously the resultant acoustic changes were evaluated. All the means of the acoustic parameters during SES and rest periods were compared using the paired t-test. Results: The findings indicated significant changes (P=0.00 in most of the acoustic parameters during SES presentation compared to them at rest. The mean of fundamental frequency standard deviation (SD F0 at rest was 1.54 (SD=0.55 versus 4.15 (SD=3.00 for the SES period. The other investigated parameters comprised fundamental frequency (F0, minimum F0, jitter, shimmer, harmonic-to-noise ratio (HNR, mean intensity, and minimum intensity. Conclusion: These findings demonstrated significant changes in most of the important acoustic features, suggesting that the stimulation of the ISLN via SES could induce motor changes in the vocal folds. The clinical applicability of the method utilized in the current study in patients with vocal fold paralysis requires further research.

  3. Sound induced activity in voice sensitive cortex predicts voice memory ability

    Directory of Open Access Journals (Sweden)

    Rebecca eWatson

    2012-04-01

    Full Text Available The ‘temporal voice areas’ (TVAs (Belin et al., 2000 of the human brain show greater neuronal activity in response to human voices than to other categories of nonvocal sounds. However, a direct link between TVA activity and voice perceptionbehaviour has not yet been established. Here we show that a functional magnetic resonance imaging (fMRI measure of activity in the TVAs predicts individual performance at a separately administered voice memory test. This relation holds whengeneral sound memory ability is taken into account. These findings provide the first evidence that the TVAs are specifically involved in voice cognition.

  4. Investigation of a glottal related harmonics-to-noise ratio and spectral tilt as indicators of glottal noise in synthesized and human voice signals.

    LENUS (Irish Health Repository)

    Murphy, Peter J

    2008-03-01

    The harmonics-to-noise ratio (HNR) of the voiced speech signal has implicitly been used to infer information regarding the turbulent noise level at the glottis. However, two problems exist for inferring glottal noise attributes from the HNR of the speech wave form: (i) the measure is fundamental frequency (f0) dependent for equal levels of glottal noise, and (ii) any deviation from signal periodicity affects the ratio, not just turbulent noise. An alternative harmonics-to-noise ratio formulation [glottal related HNR (GHNR\\')] is proposed to overcome the former problem. In GHNR\\' a mean over the spectral range of interest of the HNRs at specific harmonic\\/between-harmonic frequencies (expressed in linear scale) is calculated. For the latter issue [(ii)] two spectral tilt measures are shown, using synthesis data, to be sensitive to glottal noise while at the same time being comparatively insensitive to other glottal aperiodicities. The theoretical development predicts that the spectral tilt measures reduce as noise levels increase. A conventional HNR estimator, GHNR\\' and two spectral tilt measures are applied to a data set of 13 pathological and 12 normal voice samples. One of the tilt measures and GHNR\\' are shown to provide statistically significant differentiating power over a conventional HNR estimator.

  5. Integrating cues of social interest and voice pitch in men's preferences for women's voices.

    Science.gov (United States)

    Jones, Benedict C; Feinberg, David R; Debruine, Lisa M; Little, Anthony C; Vukovic, Jovana

    2008-04-23

    Most previous studies of vocal attractiveness have focused on preferences for physical characteristics of voices such as pitch. Here we examine the content of vocalizations in interaction with such physical traits, finding that vocal cues of social interest modulate the strength of men's preferences for raised pitch in women's voices. Men showed stronger preferences for raised pitch when judging the voices of women who appeared interested in the listener than when judging the voices of women who appeared relatively disinterested in the listener. These findings show that voice preferences are not determined solely by physical properties of voices and that men integrate information about voice pitch and the degree of social interest expressed by women when forming voice preferences. Women's preferences for raised pitch in women's voices were not modulated by cues of social interest, suggesting that the integration of cues of social interest and voice pitch when men judge the attractiveness of women's voices may reflect adaptations that promote efficient allocation of men's mating effort.

  6. Voice Therapy Practices and Techniques: A Survey of Voice Clinicians.

    Science.gov (United States)

    Mueller, Peter B.; Larson, George W.

    1992-01-01

    Eighty-three voice disorder therapists' ratings of statements regarding voice therapy practices indicated that vocal nodules are the most frequent disorder treated; vocal abuse and hard glottal attack elimination, counseling, and relaxation were preferred treatment approaches; and voice therapy is more effective with adults than with children.…

  7. Improvement of Microtremor Data Filtering and Processing Methods Used in Determining the Fundamental Frequency of Urban Areas

    Science.gov (United States)

    Mousavi Anzehaee, Mohammad; Adib, Ahmad; Heydarzadeh, Kobra

    2015-10-01

    The manner of microtremor data collection and filtering operation and also the method used for processing have a considerable effect on the accuracy of estimation of dynamic soil parameters. In this paper, running variance method was used to improve the automatic detection of data sections infected by local perturbations. In this method, the microtremor data running variance is computed using a sliding window. Then the obtained signal is used to remove the ranges of data affected by perturbations from the original data. Additionally, to determinate the fundamental frequency of a site, this study has proposed a statistical characteristics-based method. Actually, statistical characteristics, such as the probability density graph and the average and the standard deviation of all the frequencies corresponding to the maximum peaks in the H/ V spectra of all data windows, are used to differentiate the real peaks from the false peaks resulting from perturbations. The methods have been applied to the data recorded for the City of Meybod in central Iran. Experimental results show that the applied methods are able to successfully reduce the effects of extensive local perturbations on microtremor data and eventually to estimate the fundamental frequency more accurately compared to other common methods.

  8. Representative voice in different organizational contexts : a study of 40 departments of a Dutch childcare organization

    NARCIS (Netherlands)

    Pauksztat, Birgit; Wittek, Rafael

    2011-01-01

    'Representative voice' can be defined as actions in which one or more speakers represent others when speaking up about a problem at the workplace or making a suggestion. The purpose of this paper is to introduce the concept of representative voice, assess the frequency of its occurrence and examine

  9. Examining explanations for fundamental frequency's contribution to speech intelligibility in noise

    Science.gov (United States)

    Schlauch, Robert S.; Miller, Sharon E.; Watson, Peter J.

    2005-09-01

    Laures and Weismer [JSLHR, 42, 1148 (1999)] reported that speech with natural variation in fundamental frequency (F0) is more intelligible in noise than speech with a flattened F0 contour. Cognitive-linguistic based explanations have been offered to account for this drop in intelligibility for the flattened condition, but a lower-level mechanism related to auditory streaming may be responsible. Numerous psychoacoustic studies have demonstrated that modulating a tone enables a listener to segregate it from background sounds. To test these rival hypotheses, speech recognition in noise was measured for sentences with six different F0 contours: unmodified, flattened at the mean, natural but exaggerated, reversed, and frequency modulated (rates of 2.5 and 5.0 Hz). The 180 stimulus sentences were produced by five talkers (30 sentences per condition). Speech recognition for fifteen listeners replicate earlier findings showing that flattening the F0 contour results in a roughly 10% reduction in recognition of key words compared with the natural condition. Although the exaggerated condition produced results comparable to those of the flattened condition, the other conditions with unnatural F0 contours all yielded significantly poorer performance than the flattened condition. These results support the cognitive, linguistic-based explanations for the reduction in performance.

  10. Voices Not Heard: Voice-Use Profiles of Elementary Music Teachers, the Effects of Voice Amplification on Vocal Load, and Perceptions of Issues Surrounding Voice Use

    Science.gov (United States)

    Morrow, Sharon L.

    2009-01-01

    Teachers represent the largest group of occupational voice users and have voice-related problems at a rate of over twice that found in the general population. Among teachers, music teachers are roughly four times more likely than classroom teachers to develop voice-related problems. Although it has been established that music teachers use their…

  11. Behavioural evidence of a dissociation between voice gender categorization and phoneme categorization using auditory morphed stimuli

    Directory of Open Access Journals (Sweden)

    Cyril R Pernet

    2014-01-01

    Full Text Available Both voice gender and speech perception rely on neuronal populations located in the peri-sylvian areas. However, whilst functional imaging studies suggest a left versus right hemisphere and anterior versus posterior dissociation between voice and speech categorization, psycholinguistic studies on talker variability suggest that these two processes (voice and speech categorization share common mechanisms. In this study, we investigated the categorical perception of voice gender (male vs. female and phonemes (/pa/ vs. /ta/ using the same stimulus continua generated by morphing. This allowed the investigation of behavioural differences while controlling acoustic characteristics, since the same stimuli were used in both tasks. Despite a higher acoustic dissimilarity between items during the phoneme categorization task (a male and female voice producing the same phonemes than the gender task (the same person producing 2 phonemes, results showed that speech information is being processed much faster than voice information. In addition, f0 or timbre equalization did not affect RT, which disagrees with the classical psycholinguistic models in which voice information is stripped away or normalized to access phonetic content. Also, despite similar response (percentages and perceptual (d’ curves, a reverse correlation analysis on acoustic features revealed, as expected, that the formant frequencies of the consonant distinguished stimuli in the phoneme task, but that only the vowel formant frequencies distinguish stimuli in the gender task. The 2nd set of results thus also disagrees with models postulating that the same acoustic information is used for voice and speech. Altogether these results suggest that voice gender categorization and phoneme categorization are dissociated at an early stage on the basis of different enhanced acoustic features that are diagnostic to the task at hand.

  12. Dimensionality in voice quality.

    Science.gov (United States)

    Bele, Irene Velsvik

    2007-05-01

    This study concerns speaking voice quality in a group of male teachers (n = 35) and male actors (n = 36), as the purpose was to investigate normal and supranormal voices. The goal was the development of a method of valid perceptual evaluation for normal to supranormal and resonant voices. The voices (text reading at two loudness levels) had been evaluated by 10 listeners, for 15 vocal characteristics using VA scales. In this investigation, the results of an exploratory factor analysis of the vocal characteristics used in this method are presented, reflecting four dimensions of major importance for normal and supranormal voices. Special emphasis is placed on the effects on voice quality of a change in the loudness variable, as two loudness levels are studied. Furthermore, the vocal characteristics Sonority and Ringing voice quality are paid special attention, as the essence of the term "resonant voice" was a basic issue throughout a doctoral dissertation where this study was included.

  13. Tutorial and Guidelines on Measurement of Sound Pressure Level in Voice and Speech

    Science.gov (United States)

    Švec, Jan G.; Granqvist, Svante

    2018-01-01

    Purpose: Sound pressure level (SPL) measurement of voice and speech is often considered a trivial matter, but the measured levels are often reported incorrectly or incompletely, making them difficult to compare among various studies. This article aims at explaining the fundamental principles behind these measurements and providing guidelines to…

  14. Muscular tension and body posture in relation to voice handicap and voice quality in teachers with persistent voice complaints.

    Science.gov (United States)

    Kooijman, P G C; de Jong, F I C R S; Oudes, M J; Huinck, W; van Acht, H; Graamans, K

    2005-01-01

    The aim of this study was to investigate the relationship between extrinsic laryngeal muscular hypertonicity and deviant body posture on the one hand and voice handicap and voice quality on the other hand in teachers with persistent voice complaints and a history of voice-related absenteeism. The study group consisted of 25 female teachers. A voice therapist assessed extrinsic laryngeal muscular tension and a physical therapist assessed body posture. The assessed parameters were clustered in categories. The parameters in the different categories represent the same function. Further a tension/posture index was created, which is the summation of the different parameters. The different parameters and the index were related to the Voice Handicap Index (VHI) and the Dysphonia Severity Index (DSI). The scores of the VHI and the individual parameters differ significantly except for the posterior weight bearing and tension of the sternocleidomastoid muscle. There was also a significant difference between the individual parameters and the DSI, except for tension of the cricothyroid muscle and posterior weight bearing. The score of the tension/posture index correlates significantly with both the VHI and the DSI. In a linear regression analysis, the combination of hypertonicity of the sternocleidomastoid, the geniohyoid muscles and posterior weight bearing is the most important predictor for a high voice handicap. The combination of hypertonicity of the geniohyoid muscle, posterior weight bearing, high position of the hyoid bone, hypertonicity of the cricothyroid muscle and anteroposition of the head is the most important predictor for a low DSI score. The results of this study show the higher the score of the index, the higher the score of the voice handicap and the worse the voice quality is. Moreover, the results are indicative for the importance of assessment of muscular tension and body posture in the diagnosis of voice disorders.

  15. Mindfulness of voices, self-compassion, and secure attachment in relation to the experience of hearing voices.

    Science.gov (United States)

    Dudley, James; Eames, Catrin; Mulligan, John; Fisher, Naomi

    2018-03-01

    Developing compassion towards oneself has been linked to improvement in many areas of psychological well-being, including psychosis. Furthermore, developing a non-judgemental, accepting way of relating to voices is associated with lower levels of distress for people who hear voices. These factors have also been associated with secure attachment. This study explores associations between the constructs of mindfulness of voices, self-compassion, and distress from hearing voices and how secure attachment style related to each of these variables. Cross-sectional online. One hundred and twenty-eight people (73% female; M age  = 37.5; 87.5% Caucasian) who currently hear voices completed the Self-Compassion Scale, Southampton Mindfulness of Voices Questionnaire, Relationships Questionnaire, and Hamilton Programme for Schizophrenia Voices Questionnaire. Results showed that mindfulness of voices mediated the relationship between self-compassion and severity of voices, and self-compassion mediated the relationship between mindfulness of voices and severity of voices. Self-compassion and mindfulness of voices were significantly positively correlated with each other and negatively correlated with distress and severity of voices. Mindful relation to voices and self-compassion are associated with reduced distress and severity of voices, which supports the proposed potential benefits of mindful relating to voices and self-compassion as therapeutic skills for people experiencing distress by voice hearing. Greater self-compassion and mindfulness of voices were significantly associated with less distress from voices. These findings support theory underlining compassionate mind training. Mindfulness of voices mediated the relationship between self-compassion and distress from voices, indicating a synergistic relationship between the constructs. Although the current findings do not give a direction of causation, consideration is given to the potential impact of mindful and

  16. Vocal fold elasticity of the Rocky Mountain elk (Cervus elaphus nelsoni) – producing high fundamental frequency vocalization with a very long vocal fold

    OpenAIRE

    Riede, Tobias; Titze, Ingo R.

    2008-01-01

    The vocal folds of male Rocky Mountain elk (Cervus elaphus nelsoni) are about 3 cm long. If fundamental frequency were to be predicted by a simple vibrating string formula, as is often done for the human larynx, such long vocal folds would bear enormous stress to produce the species-specific mating call with an average fundamental frequency of 1 kHz. Predictions would be closer to 50 Hz. Vocal fold histology revealed the presence of a large vocal ligament between the vocal fold epithelium and...

  17. Trends in musical theatre voice: an analysis of audition requirements for singers.

    Science.gov (United States)

    Green, Kathryn; Freeman, Warren; Edwards, Matthew; Meyer, David

    2014-05-01

    The American musical theatre industry is a multibillion dollar business in which the requirements for singers are varied and complex. This study identifies the musical genres and voice requirements that are currently most requested at professional auditions to help voice teachers, pedagogues, and physicians who work with musical theatre singers understand the demands of their clients' business. Frequency count. One thousand two thirty-eight professional musical theatre audition listings were gathered over a 6-month period, and information from each listing was categorized and entered into a spreadsheet for analysis. The results indicate that four main genres of music were requested over a wide variety of styles, with more than half of auditions requesting genre categories that may not be served by traditional or classical voice technique alone. To adequately prepare young musical theatre performers for the current job market and keep the performers healthily making the sounds required by the industry, new singing styles may need to be studied and integrated into voice training that only teaches classical styles. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  18. Voice gender discrimination provides a measure of more than pitch-related perception in cochlear implant users.

    Science.gov (United States)

    Li, Tianhao; Fu, Qian-Jie

    2011-08-01

    (1) To investigate whether voice gender discrimination (VGD) could be a useful indicator of the spectral and temporal processing abilities of individual cochlear implant (CI) users; (2) To examine the relationship between VGD and speech recognition with CI when comparable acoustic cues are used for both perception processes. VGD was measured using two talker sets with different inter-gender fundamental frequencies (F(0)), as well as different acoustic CI simulations. Vowel and consonant recognition in quiet and noise were also measured and compared with VGD performance. Eleven postlingually deaf CI users. The results showed that (1) mean VGD performance differed for different stimulus sets, (2) VGD and speech recognition performance varied among individual CI users, and (3) individual VGD performance was significantly correlated with speech recognition performance under certain conditions. VGD measured with selected stimulus sets might be useful for assessing not only pitch-related perception, but also spectral and temporal processing by individual CI users. In addition to improvements in spectral resolution and modulation detection, the improvement in higher modulation frequency discrimination might be particularly important for CI users in noisy environments.

  19. Face the voice

    DEFF Research Database (Denmark)

    Lønstrup, Ansa

    2014-01-01

    will be based on a reception aesthetic and phenomenological approach, the latter as presented by Don Ihde in his book Listening and Voice. Phenomenologies of Sound , and my analytical sketches will be related to theoretical statements concerning the understanding of voice and media (Cavarero, Dolar, La......Belle, Neumark). Finally, the article will discuss the specific artistic combination and our auditory experience of mediated human voices and sculpturally projected faces in an art museum context under the general conditions of the societal panophonia of disembodied and mediated voices, as promoted by Steven...

  20. "Voice Forum" The Human Voice as Primary Instrument in Music Therapy

    DEFF Research Database (Denmark)

    Pedersen, Inge Nygaard; Storm, Sanne

    2009-01-01

    Aspects will be drawn on the human voice as tool for embodying our psychological and physiological state, and attempting integration of feelings. Presentations and dialogues on different methods and techniques in "Therapy related body-and voice work.", as well as the human voice as a tool for non...

  1. The Influence of Fundamental Frequency and Sound Pressure Level Range on Breathing Patterns in Female Classical Singing

    Science.gov (United States)

    Collyer, Sally; Thorpe, C. William; Callaghan, Jean; Davis, Pamela J.

    2008-01-01

    Purpose: This study investigated the influence of fundamental frequency (F0) and sound pressure level (SPL) range on respiratory behavior in classical singing. Method: Five trained female singers performed an 8-s messa di voce (a crescendo and decrescendo on one F0) across their musical F0 range. Lung volume (LV) change was estimated, and…

  2. Very low bit rate voice for packetized mobile applications

    International Nuclear Information System (INIS)

    Knittle, C.D.; Malone, K.T.

    1991-01-01

    This paper reports that transmitting digital voice via packetized mobile communications systems that employ relatively short packet lengths and narrow bandwidths often necessitates very low bit rate coding of the voice data. Sandia National Laboratories is currently developing an efficient voice coding system operating at 800 bits per second (bps). The coding scheme is a modified version of the 2400 bps NSA LPC-10e standard. The most significant modification to the LPC-10e scheme is the vector quantization of the line spectrum frequencies associated with the synthesis filters. An outline of a hardware implementation for the 800 bps coder is presented. The speech quality of the coder is generally good, although speaker recognition is not possible. Further research is being conducted to reduce the memory requirements and complexity of the vector quantizer, and to increase the quality of the reconstructed speech. This work may be of use dealing with nuclear materials

  3. Writing with Voice

    Science.gov (United States)

    Kesler, Ted

    2012-01-01

    In this Teaching Tips article, the author argues for a dialogic conception of voice, based in the work of Mikhail Bakhtin. He demonstrates a dialogic view of voice in action, using two writing examples about the same topic from his daughter, a fifth-grade student. He then provides five practical tips for teaching a dialogic conception of voice in…

  4. Marshall’s Voice

    Directory of Open Access Journals (Sweden)

    Halper Thomas

    2017-12-01

    Full Text Available Most judicial opinions, for a variety of reasons, do not speak with the voice of identifiable judges, but an analysis of several of John Marshall’s best known opinions reveals a distinctive voice, with its characteristic language and style of argumentation. The power of this voice helps to account for the influence of his views.

  5. A pneumatic Bionic Voice prosthesis-Pre-clinical trials of controlling the voice onset and offset.

    Science.gov (United States)

    Ahmadi, Farzaneh; Noorian, Farzad; Novakovic, Daniel; van Schaik, André

    2018-01-01

    Despite emergent progress in many fields of bionics, a functional Bionic Voice prosthesis for laryngectomy patients (larynx amputees) has not yet been achieved, leading to a lifetime of vocal disability for these patients. This study introduces a novel framework of Pneumatic Bionic Voice Prostheses as an electronic adaptation of the Pneumatic Artificial Larynx (PAL) device. The PAL is a non-invasive mechanical voice source, driven exclusively by respiration with an exceptionally high voice quality, comparable to the existing gold standard of Tracheoesophageal (TE) voice prosthesis. Following PAL design closely as the reference, Pneumatic Bionic Voice Prostheses seem to have a strong potential to substitute the existing gold standard by generating a similar voice quality while remaining non-invasive and non-surgical. This paper designs the first Pneumatic Bionic Voice prosthesis and evaluates its onset and offset control against the PAL device through pre-clinical trials on one laryngectomy patient. The evaluation on a database of more than five hours of continuous/isolated speech recordings shows a close match between the onset/offset control of the Pneumatic Bionic Voice and the PAL with an accuracy of 98.45 ±0.54%. When implemented in real-time, the Pneumatic Bionic Voice prosthesis controller has an average onset/offset delay of 10 milliseconds compared to the PAL. Hence it addresses a major disadvantage of previous electronic voice prostheses, including myoelectric Bionic Voice, in meeting the short time-frames of controlling the onset/offset of the voice in continuous speech.

  6. Temporal and spatio-temporal vibrotactile displays for voice fundamental frequency: an initial evaluation of a new vibrotactile speech perception aid with normal-hearing and hearing-impaired individuals.

    Science.gov (United States)

    Auer, E T; Bernstein, L E; Coulter, D C

    1998-10-01

    Four experiments were performed to evaluate a new wearable vibrotactile speech perception aid that extracts fundamental frequency (F0) and displays the extracted F0 as a single-channel temporal or an eight-channel spatio-temporal stimulus. Specifically, we investigated the perception of intonation (i.e., question versus statement) and emphatic stress (i.e., stress on the first, second, or third word) under Visual-Alone (VA), Visual-Tactile (VT), and Tactile-Alone (TA) conditions and compared performance using the temporal and spatio-temporal vibrotactile display. Subjects were adults with normal hearing in experiments I-III and adults with severe to profound hearing impairments in experiment IV. Both versions of the vibrotactile speech perception aid successfully conveyed intonation. Vibrotactile stress information was successfully conveyed, but vibrotactile stress information did not enhance performance in VT conditions beyond performance in VA conditions. In experiment III, which involved only intonation identification, a reliable advantage for the spatio-temporal display was obtained. Differences between subject groups were obtained for intonation identification, with more accurate VT performance by those with normal hearing. Possible effects of long-term hearing status are discussed.

  7. Tips for Healthy Voices

    Science.gov (United States)

    ... prevent voice problems and maintain a healthy voice: Drink water (stay well hydrated): Keeping your body well hydrated by drinking plenty of water each day (6-8 glasses) is essential to maintaining a healthy voice. The ...

  8. Your Cheatin' Voice Will Tell on You: Detection of Past Infidelity from Voice.

    Science.gov (United States)

    Hughes, Susan M; Harrison, Marissa A

    2017-01-01

    Evidence suggests that many physical, behavioral, and trait qualities can be detected solely from the sound of a person's voice, irrespective of the semantic information conveyed through speech. This study examined whether raters could accurately assess the likelihood that a person has cheated on committed, romantic partners simply by hearing the speaker's voice. Independent raters heard voice samples of individuals who self-reported that they either cheated or had never cheated on their romantic partners. To control for aspects that may clue a listener to the speaker's mate value, we used voice samples that did not differ between these groups for voice attractiveness, age, voice pitch, and other acoustic measures. We found that participants indeed rated the voices of those who had a history of cheating as more likely to cheat. Male speakers were given higher ratings for cheating, while female raters were more likely to ascribe the likelihood to cheat to speakers. Additionally, we manipulated the pitch of the voice samples, and for both sexes, the lower pitched versions were consistently rated to be from those who were more likely to have cheated. Regardless of the pitch manipulation, speakers were able to assess actual history of infidelity; the one exception was that men's accuracy decreased when judging women whose voices were lowered. These findings expand upon the idea that the human voice may be of value as a cheater detection tool and very thin slices of vocal information are all that is needed to make certain assessments about others.

  9. Quality of the voice after injection of hyaluronic acid into the vocal fold.

    Science.gov (United States)

    Szkiełkowska, Agata; Miaśkiewicz, Beata; Remacle, Marc; Krasnodębska, Paulina; Skarżyński, Henryk

    2013-04-17

    Voice disorders resulting from glottic insufficiency are a significant clinical problem in everyday phoniatric practice. One method of treatment is injection laryngoplasty. Our study aimed to assess the voice quality of patients treated with hyaluronic acid injection into the vocal fold. We studied 25 patients suffering from dysphonia, conducting laryngological and phoniatric examination, including videostroboscopy and acoustic voice analysis, before the operation and 1, 3, and 6 months later. In all cases there was complete or almost complete glottic closure after the operation. One month after the procedure, videostroboscopic examination revealed reappearance of vocal fold vibration in 8 cases; after 3 months this had risen to 15 cases. Perceptual voice quality (as assessed by the GRBAS scale) in patients with glottic insufficiency was improved. The most significant improvement was obtained 1 month after surgery (p=0.0002), and within the next months further statistically significant improvements (p=0.000002) were noted. Multidimensional voice analysis showed statistically significant and rapid improvement in frequency parameters, especially vFo. Other parameters were also improved 3 and 6 months after surgery. Injection of hyaluronic acid into the vocal fold improves phonatory functions of the larynx and the quality of voice in patients with glottic insufficiency. It may be a safe and conservative method for treatment of voice disorders. Hyaluronic acid injection to the vocal fold is an easy, effective, and fast method for restoration of good voice quality.

  10. Unfamiliar voice identification: Effect of post-event information on accuracy and voice ratings

    Directory of Open Access Journals (Sweden)

    Harriet Mary Jessica Smith

    2014-04-01

    Full Text Available This study addressed the effect of misleading post-event information (PEI on voice ratings, identification accuracy, and confidence, as well as the link between verbal recall and accuracy. Participants listened to a dialogue between male and female targets, then read misleading information about voice pitch. Participants engaged in verbal recall, rated voices on a feature checklist, and made a lineup decision. Accuracy rates were low, especially on target-absent lineups. Confidence and accuracy were unrelated, but the number of facts recalled about the voice predicted later lineup accuracy. There was a main effect of misinformation on ratings of target voice pitch, but there was no effect on identification accuracy or confidence ratings. As voice lineup evidence from earwitnesses is used in courts, the findings have potential applied relevance.

  11. A pneumatic Bionic Voice prosthesis-Pre-clinical trials of controlling the voice onset and offset.

    Directory of Open Access Journals (Sweden)

    Farzaneh Ahmadi

    Full Text Available Despite emergent progress in many fields of bionics, a functional Bionic Voice prosthesis for laryngectomy patients (larynx amputees has not yet been achieved, leading to a lifetime of vocal disability for these patients. This study introduces a novel framework of Pneumatic Bionic Voice Prostheses as an electronic adaptation of the Pneumatic Artificial Larynx (PAL device. The PAL is a non-invasive mechanical voice source, driven exclusively by respiration with an exceptionally high voice quality, comparable to the existing gold standard of Tracheoesophageal (TE voice prosthesis. Following PAL design closely as the reference, Pneumatic Bionic Voice Prostheses seem to have a strong potential to substitute the existing gold standard by generating a similar voice quality while remaining non-invasive and non-surgical. This paper designs the first Pneumatic Bionic Voice prosthesis and evaluates its onset and offset control against the PAL device through pre-clinical trials on one laryngectomy patient. The evaluation on a database of more than five hours of continuous/isolated speech recordings shows a close match between the onset/offset control of the Pneumatic Bionic Voice and the PAL with an accuracy of 98.45 ±0.54%. When implemented in real-time, the Pneumatic Bionic Voice prosthesis controller has an average onset/offset delay of 10 milliseconds compared to the PAL. Hence it addresses a major disadvantage of previous electronic voice prostheses, including myoelectric Bionic Voice, in meeting the short time-frames of controlling the onset/offset of the voice in continuous speech.

  12. A pneumatic Bionic Voice prosthesis—Pre-clinical trials of controlling the voice onset and offset

    Science.gov (United States)

    Noorian, Farzad; Novakovic, Daniel; van Schaik, André

    2018-01-01

    Despite emergent progress in many fields of bionics, a functional Bionic Voice prosthesis for laryngectomy patients (larynx amputees) has not yet been achieved, leading to a lifetime of vocal disability for these patients. This study introduces a novel framework of Pneumatic Bionic Voice Prostheses as an electronic adaptation of the Pneumatic Artificial Larynx (PAL) device. The PAL is a non-invasive mechanical voice source, driven exclusively by respiration with an exceptionally high voice quality, comparable to the existing gold standard of Tracheoesophageal (TE) voice prosthesis. Following PAL design closely as the reference, Pneumatic Bionic Voice Prostheses seem to have a strong potential to substitute the existing gold standard by generating a similar voice quality while remaining non-invasive and non-surgical. This paper designs the first Pneumatic Bionic Voice prosthesis and evaluates its onset and offset control against the PAL device through pre-clinical trials on one laryngectomy patient. The evaluation on a database of more than five hours of continuous/isolated speech recordings shows a close match between the onset/offset control of the Pneumatic Bionic Voice and the PAL with an accuracy of 98.45 ±0.54%. When implemented in real-time, the Pneumatic Bionic Voice prosthesis controller has an average onset/offset delay of 10 milliseconds compared to the PAL. Hence it addresses a major disadvantage of previous electronic voice prostheses, including myoelectric Bionic Voice, in meeting the short time-frames of controlling the onset/offset of the voice in continuous speech. PMID:29466455

  13. Correlation between acoustic parameters and Voice Handicap Index in dysphonic teachers.

    Science.gov (United States)

    Niebudek-Bogusz, E; Woznicka, E; Zamyslowska-Szmytke, E; Sliwinska-Kowalska, M

    2010-01-01

    The aim of this study was to investigate the relationship between acoustic analysis and biopsychosocial implications of voice problems, evaluated by the Voice Handicap Index (VHI). The study comprised 120 female teachers with voice disorders, evaluated by videolaryngostroboscopy. 60.8% of this group were diagnosed as having functional dysphonia and 39.2% had dysphonia with benign vocal fold masses (nodules and polyps). The controls consisted of 30 euphonic women. The correlations between VHI and acoustic analysis were assessed in both groups using the Pearson correlation coefficient and regression analysis. In teachers, the total VHI score was over 5 times as high as in controls (p teachers, significant positive correlations were found between the total VHI score and the frequency perturbation parameters and amplitude perturbation parameters when both statistical methods were used. These acoustic parameters also significantly correlated with the score on the functional and emotional subscales, but rarely with the physical subscale of the VHI. The study revealed a significant relationship between the objective voice measurements and the VHI. The results confirmed that VHI can be a valuable tool for assessing biopsychosocial implications of occupational dysphonia and should be incorporated in multidimensional voice evaluation. (c) 2010 S. Karger AG, Basel.

  14. A Novel Voice Sensor for the Detection of Speech Signals

    Directory of Open Access Journals (Sweden)

    Kun-Ching Wang

    2013-12-01

    Full Text Available In order to develop a novel voice sensor to detect human voices, the use of features which are more robust to noise is an important issue. Voice sensor is also called voice activity detection (VAD. Due to that the inherent nature of the formant structure only occurred on the speech spectrogram (well-known as voiceprint, Wu et al. were the first to use band-spectral entropy (BSE to describe the characteristics of voiceprints. However, the performance of VAD based on BSE feature was degraded in colored noise (or voiceprint-like noise environments. In order to solve this problem, we propose the two-dimensional part-band energy entropy (TD-PBEE parameter based on two variables: part-band partition number upon frequency index and long-term window size upon time index to further improve the BSE-based VAD algorithm. The two variables can efficiently represent the characteristics of voiceprints on each critical frequency band and use long-term information for noisy speech spectrograms, respectively. The TD-PBEE parameter can be regarded as a PBEE parameter over time. First, the strength of voiceprints can be partly enhanced by using four entropies applied to four part-bands. We can use the four part-band energy entropies for describing the voiceprints in detail. Due to the characteristics of non-stationary for speech and various noises, we will then use long-term information processing to refine the PBEE, so the voice-like noise can be distinguished from noisy speech through the concept of PBEE with long-term information. Our experiments show that the proposed feature extraction with the TD-PBEE parameter is quite insensitive to background noise. The proposed TD-PBEE-based VAD algorithm is evaluated for four types of noises and five signal-to-noise ratio (SNR levels. We find that the accuracy of the proposed TD-PBEE-based VAD algorithm averaged over all noises and all SNR levels is better than that of other considered VAD algorithms.

  15. Effect of MP4 Therapy Videos on Adherence to Voice Therapy Home Practice in Children With Dysphonia.

    Science.gov (United States)

    Braden, Maia N; van Leer, Eva

    2017-01-01

    Voice disorders in children are often treated with behavioral voice therapy, which requires home practice of exercises. Previous studies with adults demonstrated increased practice frequency when patients were given videos of a clinician and patient performing therapy tasks. The purpose of this study was to determine whether videos of practice exercises would increase adherence to therapy in children. The study used a randomized double crossover research design. Twenty-eight patients, aged 6-18, referred for voice therapy were included in the study. Two conditions were alternated on a weekly basis: standard-of-care therapy and standard-of-care therapy with video models added. Participants recorded practice frequency and participated in semi-structured interviews, which were analyzed for themes. Participants practiced an average of 1.79 times per day without videos and 1.72 with videos (P = 0.743), indicating no significant difference between conditions. There was also no age group effect (P = 0.314). Qualitative analysis of interview responses established the following themes: (1) I knew how to do my exercises, (2) I didn't like seeing/hearing myself, (3) Videos helped me remember to practice, (4) I didn't like the video player itself, (5) The videos didn't make a difference with practice, and (6) Practicing was no fun. Video models of therapy tasks do not appear to influence adherence to home practice frequency in children with voice disorders, in contrast to findings in adults. Videos were found useful by several participants as reminders to practice. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  16. Prospective clinical study on long-term swallowing function and voice quality in advanced head and neck cancer patients treated with concurrent chemoradiotherapy and preventive swallowing exercises.

    Science.gov (United States)

    Kraaijenga, Sophie A C; van der Molen, Lisette; Jacobi, Irene; Hamming-Vrieze, Olga; Hilgers, Frans J M; van den Brekel, Michiel W M

    2015-11-01

    Concurrent chemoradiotherapy (CCRT) for advanced head and neck cancer (HNC) is associated with substantial early and late side effects, most notably regarding swallowing function, but also regarding voice quality and quality of life (QoL). Despite increased awareness/knowledge on acute dysphagia in HNC survivors, long-term (i.e., beyond 5 years) prospectively collected data on objective and subjective treatment-induced functional outcomes (and their impact on QoL) still are scarce. The objective of this study was the assessment of long-term CCRT-induced results on swallowing function and voice quality in advanced HNC patients. The study was conducted as a randomized controlled trial on preventive swallowing rehabilitation (2006-2008) in a tertiary comprehensive HNC center with twenty-two disease-free and evaluable HNC patients as participants. Multidimensional assessment of functional sequels was performed with videofluoroscopy, mouth opening measurements, Functional Oral Intake Scale, acoustic voice parameters, and (study specific, SWAL-QoL, and VHI) questionnaires. Outcome measures at 6 years post-treatment were compared with results at baseline and at 2 years post-treatment. At a mean follow-up of 6.1 years most initial tumor-, and treatment-related problems remained similarly low to those observed after 2 years follow-up, except increased xerostomia (68%) and increased (mild) pain (32%). Acoustic voice analysis showed less voicedness, increased fundamental frequency, and more vocal effort for the tumors located below the hyoid bone (n = 12), without recovery to baseline values. Patients' subjective vocal function (VHI score) was good. Functional swallowing and voice problems at 6 years post-treatment are minimal in this patient cohort, originating from preventive and continued post-treatment rehabilitation programs.

  17. Fundamentals of EMS, NMS and OSS/BSS

    CERN Document Server

    Sathyan, Jithesh

    2011-01-01

    In this era where data and voice services are available at a push of a button, service providers have virtually limitless options for reaching their customers with value-added services. The changes in services and underlying networks that this always-on culture creates make it essential for service providers to understand the evolving business logic and appropriate support systems for service delivery, billing, and revenue assurance. Supplying an end-to-end understanding of telecom management layers, Fundamentals of EMS, NMS and OSS/BSS is a complete guide to telecom resource and service manag

  18. Mediatization: a concept, multiple voices

    Directory of Open Access Journals (Sweden)

    Pedro Gilberto GOMES

    2016-12-01

    Full Text Available Mediatization has become increasingly a key concept, fundamental, essential to describe the present and the history of media and communicative change taking place. Thus, it became part of a whole, one can not see them as a separate sphere. In this perspective, the media coverage is used as a concept to describe the process of expansion of the different technical means and consider the interrelationships between the communicative change, means and sociocultural change. However, although many researchers use the concept of mediatization, each gives you the meaning that best suits your needs. Thus, the concept of media coverage is treated with multiple voices. This paper discusses this problem and present a preliminary pre-position on the matter.

  19. The Role of Occupational Voice Demand and Patient-Rated Impairment in Predicting Voice Therapy Adherence.

    Science.gov (United States)

    Ebersole, Barbara; Soni, Resha S; Moran, Kathleen; Lango, Miriam; Devarajan, Karthik; Jamal, Nausheen

    2018-05-01

    Examine the relationship among the severity of patient-perceived voice impairment, perceptual dysphonia severity, occupational voice demand, and voice therapy adherence. Identify clinical predictors of increased risk for therapy nonadherence. A retrospective cohort study of patients presenting with a chief complaint of persistent dysphonia at an interdisciplinary voice center was done. The Voice Handicap Index-10 (VHI-10) and the Voice-Related Quality of Life (V-RQOL) survey scores, clinician rating of dysphonia severity using the Grade score from the Grade, Roughness Breathiness, Asthenia, and Strain scale, occupational voice demand, and patient demographics were tested for associations with therapy adherence, defined as completion of the treatment plan. Classification and Regression Tree (CART) analysis was performed to establish thresholds for nonadherence risk. Of 166 patients evaluated, 111 were recommended for voice therapy. The therapy nonadherence rate was 56%. Occupational voice demand category, VHI-10, and V-RQOL scores were the only factors significantly correlated with therapy adherence (P demand are significantly more likely to be nonadherent with therapy than those with high occupational voice demand (P 40 is a significant cutoff point for predicting therapy nonadherence (P demand and patient perception of impairment are significantly and independently correlated with therapy adherence. A VHI-10 score of ≤9 or a V-RQOL score of >40 is a significant cutoff point for predicting nonadherence risk. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  20. Low frequency mechanical resonance of the vocal tract in vocal exercises that apply tubes

    Czech Academy of Sciences Publication Activity Database

    Horáček, Jaromír; Radolf, Vojtěch; Laukkanen, A. M.

    2017-01-01

    Roč. 37, August (2017), s. 39-49 ISSN 1746-8094 R&D Projects: GA ČR(CZ) GA16-01246S Institutional support: RVO:61388998 Keywords : biomechanics of voice * vocal tract acoustics * phonation into tubes * water resistance voice therapy * bubbling frequency * formant frequencies Subject RIV: BI - Acoustics OBOR OECD: Acoustics Impact factor: 2.214, year: 2016

  1. Construction site Voice Operated Information System (VOIS) test

    Science.gov (United States)

    Lawrence, Debbie J.; Hettchen, William

    1991-01-01

    The Voice Activated Information System (VAIS), developed by USACERL, allows inspectors to verbally log on-site inspection reports on a hand held tape recorder. The tape is later processed by the VAIS, which enters the information into the system's database and produces a written report. The Voice Operated Information System (VOIS), developed by USACERL and Automated Sciences Group, through a ESACERL cooperative research and development agreement (CRDA), is an improved voice recognition system based on the concepts and function of the VAIS. To determine the applicability of the VOIS to Corps of Engineers construction projects, Technology Transfer Test Bad (T3B) funds were provided to the Corps of Engineers National Security Agency (NSA) Area Office (Fort Meade) to procure and implement the VOIS, and to train personnel in its use. This report summarizes the NSA application of the VOIS to quality assurance inspection of radio frequency shielding and to progress payment logs, and concludes that the VOIS is an easily implemented system that can offer improvements when applied to repetitive inspection procedures. Use of VOIS can save time during inspection, improve documentation storage, and provide flexible retrieval of stored information.

  2. Comparison of voice quality in patients with GERD-related dysphonia or chronic cough.

    Science.gov (United States)

    Domeracka-Kołodziej, Anna; Grabczak, Elżbieta M; Dąbrowska, Marta; Arcimowicz, Magdalena; Lachowska, Magdalena; Osuch-Wójcikiewicz, Ewa; Niemczyk, Kazimierz

    2014-01-01

    The aim was to compare a voice quality in patients with GERD-related dysphonia or chronic cough and to determine whether there is a relationship between the main symptom reported and voice quality. 249 consecutive patients diagnosed with GERD-related chronic cough or dysphonia were involved in this retrospective study and were divided into two main groups of men and women, and furthermore into groups of chronic cough and dysphonia. Laryngeal lesions were evaluated with videolaryngostroboscopy using Reflux Finding Score. Voice quality was assessed using GRBAS scale, sonograms, and multidimensional voice program (MDVP). All subjects were found to have vocal abnormalities both in subjective and objective voice analysis. Perceptual assessment of voice (GRBAS) did not reveal any differences between analyzed groups depending on the reported symptom. In MDVP analysis, the group of women with cough as the main symptom demonstrated significantly less abnormalities in VTI value. In men with cough as their main complaint, significantly less MDVP abnormalities were found in Jita, Jitt, RAP, PPQ, and sPPQ parameters. The comparison of voice perceptual assessment in patients with GERD-related dysphonia or chronic cough revealed no differences between analyzed groups. In objective voice analysis, the latter group presented lower degree of hoarseness in Yanagihara's scale. In objective MDVP analysis, the chronic cough group presented lower degree of abnormalities only in one of the noise related parameters in females and five frequency perturbation parameters in males. Copyright © 2013 Polish Otorhinolaryngology - Head and Neck Surgery Society. Published by Elsevier Urban & Partner Sp. z.o.o. All rights reserved.

  3. Improvement of electrolaryngeal speech quality using a supraglottal voice source with compensation of vocal tract characteristics.

    Science.gov (United States)

    Wu, Liang; Wan, Congying; Wang, Supin; Wan, Mingxi

    2013-07-01

    Electrolarynx (EL) is a medical speech-recovery device designed for patients who have lost their original voice box due to laryngeal cancer. As a substitute for human larynx, the current commercial EL voice source cannot reconstruct natural EL speech under laryngectomy conditions. To eliminate the abnormal acoustic properties of EL speech, a supraglottal voice source with compensation of vocal tract characteristics was proposed and provided through an experimental EL(SGVS-EL) system. The acoustic analyses of simulated EL speech and reconstructed EL speech produced with different voice sources were performed in the normal subject and laryngectomee. The results indicated that the supraglottal voice source was successful in improving the acoustic properties of EL speech by enhancing low- frequency energy, correcting the shifted formants to normal range, and eliminating the visible spectral zeros. Both normal subject and laryngectomee also produced more natural vowels using SGVS-EL than commercial EL, even if the vocal tract parameter was substituted and the supraglottal voice source was biased to a certain degree. Therefore, supraglottal voice source is a feasible and effective approach to improving the acoustic quality of EL speech.

  4. Pedagogic Voice: Student Voice in Teaching and Engagement Pedagogies

    Science.gov (United States)

    Baroutsis, Aspa; McGregor, Glenda; Mills, Martin

    2016-01-01

    In this paper, we are concerned with the notion of "pedagogic voice" as it relates to the presence of student "voice" in teaching, learning and curriculum matters at an alternative, or second chance, school in Australia. This school draws upon many of the principles of democratic schooling via its utilisation of student voice…

  5. The role of spectral and temporal cues in voice gender discrimination by normal-hearing listeners and cochlear implant users.

    Science.gov (United States)

    Fu, Qian-Jie; Chinchilla, Sherol; Galvin, John J

    2004-09-01

    The present study investigated the relative importance of temporal and spectral cues in voice gender discrimination and vowel recognition by normal-hearing subjects listening to an acoustic simulation of cochlear implant speech processing and by cochlear implant users. In the simulation, the number of speech processing channels ranged from 4 to 32, thereby varying the spectral resolution; the cutoff frequencies of the channels' envelope filters ranged from 20 to 320 Hz, thereby manipulating the available temporal cues. For normal-hearing subjects, results showed that both voice gender discrimination and vowel recognition scores improved as the number of spectral channels was increased. When only 4 spectral channels were available, voice gender discrimination significantly improved as the envelope filter cutoff frequency was increased from 20 to 320 Hz. For all spectral conditions, increasing the amount of temporal information had no significant effect on vowel recognition. Both voice gender discrimination and vowel recognition scores were highly variable among implant users. The performance of cochlear implant listeners was similar to that of normal-hearing subjects listening to comparable speech processing (4-8 spectral channels). The results suggest that both spectral and temporal cues contribute to voice gender discrimination and that temporal cues are especially important for cochlear implant users to identify the voice gender when there is reduced spectral resolution.

  6. Voice Savers for Music Teachers

    Science.gov (United States)

    Cookman, Starr

    2012-01-01

    Music teachers are in a class all their own when it comes to voice use. These elite vocal athletes require stamina, strength, and flexibility from their voices day in, day out for hours at a time. Voice rehabilitation clinics and research show that music education ranks high among the professionals most commonly affected by voice problems.…

  7. Effects of flow gradients on directional radiation of human voice.

    Science.gov (United States)

    Pulkki, Ville; Lähivaara, Timo; Huhtakallio, Ilkka

    2018-02-01

    In voice communication in windy outdoor conditions, complex velocity gradients appear in the flow field around the source, the receiver, and also in the atmosphere. It is commonly known that voice emanates stronger towards the downstream direction when compared with the upstream direction. In literature, the atmospheric effects are used to explain the stronger emanation in the downstream direction. This work shows that the wind also has an effect to the directivity of voice also favouring the downstream direction. The effect is addressed by measurements and simulations. Laboratory measurements are conducted by using a large pendulum with a loudspeaker mimicking the human head, whereas practical measurements utilizing the human voice are realized by placing a subject through the roof window of a moving car. The measurements and a simulation indicate congruent results in the speech frequency range: When the source faces the downstream direction, stronger radiation coinciding with the wind direction is observed, and when it faces the upstream direction, radiation is not affected notably. The simulated flow gradients show a wake region in the downstream direction, and the simulated acoustic field in the flow show that the region causes a wave-guide effect focusing the sound in the direction.

  8. You're a What? Voice Actor

    Science.gov (United States)

    Liming, Drew

    2009-01-01

    This article talks about voice actors and features Tony Oliver, a professional voice actor. Voice actors help to bring one's favorite cartoon and video game characters to life. They also do voice-overs for radio and television commercials and movie trailers. These actors use the sound of their voice to sell a character's emotions--or an advertised…

  9. Voice - How humans communicate?

    Science.gov (United States)

    Tiwari, Manjul; Tiwari, Maneesha

    2012-01-01

    Voices are important things for humans. They are the medium through which we do a lot of communicating with the outside world: our ideas, of course, and also our emotions and our personality. The voice is the very emblem of the speaker, indelibly woven into the fabric of speech. In this sense, each of our utterances of spoken language carries not only its own message but also, through accent, tone of voice and habitual voice quality it is at the same time an audible declaration of our membership of particular social regional groups, of our individual physical and psychological identity, and of our momentary mood. Voices are also one of the media through which we (successfully, most of the time) recognize other humans who are important to us-members of our family, media personalities, our friends, and enemies. Although evidence from DNA analysis is potentially vastly more eloquent in its power than evidence from voices, DNA cannot talk. It cannot be recorded planning, carrying out or confessing to a crime. It cannot be so apparently directly incriminating. As will quickly become evident, voices are extremely complex things, and some of the inherent limitations of the forensic-phonetic method are in part a consequence of the interaction between their complexity and the real world in which they are used. It is one of the aims of this article to explain how this comes about. This subject have unsolved questions, but there is no direct way to present the information that is necessary to understand how voices can be related, or not, to their owners.

  10. Speaker's voice as a memory cue.

    Science.gov (United States)

    Campeanu, Sandra; Craik, Fergus I M; Alain, Claude

    2015-02-01

    Speaker's voice occupies a central role as the cornerstone of auditory social interaction. Here, we review the evidence suggesting that speaker's voice constitutes an integral context cue in auditory memory. Investigation into the nature of voice representation as a memory cue is essential to understanding auditory memory and the neural correlates which underlie it. Evidence from behavioral and electrophysiological studies suggest that while specific voice reinstatement (i.e., same speaker) often appears to facilitate word memory even without attention to voice at study, the presence of a partial benefit of similar voices between study and test is less clear. In terms of explicit memory experiments utilizing unfamiliar voices, encoding methods appear to play a pivotal role. Voice congruency effects have been found when voice is specifically attended at study (i.e., when relatively shallow, perceptual encoding takes place). These behavioral findings coincide with neural indices of memory performance such as the parietal old/new recollection effect and the late right frontal effect. The former distinguishes between correctly identified old words and correctly identified new words, and reflects voice congruency only when voice is attended at study. Characterization of the latter likely depends upon voice memory, rather than word memory. There is also evidence to suggest that voice effects can be found in implicit memory paradigms. However, the presence of voice effects appears to depend greatly on the task employed. Using a word identification task, perceptual similarity between study and test conditions is, like for explicit memory tests, crucial. In addition, the type of noise employed appears to have a differential effect. While voice effects have been observed when white noise is used at both study and test, using multi-talker babble does not confer the same results. In terms of neuroimaging research modulations, characterization of an implicit memory effect

  11. Site-Specific Soundscape Design for the Creation of Sonic Architectures and the Emergent Voices of Buildings

    Directory of Open Access Journals (Sweden)

    Jordan Lacey

    2014-01-01

    Full Text Available Does a building contain its own Voice? And if so, can that Voice be discovered, transformed and augmented by soundscape design? Barry Blesser’s writings on acoustic space, discuss reverberation and resonant frequencies as providing architectural spaces with characteristic listening conditions related to the architectural space’s dimensions and materiality. The paper argues that Blesser and Salter expand such discussion into pantheistic speculation when suggesting that humanity contains the imaginative capacity to experience spaces as “living spirits”. This argument is achieved by building on the speculation through the discussion of a soundscape design methodology that considers space as containing pantheistic qualities. Sonic architectures are created with electroacoustic sound installations that recompose existing architectural soundscapes, to create the conditions for the emergence of the Voices of buildings. This paper describes two soundscape designs, Revoicing the Striated Soundscape and Subterranean Voices, which transformed existing architectural soundscapes for the emergence of Voices in a laneway and a building located in the City of Melbourne, Australia.

  12. Integrating cues of social interest and voice pitch in men's preferences for women's voices

    OpenAIRE

    Jones, Benedict C; Feinberg, David R; DeBruine, Lisa M; Little, Anthony C; Vukovic, Jovana

    2008-01-01

    Most previous studies of vocal attractiveness have focused on preferences for physical characteristics of voices such as pitch. Here we examine the content of vocalizations in interaction with such physical traits, finding that vocal cues of social interest modulate the strength of men's preferences for raised pitch in women's voices. Men showed stronger preferences for raised pitch when judging the voices of women who appeared interested in the listener than when judging the voices of women ...

  13. Is the speaking fundamental frequency in females related to body height?

    Science.gov (United States)

    Barsties, Ben; Verfaillie, Rudi; Dicks, Peter; Maryn, Youri

    2016-01-01

    The aim of the study was to determine the impact of body height on speaking fundamental frequency (SF0) while controlling for as many as possible influencing factors such as habits, biophysical conditions, medication, diseases, and others. Fifty-eight females were analyzed during spontaneous speech (i.e. explaining driving directions or a cooking recipe) of at least 60 seconds at comfortable pitch and loudness. The subjects showed a moderate negative and significant correlation between body height and SF0 (r = -0.40, P = 0.002). With r(2) = 0.16, however, a reasonable portion (16%) of the variance in SF0 is explained by the variance in body height. In comparison with other factors for which a correlation with SF0 was mentioned in literature (hypothyrodism, hemodialysis, auditory-maleness after female-to-male transsexualism, body weight, body mass index, and body fat), body height accounted for most of the proportion of SF0 in females. It is therefore possible to validate body height as a factor to account for in clinical F0 measurement.

  14. Large angle and high linearity two-dimensional laser scanner based on voice coil actuators

    Science.gov (United States)

    Wu, Xin; Chen, Sihai; Chen, Wei; Yang, Minghui; Fu, Wen

    2011-10-01

    A large angle and high linearity two-dimensional laser scanner with an in-house ingenious deflection angle detecting system is developed based on voice coil actuators direct driving mechanism. The specially designed voice coil actuators make the steering mirror moving at a sufficiently large angle. Frequency sweep method based on virtual instruments is employed to achieve the natural frequency of the laser scanner. The response shows that the performance of the laser scanner is limited by the mechanical resonances. The closed-loop controller based on mathematical model is used to reduce the oscillation of the laser scanner at resonance frequency. To design a qualified controller, the model of the laser scanner is set up. The transfer function of the model is identified with MATLAB according to the tested data. After introducing of the controller, the nonlinearity decreases from 13.75% to 2.67% at 50 Hz. The laser scanner also has other advantages such as large deflection mirror, small mechanical structure, and high scanning speed.

  15. Foetal response to music and voice.

    Science.gov (United States)

    Al-Qahtani, Noura H

    2005-10-01

    To examine whether prenatal exposure to music and voice alters foetal behaviour and whether foetal response to music differs from human voice. A prospective observational study was conducted in 20 normal term pregnant mothers. Ten foetuses were exposed to music and voice for 15 s at different sound pressure levels to find out the optimal setting for the auditory stimulation. Music, voice and sham were played to another 10 foetuses via a headphone on the maternal abdomen. The sound pressure level was 105 db and 94 db for music and voice, respectively. Computerised assessment of foetal heart rate and activity were recorded. 90 actocardiograms were obtained for the whole group. One way anova followed by posthoc (Student-Newman-Keuls method) analysis was used to find if there is significant difference in foetal response to music and voice versus sham. Foetuses responded with heart rate acceleration and motor response to both music and voice. This was statistically significant compared to sham. There was no significant difference between the foetal heart rate acceleration to music and voice. Prenatal exposure to music and voice alters the foetal behaviour. No difference was detected in foetal response to music and voice.

  16. Influence of classroom acoustics on the voice levels of teachers with and without voice problems: a field study

    DEFF Research Database (Denmark)

    Pelegrin Garcia, David; Lyberg-Åhlander, Viveka; Rydell, Roland

    2010-01-01

    of the classroom. The results thus suggest that teachers with voice problems are more aware of classroom acoustic conditions than their healthy colleagues and make use of the more supportive rooms to lower their voice levels. This behavior may result from an adaptation process of the teachers with voice problems...... of the voice problems was made with a questionnaire and a laryngological examination. During teaching, the sound pressure level at the teacher’s position was monitored. The teacher’s voice level and the activity noise level were separated using mixed Gaussians. In addition, objective acoustic parameters...... of Reverberation Time and Voice Support were measured in the 30 empty classrooms of the study. An empirical model shows that the measured voice levels depended on the activity noise levels and the voice support. Teachers with and without voice problems were differently affected by the voice support...

  17. Glottal volume velocity waveform characteristics in subjects with and without vocal training, related to gender, sound intensity, fundamental frequency, and age

    NARCIS (Netherlands)

    Sulter, AM; Wit, HP

    Glottal volume velocity waveform characteristics of 224 subjects, categorized in four groups according to gender and vocal training, were determined, and their relations to sound-pressure level, fundamental frequency, intra-oral pressure, and age were analyzed. Subjects phonated at three intensity

  18. Glottal volume velocity waveform characteristics in subjects with and without vocal training, related to gender, sound intensity, fundamental frequency, and age

    NARCIS (Netherlands)

    Sulter, AM; Wit, HP

    1996-01-01

    Glottal volume velocity waveform characteristics of 224 subjects, categorized in four groups according to gender and vocal training, were determined, and their relations to sound-pressure level, fundamental frequency, intra-oral pressure, and age were analyzed. Subjects phonated at three intensity

  19. Voice Range Profiles of Singing Students: The Effects of Training Duration and Institution.

    Science.gov (United States)

    Lycke, Hugo; Siupsinskiene, Nora

    2016-01-01

    The aim of the study was to assess differences in voice parameters measured by the physiological voice range profile (VRP) in groups of vocally healthy subjects differentiated by the duration of vocal training and the training institution. Six basic frequency- and intensity-related VRP parameters and the frequency dip of the register transition zone were determined from VRP recordings of 162 females studying in individual singing lessons (1st-5th level) in Dutch, Belgian, English, and French public or private training facilities. Sixty-seven nonsinging female students served as controls. Singing students in more advanced singing classes demonstrated a significantly greater frequency range, particularly at high frequencies, than did first-year students. Students with private training showed a significantly increased mean intensity range in comparison to those in group classes, while students with musical theater training exhibited significantly increased frequency- and intensity-related VRP parameters in comparison to the students with classical training. When compared to nonsingers, all singing student subgroups showed significant increases in all basic VRP parameters. However, the register transition parameter was not influenced by training duration or institution. Our study suggests that the extension of physiological vocal limits might depend on training duration and institution. © 2016 S. Karger AG, Basel.

  20. Understanding the 'Anorexic Voice' in Anorexia Nervosa.

    Science.gov (United States)

    Pugh, Matthew; Waller, Glenn

    2017-05-01

    In common with individuals experiencing a number of disorders, people with anorexia nervosa report experiencing an internal 'voice'. The anorexic voice comments on the individual's eating, weight and shape and instructs the individual to restrict or compensate. However, the core characteristics of the anorexic voice are not known. This study aimed to develop a parsimonious model of the voice characteristics that are related to key features of eating disorder pathology and to determine whether patients with anorexia nervosa fall into groups with different voice experiences. The participants were 49 women with full diagnoses of anorexia nervosa. Each completed validated measures of the power and nature of their voice experience and of their responses to the voice. Different voice characteristics were associated with current body mass index, duration of disorder and eating cognitions. Two subgroups emerged, with 'weaker' and 'stronger' voice experiences. Those with stronger voices were characterized by having more negative eating attitudes, more severe compensatory behaviours, a longer duration of illness and a greater likelihood of having the binge-purge subtype of anorexia nervosa. The findings indicate that the anorexic voice is an important element of the psychopathology of anorexia nervosa. Addressing the anorexic voice might be helpful in enhancing outcomes of treatments for anorexia nervosa, but that conclusion might apply only to patients with more severe eating psychopathology. Copyright © 2016 John Wiley & Sons, Ltd. Experiences of an internal 'anorexic voice' are common in anorexia nervosa. Clinicians should consider the role of the voice when formulating eating pathology in anorexia nervosa, including how individuals perceive and relate to that voice. Addressing the voice may be beneficial, particularly in more severe and enduring forms of anorexia nervosa. When working with the voice, clinicians should aim to address both the content of the voice and how

  1. The voice conveys specific emotions: evidence from vocal burst displays.

    Science.gov (United States)

    Simon-Thomas, Emiliana R; Keltner, Dacher J; Sauter, Disa; Sinicropi-Yao, Lara; Abramson, Anna

    2009-12-01

    Studies of emotion signaling inform claims about the taxonomic structure, evolutionary origins, and physiological correlates of emotions. Emotion vocalization research has tended to focus on a limited set of emotions: anger, disgust, fear, sadness, surprise, happiness, and for the voice, also tenderness. Here, we examine how well brief vocal bursts can communicate 22 different emotions: 9 negative (Study 1) and 13 positive (Study 2), and whether prototypical vocal bursts convey emotions more reliably than heterogeneous vocal bursts (Study 3). Results show that vocal bursts communicate emotions like anger, fear, and sadness, as well as seldom-studied states like awe, compassion, interest, and embarrassment. Ancillary analyses reveal family-wise patterns of vocal burst expression. Errors in classification were more common within emotion families (e.g., 'self-conscious,' 'pro-social') than between emotion families. The three studies reported highlight the voice as a rich modality for emotion display that can inform fundamental constructs about emotion.

  2. Voice application development for Android

    CERN Document Server

    McTear, Michael

    2013-01-01

    This book will give beginners an introduction to building voice-based applications on Android. It will begin by covering the basic concepts and will build up to creating a voice-based personal assistant. By the end of this book, you should be in a position to create your own voice-based applications on Android from scratch in next to no time.Voice Application Development for Android is for all those who are interested in speech technology and for those who, as owners of Android devices, are keen to experiment with developing voice apps for their devices. It will also be useful as a starting po

  3. DolphinAtack: Inaudible Voice Commands

    OpenAIRE

    Zhang, Guoming; Yan, Chen; Ji, Xiaoyu; Zhang, Taimin; Zhang, Tianchen; Xu, Wenyuan

    2017-01-01

    Speech recognition (SR) systems such as Siri or Google Now have become an increasingly popular human-computer interaction method, and have turned various systems into voice controllable systems(VCS). Prior work on attacking VCS shows that the hidden voice commands that are incomprehensible to people can control the systems. Hidden voice commands, though hidden, are nonetheless audible. In this work, we design a completely inaudible attack, DolphinAttack, that modulates voice commands on ultra...

  4. Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms

    Directory of Open Access Journals (Sweden)

    Yan Ming Cheng

    2008-08-01

    Full Text Available We describe two voice-to-phoneme conversion algorithms for speaker-independent voice-tag creation specifically targeted at applications on embedded platforms. These algorithms (batch mode and sequential are compared in speech recognition experiments where they are first applied in a same-language context in which both acoustic model training and voice-tag creation and application are performed on the same language. Then, their performance is tested in a cross-language setting where the acoustic models are trained on a particular source language while the voice-tags are created and applied on a different target language. In the same-language environment, both algorithms either perform comparably to or significantly better than the baseline where utterances are manually transcribed by a phonetician. In the cross-language context, the voice-tag performances vary depending on the source-target language pair, with the variation reflecting predicted phonological similarity between the source and target languages. Among the most similar languages, performance nears that of the native-trained models and surpasses the native reference baseline.

  5. Risk factors for voice problems in teachers.

    NARCIS (Netherlands)

    Kooijman, P.G.C.; Jong, F.I.C.R.S. de; Thomas, G.; Huinck, W.J.; Donders, A.R.T.; Graamans, K.; Schutte, H.K.

    2006-01-01

    In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints

  6. Risk factors for voice problems in teachers

    NARCIS (Netherlands)

    Kooijman, P. G. C.; de Jong, F. I. C. R. S.; Thomas, G.; Huinck, W.; Donders, R.; Graamans, K.; Schutte, H. K.

    2006-01-01

    In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints

  7. Voice quality in relation to voice complaints and vocal fold condition during the screening of female student teachers.

    Science.gov (United States)

    Meulenbroek, Leo F P; de Jong, Felix I C R S

    2011-07-01

    The purpose of this study was to compare the perceptual examination of voice quality with the condition of the vocal folds and voice complaints during voice screening in female student teachers. This research was a cross-sectional study in 214 starting student teachers using the four-point grade scale of the GRBAS and laryngostroboscopic assessment of the vocal folds. The voice quality was assessed by speech pathologists using the ordinal 4-point G-scale (overall dysphonia) of the GRBAS method in a running speech sample. Glottal closure and vocal fold lesions were recorded. A questionnaire was used for assessing voice complaints. More students with an insufficient glottal closure (89%) were rated dysphonic compared with students with sufficient glottal closure (80%). Students with sufficient glottal closure had a significantly lower mean G-score (1.21) compared with the group with insufficient glottal closure (1.52) (P = 0.038). This study showed a larger percentage of students with vocal fold lesions (96%) labeled a dysphonic voice compared to students with no vocal fold problems (81%). Students with no vocal fold lesions had a significantly lower mean G-score (1.20) compared with the group with vocal fold lesions (2.05) (P=0.002). A dysphonic voice (G≥1) was rated in 76% of the students without voice complaints compared with 86% of the students with voice complaints. Students with no voice complaints had a lower mean G-score (1.07) compared with the group with voice complaints (1.41) (P=0.090). The present study showed that perceptual assessment of the voice and voice complaints is not sufficient to check if the future professional is at risk. Therefore, preventive measures are needed to detect students at risk early in their education and this depends on broader assessment: on the one hand, assessing voice quality and voice complaints and on the other hand, examination of the vocal folds of all starting students. Copyright © 2011 The Voice Foundation

  8. Functionalization of polydimethylsiloxane membranes to be used in the production of voice prostheses

    Directory of Open Access Journals (Sweden)

    Paula Ferreira, Álvaro Carvalho, Tiago Ruivo Correia, Bernardo Paiva Antunes, Ilídio Joaquim Correia and Patrícia Alves

    2013-01-01

    Full Text Available The voice is produced by the vibration of vocal cords which are located in the larynx. Therefore, one of the major consequences for patients subjected to laryngectomy is losing their voice. In these cases, a synthetic one-way valve set (voice prosthesis can be implanted in order to allow restoration of speech. Most voice prostheses are produced with silicone-based materials such as polydimethylsiloxane (PDMS. This material has excellent properties, such as optical transparency, chemical and biological inertness, non-toxicity, permeability to gases and excellent mechanical resistance that are fundamental for its application in the biomedical field. However, PDMS is very hydrophobic and this property causes protein adsorption which is followed by microbial adhesion and biofilm formation. To overcome these problems, surface modification of materials has been proposed in this study. A commercial silicone elastomer, SylgardTM 184 was used to prepare membranes whose surface was modified by grafting 2-hydroxyethylmethacrylate and methacrylic acid by low-pressure plasma treatment. The hydrophilicity, hydrophobic recovery and surface energy of the produced materials were determined. Furthermore, the cytotoxicity and antibacterial activity of the materials were also assessed. The results obtained revealed that the PDMS surface modification performed did not affect the material's biocompatibility, but decreased their hydrophobic character and bacterial adhesion and growth on its surface.

  9. VOICE QUALITY BEFORE AND AFTER THYROIDECTOMY

    Directory of Open Access Journals (Sweden)

    Dora CVELBAR

    2016-04-01

    Full Text Available Introduction: Voice disorders are a well-known complication which is often associated with thyroid gland diseases and because voice is still the basic mean of communication it is very important to maintain its quality healthy. Objectives: The aim of this study referred to questions whether there is a statistically significant difference between results of voice self-assessment, perceptual voice assessment and acoustic voice analysis before and after thyroidectomy and whether there are statistically significant correlations between variables of voice self-assessment, perceptual assessment and acoustic analysis before and after thyroidectomy. Methods: This scientific research included 12 participants aged between 41 and 76. Voice self-assessment was conducted with the help of Croatian version of Voice Handicap Index (VHI. Recorded reading samples were used for perceptual assessment and later evaluated by two clinical speech and language therapists. Recorded samples of phonation were used for acoustic analysis which was conducted with the help of acoustic program Praat. All of the data was processed through descriptive statistics and nonparametric statistical methods. Results: Results showed that there are statistically significant differences between results of voice self-assessments and results of acoustic analysis before and after thyroidectomy. Statistically significant correlations were found between variables of perceptual assessment and acoustic analysis. Conclusion: Obtained results indicate the importance of multidimensional, preoperative and postoperative assessment. This kind of assessment allows the clinician to describe all of the voice features and provides appropriate recommendation for further rehabilitation to the patient in order to optimize voice outcomes.

  10. Aerodynamic and sound intensity measurements in tracheoesophageal voice

    NARCIS (Netherlands)

    Grolman, Wilko; Eerenstein, Simone E. J.; Tan, Frédérique M. L.; Tange, Rinze A.; Schouwenburg, Paul F.

    2007-01-01

    BACKGROUND: In laryngectomized patients, tracheoesophageal voice generally provides a better voice quality than esophageal voice. Understanding the aerodynamics of voice production in patients with a voice prosthesis is important for optimizing prosthetic designs and successful voice rehabilitation.

  11. Crossing Cultures with Multi-Voiced Journals

    Science.gov (United States)

    Styslinger, Mary E.; Whisenant, Alison

    2004-01-01

    In this article, the authors discuss the benefits of using multi-voiced journals as a teaching strategy in reading instruction. Multi-voiced journals, an adaptation of dual-voiced journals, encourage responses to reading in varied, cultured voices of characters. It is similar to reading journals in that they prod students to connect to the lives…

  12. [Applicability of Voice Handicap Index to the evaluation of voice therapy effectiveness in teachers].

    Science.gov (United States)

    Niebudek-Bogusz, Ewa; Kuzańska, Anna; Błoch, Piotr; Domańska, Maja; Woźnicka, Ewelina; Politański, Piotr; Sliwińska-Kowalska, Mariola

    2007-01-01

    The aim of this study was to assess the applicability of Voice Handicap Index (VHI) to the evaluation of effectiveness of functional voice disorders treatment in teachers. The subjects were 45 female teachers with functional dysphonia who evaluated their voice problems according to the subjective VHI scale before and after phoniatric management. Group I (29 patients) were subjected to vocal training, whereas group II (16 patients) received only voice hygiene instructions. The results demonstrated that differences in the mean VHI score before and after phoniatric treatment were significantly higher in group 1 than in group II (p teacher's dysphonia.

  13. Interactive Augmentation of Voice Quality and Reduction of Breath Airflow in the Soprano Voice.

    Science.gov (United States)

    Rothenberg, Martin; Schutte, Harm K

    2016-11-01

    In 1985, at a conference sponsored by the National Institutes of Health, Martin Rothenberg first described a form of nonlinear source-tract acoustic interaction mechanism by which some sopranos, singing in their high range, can use to reduce the total airflow, to allow holding the note longer, and simultaneously enrich the quality of the voice, without straining the voice. (M. Rothenberg, "Source-Tract Acoustic Interaction in the Soprano Voice and Implications for Vocal Efficiency," Fourth International Conference on Vocal Fold Physiology, New Haven, Connecticut, June 3-6, 1985.) In this paper, we describe additional evidence for this type of nonlinear source-tract interaction in some soprano singing and describe an analogous interaction phenomenon in communication engineering. We also present some implications for voice research and pedagogy. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  14. Interventions for preventing voice disorders in adults.

    Science.gov (United States)

    Ruotsalainen, J H; Sellman, J; Lehto, L; Jauhiainen, M; Verbeek, J H

    2007-10-17

    Poor voice quality due to a voice disorder can lead to a reduced quality of life. In occupations where voice use is substantial it can lead to periods of absence from work. To evaluate the effectiveness of interventions to prevent voice disorders in adults. We searched MEDLINE (PubMed, 1950 to 2006), EMBASE (1974 to 2006), CENTRAL (The Cochrane Library, Issue 2 2006), CINAHL (1983 to 2006), PsychINFO (1967 to 2006), Science Citation Index (1986 to 2006) and the Occupational Health databases OSH-ROM (to 2006). The date of the last search was 05/04/06. Randomised controlled clinical trials (RCTs) of interventions evaluating the effectiveness of treatments to prevent voice disorders in adults. For work-directed interventions interrupted time series and prospective cohort studies were also eligible. Two authors independently extracted data and assessed trial quality. Meta-analysis was performed where appropriate. We identified two randomised controlled trials including a total of 53 participants in intervention groups and 43 controls. One study was conducted with teachers and the other with student teachers. Both trials were poor quality. Interventions were grouped into 1) direct voice training, 2) indirect voice training and 3) direct and indirect voice training combined.1) Direct voice training: One study did not find a significant decrease of the Voice Handicap Index for direct voice training compared to no intervention.2) Indirect voice training: One study did not find a significant decrease of the Voice Handicap Index for indirect voice training when compared to no intervention.3) Direct and indirect voice training combined: One study did not find a decrease of the Voice Handicap Index for direct and indirect voice training combined when compared to no intervention. The same study did however find an improvement in maximum phonation time (Mean Difference -3.18 sec; 95 % CI -4.43 to -1.93) for direct and indirect voice training combined when compared to no

  15. Designing a Voice Controlled Interface For Radio : Guidelines for The First Generation of Voice Controlled Public Radio

    OpenAIRE

    Päärni, Anna

    2017-01-01

    From being a fictional element in sci-fi, voice control has become a reality, with inventions such as Apple's Siri, and interactive voice response (IVR) when calling your doctor's office. The combination of radio’s strength as a hands-free medium, public radio’s mission to reach across all platforms and the rise of voice makes up a relevant intersection; voice controlled public radio in Sweden. This thesis has aimed to investigate how radio listeners wish to interact using voice control to li...

  16. Masking release with changing fundamental frequency: Electric acoustic stimulation resembles normal hearing subjects.

    Science.gov (United States)

    Auinger, Alice Barbara; Riss, Dominik; Liepins, Rudolfs; Rader, Tobias; Keck, Tilman; Keintzel, Thomas; Kaider, Alexandra; Baumgartner, Wolf-Dieter; Gstoettner, Wolfgang; Arnoldner, Christoph

    2017-07-01

    It has been shown that patients with electric acoustic stimulation (EAS) perform better in noisy environments than patients with a cochlear implant (CI). One reason for this could be the preserved access to acoustic low-frequency cues including the fundamental frequency (F0). Therefore, our primary aim was to investigate whether users of EAS experience a release from masking with increasing F0 difference between target talker and masking talker. The study comprised 29 patients and consisted of three groups of subjects: EAS users, CI users and normal-hearing listeners (NH). All CI and EAS users were implanted with a MED-EL cochlear implant and had at least 12 months of experience with the implant. Speech perception was assessed with the Oldenburg sentence test (OlSa) using one sentence from the test corpus as speech masker. The F0 in this masking sentence was shifted upwards by 4, 8, or 12 semitones. For each of these masker conditions the speech reception threshold (SRT) was assessed by adaptively varying the masker level while presenting the target sentences at a fixed level. A statistically significant improvement in speech perception was found for increasing difference in F0 between target sentence and masker sentence in EAS users (p = 0.038) and in NH listeners (p = 0.003). In CI users (classic CI or EAS users with electrical stimulation only) speech perception was independent from differences in F0 between target and masker. A release from masking with increasing difference in F0 between target and masking speech was only observed in listeners and configurations in which the low-frequency region was presented acoustically. Thus, the speech information contained in the low frequencies seems to be crucial for allowing listeners to separate multiple sources. By combining acoustic and electric information, EAS users even manage tasks as complicated as segregating the audio streams from multiple talkers. Preserving the natural code, like fine-structure cues in

  17. Application of computer voice input/output

    International Nuclear Information System (INIS)

    Ford, W.; Shirk, D.G.

    1981-01-01

    The advent of microprocessors and other large-scale integration (LSI) circuits is making voice input and output for computers and instruments practical; specialized LSI chips for speech processing are appearing on the market. Voice can be used to input data or to issue instrument commands; this allows the operator to engage in other tasks, move about, and to use standard data entry systems. Voice synthesizers can generate audible, easily understood instructions. Using voice characteristics, a control system can verify speaker identity for security purposes. Two simple voice-controlled systems have been designed at Los Alamos for nuclear safeguards applicaations. Each can easily be expanded as time allows. The first system is for instrument control that accepts voice commands and issues audible operator prompts. The second system is for access control. The speaker's voice is used to verify his identity and to actuate external devices

  18. Evaluation of voice acoustic parameters related to the vocal-loading test in professionally active teachers with dysphonia.

    Science.gov (United States)

    Niebudek-Bogusz, Ewa; Kotyło, Piotr; Sliwińska-Kowalska, Mariola

    2007-01-01

    Teachers are at risk of developing voice disorders. A clinical battery of vocal function tests should include non-invasive and accurate measurements. The quantitative methods (e.g., voice acoustic analysis) make it possible to objectively evaluate voice efficiency and outcomes of dysphonia treatment. To identify possible signs of vocal fatigue, acoustic waveform perturbations during sustained phonation were measured before and after the vocal-loading test in 51 professionally active female teachers with functional voice disorders, using IRIS software. All the participants were also subjected to laryngological/phoniatric examination involving videostroboscopy combined with self-estimation by voice handicap index (VHI)-based scale. The phoniatric examination revealed glottal insufficiency with bowed vocal folds in 35.2%, soft vocal nodules in 31.4%, and hyperfunctional dysphonia with a tendency towards vestibular phonation in 19.6% of the patients. In the VHI scale, 66% of the female teachers estimated their own voice problems as moderate disability. An acoustic analysis performed after the vocal-loading test showed an increased rate of abnormal frequency perturbation parameters (pitch perturbation quotient (Jitter), relative average perturbation (RAP), and pitch period perturbation quotient (PPQ)) compared to the pre-test outcomes. The same was true of pitch-intensity contour of vowel /a:/, an indication of voice instability during sustained phonation. The recorded impairments of voice acoustic parameters related to vocal loading provide further evidence of dysphonia. The voice acoustic analysis performed before and after the vocal-loading test can significantly contribute to objective voice examinations useful in diagnosis of dysphonia among teachers.

  19. Analysis And Voice Recognition In Indonesian Language Using MFCC And SVM Method

    Directory of Open Access Journals (Sweden)

    Harvianto Harvianto

    2016-06-01

    Full Text Available Voice recognition technology is one of biometric technology. Sound is a unique part of the human being which made an individual can be easily distinguished one from another. Voice can also provide information such as gender, emotion, and identity of the speaker. This research will record human voices that pronounce digits between 0 and 9 with and without noise. Features of this sound recording will be extracted using Mel Frequency Cepstral Coefficient (MFCC. Mean, standard deviation, max, min, and the combination of them will be used to construct the feature vectors. This feature vectors then will be classified using Support Vector Machine (SVM. There will be two classification models. The first one is based on the speaker and the other one based on the digits pronounced. The classification model then will be validated by performing 10-fold cross-validation.The best average accuracy from two classification model is 91.83%. This result achieved using Mean + Standard deviation + Min + Max as features.

  20. Voice and silence in organizations

    Directory of Open Access Journals (Sweden)

    Moaşa, H.

    2011-01-01

    Full Text Available Unlike previous research on voice and silence, this article breaksthe distance between the two and declines to treat them as opposites. Voice and silence are interrelated and intertwined strategic forms ofcommunication which presuppose each other in such a way that the absence of one would minimize completely the other’s presence. Social actors are not voice, or silence. Social actors can have voice or silence, they can do both because they operate at multiple levels and deal with multiple issues at different moments in time.

  1. Why worship? Revisiting a fundamental liturgical question

    Directory of Open Access Journals (Sweden)

    Johan Cilliers

    2009-04-01

    Full Text Available In this article the fundamental liturgical question as to the motive and intention of worship is addressed within the framework of four related liturgical tensions, namely between being and becoming, between time and space, between awe and expression, and between laughter and lament. In order to do this, some classical voices from the past are listened to, for instance, Schleiermacher, Kierkegaard, Moltmann, Tillich, Otto, Bakhtin and Buber, but more contemporary views are also considered. These four tensions are described in the light of the key terms: ‘already’ and ‘not yet’, and some implications for present-day liturgical practices are drawn.

  2. ACHIEVING THE NATURAL VOICE: THE ANALYSIS OF THE LINKLATER METHOD FROM A TRAINING PERSPECTİVE–

    Directory of Open Access Journals (Sweden)

    Asli YILMAZ DAVUTOGLU

    2015-09-01

    Full Text Available In the 20th century, one of the most widespread of the voice training methods that start with the concepts “natural voice” and “the rediscovery of voice” is the Linklater Method. The primary target group of this method is actors. With its exercises designed for the "reconstruction of the body, the voice and the mind", this method aims at utilizing the innate voice capacity. As a multidisciplinary method fostered by many a scientific discipline and Eastern teaching, Linklater method comes with a language that is imbued with sophisticated and metaphorical expressions, scientific terminology and acting jargon, which makes the method prone to false and/or superficial references. The target of the present study is to explicate with a “trainer's perspective” the fundamental concepts and propositions of the Linklater Method, most notably the “natural voice”. Also, the present study aims at analysing the relation between the basic practices of the method and recent scientific data, thus examining the mental substructure these practices are based on and their physical/technical goals. In this direction, the present study involves the adopting of a general framework with respect to the inclinations and scientific sources of voice training in the 20th century that affected Linklater's propositions, a simplified summarisation of the neuro-anatomic process producing the voice, the selection of exercises on which the principles and goals of the method can be seen concretely and the grouping of these exercises under titles pertaining to the four basic steps of voice production. In the conclusion part of the study, it is argued that this method, despite being regarded as “alternative/experimental” when compared to conventional methods in Turkey, is one of the mainstream methods in contemporary voice training and that it is shaped through a multi-purpose system whose aim is not only voice training but also to develop the creativity and

  3. Immediate effects of the Finnish resonance tube method on behavioral dysphonia.

    Science.gov (United States)

    Paes, Sabrina Mazzer; Zambon, Fabiana; Yamasaki, Rosiane; Simberg, Susanna; Behlau, Mara

    2013-11-01

    To investigate the immediate effects of the Finnish resonance tube method for teachers with behavioral dysphonia. Twenty-five female teachers (m=39.9 years of age) with at least a 5-year history of dysphonia were included. Additional inclusion criteria were the diagnosis of chronic behavioral dysphonia with an indication for speech therapy and the absence of any prior speech therapy. Subjects produced three sets of 10 tokens of sustained phonation with a 1-minute rest interval between tokens into a 27-cm glass tube immersed in at least 2 cm of water. Voice samples were recorded before and after these sets. The effects of these exercises were evaluated by self-assessment, auditory perceptual analysis, and acoustic evaluation involving extraction of fundamental frequency and visual spectrographic analysis. Sixty-eight percent of the teachers reported increased phonatory comfort and 52% reported improved voice quality after performing the exercises. Perceptual analysis indicated improved voice quality in the samples of counting numbers, confirmed by decreased instability, subharmonics, noise in high frequencies, and the tendency for reduced low frequency noise on spectrographic evaluation. Additionally, mean fundamental frequency decreased. The Finnish resonance tube method increased phonatory comfort and vocal changes suggestive diminished hyperfunction. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  4. Voice Biometrics for Information Assurance Applications

    National Research Council Canada - National Science Library

    Kang, George

    2002-01-01

    .... The ultimate goal of voice biometrics is to enable the use of voice as a password. Voice biometrics are "man-in-the-loop" systems in which system performance is significantly dependent on human performance...

  5. Brain Maturation, Cognition and Voice Pattern in a Gender Dysphoria Case under Pubertal Suppression.

    Science.gov (United States)

    Schneider, Maiko A; Spritzer, Poli M; Soll, Bianca Machado Borba; Fontanari, Anna M V; Carneiro, Marina; Tovar-Moll, Fernanda; Costa, Angelo B; da Silva, Dhiordan C; Schwarz, Karine; Anes, Maurício; Tramontina, Silza; Lobato, Maria I R

    2017-01-01

    Introduction: Gender dysphoria (GD) (DMS-5) is a condition marked by increasing psychological suffering that accompanies the incongruence between one's experienced or expressed gender and one's assigned gender. Manifestation of GD can be seen early on during childhood and adolescence. During this period, the development of undesirable sexual characteristics marks an acute suffering of being opposite to the sex of birth. Pubertal suppression with gonadotropin releasing hormone analogs (GnRHa) has been proposed for these individuals as a reversible treatment for postponing the pubertal development and attenuating psychological suffering. Recently, increased interest has been observed on the impact of this treatment on brain maturation, cognition and psychological performance. Objectives: The aim of this clinical report is to review the effects of puberty suppression on the brain white matter (WM) during adolescence. WM Fractional anisotropy, voice and cognitive functions were assessed before and during the treatment. MRI scans were acquired before, and after 22 and 28 months of hormonal suppression. Methods: We performed a longitudinal evaluation of a pubertal transgender girl undergoing hormonal treatment with GnRH analog. Three longitudinal magnetic resonance imaging (MRI) scans were performed for diffusion tensor imaging (DTI), regarding Fractional Anisotropy (FA) for regions of interest analysis. In parallel, voice samples for acoustic analysis as well as executive functioning with the Wechsler Intelligence Scale (WISC-IV) were performed. Results: During the follow-up, white matter fractional anisotropy did not increase, compared to normal male puberty effects on the brain. After 22 months of pubertal suppression, operational memory dropped 9 points and remained stable after 28 months of follow-up. The fundamental frequency of voice varied during the first year; however, it remained in the female range. Conclusion: Brain white matter fractional anisotropy

  6. Analysis of failure of voice production by a sound-producing voice prosthesis

    NARCIS (Netherlands)

    van der Torn, M.; van Gogh, C.D.L.; Verdonck-de Leeuw, I M; Festen, J.M.; Mahieu, H.F.

    OBJECTIVE: To analyse the cause of failing voice production by a sound-producing voice prosthesis (SPVP). METHODS: The functioning of a prototype SPVP is described in a female laryngectomee before and after its sound-producing mechanism was impeded by tracheal phlegm. This assessment included:

  7. The relation of vocal fold lesions and voice quality to voice handicap and psychosomatic well-being

    NARCIS (Netherlands)

    Smits, R.; Marres, H.A.; de Jong, F.

    2012-01-01

    BACKGROUND: Voice disorders have a multifactorial genesis and may be present in various ways. They can cause a significant communication handicap and impaired quality of life. OBJECTIVE: To assess the effect of vocal fold lesions and voice quality on voice handicap and psychosomatic well-being.

  8. Acoustic and capacity analysis of voice academic teachers with diagnosed hyperfunctional dysphonia by using DiagnoScope Specialist software.

    Science.gov (United States)

    Zielińska-Bliźniewska, Hanna; Pietkiewicz, Piotr; Miłoński, Jarosław; Urbaniak, Joanna; Olszewski, Jurek

    2013-01-01

    The aim of the study was to assess the acoustic and capacity analyses of voice in academic teachers with hyperfunctional dysphonia using DiagnoScope Specialist software. The study covered 46 female academic teachers aged 34-48 years. The women were diagnosed with hyperfunctional dysphonia (with absence of organic pathologies). Having obtained the informed consent, a primary medical history was taken, videolaryngoscopic and stroboscopic examinations were performed and diagnostic voice acoustic and capacity analyses were carried out using DiagnoScope Specialist software. The acoustic analysis carried out of academic teachers with diagnosed hyperfunctional dysphonia showed enhancement in the following parameters: fundamental frequency (FO) by 1.2%; relative average perturbation (Jitter by 100.0% and RAP by 81.8%); relative amplitude perturbation quotient (APQ) by 2.9%; non-harmonic to harmonic ratio (U2H) by 16.0%; and noise to harmonic ratio (NHR) by 13.4%. A decrease of 2.5% from normal values was noted in relative amplitude perturbation (Shimmer). Formant frequencies also showed reduction (F1 by 10.7%, F2 by 5.1%, F3 by 2.2%, and F4 by 3.5%). The harmonic perturbation quotient (HPQ) was 0.8% lower and the residual harmonic perturbation quotient (RHPQ) 16.8% lower, with the residual to harmonic (R2H) decreasing by 35.1 per cent; the sub-harmonic to harmonic (S2H) by 2.4%; and the Yanagihara coefficient by 20.2%. The capacity analysis with the DiagnoScope Specialist software showed figures significantly lower than normal values of the following parameters: phonation time, true phonation time, phonation break coefficients, vocal capacity coefficient and mean vocal capacity. Copyright © 2013 Polish Otorhinolaryngology - Head and Neck Surgery Society. Published by Elsevier Urban & Partner Sp. z.o.o. All rights reserved.

  9. Voice Onset Time in Azerbaijani Consonants

    Directory of Open Access Journals (Sweden)

    Ali Jahan

    2009-10-01

    Full Text Available Objective: Voice onset time is known to be cue for the distinction between voiced and voiceless stops and it can be used to describe or categorize a range of developmental, neuromotor and linguistic disorders. The aim of this study is determination of standard values of voice onset time for Azerbaijani language (Tabriz dialect. Materials & Methods: In this description-analytical study, 30 Azeris persons whom were selected conveniently by simple selection, uttered 46 monosyllabic words initiating with 6 Azerbaijani stops twice. Using Praat software, the voice onset time values were analyzed by waveform and wideband spectrogram in milliseconds. Vowel effect, sex differences and the effect of place of articulation on VOT, were evaluated and data were analyzed by one-way ANOVA test. Results: There was no significant difference in voice onset time between male and female Azeris speakers (P<0.05. Vowel and place of articulation had significant correlation with voice onset time (P<0.001. Voice onset time values for /b/, /p/, /d/, /t/, /g/, /k/, and [c], [ɟ] allophones were 10.64, 86.88, 13.35, 87.09, 26.25, 100.62, 131.19, 63.18 mili second, respectively. Conclusion: Voice onset time values are the same for Azerbaijani men and women. However, like many other languages, back and high vowels and back place of articulation lengthen VOT. Also, voiceless stops are aspirated in this language and voiced stops have positive VOT values.

  10. Does CPAP treatment affect the voice?

    Science.gov (United States)

    Saylam, Güleser; Şahin, Mustafa; Demiral, Dilek; Bayır, Ömer; Yüceege, Melike Bağnu; Çadallı Tatar, Emel; Korkmaz, Mehmet Hakan

    2016-12-20

    The aim of this study was to investigate alterations in voice parameters among patients using continuous positive airway pressure (CPAP) for the treatment of obstructive sleep apnea syndrome. Patients with an indication for CPAP treatment without any voice problems and with normal laryngeal findings were included and voice parameters were evaluated before and 1 and 6 months after CPAP. Videolaryngostroboscopic findings, a self-rated scale (Voice Handicap Index-10, VHI-10), perceptual voice quality assessment (GRBAS: grade, roughness, breathiness, asthenia, strain), and acoustic parameters were compared. Data from 70 subjects (48 men and 22 women) with a mean age of 44.2 ± 6.0 years were evaluated. When compared with the pre-CPAP treatment period, there was a significant increase in the VHI-10 score after 1 month of treatment and in VHI- 10 and total GRBAS scores, jitter percent (P = 0.01), shimmer percent, noise-to-harmonic ratio, and voice turbulence index after 6 months of treatment. Vague negative effects on voice parameters after the first month of CPAP treatment became more evident after 6 months. We demonstrated nonsevere alterations in the voice quality of patients under CPAP treatment. Given that CPAP is a long-term treatment it is important to keep these alterations in mind.

  11. Managing dysphonia in occupational voice users.

    Science.gov (United States)

    Behlau, Mara; Zambon, Fabiana; Madazio, Glaucya

    2014-06-01

    Recent advances with regard to occupational voice disorders are highlighted with emphasis on issues warranting consideration when assessing, training, and treating professional voice users. Findings include the many particularities between the various categories of professional voice users, the concept that the environment plays a major role in occupational voice disorders, and that biopsychosocial influences should be analyzed on an individual basis. Assessment via self-evaluation protocols to quantify the impact of these disorders is mandatory as a component of an evaluation and to document treatment outcomes. Discomfort or odynophonia has evolved as a critical symptom in this population. Clinical trials are limited and the complexity of the environment may be a limitation in experiment design. This review reinforced the need for large population studies of professional voice users; new data highlighted important factors specific to each group of voice users. Interventions directed at student teachers are necessities to not only improving the quality of future professionals, but also to avoid the frustration and limitations associated with chronic voice problems. The causative relationship between the work environment and voice disorders has not yet been established. Randomized controlled trials are lacking and must be a focus to enhance treatment paradigms for this population.

  12. Fundamentals of Acoustics. Psychoacoustics and Hearing. Acoustical Measurements

    Science.gov (United States)

    Begault, Durand R.; Ahumada, Al (Technical Monitor)

    1997-01-01

    These are 3 chapters that will appear in a book titled "Building Acoustical Design", edited by Charles Salter. They are designed to introduce the reader to fundamental concepts of acoustics, particularly as they relate to the built environment. "Fundamentals of Acoustics" reviews basic concepts of sound waveform frequency, pressure, and phase. "Psychoacoustics and Hearing" discusses the human interpretation sound pressure as loudness, particularly as a function of frequency. "Acoustic Measurements" gives a simple overview of the time and frequency weightings for sound pressure measurements that are used in acoustical work.

  13. Epidemiology of Voice Disorders in Latvian School Teachers.

    Science.gov (United States)

    Trinite, Baiba

    2017-07-01

    The prevalence of voice disorders in the teacher population in Latvia has not been studied so far and this is the first epidemiological study whose goal is to investigate the prevalence of voice disorders and their risk factors in this professional group. A wide cross-sectional study using stratified sampling methodology was implemented in the general education schools of Latvia. The self-administered voice risk factor questionnaire and the Voice Handicap Index were completed by 522 teachers. Two teachers groups were formed: the voice disorders group which included 235 teachers with actual voice problems or problems during the last 9 months; and the control group which included 174 teachers without voice disorders. Sixty-six percent of teachers gave a positive answer to the following question: Have you ever had problems with your voice? Voice problems are more often found in female than male teachers (68.2% vs 48.8%). Music teachers suffer from voice disorders more often than teachers of other subjects. Eighty-two percent of teachers first faced voice problems in their professional carrier. The odds of voice disorders increase if the following risk factors exist: extra vocal load, shouting, throat clearing, neglecting of personal health, background noise, chronic illnesses of the upper respiratory tract, allergy, job dissatisfaction, and regular stress in the working place. The study findings indicated a high risk of voice disorders among Latvian teachers. The study confirmed data concerning the multifactorial etiology of voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  14. Singing in groups for Parkinson's disease (SING-PD): a pilot study of group singing therapy for PD-related voice/speech disorders.

    Science.gov (United States)

    Shih, Ludy C; Piel, Jordan; Warren, Amanda; Kraics, Lauren; Silver, Althea; Vanderhorst, Veronique; Simon, David K; Tarsy, Daniel

    2012-06-01

    Parkinson's disease related speech and voice impairment have significant impact on quality of life measures. LSVT(®)LOUD voice and speech therapy (Lee Silverman Voice Therapy) has demonstrated scientific efficacy and clinical effectiveness, but musically based voice and speech therapy has been underexplored as a potentially useful method of rehabilitation. We undertook a pilot, open-label study of a group-based singing intervention, consisting of twelve 90-min weekly sessions led by a voice and speech therapist/singing instructor. The primary outcome measure of vocal loudness as measured by sound pressure level (SPL) at 50 cm during connected speech was not significantly different one week after the intervention or at 13 weeks after the intervention. A number of secondary measures reflecting pitch range, phonation time and maximum loudness also were unchanged. Voice related quality of life (VRQOL) and voice handicap index (VHI) also were unchanged. This study suggests that a group singing therapy intervention at this intensity and frequency does not result in significant improvement in objective and subject-rated measures of voice and speech impairment. Copyright © 2012 Elsevier Ltd. All rights reserved.

  15. Influence of Noise Resulting From the Location and Conditions of Classrooms and Schools in Upper Egypt on Teachers' Voices.

    Science.gov (United States)

    Phadke, Ketaki Vasant; Abo-Hasseba, Ahmed; Švec, Jan G; Geneid, Ahmed

    2018-05-03

    Teachers are professional voice users, always at high risk of developing voice disorders due to high vocal demand and unfavorable environmental conditions. This study aimed at identifying possible correlations between teachers' voice symptoms and their perception of noise, the location of schools, as well as the location and conditions of their classrooms. One hundred forty teachers (ages 21-56) from schools in Upper Egypt participated in this study. They filled out a questionnaire including questions about the severity and frequency of their voice symptoms, noise perception, and the location and conditions of their schools and classrooms. Questionnaire responses were statistically analyzed to identify possible correlations. There were significant correlations (P Egyptian schools. This study may help future studies that focus on developing guidelines for the better planning of Egyptian schools in terms of improved infrastructure and architecture, thus considering the general and vocal health of teachers. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  16. Lower Vocal Tract Morphologic Adjustments Are Relevant for Voice Timbre in Singing.

    Science.gov (United States)

    Mainka, Alexander; Poznyakovskiy, Anton; Platzek, Ivan; Fleischer, Mario; Sundberg, Johan; Mürbe, Dirk

    2015-01-01

    The vocal tract shape is crucial to voice production. Its lower part seems particularly relevant for voice timbre. This study analyzes the detailed morphology of parts of the epilaryngeal tube and the hypopharynx for the sustained German vowels /a/, /e/, /i/, /o/, and /u/ by thirteen male singer subjects who were at the beginning of their academic singing studies. Analysis was based on two different phonatory conditions: a natural, speech-like phonation and a singing phonation, like in classical singing. 3D models of the vocal tract were derived from magnetic resonance imaging and compared with long-term average spectrum analysis of audio recordings from the same subjects. Comparison of singing to the speech-like phonation, which served as reference, showed significant adjustments of the lower vocal tract: an average lowering of the larynx by 8 mm and an increase of the hypopharyngeal cross-sectional area (+ 21:9%) and volume (+ 16:8%). Changes in the analyzed epilaryngeal portion of the vocal tract were not significant. Consequently, lower larynx-to-hypopharynx area and volume ratios were found in singing compared to the speech-like phonation. All evaluated measures of the lower vocal tract varied significantly with vowel quality. Acoustically, an increase of high frequency energy in singing correlated with a wider hypopharyngeal area. The findings offer an explanation how classical male singers might succeed in producing a voice timbre with increased high frequency energy, creating a singer`s formant cluster.

  17. Identifying hidden voice and video streams

    Science.gov (United States)

    Fan, Jieyan; Wu, Dapeng; Nucci, Antonio; Keralapura, Ram; Gao, Lixin

    2009-04-01

    Given the rising popularity of voice and video services over the Internet, accurately identifying voice and video traffic that traverse their networks has become a critical task for Internet service providers (ISPs). As the number of proprietary applications that deliver voice and video services to end users increases over time, the search for the one methodology that can accurately detect such services while being application independent still remains open. This problem becomes even more complicated when voice and video service providers like Skype, Microsoft, and Google bundle their voice and video services with other services like file transfer and chat. For example, a bundled Skype session can contain both voice stream and file transfer stream in the same layer-3/layer-4 flow. In this context, traditional techniques to identify voice and video streams do not work. In this paper, we propose a novel self-learning classifier, called VVS-I , that detects the presence of voice and video streams in flows with minimum manual intervention. Our classifier works in two phases: training phase and detection phase. In the training phase, VVS-I first extracts the relevant features, and subsequently constructs a fingerprint of a flow using the power spectral density (PSD) analysis. In the detection phase, it compares the fingerprint of a flow to the existing fingerprints learned during the training phase, and subsequently classifies the flow. Our classifier is not only capable of detecting voice and video streams that are hidden in different flows, but is also capable of detecting different applications (like Skype, MSN, etc.) that generate these voice/video streams. We show that our classifier can achieve close to 100% detection rate while keeping the false positive rate to less that 1%.

  18. Optical voice encryption based on digital holography.

    Science.gov (United States)

    Rajput, Sudheesh K; Matoba, Osamu

    2017-11-15

    We propose an optical voice encryption scheme based on digital holography (DH). An off-axis DH is employed to acquire voice information by obtaining phase retardation occurring in the object wave due to sound wave propagation. The acquired hologram, including voice information, is encrypted using optical image encryption. The DH reconstruction and decryption with all the correct parameters can retrieve an original voice. The scheme has the capability to record the human voice in holograms and encrypt it directly. These aspects make the scheme suitable for other security applications and help to use the voice as a potential security tool. We present experimental and some part of simulation results.

  19. Can temporal fine structure represent the fundamental frequency of unresolved harmonics?

    Science.gov (United States)

    Oxenham, Andrew J; Micheyl, Christophe; Keebler, Michael V

    2009-04-01

    At least two modes of pitch perception exist: in one, the fundamental frequency (F0) of harmonic complex tones is estimated using the temporal fine structure (TFS) of individual low-order resolved harmonics; in the other, F0 is derived from the temporal envelope of high-order unresolved harmonics that interact in the auditory periphery. Pitch is typically more accurate in the former than in the latter mode. Another possibility is that pitch can sometimes be coded via the TFS from unresolved harmonics. A recent study supporting this third possibility [Moore et al. (2006a). J. Acoust. Soc. Am. 119, 480-490] based its conclusion on a condition where phase interaction effects (implying unresolved harmonics) accompanied accurate F0 discrimination (implying TFS processing). The present study tests whether these results were influenced by audible distortion products. Experiment 1 replicated the original results, obtained using a low-level background noise. However, experiments 2-4 found no evidence for the use of TFS cues with unresolved harmonics when the background noise level was raised, or the stimulus level was lowered, to render distortion inaudible. Experiment 5 measured the presence and phase dependence of audible distortion products. The results provide no evidence that TFS cues are used to code the F0 of unresolved harmonics.

  20. Performance of Phonatory Deviation Diagrams in Synthesized Voice Analysis.

    Science.gov (United States)

    Lopes, Leonardo Wanderley; da Silva, Karoline Evangelista; da Silva Evangelista, Deyverson; Almeida, Anna Alice; Silva, Priscila Oliveira Costa; Lucero, Jorge; Behlau, Mara

    2018-05-02

    To analyze the performance of a phonatory deviation diagram (PDD) in discriminating the presence and severity of voice deviation and the predominant voice quality of synthesized voices. A speech-language pathologist performed the auditory-perceptual analysis of the synthesized voice (n = 871). The PDD distribution of voice signals was analyzed according to area, quadrant, shape, and density. Differences in signal distribution regarding the PDD area and quadrant were detected when differentiating the signals with and without voice deviation and with different predominant voice quality. Differences in signal distribution were found in all PDD parameters as a function of the severity of voice disorder. The PDD area and quadrant can differentiate normal voices from deviant synthesized voices. There are differences in signal distribution in PDD area and quadrant as a function of the severity of voice disorder and the predominant voice quality. However, the PDD area and quadrant do not differentiate the signals as a function of severity of voice disorder and differentiated only the breathy and rough voices from the normal and strained voices. PDD density is able to differentiate only signals with moderate and severe deviation. PDD shape shows differences between signals with different severities of voice deviation. © 2018 S. Karger AG, Basel.

  1. Occupational risk factors and voice disorders.

    Science.gov (United States)

    Vilkman, E

    1996-01-01

    From the point of view of occupational health, the field of voice disorders is very poorly developed as compared, for instance, to the prevention and diagnostics of occupational hearing disorders. In fact, voice disorders have not even been recognized in the field of occupational medicine. Hence, it is obviously very rare in most countries that the voice disorder of a professional voice user, e.g. a teacher, a singer or an actor, is accepted as an occupational disease by insurance companies. However, occupational voice problems do not lack significance from the point of view of the patient. We also know from questionnaires and clinical studies that voice complaints are very common. Another example of job-related health problems, which has proved more successful in terms of its occupational health status, is the repetition strain injury of the elbow, i.e. the "tennis elbow". Its textbook definition could be used as such to describe an occupational voice disorder ("dysphonia professional is"). In the present paper the effects of such risk factors as vocal loading itself, background noise and room acoustics and low relative humidity of the air are discussed. Due to individual factors underlying the development of professional voice disorders, recommendations rather than regulations are called for. There are many simple and even relatively low-cost methods available for the prevention of vocal problems as well as for supporting rehabilitation.

  2. Spectral distribution of solo voice and accompaniment in pop music.

    Science.gov (United States)

    Borch, Daniel Zangger; Sundberg, Johan

    2002-01-01

    Singers performing in popular styles of music mostly rely on feedback provided by monitor loudspeakers on the stage. The highest sound level that these loudspeakers can provide without feedback noise is often too low to be heard over the ambient sound level on the stage. Long-term-average spectra of some orchestral accompaniments typically used in pop music are compared with those of classical symphonic orchestras. In loud pop accompaniment the sound level difference between 0.5 and 2.5 kHz is similar to that of a Wagner orchestra. Long-term-average spectra of pop singers' voices showed no signs of a singer's formant but a peak near 3.5 kHz. It is suggested that pop singers' difficulties to hear their own voices may be reduced if the frequency range 3-4 kHz is boosted in the monitor sound.

  3. Idiopathic Parkinson's disease: vocal and quality of life analysis

    Directory of Open Access Journals (Sweden)

    Luiza Furtado e Silva

    2012-09-01

    Full Text Available OBJECTIVE: To compare voice and life quality of male patients with idiopathic Parkinson's disease, with individuals without disease (Control Group. METHODS: A cross-sectional study that evaluated the voice of individuals with Parkinson's disease, the group was composed of 27 subjects, aged from 39 to 79 years-old (average 59.96. The Control Group was matched on sex and age. Participants underwent voice recording. Perceptual evaluation was made using GRBASI scale, which considers G as the overall degree of dysphonia, R as roughness, B as breathiness, A as asthenia, S as strain and I as instability. The acoustic parameters analyzed were: fundamental frequency, jitter, shimmer, and harmonic to noise ratio (NHR. For vocal self-perception analysis, we used the Voice Related Quality of Life protocol. RESULTS: Fundamental frequency and jitter presented higher values in the Parkinson's group. NHR values were higher in the Control Group. Perceptual analysis showed a deviation ranging. The vocal disorder self-perception demonstrated a worse impact on quality of life. CONCLUSIONS: Individuals with Parkinson's disease have an altered voice quality and a negative impact on quality of life.

  4. Voice Response Systems Technology.

    Science.gov (United States)

    Gerald, Jeanette

    1984-01-01

    Examines two methods of generating synthetic speech in voice response systems, which allow computers to communicate in human terms (speech), using human interface devices (ears): phoneme and reconstructed voice systems. Considerations prior to implementation, current and potential applications, glossary, directory, and introduction to Input Output…

  5. Clinical Voices - an update

    DEFF Research Database (Denmark)

    Fusaroli, Riccardo; Weed, Ethan

    Anomalous aspects of speech and voice, including pitch, fluency, and voice quality, are reported to characterise many mental disorders. However, it has proven difficult to quantify and explain this oddness of speech by employing traditional statistical methods. In this talk we will show how...

  6. Changes after voice therapy in objective and subjective voice measurements of pediatric patients with vocal nodules.

    Science.gov (United States)

    Tezcaner, Ciler Zahide; Karatayli Ozgursoy, Selmin; Ozgursoy, Selmin Karatayli; Sati, Isil; Dursun, Gursel

    2009-12-01

    The aim of this study was to analyze the efficiency of the voice therapy in children with vocal nodules by using the acoustic analysis and subjective assessment. Thirty-nine patients with vocal fold nodules, aged between 7 and 14, were included in the study. Each subject had voice therapy led by an experienced voice therapist once a week. All diagnostic and follow-up workouts were performed before the voice therapy and after the third or the sixth month. Transoral and/or transnasal videostroboscopic examination and acoustic analysis were achieved using multi-dimensional voice program (MDVP) and subjective analysis with GRBAS scale. As for the perceptual assessment, the difference was significant for four parameters out of five. A significant improvement was found in the acoustic analysis parameters of jitter, shimmer, and noise-to-harmonic ratio. The voice therapy which was planned according to patients' needs, age, compliance and response to therapy had positive effects on pediatric patients with vocal nodules. Acoustic analysis and GRBAS may be used successfully in the follow-up of pediatric vocal nodule treatment.

  7. Interpretations of Frequency Domain Analyses of Neural Entrainment: Periodicity, Fundamental Frequency, and Harmonics.

    Science.gov (United States)

    Zhou, Hong; Melloni, Lucia; Poeppel, David; Ding, Nai

    2016-01-01

    Brain activity can follow the rhythms of dynamic sensory stimuli, such as speech and music, a phenomenon called neural entrainment. It has been hypothesized that low-frequency neural entrainment in the neural delta and theta bands provides a potential mechanism to represent and integrate temporal information. Low-frequency neural entrainment is often studied using periodically changing stimuli and is analyzed in the frequency domain using the Fourier analysis. The Fourier analysis decomposes a periodic signal into harmonically related sinusoids. However, it is not intuitive how these harmonically related components are related to the response waveform. Here, we explain the interpretation of response harmonics, with a special focus on very low-frequency neural entrainment near 1 Hz. It is illustrated why neural responses repeating at f Hz do not necessarily generate any neural response at f Hz in the Fourier spectrum. A strong neural response at f Hz indicates that the time scales of the neural response waveform within each cycle match the time scales of the stimulus rhythm. Therefore, neural entrainment at very low frequency implies not only that the neural response repeats at f Hz but also that each period of the neural response is a slow wave matching the time scale of a f Hz sinusoid.

  8. Fundamental-frequency and load-varying thermal cycles effects on lifetime estimation of DFIG power converter

    DEFF Research Database (Denmark)

    Zhang, G.; Zhou, D.; Yang, J.

    2017-01-01

    In respect to a Doubly-Fed Induction Generator (DFIG) system, its corresponding time scale varies from microsecond level of power semiconductor switching to second level of the mechanical response. In order to map annual thermal profile of the power semiconductors, different approaches have been ...... adopted to handle the fundamental-frequency thermal cycles and load-varying thermal cycles. Their effects on lifetime estimation of the power device in the Back-to-Back (BTB) power converter are evaluated.......In respect to a Doubly-Fed Induction Generator (DFIG) system, its corresponding time scale varies from microsecond level of power semiconductor switching to second level of the mechanical response. In order to map annual thermal profile of the power semiconductors, different approaches have been...

  9. Comparison of Acceleration and Impact Stress as Possible Loading Factors in Phonation: A Computer Modeling Study

    Czech Academy of Sciences Publication Activity Database

    Horáček, Jaromír; Laukkanen, A. M.; Šidlof, Petr; Murphy, P.; Švec, J. G.

    2009-01-01

    Roč. 61, č. 3 (2009), s. 137-145 ISSN 1021-7762 R&D Projects: GA ČR(CZ) GA101/08/1155 Institutional research plan: CEZ:AV0Z20760514 Keywords : biomechanics of voice modeling * fundamental frequency * phoniation type * gender differences in voice Subject RIV: BI - Acoustics Impact factor: 1.439, year: 2007

  10. Voice Quality Estimation in Wireless Networks

    Directory of Open Access Journals (Sweden)

    Petr Zach

    2015-01-01

    Full Text Available This article deals with the impact of Wireless (Wi-Fi networks on the perceived quality of voice services. The Quality of Service (QoS metrics must be monitored in the computer network during the voice data transmission to ensure proper voice service quality the end-user has paid for, especially in the wireless networks. In addition to the QoS, research area called Quality of Experience (QoE provides metrics and methods for quality evaluation from the end-user’s perspective. This article focuses on a QoE estimation of Voice over IP (VoIP calls in the wireless networks using network simulator. Results contribute to voice quality estimation based on characteristics of the wireless network and location of a wireless client.

  11. The Influence of Sleep Disorders on Voice Quality.

    Science.gov (United States)

    Rocha, Bruna Rainho; Behlau, Mara

    2017-09-19

    To verify the influence of sleep quality on the voice. Descriptive and analytical cross-sectional study. Data were collected by an online or printed survey divided in three parts: (1) demographic data and vocal health aspects; (2) self-assessment of sleep and vocal quality, and the influence that sleep has on voice; and (3) sleep and voice self-assessment inventories-the Epworth Sleepiness Scale (ESS), the Pittsburgh Sleep Quality Index (PSQI), and the Voice Handicap Index reduced version (VHI-10). A total of 862 people were included (493 women, 369 men), with a mean age of 32 years old (maximum age of 79 and minimum age of 18 years old). The perception of the influence that sleep has on voice showed a difference (P influence a voice handicap are vocal self-assessment, ESS total score, and self-assessment of the influence that sleep has on voice. The absence of daytime sleepiness is a protective factor (odds ratio [OR] > 1) against perceived voice handicap; the presence of daytime sleepiness is a damaging factor (OR influences voice. Perceived poor sleep quality is related to perceived poor vocal quality. Individuals with a voice handicap observe a greater influence of sleep on voice than those without. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  12. The Sound of Voice: Voice-Based Categorization of Speakers' Sexual Orientation within and across Languages.

    Directory of Open Access Journals (Sweden)

    Simone Sulpizio

    Full Text Available Empirical research had initially shown that English listeners are able to identify the speakers' sexual orientation based on voice cues alone. However, the accuracy of this voice-based categorization, as well as its generalizability to other languages (language-dependency and to non-native speakers (language-specificity, has been questioned recently. Consequently, we address these open issues in 5 experiments: First, we tested whether Italian and German listeners are able to correctly identify sexual orientation of same-language male speakers. Then, participants of both nationalities listened to voice samples and rated the sexual orientation of both Italian and German male speakers. We found that listeners were unable to identify the speakers' sexual orientation correctly. However, speakers were consistently categorized as either heterosexual or gay on the basis of how they sounded. Moreover, a similar pattern of results emerged when listeners judged the sexual orientation of speakers of their own and of the foreign language. Overall, this research suggests that voice-based categorization of sexual orientation reflects the listeners' expectations of how gay voices sound rather than being an accurate detector of the speakers' actual sexual identity. Results are discussed with regard to accuracy, acoustic features of voices, language dependency and language specificity.

  13. Eccentric Voices and the Representation of Vocal Virtuosity in Fiction: James McCourt’s Mawrdew Czgowchwz

    Directory of Open Access Journals (Sweden)

    Marcin Stawiarski

    2016-06-01

    Full Text Available This paper examines the representation of vocal virtuosity in fiction. It focuses on the concept of voice as it is represented in a work of fiction through musical eccentricity. The paper centres on James McCourt’s Mawrdew Czgowchwz (1975. James McCourt’s novel tells the story of an opera singer, Mawrdew Czgowchwz. In the novel, the voice is related to extravagance and fanaticism, so that it relates to violence and conflict. In McCourt’s novel, the stylistic features of the text show a hyperbolic use of language resorting to Rabelaisian lists, foreign vocabulary, neologisms, or nonce-words, which create tongue-twister cornucopia effects of linguistic musicality. The paper aims to demonstrate that (a the mode of eccentricity is a fundamental mode of representing music in literature; (b eccentricity rubs off on the very structure of the text, so that it leads to singular forms of operatic musicalization of fiction and musicalized writing; (c the voice ends up turning into a fetish object.

  14. [Diagnostics and therapy in professional voice-users].

    Science.gov (United States)

    Richter, B; Echternach, M

    2010-04-01

    Voice is one of the most important instruments for expression and communication in humans. Dysphonia remains very frequent. Generally people in voice-intensive professions, such as teachers, call center employees, singers and actors suffer from these complaints. In recent years methods have been developed which facilitate appropriate diagnosis and therapy, based on the criteria of evidence based medicine, in voice patients appropriate to their degree of disease. The basic protocol of the European Laryngological Society offers a standardized evaluation of multidimensional voice parameters. In our own patient collective there were statistically significant improvements in voice quality, according to a pre/post mean value comparison, in both phonomicrosurgical (n=45) and voice therapy (n=30) patients in relation to RBH, DSI and VHI.

  15. Effects of muscle tension dysphonia on tone phonation: acoustic and perceptual studies in Vietnamese female teachers.

    Science.gov (United States)

    Nguyen, Duong Duy; Kenny, Dianna T

    2009-07-01

    Muscle tension dysphonia (MTD) is a hyperfunctional voice disorder commonly seen in professional voice users. To date, published acoustic studies of this disorder have mainly focused on nontonal language speakers, and no publication has documented its impact on lexical tone characteristics. In this study, we examined whether and how this voice disorder affected acoustically and perceptually the characteristics of tones in Vietnamese teachers. Voice data were obtained from 42 Vietnamese female primary school teachers diagnosed with MTD and 30 vocally healthy teachers. Tonal data were analyzed using Computerized Speech Lab (CSL-4300B) and Speech Analyzer. Parameters analyzed included the two most important acoustic cues in Vietnamese tones, that is, tonal fundamental frequency (F(0)) and laryngealization. Tonal F(0) was assessed using a factorial analysis of variance with group and career durations as independent variables. Tonal samples were also perceptually assessed by a panel of native speakers of the same dialect. The results showed that MTD lowered tonal F(0) in high tones and tones with extensive fundamental frequency variation. There was also a significant main effect for career duration; in MTD group, tonal F(0) was lower in teachers with longer career duration. The teachers with MTD showed different patterns of laryngealization compared with the control group. Tone perception was poorer for tones with extensive fundamental frequency variation and without a typical phonation type. The results in this group of teachers supported our hypothesis that MTD impairs lexical tone phonation.

  16. [Design of standard voice sample text for subjective auditory perceptual evaluation of voice disorders].

    Science.gov (United States)

    Li, Jin-rang; Sun, Yan-yan; Xu, Wen

    2010-09-01

    To design a speech voice sample text with all phonemes in Mandarin for subjective auditory perceptual evaluation of voice disorders. The principles for design of a speech voice sample text are: The short text should include the 21 initials and 39 finals, this may cover all the phonemes in Mandarin. Also, the short text should have some meanings. A short text was made out. It had 155 Chinese words, and included 21 initials and 38 finals (the final, ê, was not included because it was rarely used in Mandarin). Also, the text covered 17 light tones and one "Erhua". The constituent ratios of the initials and finals presented in this short text were statistically similar as those in Mandarin according to the method of similarity of the sample and population (r = 0.742, P text were statistically not similar as those in Mandarin (r = 0.731, P > 0.05). A speech voice sample text with all phonemes in Mandarin was made out. The constituent ratios of the initials and finals presented in this short text are similar as those in Mandarin. Its value for subjective auditory perceptual evaluation of voice disorders need further study.

  17. Voice pedagogy-what do we need?

    Science.gov (United States)

    Gill, Brian P; Herbst, Christian T

    2016-12-01

    The final keynote panel of the 10th Pan-European Voice Conference (PEVOC) was concerned with the topic 'Voice pedagogy-what do we need?' In this communication the panel discussion is summarized, and the authors provide a deepening discussion on one of the key questions, addressing the roles and tasks of people working with voice students. In particular, a distinction is made between (1) voice building (derived from the German term 'Stimmbildung'), primarily comprising the functional and physiological aspects of singing; (2) coaching, mostly concerned with performance skills; and (3) singing voice rehabilitation. Both public and private educators are encouraged to apply this distinction to their curricula, in order to arrive at more efficient singing teaching and to reduce the risk of vocal injury to the singers concerned.

  18. Analyzing the mediated voice - a datasession

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    Broadcasted voices are technologically manipulated. In order to achieve a certain autencity or sound of “reality” paradoxically the voices are filtered and trained in order to reach the listeners. This “mis-en-scene” is important knowledge when it comes to the development of a consistent method o...... of analysis of the mediated voice...

  19. Can blind persons accurately assess body size from the voice?

    Science.gov (United States)

    Pisanski, Katarzyna; Oleszkiewicz, Anna; Sorokowska, Agnieszka

    2016-04-01

    Vocal tract resonances provide reliable information about a speaker's body size that human listeners use for biosocial judgements as well as speech recognition. Although humans can accurately assess men's relative body size from the voice alone, how this ability is acquired remains unknown. In this study, we test the prediction that accurate voice-based size estimation is possible without prior audiovisual experience linking low frequencies to large bodies. Ninety-one healthy congenitally or early blind, late blind and sighted adults (aged 20-65) participated in the study. On the basis of vowel sounds alone, participants assessed the relative body sizes of male pairs of varying heights. Accuracy of voice-based body size assessments significantly exceeded chance and did not differ among participants who were sighted, or congenitally blind or who had lost their sight later in life. Accuracy increased significantly with relative differences in physical height between men, suggesting that both blind and sighted participants used reliable vocal cues to size (i.e. vocal tract resonances). Our findings demonstrate that prior visual experience is not necessary for accurate body size estimation. This capacity, integral to both nonverbal communication and speech perception, may be present at birth or may generalize from broader cross-modal correspondences. © 2016 The Author(s).

  20. Voice Quality in Mobile Telecommunication System

    Directory of Open Access Journals (Sweden)

    Evaldas Stankevičius

    2013-05-01

    Full Text Available The article deals with methods measuring the quality of voice transmitted over the mobile network as well as related problem, algorithms and options. It presents the created voice quality measurement system and discusses its adequacy as well as efficiency. Besides, the author presents the results of system application under the optimal hardware configuration. Under almost ideal conditions, the system evaluates the voice quality with MOS 3.85 average estimate; while the standardized TEMS Investigation 9.0 has 4.05 average MOS estimate. Next, the article presents the discussion of voice quality predictor implementation and investigates the predictor using nonlinear and linear prediction methods of voice quality dependence on the mobile network settings. Nonlinear prediction using artificial neural network resulted in the correlation coefficient of 0.62. While the linear prediction method using the least mean squares resulted in the correlation coefficient of 0.57. The analytical expression of voice quality features from the three network parameters: BER, C / I, RSSI is given as well.Article in Lithuanian

  1. Comparing the accuracy of perturbative and variational calculations for predicting fundamental vibrational frequencies of dihalomethanes

    Science.gov (United States)

    Krasnoshchekov, Sergey V.; Schutski, Roman S.; Craig, Norman C.; Sibaev, Marat; Crittenden, Deborah L.

    2018-02-01

    Three dihalogenated methane derivatives (CH2F2, CH2FCl, and CH2Cl2) were used as model systems to compare and assess the accuracy of two different approaches for predicting observed fundamental frequencies: canonical operator Van Vleck vibrational perturbation theory (CVPT) and vibrational configuration interaction (VCI). For convenience and consistency, both methods employ the Watson Hamiltonian in rectilinear normal coordinates, expanding the potential energy surface (PES) as a Taylor series about equilibrium and constructing the wavefunction from a harmonic oscillator product basis. At the highest levels of theory considered here, fourth-order CVPT and VCI in a harmonic oscillator basis with up to 10 quanta of vibrational excitation in conjunction with a 4-mode representation sextic force field (SFF-4MR) computed at MP2/cc-pVTZ with replacement CCSD(T)/aug-cc-pVQZ harmonic force constants, the agreement between computed fundamentals is closer to 0.3 cm-1 on average, with a maximum difference of 1.7 cm-1. The major remaining accuracy-limiting factors are the accuracy of the underlying electronic structure model, followed by the incompleteness of the PES expansion. Nonetheless, computed and experimental fundamentals agree to within 5 cm-1, with an average difference of 2 cm-1, confirming the utility and accuracy of both theoretical models. One exception to this rule is the formally IR-inactive but weakly allowed through Coriolis-coupling H-C-H out-of-plane twisting mode of dichloromethane, whose spectrum we therefore revisit and reassign. We also investigate convergence with respect to order of CVPT, VCI excitation level, and order of PES expansion, concluding that premature truncation substantially decreases accuracy, although VCI(6)/SFF-4MR results are still of acceptable accuracy, and some error cancellation is observed with CVPT2 using a quartic force field.

  2. F0 Characteristics of Newsreaders on Varied Emotional Texts in Tamil Language.

    Science.gov (United States)

    Gunasekaran, Nishanthi; Boominathan, Prakash; Seethapathy, Jayashree

    2017-12-26

    The objective of this study was to profile speaking F 0 and its variations in newsreaders on varied emotional texts. This study has a prospective, case-control study design. Fifteen professional newsreaders and 15 non-newsreaders were the participants. The participants read the news bulletin that conveyed different emotions (shock, neutral, happy, and sad) in a habitual and "newsreading" voice. Speaking fundamental frequency (SFF) and F 0 variations were extracted from 1620 tokens using Praat software (version 5.2.32) on the opening lines, headlines, news stories, and closing lines of each news item. Paired t test, independent t test, and Friedman test were used for statistical analysis. Both male and female newsreaders had significantly (P ≤ 0.05) higher SFFs and standard deviations (SDs) of SFF in newsreading voice than speaking voice. Female non-newsreaders demonstrated significantly higher SFF and SD of SFF in newsreading voice, whereas no significant differences were noticed in the frequency parameters for male non-newsreaders. No significant difference was noted in the frequency parameters of speaking and newsreading voice between male newsreaders and male non-newsreaders. A significant difference in the SD of SFF was noticed between female newsreaders and female non-newsreaders in newsreading voice. Female newsreaders had a higher frequency range in both speaking voice and newsreading voice when compared with non-newsreaders. F 0 characteristics and frequency range determine the amount of frequency changes exercised by newsreaders while reading bulletins. This information is highly pedagogic for training voices in this profession. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  3. Smartphone App for Voice Disorders

    Science.gov (United States)

    ... on. Feature: Taste, Smell, Hearing, Language, Voice, Balance Smartphone App for Voice Disorders Past Issues / Fall 2013 ... developed a mobile monitoring device that relies on smartphone technology to gather a week's worth of talking, ...

  4. Hearing Voices and Seeing Things

    Science.gov (United States)

    ... Facts for Families Guide Facts for Families - Vietnamese Hearing Voices and Seeing Things No. 102; Updated October ... delusions (a fixed, false, and often bizarre belief). Hearing voices or seeing things that are not there ...

  5. Clinical Features of Psychogenic Voice Disorder and the Efficiency of Voice Therapy and Psychological Evaluation.

    Science.gov (United States)

    Tezcaner, Zahide Çiler; Gökmen, Muhammed Fatih; Yıldırım, Sibel; Dursun, Gürsel

    2017-11-06

    The aim of this study was to define the clinical features of psychogenic voice disorder (PVD) and explore the treatment efficiency of voice therapy and psychological evaluation. Fifty-eight patients who received treatment following the PVD diagnosis and had no organic or other functional voice disorders were assessed retrospectively based on laryngoscopic examinations and subjective and objective assessments. Epidemiological characteristics, accompanying organic and psychological disorders, preferred methods of treatment, and previous treatment outcomes were examined for each patient. A comparison was made based on voice disorders and responses to treatment between patients who received psychotherapy and patients who did not. Participants in this study comprised 58 patients, 10 male and 48 female. Voice therapy was applied in all patients, 54 (93.1%) of whom had improvement in their voice. Although all patients were advised to undergo psychological assessment, only 60.3% (35/58) of them underwent psychological assessment. No statistically significant difference was found between patients who did receive psychological support concerning their treatment responses and patients who did not. Relapse occurred in 14.7% (5/34) of the patients who applied for psychological assessment and in 50% (10/20) of those who did not. There was a statistically significant difference in relapse rates, which was higher among patients who did not receive psychological support (P therapy is an efficient treatment method for PVD. However, in the long-term follow-up, relapse of the disease is observed to be higher among patients who failed to follow up on the recommendation for psychological assessment. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  6. Reliability in perceptual analysis of voice quality.

    Science.gov (United States)

    Bele, Irene Velsvik

    2005-12-01

    This study focuses on speaking voice quality in male teachers (n = 35) and male actors (n = 36), who represent untrained and trained voice users, because we wanted to investigate normal and supranormal voices. In this study, both substantial and methodologic aspects were considered. It includes a method for perceptual voice evaluation, and a basic issue was rater reliability. A listening group of 10 listeners, 7 experienced speech-language therapists, and 3 speech-language therapist students evaluated the voices by 15 vocal characteristics using VA scales. Two sets of voice signals were investigated: text reading (2 loudness levels) and sustained vowel (3 levels). The results indicated a high interrater reliability for most perceptual characteristics. Connected speech was evaluated more reliably, especially at the normal level, but both types of voice signals were evaluated reliably, although the reliability for connected speech was somewhat higher than for vowels. Experienced listeners tended to be more consistent in their ratings than did the student raters. Some vocal characteristics achieved acceptable reliability even with a smaller panel of listeners. The perceptual characteristics grouped in 4 factors reflected perceptual dimensions.

  7. [Hearing voices does not always constitute a psychosis].

    Science.gov (United States)

    Sommer, I E C; van der Spek, D W

    2016-01-01

    Hearing voices (i.e. auditory verbal hallucinations) is mainly known as part of schizophrenia and other psychotic disorders. However, hearing voices is a symptom that can occur in many psychiatric, neurological and general medical conditions. We present three cases of non-psychotic patients with auditory verbal hallucinations caused by different disorders. The first patient is a 74-year-old male with voices due to hearing loss, the second is a 20-year-old woman with voices due to traumatisation. The third patient is a 27-year-old woman with voices caused by temporal lobe epilepsy. Hearing voices is a phenomenon that occurs in a variety of disorders. Therefore, identification of the underlying disorder is essential to indicate treatment. Improvement of coping with the voices can reduce their impact on a patient. Antipsychotic drugs are especially effective when hearing voices is accompanied by delusions or disorganization. When this is not the case, the efficacy of antipsychotic drugs will probably not outweigh the side-effects.

  8. Permanent Quadriplegia Following Replacement of Voice Prosthesis.

    Science.gov (United States)

    Ozturk, Kayhan; Erdur, Omer; Kibar, Ertugrul

    2016-11-01

    The authors presented a patient with quadriplegia caused by cervical spine abscess following voice prosthesis replacement. The authors present the first reported permanent quadriplegia patient caused by voice prosthesis replacement. The authors wanted to emphasize that life-threatening complications may be faced during the replacement of voice prosthesis. Care should be taken during the replacement of voice prosthesis and if some problems have been faced during the procedure patients must be followed closely.

  9. Updating signal typing in voice: addition of type 4 signals.

    Science.gov (United States)

    Sprecher, Alicia; Olszewski, Aleksandra; Jiang, Jack J; Zhang, Yu

    2010-06-01

    The addition of a fourth type of voice to Titze's voice classification scheme is proposed. This fourth voice type is characterized by primarily stochastic noise behavior and is therefore unsuitable for both perturbation and correlation dimension analysis. Forty voice samples were classified into the proposed four types using narrowband spectrograms. Acoustic, perceptual, and correlation dimension analyses were completed for all voice samples. Perturbation measures tended to increase with voice type. Based on reliability cutoffs, the type 1 and type 2 voices were considered suitable for perturbation analysis. Measures of unreliability were higher for type 3 and 4 voices. Correlation dimension analyses increased significantly with signal type as indicated by a one-way analysis of variance. Notably, correlation dimension analysis could not quantify the type 4 voices. The proposed fourth voice type represents a subset of voices dominated by noise behavior. Current measures capable of evaluating type 4 voices provide only qualitative data (spectrograms, perceptual analysis, and an infinite correlation dimension). Type 4 voices are highly complex and the development of objective measures capable of analyzing these voices remains a topic of future investigation.

  10. Voice amplification for primary school teachers with voice disorders: a randomized clinical trial.

    Science.gov (United States)

    Bovo, Roberto; Trevisi, Patrizia; Emanuelli, Enzo; Martini, Alessandro

    2013-06-01

    Several studies have demonstrated a high prevalence of voice disorders in teachers, together with the personal, professional and economical consequences of the problem. Good primary prevention should be based on 3 aspects: 1) amelioration of classroom acoustics, 2) voice care programs for future professional voice users, including teachers and 3) classroom or portable amplification systems. The aim of the study was to assess the benefit obtained from the use of portable amplification systems by female primary school teachers in their occupational setting. Forty female primary school teachers attended a course about professional voice care, which comprised two theoretical lectures, each 60 min long. Thereafter, they were randomized into 2 groups: the teachers of the first group were asked to use a portable vocal amplifier for 3 months, till the end of school-year. The other 20 teachers were part of the control group, matched for age and years of employment. All subjects had a grade 1 of dysphonia with no significant organic lesion of the vocal folds. Most teachers of the experimental group used the amplifier consistently for the whole duration of the experiment and found it very useful in reducing the symptoms of vocal fatigue. In fact, after 3 months, Voice Handicap Index (VHI) scores in "course + amplifier" group demonstrated a significant amelioration (p = 0.003). The perceptual grade of dysphonia also improved significantly (p = 0.0005). The same parameters changed favourably also in the "course only" group, but the results were not statistically significant (p = 0.4 for VHI and p = 0.03 for perceptual grade). In teachers, and particularly in those with a constitutional weak voice and/or those who are prone to vocal fold pathology, vocal amplifiers may be an effective and low-cost intervention to decrease potentially damaging vocal loads and may represent a necessary form of prevention.

  11. Measurement of voice onset time in maxillectomy patients.

    Science.gov (United States)

    Hattori, Mariko; Sumita, Yuka I; Taniguchi, Hisashi

    2014-01-01

    Objective speech evaluation using acoustic measurement is needed for the proper rehabilitation of maxillectomy patients. For digital evaluation of consonants, measurement of voice onset time is one option. However, voice onset time has not been measured in maxillectomy patients as their consonant sound spectra exhibit unique characteristics that make the measurement of voice onset time challenging. In this study, we established criteria for measuring voice onset time in maxillectomy patients for objective speech evaluation. We examined voice onset time for /ka/ and /ta/ in 13 maxillectomy patients by calculating the number of valid measurements of voice onset time out of three trials for each syllable. Wilcoxon's signed rank test showed that voice onset time measurements were more successful for /ka/ and /ta/ when a prosthesis was used (Z = -2.232, P = 0.026 and Z = -2.401, P = 0.016, resp.) than when a prosthesis was not used. These results indicate a prosthesis affected voice onset measurement in these patients. Although more research in this area is needed, measurement of voice onset time has the potential to be used to evaluate consonant production in maxillectomy patients wearing a prosthesis.

  12. Measurement of Voice Onset Time in Maxillectomy Patients

    Directory of Open Access Journals (Sweden)

    Mariko Hattori

    2014-01-01

    Full Text Available Objective speech evaluation using acoustic measurement is needed for the proper rehabilitation of maxillectomy patients. For digital evaluation of consonants, measurement of voice onset time is one option. However, voice onset time has not been measured in maxillectomy patients as their consonant sound spectra exhibit unique characteristics that make the measurement of voice onset time challenging. In this study, we established criteria for measuring voice onset time in maxillectomy patients for objective speech evaluation. We examined voice onset time for /ka/ and /ta/ in 13 maxillectomy patients by calculating the number of valid measurements of voice onset time out of three trials for each syllable. Wilcoxon’s signed rank test showed that voice onset time measurements were more successful for /ka/ and /ta/ when a prosthesis was used (Z=−2.232, P=0.026 and Z=−2.401, P=0.016, resp. than when a prosthesis was not used. These results indicate a prosthesis affected voice onset measurement in these patients. Although more research in this area is needed, measurement of voice onset time has the potential to be used to evaluate consonant production in maxillectomy patients wearing a prosthesis.

  13. Multidimensional assessment of strongly irregular voices such as in substitution voicing and spasmodic dysphonia: a compilation of own research.

    Science.gov (United States)

    Moerman, Mieke; Martens, Jean-Pierre; Dejonckere, Philippe

    2015-04-01

    This article is a compilation of own research performed during the European COoperation in Science and Technology (COST) action 2103: 'Advance Voice Function Assessment', an initiative of voice and speech processing teams consisting of physicists, engineers, and clinicians. This manuscript concerns analyzing largely irregular voicing types, namely substitution voicing (SV) and adductor spasmodic dysphonia (AdSD). A specific perceptual rating scale (IINFVo) was developed, and the Auditory Model Based Pitch Extractor (AMPEX), a piece of software that automatically analyses running speech and generates pitch values in background noise, was applied. The IINFVo perceptual rating scale has been shown to be useful in evaluating SV. The analysis of strongly irregular voices stimulated a modification of the European Laryngological Society's assessment protocol which was originally designed for the common types of (less severe) dysphonia. Acoustic analysis with AMPEX demonstrates that the most informative features are, for SV, the voicing-related acoustic features and, for AdSD, the perturbation measures. Poor correlations between self-assessment and acoustic and perceptual dimensions in the assessment of highly irregular voices argue for a multidimensional approach.

  14. Voice and choice by delegation.

    Science.gov (United States)

    van de Bovenkamp, Hester; Vollaard, Hans; Trappenburg, Margo; Grit, Kor

    2013-02-01

    In many Western countries, options for citizens to influence public services are increased to improve the quality of services and democratize decision making. Possibilities to influence are often cast into Albert Hirschman's taxonomy of exit (choice), voice, and loyalty. In this article we identify delegation as an important addition to this framework. Delegation gives individuals the chance to practice exit/choice or voice without all the hard work that is usually involved in these options. Empirical research shows that not many people use their individual options of exit and voice, which could lead to inequality between users and nonusers. We identify delegation as a possible solution to this problem, using Dutch health care as a case study to explore this option. Notwithstanding various advantages, we show that voice and choice by delegation also entail problems of inequality and representativeness.

  15. Natural asynchronies in audiovisual communication signals regulate neuronal multisensory interactions in voice-sensitive cortex.

    Science.gov (United States)

    Perrodin, Catherine; Kayser, Christoph; Logothetis, Nikos K; Petkov, Christopher I

    2015-01-06

    When social animals communicate, the onset of informative content in one modality varies considerably relative to the other, such as when visual orofacial movements precede a vocalization. These naturally occurring asynchronies do not disrupt intelligibility or perceptual coherence. However, they occur on time scales where they likely affect integrative neuronal activity in ways that have remained unclear, especially for hierarchically downstream regions in which neurons exhibit temporally imprecise but highly selective responses to communication signals. To address this, we exploited naturally occurring face- and voice-onset asynchronies in primate vocalizations. Using these as stimuli we recorded cortical oscillations and neuronal spiking responses from functional MRI (fMRI)-localized voice-sensitive cortex in the anterior temporal lobe of macaques. We show that the onset of the visual face stimulus resets the phase of low-frequency oscillations, and that the face-voice asynchrony affects the prominence of two key types of neuronal multisensory responses: enhancement or suppression. Our findings show a three-way association between temporal delays in audiovisual communication signals, phase-resetting of ongoing oscillations, and the sign of multisensory responses. The results reveal how natural onset asynchronies in cross-sensory inputs regulate network oscillations and neuronal excitability in the voice-sensitive cortex of macaques, a suggested animal model for human voice areas. These findings also advance predictions on the impact of multisensory input on neuronal processes in face areas and other brain regions.

  16. Singing Voice Analysis, Synthesis, and Modeling

    Science.gov (United States)

    Kim, Youngmoo E.

    The singing voice is the oldest musical instrument, but its versatility and emotional power are unmatched. Through the combination of music, lyrics, and expression, the voice is able to affect us in ways that no other instrument can. The fact that vocal music is prevalent in almost all cultures is indicative of its innate appeal to the human aesthetic. Singing also permeates most genres of music, attesting to the wide range of sounds the human voice is capable of producing. As listeners we are naturally drawn to the sound of the human voice, and, when present, it immediately becomes the focus of our attention.

  17. Voice stress analysis and evaluation

    Science.gov (United States)

    Haddad, Darren M.; Ratley, Roy J.

    2001-02-01

    Voice Stress Analysis (VSA) systems are marketed as computer-based systems capable of measuring stress in a person's voice as an indicator of deception. They are advertised as being less expensive, easier to use, less invasive in use, and less constrained in their operation then polygraph technology. The National Institute of Justice have asked the Air Force Research Laboratory for assistance in evaluating voice stress analysis technology. Law enforcement officials have also been asking questions about this technology. If VSA technology proves to be effective, its value for military and law enforcement application is tremendous.

  18. Effects of Medications on Voice

    Science.gov (United States)

    ... ENTCareers Marketplace Find an ENT Doctor Near You Effects of Medications on Voice Effects of Medications on Voice Patient Health Information News ... replacement therapy post-menopause may have a variable effect. An inadequate level of thyroid replacement medication in ...

  19. Technological Fundamentalism? The Use of Unmanned Aerial Vehicles in the Conduct of War

    OpenAIRE

    Futrell, Doris J.

    2004-01-01

    There is an on-going battle in the Department of Defense between reason and the faith in technology. Those ascribing to technological fundamentalism are blind to the empirical evidence that their faith in technology is obscuring the technological limitations that are evident. The desire for information dominance to reach the state of total transparency of the opponent in order to win the war is untenable. The reasoning voiced by skeptics should be heeded but the technological fundamentalis...

  20. The Voices of the Documentarist

    Science.gov (United States)

    Utterback, Ann S.

    1977-01-01

    Discusses T. S. Elliot's essay, "The Three Voices of Poetry" which conceptualizes the position taken by the poet or creator. Suggests that an examination of documentary film, within the three voices concept, expands the critical framework of the film genre. (MH)

  1. Effects of Variability in Fundamental Frequency on L2 Vocabulary Learning: A Comparison between Learners Who Do and Do Not Speak a Tone Language

    Science.gov (United States)

    Barcroft, Joe; Sommers, Mitchell S.

    2014-01-01

    Previous studies (Barcroft & Sommers, 2005; Sommers & Barcroft, 2007) have demonstrated that variability in talker, speaking style, and speaking rate positively affect second language vocabulary learning, whereas variability in overall amplitude and fundamental frequency (F0) do not, at least for native English speakers. Sommers and…

  2. Obligatory and facultative brain regions for voice-identity recognition

    Science.gov (United States)

    Roswandowitz, Claudia; Kappes, Claudia; Obrig, Hellmuth; von Kriegstein, Katharina

    2018-01-01

    Abstract Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal

  3. The effect of voice quality on hiring decisions

    Directory of Open Access Journals (Sweden)

    Lea Tylečková

    2017-09-01

    Full Text Available This paper examines the effect of voice quality on hiring decisions. Considering voice quality an important tool in an individual’s self-presentation in the job market, it may very well enhance his/her job prospects, while some voice qualities may affect employers’ judgments in a negative way. Five men and five women were recorded reading four different utterances representing answers to job interviewers’ questions in four different phonation guises: modal, breathy, creaky and pressed. 38 professional employment interviewers recorded the speakers’ hireability and personality ratings (likeability, self-confidence and trustworthiness on 7-point semantic differential scales based on the speakers’ voice. The results revealed a significant effect of the phonation guises on the speakers’ ratings with the modal voice being superior to the cluster of non-modal voices. Interestingly, the non-modal guises were evaluated in a very similar way, except for the self-confidence category with the breathy voice getting the lowest scores on the one hand and the pressed voice correlating with high self-confidence ratings on the other.

  4. Can a voice disorder be an occupational disease?

    Directory of Open Access Journals (Sweden)

    Daša Gluvajić

    2012-11-01

    Full Text Available Voice disorders are all changes in the voice quality that can be detected by hearing. Some etiological factors that contribute to the development of voice disorders are related to occupation, working environment and working conditions. In modern societies one third of the labour force works in professions with vocal loading. In such professions, voice disorders influence work ability and quality of life. For an occupational disease, the exposure to harmful factors in the workplace is essential and causes the development of a disorder in a previously healthy individual. In some European countries, voice disorders in teachers, which do not improve after proper treatment are recognized as occupational diseases. In Slovenia, no organic or functional voice disorder is listed on the current list of occupational diseases. Prevention and cure of occupational voice disorders can contribute to better safety at the workplace and improve the workers’ health. Voice professionals must also know that they are responsible for their own health and that they must actively take care of it.

  5. The Voice as Computer Interface: A Look at Tomorrow's Technologies.

    Science.gov (United States)

    Lange, Holley R.

    1991-01-01

    Discussion of voice as the communications device for computer-human interaction focuses on voice recognition systems for use within a library environment. Voice technologies are described, including voice response and voice recognition; examples of voice systems in use in libraries are examined; and further possibilities, including use with…

  6. Probing echoic memory with different voices.

    Science.gov (United States)

    Madden, D J; Bastian, J

    1977-05-01

    Considerable evidence has indicated that some acoustical properties of spoken items are preserved in an "echoic" memory for approximately 2 sec. However, some of this evidence has also shown that changing the voice speaking the stimulus items has a disruptive effect on memory which persists longer than that of other acoustical variables. The present experiment examined the effect of voice changes on response bias as well as on accuracy in a recognition memory task. The task involved judging recognition probes as being present in or absent from sets of dichotically presented digits. Recognition of probes spoken in the same voice as that of the dichotic items was more accurate than recognition of different-voice probes at each of three retention intervals of up to 4 sec. Different-voice probes increased the likelihood of "absent" responses, but only up to a 1.4-sec delay. These shifts in response bias may represent a property of echoic memory which should be investigated further.

  7. Voice disorders in teachers. A review.

    Science.gov (United States)

    Martins, Regina Helena Garcia; Pereira, Eny Regina Bóia Neves; Hidalgo, Caio Bosque; Tavares, Elaine Lara Mendes

    2014-11-01

    Voice disorders are very prevalent among teachers and consequences are serious. Although the literature is extensive, there are differences in the concepts and methodology related to voice problems; most studies are restricted to analyzing the responses of teachers to questionnaires and only a few studies include vocal assessments and videolaryngoscopic examinations to obtain a definitive diagnosis. To review demographic studies related to vocal disorders in teachers to analyze the diverse methodologies, the prevalence rates pointed out by the authors, the main risk factors, the most prevalent laryngeal lesions, and the repercussions of dysphonias on professional activities. The available literature (from 1997 to 2013) was narratively reviewed based on Medline, PubMed, Lilacs, SciELO, and Cochrane library databases. Excluded were articles that specifically analyzed treatment modalities and those that did not make their abstracts available in those databases. The keywords included were teacher, dysphonia, voice disorders, professional voice. Copyright © 2014 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  8. Fundamentals of electro-engineering I

    International Nuclear Information System (INIS)

    Rapsik, M.; Smola, M.; Bohac, M.; Mucha, M.

    2004-01-01

    This is the text-book of the fundamentals of electro-engineering. It contains the following chapters: (1) Selected terms in electro-engineering; (2) Fundamental electric values; (3) Energy and their transformations; (4) Water, hydro-energy and hydro-energetic potential of the Slovak Republic; (5) Nuclear power engineering; (6) Conventional thermal power plants; (7) Heating and cogeneration of electric power and heat production; (8) Equipment of electricity supply system; (9) Measurements in electro-engineering ; (10) Regulation of frequency and voltage, electric power quality

  9. Noise Source Visualization Using a Digital Voice Recorder and Low-Cost Sensors.

    Science.gov (United States)

    Cho, Yong Thung

    2018-04-03

    Accurate sound visualization of noise sources is required for optimal noise control. Typically, noise measurement systems require microphones, an analog-digital converter, cables, a data acquisition system, etc., which may not be affordable for potential users. Also, many such systems are not highly portable and may not be convenient for travel. Handheld personal electronic devices such as smartphones and digital voice recorders with relatively lower costs and higher performance have become widely available recently. Even though such devices are highly portable, directly implementing them for noise measurement may lead to erroneous results since such equipment was originally designed for voice recording. In this study, external microphones were connected to a digital voice recorder to conduct measurements and the input received was processed for noise visualization. In this way, a low cost, compact sound visualization system was designed and introduced to visualize two actual noise sources for verification with different characteristics: an enclosed loud speaker and a small air compressor. Reasonable accuracy of noise visualization for these two sources was shown over a relatively wide frequency range. This very affordable and compact sound visualization system can be used for many actual noise visualization applications in addition to educational purposes.

  10. Efeitos agudos laringológicos e vocais da radioiodoterapia em pacientes com hipertireoidismo por doença de Basedow Graves Acute effects of radioiodine therapy on the voice and larynx of Basedow-Graves patients

    Directory of Open Access Journals (Sweden)

    Roberta Werlang Isolan-Cury

    2008-04-01

    Full Text Available A Doença de Graves constitui a forma mais comum de hipertireoidismo e três abordagens terapêuticas são atualmente utilizadas: uso de medicamentos antitireoideanos, cirurgia e iodo radioativo (I 131. Os efeitos do o I 131 e a indução precoce de hipotireoidismo são conseqüências da destruição induzida do I131 sobre o parênquima tireoideano. São poucos relatos encontrados na literatura acerca dos efeitos da radioioterapia sobre a laringe e conseqüentemente na produção vocal. OBJETIVO: Avaliar os efeitos agudos sobre a voz da radioiodoterapia em pacientes com hipertireoidismo por Doença de Basedow Graves. MATERIAL E MÉTODO: Estudo de corte contemporâneo longitudinal, prospectivo. Procedimentos: Investigação vocal, mensuração do tempo máximo fonatório de /a/ e relação s/z, análise freqüência fundamental (Software Praat, laringoscopia e análise perceptivo-auditiva em três momentos: pré-dose, 4 dias e 20 dias pós dose. Momentos baseados no perfil inflamatório do tecido tireoideano. RESULTADOS: Não houve mudanças estatisticamente significantes nos aspectos vocais e laringológicos nos três momentos avaliados. CONCLUSÃO: A radioiodoterapia não afeta a qualidade vocal.Graves's disease is the most common cause of hyperthyroidism. There are three current therapeutic options: anti-thyroid medication, surgery, and radioactive iodine (I 131. There are few data in the literature regarding the effects of radioiodine therapy on the larynx and voice. The aim and the AIM: os this study was: to assess the effect of radioiodine therapy on the voice of Basedow-Graves patients. MATERIAL AND METHOD: A prospective study was done. Following the diagnosis of Grave's disease, patients underwent investigation of their voice, measurement of maximum phonatory time (/a/ and the s/z ratio, fundamental frequency analysis (Praat software, laringoscopy and (perceptive-auditory analysis in three different conditions: pre-treatment, 4 days, and

  11. Optimal dose-response relationships in voice therapy.

    Science.gov (United States)

    Roy, Nelson

    2012-10-01

    Like other areas of speech-language pathology, the behavioural management of voice disorders lacks precision regarding optimal dose-response relationships. In voice therapy, dosing can presumably vary from no measurable effect (i.e., no observable benefit or adverse effect), to ideal dose (maximum benefit with no adverse effects), to doses that produce toxic or harmful effects on voice production. Practicing specific vocal exercises will inevitably increase vocal load. At ideal doses, these exercises may be non-toxic and beneficial, while at intermediate or high doses, the same exercises may actually be toxic or damaging to vocal fold tissues. In pharmacology, toxicity is a critical concept, yet it is rarely considered in voice therapy, with little known regarding "effective" concentrations of specific voice therapies vs "toxic" concentrations. The potential for vocal fold tissue damage related to overdosing on specific vocal exercises has been under-studied. In this commentary, the issue of dosing will be explored within the context of voice therapy, with particular emphasis placed on possible "overdosing".

  12. The recognition of female voice based on voice registers in singing techniques in real-time using hankel transform method and macdonald function

    Science.gov (United States)

    Meiyanti, R.; Subandi, A.; Fuqara, N.; Budiman, M. A.; Siahaan, A. P. U.

    2018-03-01

    A singer doesn’t just recite the lyrics of a song, but also with the use of particular sound techniques to make it more beautiful. In the singing technique, more female have a diverse sound registers than male. There are so many registers of the human voice, but the voice registers used while singing, among others, Chest Voice, Head Voice, Falsetto, and Vocal fry. Research of speech recognition based on the female’s voice registers in singing technique is built using Borland Delphi 7.0. Speech recognition process performed by the input recorded voice samples and also in real time. Voice input will result in weight energy values based on calculations using Hankel Transformation method and Macdonald Functions. The results showed that the accuracy of the system depends on the accuracy of sound engineering that trained and tested, and obtained an average percentage of the successful introduction of the voice registers record reached 48.75 percent, while the average percentage of the successful introduction of the voice registers in real time to reach 57 percent.

  13. Human voice perception.

    Science.gov (United States)

    Latinus, Marianne; Belin, Pascal

    2011-02-22

    We are all voice experts. First and foremost, we can produce and understand speech, and this makes us a unique species. But in addition to speech perception, we routinely extract from voices a wealth of socially-relevant information in what constitutes a more primitive, and probably more universal, non-linguistic mode of communication. Consider the following example: you are sitting in a plane, and you can hear a conversation in a foreign language in the row behind you. You do not see the speakers' faces, and you cannot understand the speech content because you do not know the language. Yet, an amazing amount of information is available to you. You can evaluate the physical characteristics of the different protagonists, including their gender, approximate age and size, and associate an identity to the different voices. You can form a good idea of the different speaker's mood and affective state, as well as more subtle cues as the perceived attractiveness or dominance of the protagonists. In brief, you can form a fairly detailed picture of the type of social interaction unfolding, which a brief glance backwards can on the occasion help refine - sometimes surprisingly so. What are the acoustical cues that carry these different types of vocal information? How does our brain process and analyse this information? Here we briefly review an emerging field and the main tools used in voice perception research. Copyright © 2011 Elsevier Ltd. All rights reserved.

  14. How to help teachers' voices.

    Science.gov (United States)

    Saatweber, Margarete

    2008-01-01

    It has been shown that teachers are at high risk of developing occupational dysphonia, and it has been widely accepted that the vocal characteristics of a speaker play an important role in determining the reactions of listeners. The functions of breathing, breathing movement, breathing tonus, voice vibrations and articulation tonus are transmitted to the listener. So we may conclude that listening to the teacher's voice at school influences children's behavior and the perception of spoken language. This paper presents the concept of Schlaffhorst-Andersen including exercises to help teachers improve their voice, breathing, movement and their posture. Copyright 2008 S. Karger AG, Basel.

  15. Implicit multisensory associations influence voice recognition.

    Directory of Open Access Journals (Sweden)

    Katharina von Kriegstein

    2006-10-01

    Full Text Available Natural objects provide partially redundant information to the brain through different sensory modalities. For example, voices and faces both give information about the speech content, age, and gender of a person. Thanks to this redundancy, multimodal recognition is fast, robust, and automatic. In unimodal perception, however, only part of the information about an object is available. Here, we addressed whether, even under conditions of unimodal sensory input, crossmodal neural circuits that have been shaped by previous associative learning become activated and underpin a performance benefit. We measured brain activity with functional magnetic resonance imaging before, while, and after participants learned to associate either sensory redundant stimuli, i.e. voices and faces, or arbitrary multimodal combinations, i.e. voices and written names, ring tones, and cell phones or brand names of these cell phones. After learning, participants were better at recognizing unimodal auditory voices that had been paired with faces than those paired with written names, and association of voices with faces resulted in an increased functional coupling between voice and face areas. No such effects were observed for ring tones that had been paired with cell phones or names. These findings demonstrate that brief exposure to ecologically valid and sensory redundant stimulus pairs, such as voices and faces, induces specific multisensory associations. Consistent with predictive coding theories, associative representations become thereafter available for unimodal perception and facilitate object recognition. These data suggest that for natural objects effective predictive signals can be generated across sensory systems and proceed by optimization of functional connectivity between specialized cortical sensory modules.

  16. Obligatory and facultative brain regions for voice-identity recognition.

    Science.gov (United States)

    Roswandowitz, Claudia; Kappes, Claudia; Obrig, Hellmuth; von Kriegstein, Katharina

    2018-01-01

    Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal lobe is

  17. Voice amplification for primary school teachers with voice disorders: A randomized clinical trial

    Directory of Open Access Journals (Sweden)

    Roberto Bovo

    2013-06-01

    Full Text Available Objectives: Several studies have demonstrated a high prevalence of voice disorders in teachers, together with the personal, professional and economical consequences of the problem. Good primary prevention should be based on 3 aspects: 1 amelioration of classroom acoustics, 2 voice care programs for future professional voice users, including teachers and 3 classroom or portable amplification systems. The aim of the study was to assess the benefit obtained from the use of portable amplification systems by female primary school teachers in their occupational setting. Materials and Methods: Forty female primary school teachers attended a course about professional voice care, which comprised two theoretical lectures, each 60 min long. Thereafter, they were randomized into 2 groups: the teachers of the first group were asked to use a portable vocal amplifier for 3 months, till the end of school-year. The other 20 teachers were part of the control group, matched for age and years of employment. All subjects had a grade 1 of dysphonia with no significant organic lesion of the vocal folds. Results: Most teachers of the experimental group used the amplifier consistently for the whole duration of the experiment and found it very useful in reducing the symptoms of vocal fatigue. In fact, after 3 months, Voice Handicap Index (VHI scores in "course + amplifier" group demonstrated a significant amelioration (p = 0.003. The perceptual grade of dysphonia also improved significantly (p = 0.0005. The same parameters changed favourably also in the "course only" group, but the results were not statistically significant (p = 0.4 for VHI and p = 0.03 for perceptual grade. Conclusions: In teachers, and particularly in those with a constitutional weak voice and/or those who are prone to vocal fold pathology, vocal amplifiers may be an effective and low-cost intervention to decrease potentially damaging vocal loads and may represent a necessary form of prevention.

  18. Test-retest reliability for aerodynamic measures of voice.

    Science.gov (United States)

    Awan, Shaheen N; Novaleski, Carolyn K; Yingling, Julie R

    2013-11-01

    The purpose of this study was to investigate the intrasubject reliability of aerodynamic characteristics of the voice within typical/normal speakers across testing sessions using the Phonatory Aerodynamic System (PAS 6600; KayPENTAX, Montvale, NJ). Participants were 60 healthy young adults (30 males and 30 females) between the ages 18 and 31 years with perceptually typical voice. Participants were tested using the PAS 6600 (Phonatory Aerodynamic System) on two separate days with approximately 1 week between each session at approximately the same time of day. Four PAS protocols were conducted (vital capacity, maximum sustained phonation, comfortable sustained phonation, and voicing efficiency) and measures of expiratory volume, maximum phonation time, mean expiratory airflow (during vowel production) and target airflow (obtained via syllable repetition), peak air pressure, aerodynamic power, aerodynamic resistance, and aerodynamic efficiency were obtained during each testing session. Associated acoustic measures of vocal intensity and frequency were also collected. All phonations were elicited at comfortable pitch and loudness. All aerodynamic and associated variables evaluated in this study showed useable test-retest reliability (ie, intraclass correlation coefficients [ICCs] ≥ 0.60). A high degree of mean test-retest reliability was found across all subjects for aerodynamic and associated acoustic measurements of vital capacity, maximum sustained phonation, glottal resistance, and vocal intensity (all with ICCs > 0.75). Although strong ICCs were observed for measures of glottal power and mean expiratory airflow in males, weaker overall results for these measures (ICC range: 0.60-0.67) were observed in females subjects and sizable coefficients of variation were observed for measures of power, resistance, and efficiency in both men and women. Differences in degree of reliability from measure to measure were revealed in greater detail using methods such as ICCs and

  19. Voiced Excitations

    National Research Council Canada - National Science Library

    Holzricher, John

    2004-01-01

    To more easily obtain a voiced excitation function for speech characterization, measurements of skin motion, tracheal tube, and vocal fold, motions were made and compared to EM sensor-glottal derived...

  20. Voice Disorders in Occupations with Vocal Load in Slovenia.

    Science.gov (United States)

    Boltežar, Lučka; Šereg Bahar, Maja

    2014-12-01

    The aim of this paper is to compare the prevalence of voice disorders and the risk factors for them in different occupations with a vocal load in Slovenia. A meta-analysis of six different Slovenian studies involving teachers, physicians, salespeople, catholic priests, nurses and speech-and-language therapists (SLTs) was performed. In all six studies, similar questions about the prevalence of voice disorders and the causes for them were included. The comparison of the six studies showed that more than 82% of the 2347 included subjects had voice problems at some time during their career. The teachers were the most affected by voice problems. The prevalent cause of voice problems was the vocal load in teachers and salespeople and respiratory-tract infections in all the other occupational groups. When the occupational groups were compared, it was stated that the teachers had more voice problems and showed less care for their voices than the priests. The physicians had more voice problems and showed better consideration of vocal hygiene rules than the SLTs. The majority of all the included subjects did not receive instructions about voice care during education. In order to decrease the prevalence of voice disorders in vocal professionals, a screening program is recommended before the beginning of their studies. Regular courses on voice care and proper vocal technique should be obligatory for all professional voice users during their career. The inclusion of dysphonia in the list of occupational diseases should be considered in Slovenia as it is in some European countries.

  1. Comparison of fundamental, second harmonic, and superharmonic imaging: a simulation study.

    Science.gov (United States)

    van Neer, Paul L M J; Danilouchkine, Mikhail G; Verweij, Martin D; Demi, Libertario; Voormolen, Marco M; van der Steen, Anton F W; de Jong, Nico

    2011-11-01

    In medical ultrasound, fundamental imaging (FI) uses the reflected echoes from the same spectral band as that of the emitted pulse. The transmission frequency determines the trade-off between penetration depth and spatial resolution. Tissue harmonic imaging (THI) employs the second harmonic of the emitted frequency band to construct images. Recently, superharmonic imaging (SHI) has been introduced, which uses the third to the fifth (super) harmonics. The harmonic level is determined by two competing phenomena: nonlinear propagation and frequency dependent attenuation. Thus, the transmission frequency yielding the optimal trade-off between the spatial resolution and the penetration depth differs for THI and SHI. This paper quantitatively compares the concepts of fundamental, second harmonic, and superharmonic echocardiography at their optimal transmission frequencies. Forward propagation is modeled using a 3D-KZK implementation and the iterative nonlinear contrast source (INCS) method. Backpropagation is assumed to be linear. Results show that the fundamental lateral beamwidth is the narrowest at focus, while the superharmonic one is narrower outside the focus. The lateral superharmonic roll-off exceeds the fundamental and second harmonic roll-off. Also, the axial resolution of SHI exceeds that of FI and THI. The far-field pulse-echo superharmonic pressure is lower than that of the fundamental and second harmonic. SHI appears suited for echocardiography and is expected to improve its image quality at the cost of a slight reduction in depth-of-field.

  2. Measurement of Voice Onset Time in Maxillectomy Patients

    OpenAIRE

    Hattori, Mariko; Sumita, Yuka I.; Taniguchi, Hisashi

    2014-01-01

    Objective speech evaluation using acoustic measurement is needed for the proper rehabilitation of maxillectomy patients. For digital evaluation of consonants, measurement of voice onset time is one option. However, voice onset time has not been measured in maxillectomy patients as their consonant sound spectra exhibit unique characteristics that make the measurement of voice onset time challenging. In this study, we established criteria for measuring voice onset time in maxillectomy patients ...

  3. Voice, Schooling, Inequality, and Scale

    Science.gov (United States)

    Collins, James

    2013-01-01

    The rich studies in this collection show that the investigation of voice requires analysis of "recognition" across layered spatial-temporal and sociolinguistic scales. I argue that the concepts of voice, recognition, and scale provide insight into contemporary educational inequality and that their study benefits, in turn, from paying attention to…

  4. Voice restoration following total laryngectomy by tracheoesophageal prosthesis: Effect on patients' quality of life and voice handicap in Jordan

    Directory of Open Access Journals (Sweden)

    Wreikat Mahmoud M

    2008-03-01

    Full Text Available Abstract Background Little has been reported about the impact of tracheoesophageal (TE speech on individuals in the Middle East where the procedure has been gaining in popularity. After total laryngectomy, individuals in Europe and North America have rated their quality of life as being lower than non-laryngectomized individuals. The purpose of this study was to evaluate changes in quality of life and degree of voice handicap reported by laryngectomized speakers from Jordan before and after establishment of TE speech. Methods Twelve male Jordanian laryngectomees completed the University of Michigan Head & Neck Quality of Life instrument and the Voice Handicap Index pre- and post-TE puncture. Results All subjects showed significant improvements in their quality of life following successful prosthetic voice restoration. In addition, voice handicap scores were significantly reduced from pre- to post-TE puncture. Conclusion Tracheoesophageal speech significantly improved the quality of life and limited the voice handicap imposed by total laryngectomy. This method of voice restoration has been used for a number of years in other countries and now appears to be a viable alternative within Jordan.

  5. The acoustic and perceptual differences to the non-singer's singing voice before and after a singing vocal warm-up

    Science.gov (United States)

    DeRosa, Angela

    The present study analyzed the acoustic and perceptual differences in non-singer's singing voice before and after a vocal warm-up. Experiments were conducted with 12 females who had no singing experience and considered themselves to be non-singers. Participants were recorded performing 3 tasks: a musical scale stretching to their most comfortable high and low pitches, sustained productions of the vowels /a/ and /i/, and singing performance of the "Star Spangled Banner." Participants were recorded performing these three tasks before a vocal warm-up, after a vocal warm-up, and then again 2-3 weeks later after 2-3 weeks of practice. Acoustical analysis consisted of formant frequency analysis, singer's formant/singing power ratio analysis, maximum phonation frequency range analysis, and an analysis of jitter, noise to harmonic ratio (NHR), relative average perturbation (RAP), and voice turbulence index (VTI). A perceptual analysis was also conducted with 12 listeners rating comparison performances of before vs. after the vocal warm-up, before vs. after the second vocal warm-up, and after both vocal warm-ups. There were no significant findings for the formant frequency analysis of the vowel /a/, but there was significance for the 1st formant frequency analysis of the vowel /i/. Singer's formant analyzed via Singing Power Ratio analysis showed significance only for the vowel /i/. Maximum phonation frequency range analysis showed a significant increase after the vocal warm-ups. There were no significant findings for the acoustic measures of jitter, NHR, RAP, and VTI. Perceptual analysis showed a significant difference after a vocal warm-up. The results indicate that a singing vocal warm-up can have a significant positive influence on the singing voice of non-singers.

  6. Self-Reported Acute and Chronic Voice Disorders in Teachers.

    Science.gov (United States)

    Rossi-Barbosa, Luiza Augusta Rosa; Barbosa, Mirna Rossi; Morais, Renata Martins; de Sousa, Kamilla Ferreira; Silveira, Marise Fagundes; Gama, Ana Cristina Côrtes; Caldeira, Antônio Prates

    2016-11-01

    The present study aimed to identify factors associated with self-reported acute and chronic voice disorders among municipal elementary school teachers in the city of Montes Claros, in the State of Minas Gerais, Brazil. The dependent variable, self-reported dysphonia, was determined via a single question, "Have you noticed changes in your voice quality?" and if so, a follow-up question queried the duration of this change, acute or chronic. The independent variables were dichotomized and divided into five categories: sociodemographic and economic data; lifestyle; organizational and environmental data; health-disease processes; and voice. Analyses of associated factors were performed via a hierarchical multiple logistic regression model. The present study included 226 teachers, of whom 38.9% reported no voice disorders, 35.4% reported an acute disorder, and 25.7% reported a chronic disorder. Excessive voice use daily, consuming more than one alcoholic drink per time, and seeking medical treatment because of voice disorders were associated factors for acute and chronic voice disorders. Consuming up to three glasses of water per day was associated with acute voice disorders. Among teachers who reported chronic voice disorders, teaching for over 15 years and the perception of disturbing or unbearable noise outside the school were both associated factors. Identification of organizational, environmental, and predisposing risk factors for voice disorders is critical, and furthermore, a vocal health promotion program may address these issues. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  7. Common diagnoses and treatments in professional voice users.

    Science.gov (United States)

    Franco, Ramon A; Andrus, Jennifer G

    2007-10-01

    Common problems among all patients seen by the laryngologist are also common among professional voice users. These include laryngopharyngeal reflux, muscle tension dysphonia, fibrovascular vocal fold lesions (eg, nodules and polyps), cysts, vocal fold scarring, changes in vocal fold mobility, and age-related changes. Microvascular lesions and their associated sequelae of vocal fold hemorrhage and laryngitis due to voice overuse are more common among professional voice users. Much more common among professional voice users is the negative impact that voice problems have on their ability to work, on their overall sense of well-being, and sometimes on their very sense of self. This article reviews the diagnosis and treatment options for these and other problems among professional voice users, describing the relevant roles of medical treatment, voice therapy, and surgery. The common scenario of multiple concomitant entities contributing to a symptom complex is underscored. Emphasis is placed on gaining insight into the "whole" patient so that individualized management plans can be developed. Videos of select diagnoses accompany this content online.

  8. Engaging retailers: giving them voice or controlling their voice, a supplier's perspective

    OpenAIRE

    Jackson, Keith; Jackson, Jacqui; Hopkinson, Gillian

    2013-01-01

    This full paper from the Marketing and Retail track of BAM 2013 investigates the relationships between suppliers and retailers in the UK convenience store sector in terms of Hirschman's model whereby members of a group can influence it by either expressing their opinions (voice) or leaving it in protest (exit). Suppliers may create loyalty among retailers by raising exit costs and/or allowing them to express their voices. The investigation was carried out using the recorded turnover of the to...

  9. Comparison of vocal tract discomfort scale results with objective and instrumental phoniatric parameters among teacher rehabilitees from voice disorders

    Directory of Open Access Journals (Sweden)

    Ewelina Woźnicka

    2013-04-01

    Full Text Available Background: Diagnostic and therapeutic procedures of occupational dysphonia play a major role in voice self-assessment, which is one of the elements of a comprehensive evaluation of voice disorders. The aim of the study was to assess the applicability of the Vocal Tract Discomfort (VTD scale to monitor the effectiveness of voice rehabilitation and compare the VTD results with objective and instrumental methods of phoniatric diagnosis. Materials and Methods: The study included 55 teachers (mean age, 47.2 with occupational dysphonia. A comprehensive diagnosis took into account self-assessment by VTD scale, phoniatric examination, including laryngovideostroboscopy (LVSS and objective measurements of the aerodynamic parameter - the maximum phonation time (MPT. After 4 months of intense rehabilitation, post-therapy examination was performed using the methods specified above. Results: After the treatment, a significant improvement was obtained in the subjective symptoms measured on a VTD scale - assessed both for the frequency (p = 0.000 and the severity (p = 0.000 subscales. Positive effects of the therapy were also observed for the parameters evaluated in the phoniatric study (p < 0.01 and laryngovideostroboscopy (p < 0.01. After voice therapy, there was also an improvement in the objective parameter MCF, which was about 5 seconds longer. Studies have shown that the VTD scale is characterized by high reliability - Cronbach's alpha coefficient in the preliminary test was as follows: for the frequency subscale symptoms - 0.826, and severity - 0.845; similarly high reliability was achieved in the control test, 0.908 and 0.923, respectively. Conclusions: Vocal Tract Discomfort scale can be a valuable tool for assessing voice, and can also be used to monitor the effectiveness of therapy of the occupational dysphonia. Med Pr 2013;64(2:199–206

  10. A Voice Processing Technology for Rural Specific Context

    Science.gov (United States)

    He, Zhiyong; Zhang, Zhengguang; Zhao, Chunshen

    Durian the promotion and applications of rural information, different geographical dialect voice interaction is a very complex issue. Through in-depth analysis of TTS core technologies, this paper presents the methods of intelligent segmentation, word segmentation algorithm and intelligent voice thesaurus construction in the different dialects context. And then COM based development methodology for specific context voice processing system implementation and programming method. The method has a certain reference value for the rural dialect and voice processing applications.

  11. Collaboration and conquest: MTD as viewed by voice teacher (singing voice specialist) and speech-language pathologist.

    Science.gov (United States)

    Goffi-Fynn, Jeanne C; Carroll, Linda M

    2013-05-01

    This study was designed as a qualitative case study to demonstrate the process of diagnosis and treatment between a voice team to manage a singer diagnosed with muscular tension dysphonia (MTD). Traditionally, literature suggests that MTD is challenging to treat and little in the literature directly addresses singers with MTD. Data collected included initial medical screening with laryngologist, referral to speech-language pathologist (SLP) specializing in voice disorders among singers, and adjunctive voice training with voice teacher trained in vocology (singing voice specialist or SVS). Initial target goals with SLP included reducing extrinsic laryngeal tension, using a relaxed laryngeal posture, and effective abdominal-diaphragmatic support for all phonation events. Balance of respiratory forces, laryngeal coordination, and use of optimum filtering of the source signal through resonance and articulatory awareness was emphasized. Further work with SVS included three main goals including a lowered breathing pattern to aid in decreasing subglottic air pressure, vertical laryngeal position to lower to allow for a relaxed laryngeal position, and a top-down singing approach to encourage an easier, more balanced registration, and better resonance. Initial results also emphasize the retraining of subject toward a sensory rather than auditory mode of monitoring. Other areas of consideration include singers' training and vocal use, the psychological effects of MTD, the personalities potentially associated with it, and its relationship with stress. Finally, the results emphasize that a positive rapport with the subject and collaboration between all professionals involved in a singer's care are essential for recovery. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  12. Multivariate sensitivity to voice during auditory categorization.

    Science.gov (United States)

    Lee, Yune Sang; Peelle, Jonathan E; Kraemer, David; Lloyd, Samuel; Granger, Richard

    2015-09-01

    Past neuroimaging studies have documented discrete regions of human temporal cortex that are more strongly activated by conspecific voice sounds than by nonvoice sounds. However, the mechanisms underlying this voice sensitivity remain unclear. In the present functional MRI study, we took a novel approach to examining voice sensitivity, in which we applied a signal detection paradigm to the assessment of multivariate pattern classification among several living and nonliving categories of auditory stimuli. Within this framework, voice sensitivity can be interpreted as a distinct neural representation of brain activity that correctly distinguishes human vocalizations from other auditory object categories. Across a series of auditory categorization tests, we found that bilateral superior and middle temporal cortex consistently exhibited robust sensitivity to human vocal sounds. Although the strongest categorization was in distinguishing human voice from other categories, subsets of these regions were also able to distinguish reliably between nonhuman categories, suggesting a general role in auditory object categorization. Our findings complement the current evidence of cortical sensitivity to human vocal sounds by revealing that the greatest sensitivity during categorization tasks is devoted to distinguishing voice from nonvoice categories within human temporal cortex. Copyright © 2015 the American Physiological Society.

  13. Trailblazers and Cassandras: Other Voices in Northern Ireland

    DEFF Research Database (Denmark)

    McQuaid, Sara Dybris

    2012-01-01

    voices and alternative positions in the process of conflict interpretation and resolution. This essay will outline a ‘thumbnail’ sketch of three areas in which ‘other’ voices are sidelined or silenced: in terms of political discourses; community discourses; and wider academic and public discourses......’ and ‘Cassandras’ the essay concludes that the arguments forwarded by other voices are not disappeared but adapted and realigned to the reigning discourses, and that there is not so much a culture of silence surrounding ‘other’ voices as a certain selective and sectarian hearing in picking them up. Whilst...... it follows that ‘other’ voices have failed to dissolve the magnetic field of Northern Irish politics, the essay suggests that in order to rise to current political challenges in Northern Ireland it is worthwhile sounding out the historical and contemporary ‘other’ voices for carefully thought out and non...

  14. The Show with the Voice: An [Au]/-[o]-tophonographic Parody

    Directory of Open Access Journals (Sweden)

    David D.J. Sander Scheidt

    2008-05-01

    Full Text Available According to my claim that voice as a phenomenon cannot be materialised or located, neither in the (voice organ of the self nor in the (ear of the other, I coined the term [au]/[o]-tophonography for my examination of the possibilities of performing subjectivity in writing and in sound productions. Drawing on the theory of performativity in its deconstructive senses (see BUTLER, 1993, 1997, 1999/1990; DERRIDA, 1988/1972, 1997/1967, 2002/1981; SMITH, 1995 my performative epistemology reaches beyond the theoretical, including the practical and the aesthetical, aiming at questioning notions of "self", "audience", "voice", "writing" and "communication". "The show with the voice" (http://www.qualitative-research.net/fqs-texte/2-08/08-2-27_audio.mp3 is an example of this practice. It parodies the medico-scientific approach to the human voice by presenting some of its possible appearances (the "normal", the "disordered", the "homosexual" and the "transsexual" voice in an audio collage that takes the shape of a mock tutorial. Through re-contextualising and re-compiling voice samples from different sources that are usually kept apart (e.g. the lecturer's voice, the researcher's voice, the artist's voice, the autobiographer's voice I open a space for a multidisciplinary and creative perspective to the examination of voice. URN: urn:nbn:de:0114-fqs0802279

  15. Muted 'voice': The writing of two groups of postgraduate ...

    African Journals Online (AJOL)

    The purpose of this article is to demonstrate and account for the weak emergence of 'voice' in the writing of students embarking upon their postgraduate studies in Geosciences. The two elements of 'voice' that are emphasised are 'voice' as style of expression and 'voice' as the ability to write distinctly, yet building upon ...

  16. 3D simulation of an audible ultrasonic electrolarynx using difference waves.

    Science.gov (United States)

    Mills, Patrick; Zara, Jason

    2014-01-01

    A total laryngectomy removes the vocal folds which are fundamental in forming voiced sounds that make speech possible. Although implanted prosthetics are commonly used in developed countries, simple handheld vibrating electrolarynxes are still common worldwide. These devices are easy to use but suffer from many drawbacks including dedication of a hand, mechanical sounding voice, and sound leakage. To address some of these drawbacks, we introduce a novel electrolarynx that uses vibro-acoustic interference of dual ultrasonic waves to generate an audible fundamental frequency. A 3D simulation of the principles of the device is presented in this paper.

  17. Noise Source Visualization Using a Digital Voice Recorder and Low-Cost Sensors

    Directory of Open Access Journals (Sweden)

    Yong Thung Cho

    2018-04-01

    Full Text Available Accurate sound visualization of noise sources is required for optimal noise control. Typically, noise measurement systems require microphones, an analog-digital converter, cables, a data acquisition system, etc., which may not be affordable for potential users. Also, many such systems are not highly portable and may not be convenient for travel. Handheld personal electronic devices such as smartphones and digital voice recorders with relatively lower costs and higher performance have become widely available recently. Even though such devices are highly portable, directly implementing them for noise measurement may lead to erroneous results since such equipment was originally designed for voice recording. In this study, external microphones were connected to a digital voice recorder to conduct measurements and the input received was processed for noise visualization. In this way, a low cost, compact sound visualization system was designed and introduced to visualize two actual noise sources for verification with different characteristics: an enclosed loud speaker and a small air compressor. Reasonable accuracy of noise visualization for these two sources was shown over a relatively wide frequency range. This very affordable and compact sound visualization system can be used for many actual noise visualization applications in addition to educational purposes.

  18. Disability: a voice in Australian bioethics?

    Science.gov (United States)

    Newell, Christopher

    2003-06-01

    The rise of research and advocacy over the years to establish a disability voice in Australia with regard to bioethical issues is explored. This includes an analysis of some of the political processes and engagement in mainstream bioethical debate. An understanding of the politics of rejected knowledge is vital in understanding the muted disability voices in Australian bioethics and public policy. It is also suggested that the voices of those who are marginalised or oppressed in society, such as people with disability, have particular contribution to make in fostering critical bioethics.

  19. Investigating the Effects of Glottal Stop Productions on Voice in Children With Cleft Palate Using Multidimensional Voice Assessment Methods.

    Science.gov (United States)

    Aydınlı, Fatma Esen; Özcebe, Esra; Kulak Kayıkçı, Maviş E; Yılmaz, Taner; Özgür, Fatma F

    2016-11-01

    The aim was to investigate the effects of glottal stop productions (GS) on voice in children with cleft palate using multidimensional voice assessment methods. This is a prospective case-control study. Children with repaired cleft palate (n = 34) who did not have any vocal fold lesions were separated into two groups based on the results of the articulation test. The glottal stop group (GSG) consisted of 17 children who had GS. The control group (CG) consisted of an equal number of age- and gender-matched children who did not have GS. The voice evaluation protocol included acoustic analysis, Pediatric Voice Handicap Index (pVHI), and perceptual analysis (Grade, Roughness, Breathiness, Asthenia, Strain method). The velopharyngeal statuses of the groups were compared using the nasopharyngoscopy and the nasometer. The total pVHI score and the subscales of the pVHI were found to be significantly higher in the GSG. The F0, jitter, and shimmer were found to be numerically higher in the GSG with the difference being statistically significant in jitter (P speech and language pathology intervention including voice therapy techniques. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  20. Psychological effects of dysphonia in voice professionals.

    Science.gov (United States)

    Salturk, Ziya; Kumral, Tolgar Lutfi; Aydoğdu, Imran; Arslanoğlu, Ahmet; Berkiten, Güler; Yildirim, Güven; Uyar, Yavuz

    2015-08-01

    To evaluate the psychological effects of dysphonia in voice professionals compared to non-voice professionals and in both genders. Cross-sectional analysis. Forty-eight 48 voice professionals and 52 non-voice professionals with dysphonia were included in this study. All participants underwent a complete ear, nose, and throat examination and an evaluation for pathologies that might affect vocal quality. Participants were asked to complete the Turkish versions of the Voice Handicap Index-30 (VHI-30), Perceived Stress Scale (PSS), and the Hospital Anxiety and Depression Scale (HADS). HADS scores were evaluated as HADS-A (anxiety) and HADS-D (depression). Dysphonia status was evaluated by grade, roughness, breathiness, asthenia, and strain (GRBAS) scale perceptually. The results were compared statistically. Significant differences between the two groups were evident when the VHI-30 and PSS data were compared (P = .00001 and P = .00001, respectively). However, neither HADS score (HADS-A and HADS-D) differed between groups. An analysis of the scores in terms of sex revealed that females had significantly higher PSS scores (P = .006). The GRBAS scale revealed no difference between groups (P = .819, .931, .803, .655, and .803, respectively). No between-sex differences in the VHI-30 or HADS scores were evident We found that voice professionals and females experienced more stress and were more dissatisfied with their voices. 4. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.

  1. Dielectric properties of agricultural products – fundamental principles, influencing factors, and measurement technirques. Chapter 4. Electrotechnologies for Food Processing: Book Series. Volume 3. Radio-Frequency Heating

    Science.gov (United States)

    In this chapter, definitions of dielectric properties, or permittivity, of materials and a brief discussion of the fundamental principles governing their behavior with respect to influencing factors are presented. The basic physics of the influence of frequency of the electric fields and temperatur...

  2. ATC/pilot voice communications: A survey of the literature

    Science.gov (United States)

    Prinzo, O. Veronika; Britton, Thomas W.

    1993-11-01

    The first radio-equipped control tower in the United States opened at the Cleveland Municipal Airport in 1930. From that time to the present, voice radio communications have played a primary role in air safety. Verbal communications in air traffic control (ATC) operations have been frequently cited as causal factors in operational errors and pilot deviations in the FAA Operational Error and Deviation System, the NASA Aviation Safety Reporting System (ASRS), and reports derived from government sponsored research projects. Collectively, the data provided by these programs indicate that communications constitute a significant problem for pilots and controllers. Although the communications problem was well known the research literature was fragmented, making it difficult to appreciate the various types of verbal communications problems that existed and their unique influence on the quality of ATC/pilot communications. This is a survey of the voice radio communications literature. The 43 reports in the review represent survey data, field studies, laboratory studies, narrative reports, and reviews. The survey topics pertain to communications taxonomies, acoustical correlates and cognitive/psycholinguistic perspectives. Communications taxonomies were used to identify the frequency and types of information that constitute routine communications, as well as those communications involved in operational errors, pilot deviations, and other safety-related events. Acoustical correlate methodologies identified some qualities of a speaker's voice, such as loudness, pitch, and speech rate, which might be used potentially to monitor stress, mental workload, and other forms of psychological or physiological factors that affect performance. Cognitive/psycho-linguistic research offered an information processing perspective for understanding how pilots' and controllers' memory and language comprehension processes affect their ability to communicate effectively with one another. This

  3. Auditory word recognition: extrinsic and intrinsic effects of word frequency.

    Science.gov (United States)

    Connine, C M; Titone, D; Wang, J

    1993-01-01

    Two experiments investigated the influence of word frequency in a phoneme identification task. Speech voicing continua were constructed so that one endpoint was a high-frequency word and the other endpoint was a low-frequency word (e.g., best-pest). Experiment 1 demonstrated that ambiguous tokens were labeled such that a high-frequency word was formed (intrinsic frequency effect). Experiment 2 manipulated the frequency composition of the list (extrinsic frequency effect). A high-frequency list bias produced an exaggerated influence of frequency; a low-frequency list bias showed a reverse frequency effect. Reaction time effects were discussed in terms of activation and postaccess decision models of frequency coding. The results support a late use of frequency in auditory word recognition.

  4. The Influence of High-Frequency Envelope Information on Low-Frequency Vowel Identification in Noise.

    Directory of Open Access Journals (Sweden)

    Wiebke Schubotz

    Full Text Available Vowel identification in noise using consonant-vowel-consonant (CVC logatomes was used to investigate a possible interplay of speech information from different frequency regions. It was hypothesized that the periodicity conveyed by the temporal envelope of a high frequency stimulus can enhance the use of the information carried by auditory channels in the low-frequency region that share the same periodicity. It was further hypothesized that this acts as a strobe-like mechanism and would increase the signal-to-noise ratio for the voiced parts of the CVCs. In a first experiment, different high-frequency cues were provided to test this hypothesis, whereas a second experiment examined more closely the role of amplitude modulations and intact phase information within the high-frequency region (4-8 kHz. CVCs were either natural or vocoded speech (both limited to a low-pass cutoff-frequency of 2.5 kHz and were presented in stationary 3-kHz low-pass filtered masking noise. The experimental results did not support the hypothesized use of periodicity information for aiding low-frequency perception.

  5. The Influence of High-Frequency Envelope Information on Low-Frequency Vowel Identification in Noise.

    Science.gov (United States)

    Schubotz, Wiebke; Brand, Thomas; Kollmeier, Birger; Ewert, Stephan D

    2016-01-01

    Vowel identification in noise using consonant-vowel-consonant (CVC) logatomes was used to investigate a possible interplay of speech information from different frequency regions. It was hypothesized that the periodicity conveyed by the temporal envelope of a high frequency stimulus can enhance the use of the information carried by auditory channels in the low-frequency region that share the same periodicity. It was further hypothesized that this acts as a strobe-like mechanism and would increase the signal-to-noise ratio for the voiced parts of the CVCs. In a first experiment, different high-frequency cues were provided to test this hypothesis, whereas a second experiment examined more closely the role of amplitude modulations and intact phase information within the high-frequency region (4-8 kHz). CVCs were either natural or vocoded speech (both limited to a low-pass cutoff-frequency of 2.5 kHz) and were presented in stationary 3-kHz low-pass filtered masking noise. The experimental results did not support the hypothesized use of periodicity information for aiding low-frequency perception.

  6. Voice disorders in Nigerian primary school teachers.

    Science.gov (United States)

    Akinbode, R; Lam, K B H; Ayres, J G; Sadhra, S

    2014-07-01

    The prolonged use or abuse of voice may lead to vocal fatigue and vocal fold tissue damage. School teachers routinely use their voices intensively at work and are therefore at a higher risk of dysphonia. To determine the prevalence of voice disorders among primary school teachers in Lagos, Nigeria, and to explore associated risk factors. Teaching and non-teaching staff from 19 public and private primary schools completed a self-administered questionnaire to obtain information on personal lifestyles, work experience and environment, and voice disorder symptoms. Dysphonia was defined as the presence of at least one of the following: hoarseness, repetitive throat clearing, tired voice or straining to speak. A total of 341 teaching and 155 non-teaching staff participated. The prevalence of dysphonia in teachers was 42% compared with 18% in non-teaching staff. A significantly higher proportion of the teachers reported that voice symptoms had affected their ability to communicate effectively. School type (public/private) did not predict the presence of dysphonia. Statistically significant associations were found for regular caffeinated drink intake (odds ratio [OR] = 3.07; 95% confidence interval [CI]: 1.51-6.62), frequent upper respiratory tract infection (OR = 3.60; 95% CI: 1.39-9.33) and raised voice while teaching (OR = 10.1; 95% CI: 5.07-20.2). Nigerian primary school teachers were at risk for dysphonia. Important environment and personal factors were upper respiratory infection, the need to frequently raise the voice when teaching and regular intake of caffeinated drinks. Dysphonia was not associated with age or years of teaching. © The Author 2014. Published by Oxford University Press on behalf of the Society of Occupational Medicine. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  7. Teachers’ voice use in teaching environment. Aspects on speakers’ comfort

    DEFF Research Database (Denmark)

    Lyberg-Åhlander, Viveka; Rydell, Roland; Löfqvist, Anders

    2015-01-01

    use and prevalence of voice problems in teachers and to explore their ratings of vocally loading aspects of their working environment. Method: A questionnaire-survey in 467 teachers aiming to explore the prevalence of voice problems in teaching staff identified teachers with voice problems and vocally...... in the teaching environment and aspects of the classroom environment were also measured. Results: Teachers with voice problems were more affected by any loading factor in the work-environment and were more perceptive of the room acoustics. Differences between the groups were found during field......-measurements of the voice, while there were no differences in the findings from the clinical examinations of larynx and voice. Conclusion: Teachers suffering from voice problems react stronger to loading factors in the teaching environment. It is in the interplay between the individual and the work environment that voice...

  8. Stimulus variability and the phonetic relevance hypothesis: effects of variability in speaking style, fundamental frequency, and speaking rate on spoken word identification.

    Science.gov (United States)

    Sommers, Mitchell S; Barcroft, Joe

    2006-04-01

    Three experiments were conducted to examine the effects of trial-to-trial variations in speaking style, fundamental frequency, and speaking rate on identification of spoken words. In addition, the experiments investigated whether any effects of stimulus variability would be modulated by phonetic confusability (i.e., lexical difficulty). In Experiment 1, trial-to-trial variations in speaking style reduced the overall identification performance compared with conditions containing no speaking-style variability. In addition, the effects of variability were greater for phonetically confusable words than for phonetically distinct words. In Experiment 2, variations in fundamental frequency were found to have no significant effects on spoken word identification and did not interact with lexical difficulty. In Experiment 3, two different methods for varying speaking rate were found to have equivalent negative effects on spoken word recognition and similar interactions with lexical difficulty. Overall, the findings are consistent with a phonetic-relevance hypothesis, in which accommodating sources of acoustic-phonetic variability that affect phonetically relevant properties of speech signals can impair spoken word identification. In contrast, variability in parameters of the speech signal that do not affect phonetically relevant properties are not expected to affect overall identification performance. Implications of these findings for the nature and development of lexical representations are discussed.

  9. Voice pitch influences perceptions of sexual infidelity.

    Science.gov (United States)

    O'Connor, Jillian J M; Re, Daniel E; Feinberg, David R

    2011-02-28

    Sexual infidelity can be costly to members of both the extra-pair and the paired couple. Thus, detecting infidelity risk is potentially adaptive if it aids in avoiding cuckoldry or loss of parental and relationship investment. Among men, testosterone is inversely related to voice pitch, relationship and offspring investment, and is positively related to the pursuit of short-term relationships, including extra-pair sex. Among women, estrogen is positively related to voice pitch, attractiveness, and the likelihood of extra-pair involvement. Although prior work has demonstrated a positive relationship between men's testosterone levels and infidelity, this study is the first to investigate attributions of infidelity as a function of sexual dimorphism in male and female voices. We found that men attributed high infidelity risk to feminized women's voices, but not significantly more often than did women. Women attributed high infidelity risk to masculinized men's voices at significantly higher rates than did men. These data suggest that voice pitch is used as an indicator of sexual strategy in addition to underlying mate value. The aforementioned attributions may be adaptive if they prevent cuckoldry and/or loss of parental and relationship investment via avoidance of partners who may be more likely to be unfaithful.

  10. Voice Pitch Influences Perceptions of Sexual Infidelity

    Directory of Open Access Journals (Sweden)

    Jillian J.M. O'Connor

    2011-01-01

    Full Text Available Sexual infidelity can be costly to members of both the extra-pair and the paired couple. Thus, detecting infidelity risk is potentially adaptive if it aids in avoiding cuckoldry or loss of parental and relationship investment. Among men, testosterone is inversely related to voice pitch, relationship and offspring investment, and is positively related to the pursuit of short-term relationships, including extra-pair sex. Among women, estrogen is positively related to voice pitch, attractiveness, and the likelihood of extra-pair involvement. Although prior work has demonstrated a positive relationship between men's testosterone levels and infidelity, this study is the first to investigate attributions of infidelity as a function of sexual dimorphism in male and female voices. We found that men attributed high infidelity risk to feminized women's voices, but not significantly more often than did women. Women attributed high infidelity risk to masculinized men's voices at significantly higher rates than did men. These data suggest that voice pitch is used as an indicator of sexual strategy in addition to underlying mate value. The aforementioned attributions may be adaptive if they prevent cuckoldry and/or loss of parental and relationship investment via avoidance of partners who may be more likely to be unfaithful.

  11. The development of the Spanish verb ir into auxiliary of voice

    DEFF Research Database (Denmark)

    Vinther, Thora

    2005-01-01

    spanish, syntax, grammaticalisation, past participle, passive voice, middle voice, language development......spanish, syntax, grammaticalisation, past participle, passive voice, middle voice, language development...

  12. Overgeneral autobiographical memory bias in clinical and non-clinical voice hearers.

    Science.gov (United States)

    Jacobsen, Pamela; Peters, Emmanuelle; Ward, Thomas; Garety, Philippa A; Jackson, Mike; Chadwick, Paul

    2018-03-14

    Hearing voices can be a distressing and disabling experience for some, whilst it is a valued experience for others, so-called 'healthy voice-hearers'. Cognitive models of psychosis highlight the role of memory, appraisal and cognitive biases in determining emotional and behavioural responses to voices. A memory bias potentially associated with distressing voices is the overgeneral memory bias (OGM), namely the tendency to recall a summary of events rather than specific occasions. It may limit access to autobiographical information that could be helpful in re-appraising distressing experiences, including voices. We investigated the possible links between OGM and distressing voices in psychosis by comparing three groups: (1) clinical voice-hearers (N = 39), (2) non-clinical voice-hearers (N = 35) and (3) controls without voices (N = 77) on a standard version of the autobiographical memory test (AMT). Clinical and non-clinical voice-hearers also completed a newly adapted version of the task, designed to assess voices-related memories (vAMT). As hypothesised, the clinical group displayed an OGM bias by retrieving fewer specific autobiographical memories on the AMT compared with both the non-clinical and control groups, who did not differ from each other. The clinical group also showed an OGM bias in recall of voice-related memories on the vAMT, compared with the non-clinical group. Clinical voice-hearers display an OGM bias when compared with non-clinical voice-hearers on both general and voices-specific recall tasks. These findings have implications for the refinement and targeting of psychological interventions for psychosis.

  13. Speaking with the voice of authority

    CERN Multimedia

    2002-01-01

    GPB Consulting has developed a scientific approach to voice coaching. A digital recording of the voice is sent to a lab in Switzerland and analyzed by a computer programme designed by a doctor of psychology and linguistics and a scientist at CERN (1 page).

  14. Risk factors for voice quality after radiotherapy for early glottic cancer

    International Nuclear Information System (INIS)

    Hocevar-Boltezar, Irena; Zargi, Miha; Strojan, Primoz

    2009-01-01

    Background and purpose: In the majority of patients irradiated for early glottic cancer an abnormal voice was reported. The purpose of the study was to determine the factors influencing voice quality after radiotherapy for T1 glottic cancer. Methods: The voices of 75 male patients irradiated for T1 glottic carcinoma were assessed subjectively and objectively by acoustic analyses and aerodynamic measurements. The laryngeal function and morphology were evaluated by videolaryngostroboscopy. The data on smoking habits, the associated diseases influencing voice quality, the extent of the tumor, the type of biopsy, and the irradiation technique were collected from the medical records. The data on the factors influencing voice quality were compared for patients with a normal/near-normal voice and those with a hoarse voice. Results: Voice quality was at least slightly abnormal in 94.7% and 81.3% of patients, when assessed perceptively and objectively, respectively. Smoking after the completed treatment, more severe morphologic alterations of the vocal folds, dryness of the throat, incomplete closure of the vocal folds and functional voice disorders expressed as supraglottic activity adversely influenced the voice quality. A good correlation between the perceptive voice assessment and the acoustic analyses was established. Conclusions: After the successful irradiation for T1 glottic carcinoma, the great majority of the patients have at least a slightly hoarse voice. A better voice outcome could be achieved if radiotherapy was followed by the patient's cessation of smoking and the appropriate voice therapy.

  15. Different Vocal Parameters Predict Perceptions of Dominance and Attractiveness

    OpenAIRE

    Hodges-Simeon, Carolyn R.; Gaulin, Steven J. C.; Puts, David A.

    2010-01-01

    Low mean fundamental frequency (F 0) in men’s voices has been found to positively influence perceptions of dominance by men and attractiveness by women using standardized speech. Using natural speech obtained during an ecologically valid social interaction, we examined relationships between multiple vocal parameters and dominance and attractiveness judgments. Male voices from an unscripted dating game were judged by men for physical and social dominance and by women in fert...

  16. Influence of Smartphones and Software on Acoustic Voice Measures.

    Directory of Open Access Journals (Sweden)

    Elizabeth U. Grillo

    2016-12-01

    Full Text Available This study assessed the within-subject variability of voice measures captured using different recording devices (i.e., smartphones and head mounted microphone and software programs (i.e., Analysis of Dysphonia in Speech and Voice (ADSV, Multi-dimensional Voice Program (MDVP, and Praat.  Correlations between the software programs that calculated the voice measures were also analyzed.  Results demonstrated no significant within-subject variability across devices and software and that some of the measures were highly correlated across software programs.  The study suggests that certain smartphones may be appropriate to record daily voice measures representing the effects of vocal loading within individuals.  In addition, even though different algorithms are used to compute voice measures across software programs, some of the programs and measures share a similar relationship.

  17. Psychosocial risk factors which may differentiate between women with Functional Voice Disorder, Organic Voice Disorder and a Control group.

    Science.gov (United States)

    Baker, Janet; Ben-Tovim, David; Butcher, Andrew; Esterman, Adrian; McLaughlin, Kristin

    2013-12-01

    This study aimed to explore psychosocial factors contributing to the development of functional voice disorders (FVD) and those differentiating between organic voice disorders (OVD) and a non-voice-disordered control group. A case-control study was undertaken of 194 women aged 18-80 years diagnosed with FVD (n = 73), OVD (n = 55), and controls (n = 66). FVD women were allocated into psychogenic voice disorder (PVD) (n = 37) and muscle tension voice disorder (MTVD) (n = 36) for sub-group analysis. Dependent variables included biographical and voice assessment data, the number and severity of life events and difficulties and conflict over speaking out (COSO) situations derived from the Life Events and Difficulties Schedule (LEDS), and psychological traits including emotional expressiveness scales. Four psychosocial components differentiated between the FVD and control group accounting for 84.9% of the variance: severe events, moderate events, severe COSO, and mild COSO difficulties. Severe events, severe and mild COSO difficulties differentiated between FVD and OVD groups, accounting for 80.5% of the variance. Moderate events differentiated between PVD and MTVD sub-groups, accounting for 58.9% of the variance. Psychological traits did not differentiate between groups. Stressful life events and COSO situations best differentiated FVD from OVD and control groups. More refined aetiological studies are needed to differentiate between PVD and MTVD.

  18. Benefits for Voice Learning Caused by Concurrent Faces Develop over Time.

    Science.gov (United States)

    Zäske, Romi; Mühl, Constanze; Schweinberger, Stefan R

    2015-01-01

    Recognition of personally familiar voices benefits from the concurrent presentation of the corresponding speakers' faces. This effect of audiovisual integration is most pronounced for voices combined with dynamic articulating faces. However, it is unclear if learning unfamiliar voices also benefits from audiovisual face-voice integration or, alternatively, is hampered by attentional capture of faces, i.e., "face-overshadowing". In six study-test cycles we compared the recognition of newly-learned voices following unimodal voice learning vs. bimodal face-voice learning with either static (Exp. 1) or dynamic articulating faces (Exp. 2). Voice recognition accuracies significantly increased for bimodal learning across study-test cycles while remaining stable for unimodal learning, as reflected in numerical costs of bimodal relative to unimodal voice learning in the first two study-test cycles and benefits in the last two cycles. This was independent of whether faces were static images (Exp. 1) or dynamic videos (Exp. 2). In both experiments, slower reaction times to voices previously studied with faces compared to voices only may result from visual search for faces during memory retrieval. A general decrease of reaction times across study-test cycles suggests facilitated recognition with more speaker repetitions. Overall, our data suggest two simultaneous and opposing mechanisms during bimodal face-voice learning: while attentional capture of faces may initially impede voice learning, audiovisual integration may facilitate it thereafter.

  19. The role of fundamental frequency and formants in the perception of speaker sex

    Science.gov (United States)

    Hillenbrand, James M.

    2005-09-01

    The purpose of this study was to determine the relative contributions of fundamental frequency (F0) and formants in controlling the speaker-sex percept. A source-filter synthesizer was used to create four versions of 25 sentences spoken by men: (1) unmodified synthesis; (2) F0 only shifted up toward values typical of women; (3) formants only shifted up toward values typical of women; and (4) both F0 and formants shifted up. Identical methods were used to generate four comparable versions of 25 sentences spoken by women (e.g., unmodified synthesis, F0 only shifted down toward values typical of men, etc.). Listening tests showed: (1) perceived talker sex for the unmodified synthesis conditions was nearly always correct; (2) shifting both F0 and formants was usually effective (~82%) in changing the perceived sex of the utterance; (3) shifting either F0 or formants alone was usually ineffective in changing the perceived sex of the utterance. Both F0 and formants are apparently needed to specify speaker sex, though even together these cues are not entirely effective. Results also suggested that F0 is just slightly more important than formants, despite the fact that the male-female difference in F0 is proportionally much larger than the difference in formants. [Work supported by NIH.

  20. Fundamental Limitations for Imaging GEO Satellites

    Science.gov (United States)

    2015-10-18

    Fundamental limitations for imaging GEO satellites D. Mozurkewich Seabrook Engineering , Seabrook, MD 20706 USA H. R. Schmitt, J. T. Armstrong Naval...higher spatial frequency. Send correspondence to David Mozurkewich, Seabrook Engineering , 9310 Dubarry Ave., Seabrook MD 20706 E-mail: dave

  1. Analog voicing detector responds to pitch

    Science.gov (United States)

    Abel, R. S.; Watkins, H. E.

    1967-01-01

    Modified electronic voice encoder /Vocoder/ includes an independent analog mode of operation in addition to the conventional digital mode. The Vocoder is a bandwidth compression equipment that permits voice transmission over channels, having only a fraction of the bandwidth required for conventional telephone-quality speech transmission.

  2. Speaking in Character: Voice Communication in Virtual Worlds

    Science.gov (United States)

    Wadley, Greg; Gibbs, Martin R.

    This chapter summarizes 5 years of research on the implications of introducing voice communication systems to virtual worlds. Voice introduces both benefits and problems for players of fast-paced team games, from better coordination of groups and greater social presence of fellow players on the positive side, to negative features such as channel congestion, transmission of noise, and an unwillingness by some to use voice with strangers online. Similarly, in non-game worlds like Second Life, issues related to identity and impression management play important roles, as voice may build greater trust that is especially important for business users, yet it erodes the anonymity and ability to conceal social attributes like gender that are important for other users. A very different mixture of problems and opportunities exists when users conduct several simultaneous conversations in multiple text and voice channels. Technical difficulties still exist with current systems, including the challenge of debugging and harmonizing all the participants' voice setups. Different groups use virtual worlds for very different purposes, so a single modality may not suit all.

  3. Voice Based City Panic Button System

    Science.gov (United States)

    Febriansyah; Zainuddin, Zahir; Bachtiar Nappu, M.

    2018-03-01

    The development of voice activated panic button application aims to design faster early notification of hazardous condition in community to the nearest police by using speech as the detector where the current application still applies touch-combination on screen and use coordination of orders from control center then the early notification still takes longer time. The method used in this research was by using voice recognition as the user voice detection and haversine formula for the comparison of closest distance between the user and the police. This research was equipped with auto sms, which sent notification to the victim’s relatives, that was also integrated with Google Maps application (GMaps) as the map to the victim’s location. The results show that voice registration on the application reaches 100%, incident detection using speech recognition while the application is running is 94.67% in average, and the auto sms to the victim relatives reaches 100%.

  4. Voice Recognition in Face-Blind Patients

    Science.gov (United States)

    Liu, Ran R.; Pancaroglu, Raika; Hills, Charlotte S.; Duchaine, Brad; Barton, Jason J. S.

    2016-01-01

    Right or bilateral anterior temporal damage can impair face recognition, but whether this is an associative variant of prosopagnosia or part of a multimodal disorder of person recognition is an unsettled question, with implications for cognitive and neuroanatomic models of person recognition. We assessed voice perception and short-term recognition of recently heard voices in 10 subjects with impaired face recognition acquired after cerebral lesions. All 4 subjects with apperceptive prosopagnosia due to lesions limited to fusiform cortex had intact voice discrimination and recognition. One subject with bilateral fusiform and anterior temporal lesions had a combined apperceptive prosopagnosia and apperceptive phonagnosia, the first such described case. Deficits indicating a multimodal syndrome of person recognition were found only in 2 subjects with bilateral anterior temporal lesions. All 3 subjects with right anterior temporal lesions had normal voice perception and recognition, 2 of whom performed normally on perceptual discrimination of faces. This confirms that such lesions can cause a modality-specific associative prosopagnosia. PMID:25349193

  5. Acoustic Correlates of Compensatory Adjustments to the Glottic and Supraglottic Structures in Patients with Unilateral Vocal Fold Paralysis

    Directory of Open Access Journals (Sweden)

    Luis M. T. Jesus

    2015-01-01

    Full Text Available The goal of this study was to analyse perceptually and acoustically the voices of patients with Unilateral Vocal Fold Paralysis (UVFP and compare them to the voices of normal subjects. These voices were analysed perceptually with the GRBAS scale and acoustically using the following parameters: mean fundamental frequency (F0, standard-deviation of F0, jitter (ppq5, shimmer (apq11, mean harmonics-to-noise ratio (HNR, mean first (F1 and second (F2 formants frequency, and standard-deviation of F1 and F2 frequencies. Statistically significant differences were found in all of the perceptual parameters. Also the jitter, shimmer, HNR, standard-deviation of F0, and standard-deviation of the frequency of F2 were statistically different between groups, for both genders. In the male data differences were also found in F1 and F2 frequencies values and in the standard-deviation of the frequency of F1. This study allowed the documentation of the alterations resulting from UVFP and addressed the exploration of parameters with limited information for this pathology.

  6. Hemispheric association and dissociation of voice and speech information processing in stroke.

    Science.gov (United States)

    Jones, Anna B; Farrall, Andrew J; Belin, Pascal; Pernet, Cyril R

    2015-10-01

    As we listen to someone speaking, we extract both linguistic and non-linguistic information. Knowing how these two sets of information are processed in the brain is fundamental for the general understanding of social communication, speech recognition and therapy of language impairments. We investigated the pattern of performances in phoneme versus gender categorization in left and right hemisphere stroke patients, and found an anatomo-functional dissociation in the right frontal cortex, establishing a new syndrome in voice discrimination abilities. In addition, phoneme and gender performances were most often associated than dissociated in the left hemisphere patients, suggesting a common neural underpinnings. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Voice- and swallow-related quality of life in idiopathic Parkinson's disease.

    Science.gov (United States)

    van Hooren, Michel R A; Baijens, Laura W J; Vos, Rein; Pilz, Walmari; Kuijpers, Laura M F; Kremer, Bernd; Michou, Emilia

    2016-02-01

    This study explores whether changes in voice- and swallow-related QoL are associated with progression of idiopathic Parkinson's disease (IPD). Furthermore, it examines the relationship between patients' perception of both voice and swallowing disorders in IPD. Prospective clinical study, quality of life (QoL). One-hundred mentally competent IPD patients with voice and swallowing complaints were asked to answer four QoL questionnaires (Voice Handicap Index, MD Anderson Dysphagia Inventory, Visual Analog Scale [VAS] voice, and Dysphagia Severity Scale [DSS]). Differences in means for the QoL questionnaires and their subscales within Hoehn and Yahr stage groups were calculated using one-way analysis of variance. The relationship between voice- and swallow-related QoL questionnaires was determined with the Spearman correlation coefficient. Scores on both voice and swallow questionnaires suggest an overall decrease in QoL with progression of IPD. A plateau in QoL for VAS voice and the DSS was seen in the early Hoehn and Yahr stages. Finally, scores on voice-related QoL questionnaires were significantly correlated with swallow-related QoL outcomes. Voice- and swallow-related QoL decreases with progression of IPD. A significant association was found between voice- and swallow-related QoL questionnaires. Healthcare professionals can benefit from voice- and swallow-related QoL questionnaires in a multidimensional voice- or swallow-assessment protocol. The patient's perception of his/her voice and swallowing disorders and its impact on QoL in IPD should not be disregarded. 2b. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.

  8. Fundamental length

    International Nuclear Information System (INIS)

    Pradhan, T.

    1975-01-01

    The concept of fundamental length was first put forward by Heisenberg from purely dimensional reasons. From a study of the observed masses of the elementary particles known at that time, it is sumrised that this length should be of the order of magnitude 1 approximately 10 -13 cm. It was Heisenberg's belief that introduction of such a fundamental length would eliminate the divergence difficulties from relativistic quantum field theory by cutting off the high energy regions of the 'proper fields'. Since the divergence difficulties arise primarily due to infinite number of degrees of freedom, one simple remedy would be the introduction of a principle that limits these degrees of freedom by removing the effectiveness of the waves with a frequency exceeding a certain limit without destroying the relativistic invariance of the theory. The principle can be stated as follows: It is in principle impossible to invent an experiment of any kind that will permit a distintion between the positions of two particles at rest, the distance between which is below a certain limit. A more elegant way of introducing fundamental length into quantum theory is through commutation relations between two position operators. In quantum field theory such as quantum electrodynamics, it can be introduced through the commutation relation between two interpolating photon fields (vector potentials). (K.B.)

  9. Voice and Narrative in L1 Writing

    DEFF Research Database (Denmark)

    Krogh, Ellen; Piekut, Anke

    2015-01-01

    This paper investigates issues of voice and narrative in L1 writing. Three branches of research are initial-ly discussed: research on narratives as resources for identity work, research on writer identity and voice as an essential aspect of identity, and research on Bildung in L1 writing. Subsequ...... training of voice and narratives as a resource for academic writing, and that the Bildung potential of L1 writing may be tied to this issue.......This paper investigates issues of voice and narrative in L1 writing. Three branches of research are initial-ly discussed: research on narratives as resources for identity work, research on writer identity and voice as an essential aspect of identity, and research on Bildung in L1 writing...... in lower secondary L1, she found that her previous writing strategies were not rewarded in upper secondary school. In the second empiri-cal study, two upper-secondary exam papers are investigated, with a focus on their approaches to exam genres and their use of narrative resources to address issues...

  10. Analysis of the Auditory Feedback and Phonation in Normal Voices.

    Science.gov (United States)

    Arbeiter, Mareike; Petermann, Simon; Hoppe, Ulrich; Bohr, Christopher; Doellinger, Michael; Ziethe, Anke

    2018-02-01

    The aim of this study was to investigate the auditory feedback mechanisms and voice quality during phonation in response to a spontaneous pitch change in the auditory feedback. Does the pitch shift reflex (PSR) change voice pitch and voice quality? Quantitative and qualitative voice characteristics were analyzed during the PSR. Twenty-eight healthy subjects underwent transnasal high-speed video endoscopy (HSV) at 8000 fps during sustained phonation [a]. While phonating, the subjects heard their sound pitched up for 700 cents (interval of a fifth), lasting 300 milliseconds in their auditory feedback. The electroencephalography (EEG), acoustic voice signal, electroglottography (EGG), and high-speed-videoendoscopy (HSV) were analyzed to compare feedback mechanisms for the pitched and unpitched condition of the phonation paradigm statistically. Furthermore, quantitative and qualitative voice characteristics were analyzed. The PSR was successfully detected within all signals of the experimental tools (EEG, EGG, acoustic voice signal, HSV). A significant increase of the perturbation measures and an increase of the values of the acoustic parameters during the PSR were observed, especially for the audio signal. The auditory feedback mechanism seems not only to control for voice pitch but also for voice quality aspects.

  11. Translators’ voices in Norwegian retranslations of Bob Dylan’s songs

    OpenAIRE

    Greenall, Annjo Klungervik

    2015-01-01

    This chapter tackles several questions relating to the issue of the translator’s voice in retranslation: how do others’ voices (including other (re)translations) interact with the translator’s voice in the production of a translation? How does the intersubjectively constituted voice of the translator manifest itself in paratexts, in the translated text and, in the case of singer-translators, in the translator’s physical, performing voice? The case discussed is that of Bob Dylan in (re)transl...

  12. Speaking comfort and voice use of teachers in classrooms

    DEFF Research Database (Denmark)

    Brunskog, Jonas; Pelegrin Garcia, David

    2010-01-01

    Teachers suffer from voice problems more often than the rest of the population, as a consequence of the intensive use of their voices during teaching. Noise and classroom acoustics have been defined as hazards eventually leading to voice problems. In order to make a good classroom acoustic design...... to preserve the teachers’ voices and maximize their comfort, it is necessary to understand the underlaying relationship between classroom acoustics and teachers’ voice production. This paper presents a brief summary of investigations looking into this relationship. A pilot study, carried out in different...... located at various distances, in rooms with very different acoustics. A field study in schools of southern Sweden found out that teachers with and without voice problems, during actual teaching, are affected differently by the support of the classroom. A last laboratory experiment was carried out...

  13. Influence of Smartphones and Software on Acoustic Voice Measures.

    OpenAIRE

    Elizabeth U. Grillo; Jenna N. Brosious; Staci L. Sorrell; Supraja Anand

    2016-01-01

    This study assessed the within-subject variability of voice measures captured using different recording devices (i.e., smartphones and head mounted microphone) and software programs (i.e., Analysis of Dysphonia in Speech and Voice (ADSV), Multi-dimensional Voice Program (MDVP), and Praat).  Correlations between the software programs that calculated the voice measures were also analyzed.  Results demonstrated no significant within-subject variability across devices and software and that some o...

  14. Massed versus Spaced Practice in Vocology: Effect of a Short-Term Intensive Voice Training versus a Longer-Term Traditional Voice Training

    Science.gov (United States)

    Meerschman, Iris; Van Lierde, Kristiane; Van Puyvelde, Caro; Bostyn, Astrid; Claeys, Sofie; D'haeseleer, Evelien

    2018-01-01

    Background: In contrast with most medical and pharmaceutical therapies, the optimal dosage for voice therapy or training is unknown. Aims: The aim of this study was to compare the effect of a short-term intensive voice training (IVT) with a longer-term traditional voice training (TVT) on the vocal quality and vocal capacities of vocally healthy…

  15. Fundamental investigations of capacitive radio frequency plasmas: simulations and experiments

    International Nuclear Information System (INIS)

    Donkó, Z; Derzsi, A; Hartmann, P; Korolov, I; Schulze, J; Czarnetzki, U; Schüngel, E

    2012-01-01

    Capacitive radio frequency (RF) discharge plasmas have been serving hi-tech industry (e.g. chip and solar cell manufacturing, realization of biocompatible surfaces) for several years. Nonetheless, their complex modes of operation are not fully understood and represent topics of high interest. The understanding of these phenomena is aided by modern diagnostic techniques and computer simulations. From the industrial point of view the control of ion properties is of particular interest; possibilities of independent control of the ion flux and the ion energy have been utilized via excitation of the discharges with multiple frequencies. ‘Classical’ dual-frequency (DF) discharges (where two significantly different driving frequencies are used), as well as discharges driven by a base frequency and its higher harmonic(s) have been analyzed thoroughly. It has been recognized that the second solution results in an electrically induced asymmetry (electrical asymmetry effect), which provides the basis for the control of the mean ion energy. This paper reviews recent advances on studies of the different electron heating mechanisms, on the possibilities of the separate control of ion energy and ion flux in DF discharges, on the effects of secondary electrons, as well as on the non-linear behavior (self-generated resonant current oscillations) of capacitive RF plasmas. The work is based on a synergistic approach of theoretical modeling, experiments and kinetic simulations based on the particle-in-cell approach. (paper)

  16. Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants.

    Science.gov (United States)

    Hoy, Matthew B

    2018-01-01

    Voice assistants are software agents that can interpret human speech and respond via synthesized voices. Apple's Siri, Amazon's Alexa, Microsoft's Cortana, and Google's Assistant are the most popular voice assistants and are embedded in smartphones or dedicated home speakers. Users can ask their assistants questions, control home automation devices and media playback via voice, and manage other basic tasks such as email, to-do lists, and calendars with verbal commands. This column will explore the basic workings and common features of today's voice assistants. It will also discuss some of the privacy and security issues inherent to voice assistants and some potential future uses for these devices. As voice assistants become more widely used, librarians will want to be familiar with their operation and perhaps consider them as a means to deliver library services and materials.

  17. [Psychological effects of preventive voice care training in student teachers].

    Science.gov (United States)

    Nusseck, M; Richter, B; Echternach, M; Spahn, C

    2017-07-01

    Studies on the effectiveness of preventive voice care programs have focused mainly on voice parameters. Psychological parameters, however, have not been investigated in detail so far. The effect of a voice training program for German student teachers on psychological health parameters was investigated in a longitudinal study. The sample of 204 student teachers was divided into the intervention group (n = 123), who participated in the voice training program, and the control group (n = 81), who received no voice training. Voice training contained ten 90-min group courses and an individual visit by the voice trainer in a teaching situation with feedback afterwards. Participants were asked to fill out questionnaires (self-efficacy, Short-Form Health Survey, self-consciousness, voice self-concept, work-related behaviour and experience patterns) at the beginning and the end of their student teacher training period. The training program showed significant positive influences on psychological health, voice self-concept (i.e. more positive perception and increased awareness of one's own voice) and work-related coping behaviour in the intervention group. On average, the mental health status of all participants reduced over time, whereas the status in the trained group diminished significantly less than in the control group. Furthermore, the trained student teachers gained abilities to cope with work-related stress better than those without training. The training program clearly showed a positive impact on mental health. The results maintain the importance of such a training program not only for voice health, but also for wide-ranging aspects of constitutional health.

  18. Guided self-help cognitive-behaviour Intervention for VoicEs (GiVE): Results from a pilot randomised controlled trial in a transdiagnostic sample.

    Science.gov (United States)

    Hazell, Cassie M; Hayward, Mark; Cavanagh, Kate; Jones, Anna-Marie; Strauss, Clara

    2017-10-12

    Few patients have access to cognitive behaviour therapy for psychosis (CBTp) even though at least 16 sessions of CBTp is recommended in treatment guidelines. Briefer CBTp could improve access as the same number of therapists could see more patients. In addition, focusing on single psychotic symptoms, such as auditory hallucinations ('voices'), rather than on psychosis more broadly, may yield greater benefits. This pilot RCT recruited 28 participants (with a range of diagnoses) from NHS mental health services who were distressed by hearing voices. The study compared an 8-session guided self-help CBT intervention for distressing voices with a wait-list control. Data were collected at baseline and at 12weeks with post-therapy assessments conducted blind to allocation. Voice-impact was the pre-determined primary outcome. Secondary outcomes were depression, anxiety, wellbeing and recovery. Mechanism measures were self-esteem, beliefs about self, beliefs about voices and voice-relating. Recruitment and retention was feasible with low study (3.6%) and therapy (14.3%) dropout. There were large, statistically significant between-group effects on the primary outcome of voice-impact (d=1.78; 95% CIs: 0.86-2.70), which exceeded the minimum clinically important difference. Large, statistically significant effects were found on a number of secondary and mechanism measures. Large effects on the pre-determined primary outcome of voice-impact are encouraging, and criteria for progressing to a definitive trial are met. Significant between-group effects on measures of self-esteem, negative beliefs about self and beliefs about voice omnipotence are consistent with these being mechanisms of change and this requires testing in a future trial. Copyright © 2017. Published by Elsevier B.V.

  19. The impact of rate reduction and increased loudness on fundamental frequency characteristics in dysarthria.

    Science.gov (United States)

    Tjaden, Kris; Wilding, Greg

    2011-01-01

    This study examined the extent to which articulatory rate reduction and increased loudness were associated with adjustments in utterance-level measures of fundamental frequency (F(0)) variability for speakers with dysarthria and healthy controls that have been shown to impact on intelligibility in previously published studies. More generally, the current study sought to compare and contrast how a slower-than-normal rate and increased vocal loudness impact on a variety of utterance-level F(0) characteristics for speakers with dysarthria and healthy controls. Eleven speakers with Parkinson's disease, 15 speakers with multiple sclerosis, and 14 healthy control speakers were audio recorded while reading a passage in habitual, loud, and slow conditions. Magnitude production was used to elicit variations in rate and loudness. Acoustic measures of duration, intensity and F(0) were obtained. For all speaker groups, a slower-than-normal articulatory rate and increased vocal loudness had distinct effects on F(0) relative to the habitual condition, including a tendency for measures of F(0) variation to be greater in the loud condition and reduced in the slow condition. These results suggest implications for the treatment of dysarthria. Copyright © 2010 S. Karger AG, Basel.

  20. Gender and vocal production mode discrimination using the high frequencies for speech and singing

    Science.gov (United States)

    Monson, Brian B.; Lotto, Andrew J.; Story, Brad H.

    2014-01-01

    Humans routinely produce acoustical energy at frequencies above 6 kHz during vocalization, but this frequency range is often not represented in communication devices and speech perception research. Recent advancements toward high-definition (HD) voice and extended bandwidth hearing aids have increased the interest in the high frequencies. The potential perceptual information provided by high-frequency energy (HFE) is not well characterized. We found that humans can accomplish tasks of gender discrimination and vocal production mode discrimination (speech vs. singing) when presented with acoustic stimuli containing only HFE at both amplified and normal levels. Performance in these tasks was robust in the presence of low-frequency masking noise. No substantial learning effect was observed. Listeners also were able to identify the sung and spoken text (excerpts from “The Star-Spangled Banner”) with very few exposures. These results add to the increasing evidence that the high frequencies provide at least redundant information about the vocal signal, suggesting that its representation in communication devices (e.g., cell phones, hearing aids, and cochlear implants) and speech/voice synthesizers could improve these devices and benefit normal-hearing and hearing-impaired listeners. PMID:25400613

  1. Student Voice and the Common Core

    Science.gov (United States)

    Yonezawa, Susan

    2015-01-01

    Common Core proponents and detractors debate its merits, but students have voiced their opinion for years. Using a decade's worth of data gathered through design-research on youth voice, this article discusses what high school students have long described as more ideal learning environments for themselves--and how remarkably similar the Common…

  2. Student Voices in School-Based Assessment

    Science.gov (United States)

    Tong, Siu Yin Annie; Adamson, Bob

    2015-01-01

    The value of student voices in dialogues about learning improvement is acknowledged in the literature. This paper examines how the views of students regarding School-based Assessment (SBA), a significant shift in examination policy and practice in secondary schools in Hong Kong, have largely been ignored. The study captures student voices through…

  3. Measurement errors in voice-key naming latency for Hiragana.

    Science.gov (United States)

    Yamada, Jun; Tamaoka, Katsuo

    2003-12-01

    This study makes explicit the limitations and possibilities of voice-key naming latency research on single hiragana symbols (a Japanese syllabic script) by examining three sets of voice-key naming data against Sakuma, Fushimi, and Tatsumi's 1997 speech-analyzer voice-waveform data. Analysis showed that voice-key measurement errors can be substantial in standard procedures as they may conceal the true effects of significant variables involved in hiragana-naming behavior. While one can avoid voice-key measurement errors to some extent by applying Sakuma, et al.'s deltas and by excluding initial phonemes which induce measurement errors, such errors may be ignored when test items are words and other higher-level linguistic materials.

  4. Quick Statistics about Voice, Speech, and Language

    Science.gov (United States)

    ... here Home » Health Info » Statistics and Epidemiology Quick Statistics About Voice, Speech, Language Voice, Speech, Language, and ... no 205. Hyattsville, MD: National Center for Health Statistics. 2015. Hoffman HJ, Li C-M, Losonczy K, ...

  5. [Role of aerodynamic parameters in voice function assessment].

    Science.gov (United States)

    Guo, Yong-qing; Lin, Sheng-zhi; Xu, Xin-lin; Zhou, Li; Zhuang, Pei-yun; Jiang, Jack J

    2012-10-01

    To investigate the application and significance of aerodynamic parameters in voice function assessment. The phonatory aerodynamic system (PAS) was used to collect aerodynamic parameters from subjects with normal voice, vocal fold polyp, vocal fold cyst, and vocal fold immobility. Multivariate statistical analysis was used to compare measurements across groups. Phonation threshold flow (PTF), mean flow rate (MFR), maximum phonation time (MPT), and glottal resistance (GR) in one hundred normal subjects were significantly affected by sex (P efficiency (VE) were not (P > 0.05). PTP, PTF, MFR, SGP, and MPT were significantly different between normal voice and voice disorders (P 0.05). Receiver operating characteristic (ROC) analysis found that PTP, PTF, SGP, MFR, MPT, and VE in one hundred thirteen voice dis orders had similar diagnostic utility (P aerodynamic parameters of the three degrees of voice dysfunction due to vocal cord polyps were compared and found to have no significant differences (P > 0.05). PTP, PTF, MFR, SGP and MPT in forty one patients with vocal polyps were significantly different after surgical resection of vocal cord polyps (P aerodynamic parameters can objectively and effectively evaluate the variations of vocal function, and have good auxiliary diagnostic value.

  6. English Voicing in Dimensional Theory*

    Science.gov (United States)

    Iverson, Gregory K.; Ahn, Sang-Cheol

    2007-01-01

    Assuming a framework of privative features, this paper interprets two apparently disparate phenomena in English phonology as structurally related: the lexically specific voicing of fricatives in plural nouns like wives or thieves and the prosodically governed “flapping” of medial /t/ (and /d/) in North American varieties, which we claim is itself not a rule per se, but rather a consequence of the laryngeal weakening of fortis /t/ in interaction with speech-rate determined segmental abbreviation. Taking as our point of departure the Dimensional Theory of laryngeal representation developed by Avery & Idsardi (2001), along with their assumption that English marks voiceless obstruents but not voiced ones (Iverson & Salmons 1995), we find that an unexpected connection between fricative voicing and coronal flapping emerges from the interplay of familiar phonemic and phonetic factors in the phonological system. PMID:18496590

  7. Voice-associated static face image releases speech from informational masking.

    Science.gov (United States)

    Gao, Yayue; Cao, Shuyang; Qu, Tianshu; Wu, Xihong; Li, Haifeng; Zhang, Jinsheng; Li, Liang

    2014-06-01

    In noisy, multipeople talking environments such as a cocktail party, listeners can use various perceptual and/or cognitive cues to improve recognition of target speech against masking, particularly informational masking. Previous studies have shown that temporally prepresented voice cues (voice primes) improve recognition of target speech against speech masking but not noise masking. This study investigated whether static face image primes that have become target-voice associated (i.e., facial images linked through associative learning with voices reciting the target speech) can be used by listeners to unmask speech. The results showed that in 32 normal-hearing younger adults, temporally prepresenting a voice-priming sentence with the same voice reciting the target sentence significantly improved the recognition of target speech that was masked by irrelevant two-talker speech. When a person's face photograph image became associated with the voice reciting the target speech by learning, temporally prepresenting the target-voice-associated face image significantly improved recognition of target speech against speech masking, particularly for the last two keywords in the target sentence. Moreover, speech-recognition performance under the voice-priming condition was significantly correlated to that under the face-priming condition. The results suggest that learned facial information on talker identity plays an important role in identifying the target-talker's voice and facilitating selective attention to the target-speech stream against the masking-speech stream. © 2014 The Institute of Psychology, Chinese Academy of Sciences and Wiley Publishing Asia Pty Ltd.

  8. A Wireless LAN and Voice Information System for Underground Coal Mine

    Directory of Open Access Journals (Sweden)

    Yu Zhang

    2014-06-01

    Full Text Available In this paper we constructed a wireless information system, and developed a wireless voice communication subsystem based on Wireless Local Area Networks (WLAN for underground coal mine, which employs Voice over IP (VoIP technology and Session Initiation Protocol (SIP to achieve wireless voice dispatching communications. The master control voice dispatching interface and call terminal software are also developed on the WLAN ground server side to manage and implement the voice dispatching communication. A testing system for voice communication was constructed in tunnels of an underground coal mine, which was used to actually test the wireless voice communication subsystem via a network analysis tool, named Clear Sight Analyzer. In tests, the actual flow charts of registration, call establishment and call removal were analyzed by capturing call signaling of SIP terminals, and the key performance indicators were evaluated in coal mine, including average subjective value of voice quality, packet loss rate, delay jitter, disorder packet transmission and end-to- end delay. Experimental results and analysis demonstrate that the wireless voice communication subsystem developed communicates well in underground coal mine environment, achieving the designed function of voice dispatching communication.

  9. The Voice of the Technical Writer.

    Science.gov (United States)

    Euler, James S.

    The author's voice is implicit in all writing, even technical writing. It is the expression of the writer's attitude toward audience, subject matter, and self. Effective use of voice is made possible by recognizing the three roles of the technical writer: transmitter, translator, and author. As a transmitter, the writer must consciously apply an…

  10. Predictors of Choral Directors' Voice Handicap

    Science.gov (United States)

    Schwartz, Sandra

    2013-01-01

    Vocal demands of teaching are considerable and these challenges are greater for choral directors who depend on the voice as a musical and instructive instrument. The purpose of this study was to (1) examine choral directors' vocal condition using a modified Voice Handicap Index (VHI), and (2) determine the extent to which the major variables…

  11. Electrothermal frequency reference

    NARCIS (Netherlands)

    Makinwa, K.A.A.; Kashmiri, S.M.

    2011-01-01

    An electrothermal frequency-locked loop (EFLL) circuit is described. This EFLL circuit includes an oscillator in a feedback loop. A drive circuit in the EFLL circuit generates a first signal having a fundamental frequency, and an electrothermal filter (ETF) in the EFLL circuit provides a second

  12. Back-and-Forth Methodology for Objective Voice Quality Assessment: From/to Expert Knowledge to/from Automatic Classification of Dysphonia

    Science.gov (United States)

    Fredouille, Corinne; Pouchoulin, Gilles; Ghio, Alain; Revis, Joana; Bonastre, Jean-François; Giovanni, Antoine

    2009-12-01

    This paper addresses voice disorder assessment. It proposes an original back-and-forth methodology involving an automatic classification system as well as knowledge of the human experts (machine learning experts, phoneticians, and pathologists). The goal of this methodology is to bring a better understanding of acoustic phenomena related to dysphonia. The automatic system was validated on a dysphonic corpus (80 female voices), rated according to the GRBAS perceptual scale by an expert jury. Firstly, focused on the frequency domain, the classification system showed the interest of 0-3000 Hz frequency band for the classification task based on the GRBAS scale. Later, an automatic phonemic analysis underlined the significance of consonants and more surprisingly of unvoiced consonants for the same classification task. Submitted to the human experts, these observations led to a manual analysis of unvoiced plosives, which highlighted a lengthening of VOT according to the dysphonia severity validated by a preliminary statistical analysis.

  13. Back-and-Forth Methodology for Objective Voice Quality Assessment: From/to Expert Knowledge to/from Automatic Classification of Dysphonia

    Directory of Open Access Journals (Sweden)

    Corinne Fredouille

    2009-01-01

    Full Text Available This paper addresses voice disorder assessment. It proposes an original back-and-forth methodology involving an automatic classification system as well as knowledge of the human experts (machine learning experts, phoneticians, and pathologists. The goal of this methodology is to bring a better understanding of acoustic phenomena related to dysphonia. The automatic system was validated on a dysphonic corpus (80 female voices, rated according to the GRBAS perceptual scale by an expert jury. Firstly, focused on the frequency domain, the classification system showed the interest of 0–3000 Hz frequency band for the classification task based on the GRBAS scale. Later, an automatic phonemic analysis underlined the significance of consonants and more surprisingly of unvoiced consonants for the same classification task. Submitted to the human experts, these observations led to a manual analysis of unvoiced plosives, which highlighted a lengthening of VOT according to the dysphonia severity validated by a preliminary statistical analysis.

  14. The Effects of Amplification on Vocal Dose in Teachers with Dysphonia.

    Science.gov (United States)

    Assad, Joana Perpetuo; Gama, Ana Cristina Côrtes; Santos, Juliana Nunes; de Castro Magalhães, Max

    2017-11-06

    The purpose of this study was to determine if voice amplification influenced vocal dose in female teachers with dysphonia. This was an experimental study with comparative intrasubjects in which 15 individuals were compared in two different moments: condition 1 (C1) without voice amplification and condition 2 (C2) with voice amplification. All of them were female, kindergarten and elementary school teachers who presented organic or functional dysphonia. The search was carried out at the school where the teachers work. The professional voice use was considered the teachers' activity for a continuous period of two classes (average recording time of 96 minutes, with no difference in time between C1 and C2). To measure the dose we used the vocal dosimeter composed of a microphone, an accelerometer fixed to the neck, and a portable unit that stores the vocal data. The phonation data (intensity, fundamental frequency, phonation percentage, cycle dose, and distance dose) were analyzed by the equipment software (VoxLog). The use of vocal amplification in teachers promotes a reduction of the fundamental frequency (295.6-267.7 Hz), the voice intensity (96.2-93.3 dB sound pressure level), the cycle doses (489.4-345.2 thousand cycles per second), and distance doses (3,800-2,300 m). The vocal amplification allows the teacher to maintain the same phonation time (phonation percentage) but decreases the number of vocal fold oscillations (cycle dose) and the total distance traveled by the vocal fold tissue during phonation (distance dose), reducing the exposure of the vocal folds to voice trauma. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  15. High quality voice synthesis middle ware; Kohinshitsu onsei gosei middle war

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2000-03-01

    Toshiba Corp. newly developed a natural voice synthesis system, TOS Drive TTS (TOtally speaker Driven Text-To-Speech) system, in which natural high-quality read-aloud is greatly improved, and also developed as its application a voice synthesis middle ware. In the newly developed system, using as a model a narrator's voice recorded preliminarily, a metrical control dictionary is automatically learned that reproduces the characteristics of metrical patters such as intonation or rhythm of a human voice, as is a voice bases dictionary that reproduces the characteristics of a voice quality, enabling natural voice synthesis to be realized that picks up human voice characteristics. The system is high quality and also very compact, while the voice synthesis middle ware utilizing this technology is adaptable to various platforms such as MPU or OS. The system is very suitable for audio response in the ITS field having car navigation systems as the core; besides, expanded application is expected to an audio response system that used to employ a sound recording and reproducing system. (translated by NEDO)

  16. Simultaneous negative refraction and focusing of fundamental frequency and second-harmonic fields by two-dimensional photonic crystals

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Jun [School of Physics, Beijing Institute of Technology and Beijing Key Laboratory of Fractional Signals and Systems, Beijing 100081 (China); College of Physics and Electronic Engineering, Henan Normal University, 453007 Xinxiang, Henan (China); Zhang, Xiangdong, E-mail: zhangxd@bit.edu.cn [School of Physics, Beijing Institute of Technology and Beijing Key Laboratory of Fractional Signals and Systems, Beijing 100081 (China)

    2015-09-28

    Simultaneous negative refraction for both the fundamental frequency (FF) and second-harmonic (SH) fields in two-dimensional nonlinear photonic crystals have been found through both the physical analysis and exact numerical simulation. By combining such a property with the phase-matching condition and strong second-order susceptibility, we have designed a SH lens to realize focusing for both the FF and SH fields at the same time. Good-quality non-near field images for both FF and SH fields have been observed. The physical mechanism for such SH focusing phenomena has been disclosed, which is different from the backward SH generation as has been pointed out in the previous investigations. In addition, the effect of absorption losses on the phenomena has also been discussed. Thus, potential applications of these phenomena to biphotonic microscopy technique are anticipated.

  17. AUTHORIAL VOICE IN ISLAMIC COLLEGE ENGLISH DEPARTMENT STUDENTS’ ARGUMENTATIVE WRITING

    Directory of Open Access Journals (Sweden)

    Nur Afifi

    2014-11-01

    Full Text Available While considered elusive and abstract, authorial voice is paramount in English writing. Unfortunately, many of Indonesian EFL learners found it is highly challeging to show their voice in their writing. The importance of voice is even exaggerated in argumentative writing, since this kind of writing needs obvious stance of the writer. This study investigates the authorial voice students made in their argumentative writing. The purpose of this study is to gain the picture of students‟ writing ability especially in authorial voice to map the road in guiding the next writing classes. The object of the study is the argumentative writing made by English department students at one Indonesian State College of Islamic Studies in their writing III course. Using Hyland‟s interactional model of voice (2008 the data analysis results the authorial presence in the essays is in position 2 at 0 – 4 scale which means the reader feels somehow weak presence of the authorial voice in the essay. This result confirms the findings of some previous studies that EFL learners especially from „interdependent‟ cultural background tend to find this authorial voice difficult in writing English essay.

  18. Understanding the mechanisms of familiar voice-identity recognition in the human brain.

    Science.gov (United States)

    Maguinness, Corrina; Roswandowitz, Claudia; von Kriegstein, Katharina

    2018-03-31

    Humans have a remarkable skill for voice-identity recognition: most of us can remember many voices that surround us as 'unique'. In this review, we explore the computational and neural mechanisms which may support our ability to represent and recognise a unique voice-identity. We examine the functional architecture of voice-sensitive regions in the superior temporal gyrus/sulcus, and bring together findings on how these regions may interact with each other, and additional face-sensitive regions, to support voice-identity processing. We also contrast findings from studies on neurotypicals and clinical populations which have examined the processing of familiar and unfamiliar voices. Taken together, the findings suggest that representations of familiar and unfamiliar voices might dissociate in the human brain. Such an observation does not fit well with current models for voice-identity processing, which by-and-large assume a common sequential analysis of the incoming voice signal, regardless of voice familiarity. We provide a revised audio-visual integrative model of voice-identity processing which brings together traditional and prototype models of identity processing. This revised model includes a mechanism of how voice-identity representations are established and provides a novel framework for understanding and examining the potential differences in familiar and unfamiliar voice processing in the human brain. Copyright © 2018 Elsevier Ltd. All rights reserved.

  19. Voice Activated Cockpit Management Systems: Voice-Flight NexGen, Phase I

    Data.gov (United States)

    National Aeronautics and Space Administration — Speaking to the cockpit as a method of system management in flight can become an effective interaction method, since voice communication is very efficient. Automated...

  20. Voices Falling Through the Air

    Directory of Open Access Journals (Sweden)

    Paul Elliman

    2012-11-01

    Full Text Available Where am I? Or as the young boy in Jules Verne’s Journey to the Centre of the Earth calls back to his distant-voiced companions: ‘Lost… in the most intense darkness.’ ‘Then I understood it,’ says the boy, Axel, ‘To make them hear me, all I had to do was to speak with my mouth close to the wall, which would serve to conduct my voice, as the wire conducts the electric fluid’ (Verne 1864. By timing their calls, the group of explorers work out that Axel is separated from them by a distance of four miles, held in a cavernous vertical gallery of smooth rock. Feeling his way down towards the others, the boy ends up falling, along with his voice, through the space. Losing consciousness he seems to give himself up to the space...

  1. Audiovisual speech facilitates voice learning.

    Science.gov (United States)

    Sheffert, Sonya M; Olson, Elizabeth

    2004-02-01

    In this research, we investigated the effects of voice and face information on the perceptual learning of talkers and on long-term memory for spoken words. In the first phase, listeners were trained over several days to identify voices from words presented auditorily or audiovisually. The training data showed that visual information about speakers enhanced voice learning, revealing cross-modal connections in talker processing akin to those observed in speech processing. In the second phase, the listeners completed an auditory or audiovisual word recognition memory test in which equal numbers of words were spoken by familiar and unfamiliar talkers. The data showed that words presented by familiar talkers were more likely to be retrieved from episodic memory, regardless of modality. Together, these findings provide new information about the representational code underlying familiar talker recognition and the role of stimulus familiarity in episodic word recognition.

  2. Who speaks for extinct nations? The Beothuk and narrative voice

    Directory of Open Access Journals (Sweden)

    C. Leggo

    1995-04-01

    Full Text Available The Beothuk of Newfoundland were among the first inhabitants of North America to encounter European explorers and settlers. By the first part of the nineteenth century the Beothuk were extinct, exterminated by the fishers and soldiers and settlers of western Europe. The last Beothuk was a woman named Shanadithit. She was captured and lived with white settlers for a few years before she died in 1829. Today all that remains of the Beothuk nation, which once numbered seven hundred to one thousand people, are some bones, arrowheads, tools, written records of explorers and settlers, and copies of drawings by Shanadithit in the Newfoundland Museum. In recent years several writers (all are white and male have written fiction and poetry and drama about the Beothuk, including Peter Such (Riverrun, 1973, Paul O'Neill (Legends of a Lost Tribe, 1976, Sid Stephen (Beothuk Poems, 1976, Al Pittman ("Shanadithit," 1978, Geoffrey Ursell (The Running of the Deer; A Play, 1981, Donald Gale (Sooshewan: A Child of the Beothuk, 1988, and Kevin Major (Blood Red Ochre, 1990. A recurring theme in all these narratives is the theme of regret and guilt. These narrative accounts of the Beothuk raise significant questions about voice and narrative, including: Who can speak for Native peoples? Who can speak for extinct peoples? Are there peoples without voices? How is voice historically determined? What is the relationship between voice and power? How are the effects of voice generated? What is an authentic voice? How is voice related to the illusion of presence? What is the relation between voice and silence? In examining contemporary narrative accounts of the Beothuk my goal is to reveal the rhetorical ways in which the Beothuk are given voice(s and to interrogate the ethical and pedagogical implications of contemporary authors revisiting and revisioning and re-voicing a nation of people long extinct.

  3. Voice disorders in teachers: occupational risk factors and psycho-emotional factors.

    Science.gov (United States)

    van Houtte, Evelyne; Claeys, Sofie; Wuyts, Floris; van Lierde, Kristiane

    2012-10-01

    Teaching is a high-risk occupation for developing voice disorders. The purpose of this study was to investigate previously described vocal risk factors as well as to identify new risk factors related to both the personal life of the teacher (fluid intake, voice-demanding activities, family history of voice disorders, and children at home) and to environmental factors (temperature changes, chalk use, presence of curtains, carpet, or air-conditioning, acoustics in the classroom, and noise in and outside the classroom). The study group comprised 994 teachers (response rate 46.6%). All participants completed a questionnaire. Chi-square tests and logistic regression analyses were performed. A total of 51.2% (509/994) of the teachers presented with voice disorders. Women reported more voice disorders compared to men (56.4% versus 40.4%, P history of voice disorders (P = 0.005), temperature changes in the classroom (P = 0.017), the number of pupils per classroom (P = 0.001), and noise level inside the classroom (P = 0.001). Teachers with voice disorders presented a higher level of psychological distress (P < 0.001) compared to teachers without voice problems. Voice disorders are frequent among teachers, especially in female teachers. The results of this study emphasize that multiple factors are involved in the development of voice disorders.

  4. Effect of voice therapy in sulcus vocalis: A single case study

    Directory of Open Access Journals (Sweden)

    R. Rajasudhakar

    2016-02-01

    Full Text Available Background: Sulcus vocalis is a structural deformity of the vocal ligament. It is the focal invagination of the epithelium deeply attaching to the vocal ligament. There is a dearth of literature on the outcome of voice therapy in sulcus vocalis condition.Objective: The primary objective of this study was to document voice characteristics of sulcus vocalis and the secondary objective was to establish the efficacy of voice therapy in a patient with sulcus vocalis.Method: A trial of voice therapy was given to the client who was diagnosed as having sulcus vocalis. Boon’s facilitation techniques were used in voice therapy along with other techniques such as breath holding and push and pull approach prior to surgery. Acoustic, aerodynamic, perceptual, quantitative measures of voice quality and self-rating measurements were performed before and after voice therapy.Results: Improvement was noticed in 10/10 acoustic, 4/4 aerodynamic, perceptual, dysphonia severity index and voice handicap index scores, which hinted that voice therapy can be an option critically for clients with sulcus vocalis in the initial stage.Conclusion: Voice therapy showed promising improvement in the study and it must be recommended as the initial treatment option before any surgical management.

  5. A Wireless LAN and Voice Information System for Underground Coal Mine

    OpenAIRE

    Yu Zhang; Wei Yang; Dongsheng Han; Young-Il Kim

    2014-01-01

    In this paper we constructed a wireless information system, and developed a wireless voice communication subsystem based on Wireless Local Area Networks (WLAN) for underground coal mine, which employs Voice over IP (VoIP) technology and Session Initiation Protocol (SIP) to achieve wireless voice dispatching communications. The master control voice dispatching interface and call terminal software are also developed on the WLAN ground server side to manage and implement the voice dispatching co...

  6. Why Is My Voice Changing? (For Teens)

    Science.gov (United States)

    ... enter puberty earlier or later than others. How Deep Will My Voice Get? How deep a guy's voice gets depends on his genes: ... of Use Notice of Nondiscrimination Visit the Nemours Web site. Note: All information on TeensHealth® is for ...

  7. Acoustic Analysis of Voice in Singers: A Systematic Review

    Science.gov (United States)

    Gunjawate, Dhanshree R.; Ravi, Rohit; Bellur, Rajashekhar

    2018-01-01

    Purpose: Singers are vocal athletes having specific demands from their voice and require special consideration during voice evaluation. Presently, there is a lack of standards for acoustic evaluation in them. The aim of the present study was to systematically review the available literature on the acoustic analysis of voice in singers. Method: A…

  8. Acoustic analysis with vocal loading test in occupational voice disorders: outcomes before and after voice therapy.

    Science.gov (United States)

    Niebudek-Bogusz, Ewa; Kotyło, Piotr; Politański, Piotr; Sliwińska-Kowalska, Mariola

    2008-01-01

    To assess the usefulness of acoustic analysis with vocal loading test for evaluating the treatment outcomes in occupational voice disorders. Fifty-one female teachers with dysphonia were examined (Voice Handicap Index--VHI, laryngovideostroboscopy and acoustic analysis with vocal loading) before and after treatment. The outcomes of teachers receiving vocal training (group I) were referred to outcomes of group II receiving only voice hygiene instructions. The results of subjective assessment (VHI score) and objective evaluation (acoustic analysis) improved more significantly in group I than in group II. The post-treatment examination revealed a decreased percentage of subjects with deteriorated jitter parameters after vocal loading, particularly in group I. Acoustic analysis with vocal loading test can be a helpful tool in the diagnosis and evaluation of treatment efficacy in occupational dysphonia.

  9. Voice Quality Measuring Setup with Automatic Voice over IP Call Generator and Lawful Interception Packet Analyzer

    Directory of Open Access Journals (Sweden)

    PLEVA Matus

    Full Text Available This paper describes the packet measuring laboratory setup, which could be used also for lawful interception applications, using professional packet analyzer, Voice over IP call generator, free call server (Asterisk linux setup and appropriate software and hardware described below. This setup was used for measuring the quality of the automatically generated VoIP calls under stressed network conditions, when the call manager server was flooded with high bandwidth traffic, near the bandwidth limit of the connected switch. The call generator realizes 30 calls simultaneously and the packet capturer & analyzercould decode the VoIP traffic, extract RTP session data, automatically analyze the voice quality using standardized MOS (Mean Opinion Score values and describe also the source of the voice degradation (jitter, packet loss, codec, delay, etc..

  10. Factors influencing referral of patients with voice disorders from primary care to otolaryngology.

    Science.gov (United States)

    Cohen, Seth M; Kim, Jaewhan; Roy, Nelson; Courey, Mark

    2014-01-01

    To evaluate the frequency, timing, and factors that influence referral of patients with laryngeal/voice disorders to otolaryngology following initial evaluation by a primary care physician (PCP). Retrospective analysis of a large, national administrative US claims database. Patients with a laryngeal disorder based on International Classification of Diseases, Ninth Revision, Clinical Modification codes from January 1, 2004 to December 31, 2008, seen by a PCP as an outpatient (with or without otolaryngology involvement), and continuously enrolled for 12 months were included. Patient age, gender, geographic region, last PCP laryngeal diagnosis, comorbid conditions, time from first PCP visit to first otolaryngology visit, number of PCP outpatient visits, and number of PCP laryngeal diagnoses were collected. Cox and generalized linear regressions were performed. A total of 149,653 unique patients saw a PCP as an outpatient for a laryngeal/voice disorder, with 136,152 (90.9%) only seeing a PCP, 6,013 (4.0%) referred by a PCP to an otolaryngologist, and 3,820 (2.6%) self-referred to an otolaryngologist. Acute laryngitis had a lower hazard ratio (HR) for otolaryngology referral than chronic laryngitis, nonspecific dysphonia, and laryngeal cancer. Having multiple comorbid conditions was associated with a greater HR for otolaryngology referral than having no comorbidities. Patient age, gender, and geographic region also affected otolaryngology referral. The time to otolaryngology evaluation ranged from 3 months. PCP-referred patients had less time to the otolaryngology evaluation than self-referred patients. Multiple factors affected otolaryngology referral for patients with laryngeal/voice disorders. Further education of PCPs regarding appropriate otolaryngology referral for laryngeal/voice disorders is needed. © 2013 The American Laryngological, Rhinological and Otological Society, Inc.

  11. Reverberation impairs brainstem temporal representations of voiced vowel sounds: challenging periodicity-tagged segregation of competing speech in rooms

    Directory of Open Access Journals (Sweden)

    Mark eSayles

    2015-01-01

    Full Text Available The auditory system typically processes information from concurrently active sound sources (e.g., two voices speaking at once, in the presence of multiple delayed, attenuated and distorted sound-wave reflections (reverberation. Brainstem circuits help segregate these complex acoustic mixtures into auditory objects. Psychophysical studies demonstrate a strong interaction between reverberation and fundamental-frequency (F0 modulation, leading to impaired segregation of competing vowels when segregation is on the basis of F0 differences. Neurophysiological studies of complex-sound segregation have concentrated on sounds with steady F0s, in anechoic environments. However, F0 modulation and reverberation are quasi-ubiquitous.We examine the ability of 129 single units in the ventral cochlear nucleus of the anesthetized guinea pig to segregate the concurrent synthetic vowel sounds /a/ and /i/, based on temporal discharge patterns under closed-field conditions. We address the effects of added real-room reverberation, F0 modulation, and the interaction of these two factors, on brainstem neural segregation of voiced speech sounds. A firing-rate representation of single-vowels’ spectral envelopes is robust to the combination of F0 modulation and reverberation: local firing-rate maxima and minima across the tonotopic array code vowel-formant structure. However, single-vowel F0-related periodicity information in shuffled inter-spike interval distributions is significantly degraded in the combined presence of reverberation and F0 modulation. Hence, segregation of double-vowels’ spectral energy into two streams (corresponding to the two vowels, on the basis of temporal discharge patterns, is impaired by reverberation; specifically when F0 is modulated. All unit types (primary-like, chopper, onset are similarly affected. These results offer neurophysiological insights to perceptual organization of complex acoustic scenes under realistically challenging

  12. Stated product formulation preferences for HIV pre-exposure prophylaxis among women in the VOICE-D (MTN-003D) study.

    Science.gov (United States)

    Luecke, Ellen H; Cheng, Helen; Woeber, Kubashni; Nakyanzi, Teopista; Mudekunye-Mahaka, Imelda C; van der Straten, Ariane

    2016-01-01

    The effectiveness of HIV pre-exposure prophylaxis (PrEP) requires consistent and correct product use, thus a deeper understanding of women's stated product formulation preferences, and the correlates of those preferences, can help guide future research. VOICE-D (MTN-003D), a qualitative ancillary study conducted after the VOICE trial, retrospectively explored participants' tablet and gel use, as well as their preferences for other potential PrEP product formulations. We conducted an analysis of quantitative and qualitative data from VOICE-D participants. During in-depth interviews, women were presented with pictures and descriptions of eight potential PrEP product formulations, including the oral tablet and vaginal gel tested in VOICE, and asked to discuss which product formulations they would prefer to use and why. Seven of the original product formulations displayed were combined into preferred product formulation categories based on exploratory factor and latent class analyses. We examined demographic and behavioural correlates of these preferred product formulation categories. In-depth interviews with participants were conducted, coded, and analysed for themes related to product preference. Of the 68 female participants who completed in-depth interviews (22 South Africa, 24 Zimbabwe, 22 Uganda), median age was 28 (range 21-41), 81% were HIV negative, and 49% were married or living with a partner. Four preferred product formulation categories were identified via exploratory factor analysis: 1) oral tablets; 2) vaginal gel; 3) injectable, implant, or vaginal ring; and 4) vaginal film or suppository. A majority of women (81%) expressed a preference for product formulations included in category 3. Characteristics significantly associated with each preferred product category differed. Attributes described by participants as being important in a preferred product formulation included duration of activity, ease of use, route of administration, clinic- versus self

  13. Student voice: An emerging discourse in Irish education policy

    Directory of Open Access Journals (Sweden)

    Domnall Fleming

    2015-09-01

    Full Text Available In positioning student voice within the Irish education policy discourse it is imperative that this emergent and complex concept is explored and theorized in the context of its definition and motivation. Student voice can then be positioned and critiqued as it emerged within Irish education policy primarily following Ireland’s ratification of the United Nations Charter on the Rights of the Child (UNCRC in 1992. Initially emerging in policy from a rights-based and democratic citizenship perspective, the student council became the principal construct for student voice in Irish post-primary schools. While central to the policy discourse, the student council construct has become tokenistic and redundant in practice. School evaluation policy, both external and internal, became a further catalyst for student voice in Ireland. Both processes further challenge and contest the motivation for student voice and point to the concept as an instrument for school improvement and performativity that lacks any centrality for a person-centered, rights-based, dialogic and consultative student voice within an inclusive classroom and school culture.

  14. Perception of Paralinguistic Traits in Synthesized Voices

    DEFF Research Database (Denmark)

    Baird, Alice Emily; Hasse Jørgensen, Stina; Parada-Cabaleiro, Emilia

    2017-01-01

    Along with the rise of artificial intelligence and the internet-of-things, synthesized voices are now common in daily–life, providing us with guidance, assistance, and even companionship. From formant to concatenative synthesis, the synthesized voice continues to be defined by the same traits we...

  15. Voice as Form of Life and Life Form

    Directory of Open Access Journals (Sweden)

    Sandra Laugier

    2015-10-01

    Full Text Available This paper studies the concept of form of life as central to ordinary language philosophy (as understood in Wittgenstein’s, Austin’s and Stanley Cavell’s work: philosophy of our language as spoken; pronounced by a human voice within a form of life. Such an approach to Wittgenstein’s later philosophy shifts the question of the common use of language – central to Wittgenstein’s Investigations – to the definition of the subject as voice, and to the reinvention of subjectivity in language. The voice is both a subjective and common expression: it is what makes it possible for my individual voice, or claim, to become shared and for our forms of life to be intertwined with a lifeform.

  16. Low-frequency noise complaints

    DEFF Research Database (Denmark)

    Pedersen, Christian Sejer; Møller, Henrik; Persson-Waye, Kerstin

    2006-01-01

    is only heard by a single person in the household. This raises the fundamental question whether the complainants are annoyed by an external physical sound, or if other explanations such as low-frequency tinnitus must be sought. The main aim of this study is to answer this fundamental question...

  17. The effect of singing training on voice quality for people with quadriplegia.

    Science.gov (United States)

    Tamplin, Jeanette; Baker, Felicity A; Buttifant, Mary; Berlowitz, David J

    2014-01-01

    Despite anecdotal reports of voice impairment in quadriplegia, the exact nature of these impairments is not well described in the literature. This article details objective and subjective voice assessments for people with quadriplegia at baseline and after a respiratory-targeted singing intervention. Randomized controlled trial. Twenty-four participants with quadriplegia were randomly assigned to a 12-week program of either a singing intervention or active music therapy control. Recordings of singing and speech were made at baseline, 6 weeks, 12 weeks, and 6 months postintervention. These deidentified recordings were used to measure sound pressure levels and assess voice quality using the Multidimensional Voice Profile and the Perceptual Voice Profile. Baseline voice quality data indicated deviation from normality in the areas of breathiness, strain, and roughness. A greater percentage of intervention participants moved toward more normal voice quality in terms of jitter, shimmer, and noise-to-harmonic ratio; however, the improvements failed to achieve statistical significance. Subjective and objective assessments of voice quality indicate that quadriplegia may have a detrimental effect on voice quality; in particular, causing a perception of roughness and breathiness in the voice. The results of this study suggest that singing training may have a role in ameliorating these voice impairments. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  18. Voice hearing within the context of hearers' social worlds: an interpretative phenomenological analysis.

    Science.gov (United States)

    Mawson, Amy; Berry, Katherine; Murray, Craig; Hayward, Mark

    2011-09-01

    Research has found relational qualities of power and intimacy to exist within hearer-voice interactions. The present study aimed to provide a deeper understanding of the interpersonal context of voice hearing by exploring participants' relationships with their voices and other people in their lives. This research was designed in consultation with service users and employed a qualitative, phenomenological, and idiographic design using semi-structured interviews. Ten participants, recruited via mental health services, and who reported hearing voices in the previous week, completed the interviews. These were transcribed verbatim and analysed using interpretative phenomenological analysis. Five themes resulted from the analysis. Theme 1: 'person and voice' demonstrated that participants' voices often reflected the identity, but not always the quality of social acquaintances. Theme 2: 'voices changing and confirming relationship with the self' explored the impact of voice hearing in producing an inferior sense-of-self in comparison to others. Theme 3: 'a battle for control' centred on issues of control and a dilemma of independence within voice relationships. Theme 4: 'friendships facilitating the ability to cope' and theme 5: 'voices creating distance in social relationships' explored experiences of social relationships within the context of voice hearing, and highlighted the impact of social isolation for voice hearers. The study demonstrated the potential role of qualitative research in developing theories of voice hearing. It extended previous research by highlighting the interface between voices and the social world of the hearer, including reciprocal influences of social relationships on voices and coping. Improving voice hearers' sense-of-self may be a key factor in reducing the distress caused by voices. ©2010 The British Psychological Society.

  19. Employee voice and engagement : Connections and consequences

    NARCIS (Netherlands)

    Rees, C.; Alfes, K.; Gatenby, M.

    2013-01-01

    This paper considers the relationship between employee voice and employee engagement. Employee perceptions of voice behaviour aimed at improving the functioning of the work group are found to have both a direct impact and an indirect impact on levels of employee engagement. Analysis of data from two

  20. Voices in Suicide. The Relationship between the Firestone Voice Scale for Self-Destructive Behavior and Self-Destructive Life-Styles.

    Science.gov (United States)

    Firestone, Robert W.

    This article presents findings from recent research demonstrating a significant relationship between parental introjects or "voices," and self-destructive behavior. The "voice" is defined as a systematized, integrated pattern of negative thoughts accompanied by angry affect, that is the basis of an individual's maladaptive…

  1. Controlling An Electric Car Starter System Through Voice

    Directory of Open Access Journals (Sweden)

    A.B. Muhammad Firdaus

    2015-04-01

    Full Text Available Abstract These days automotive has turned into a stand out amongst the most well-known modes of transportation on the grounds that a large number of Malaysians could bear to have an auto. There are numerous decisions of innovations in auto that have in the market. One of the engineering is voice controlled framework. Voice Recognition is the procedure of consequently perceiving a certain statement talked by a specific speaker focused around individual data included in discourse waves. This paper is to make an car controlled by voice of human. An essential pre-processing venture in Voice Recognition systems is to recognize the vicinity of noise. Sensitivity to speech variability lacking recognition precision and helplessness to mimic are among the principle specialized obstacles that keep the far reaching selection of speech-based recognition systems. Voice recognition systems work sensibly well with a quiet conditions however inadequately under loud conditions or in twisted channels. The key focus of the project is to control an electric car starter system.

  2. Shielding voices: The modulation of binding processes between voice features and response features by task representations.

    Science.gov (United States)

    Bogon, Johanna; Eisenbarth, Hedwig; Landgraf, Steffen; Dreisbach, Gesine

    2017-09-01

    Vocal events offer not only semantic-linguistic content but also information about the identity and the emotional-motivational state of the speaker. Furthermore, most vocal events have implications for our actions and therefore include action-related features. But the relevance and irrelevance of vocal features varies from task to task. The present study investigates binding processes for perceptual and action-related features of spoken words and their modulation by the task representation of the listener. Participants reacted with two response keys to eight different words spoken by a male or a female voice (Experiment 1) or spoken by an angry or neutral male voice (Experiment 2). There were two instruction conditions: half of participants learned eight stimulus-response mappings by rote (SR), and half of participants applied a binary task rule (TR). In both experiments, SR instructed participants showed clear evidence for binding processes between voice and response features indicated by an interaction between the irrelevant voice feature and the response. By contrast, as indicated by a three-way interaction with instruction, no such binding was found in the TR instructed group. These results are suggestive of binding and shielding as two adaptive mechanisms that ensure successful communication and action in a dynamic social environment.

  3. Spatial stability of jets - the nonaxisymmetric fundamental and reflection modes

    International Nuclear Information System (INIS)

    Hardee, P.E.

    1987-01-01

    A spatial stability analysis of the relativistic dispersion relation governing the growth and propagation of harmonic components comprising a perturbation to the surface of a cylindrical jet is performed. The spatial growth of harmonic components associated with the nonaxisymmetric fundamental solution and reflection solutions of several Fourier modes are analyzed. Approximate analytical expressions describing resonant frequencies and wavelengths, and maximum growth rates at resonance applicable to relativistic jets are found from the dispersion relation, and the nature of the resonances is explored. On transonic jets there is only a fundamental solution for each Fourier mode with no resonance or maximum growth rate. On supersonic jets there is a fundamental solution and reflection solutions for each Fourier mode, and each solution contains a resonance at which the growth rate is a maximum. A numerical analysis of the fundamental and first three reflection solutions of the axisymmetric and first three nonaxisymmetric Fourier modes is performed. The numerical analysis is restricted to nonrelativistic flows but otherwise covers a broad range of Mach numbers and jet densities. The numerical results are used along with the analytical results to obtain accurate expressions for resonant frequencies, wavelengths, and growth rates as a function of Mach numnber and jet density. In all cases the fastest spatial growth rate at a given frequency is of harmonic components associated with the fundamental solution of one of the nonaxisymmetric Fourier modes. The application of these results to jet structure and implication of these results for jet structure in extragalactic radio sources are considered. 23 references

  4. Speech-Language Pathology production regarding voice in popular singing.

    Science.gov (United States)

    Drumond, Lorena Badaró; Vieira, Naymme Barbosa; Oliveira, Domingos Sávio Ferreira de

    2011-12-01

    To present a literature review about the Brazilian scientific production in Speech-Language Pathology and Audiology regarding voice in popular singing in the last decade, as for number of publications, musical styles studied, focus of the researches, and instruments used for data collection. Cross-sectional descriptive study carried out in two stages: search in databases and publications encompassing the last decade of researches in this area in Brazil, and reading of the material obtained for posterior categorization. The databases LILACS and SciELO, the Databasis of Dissertations and Theses organized by CAPES, the online version of Acta ORL, and the online version of OPUS were searched, using the following uniterms: voice, professional voice, singing voice, dysphonia, voice disorders, voice training, music, dysodia. Articles published between the years 2000 and 2010 were selected. The researches found were classified and categorized after reading their abstracts and, when necessary, the whole study. Twenty researches within the proposed theme were selected, all of which were descriptive, involving several musical styles. Twelve studies focused on the evaluation of the popular singer's voice, and the most frequently used data collection instrument was the auditory-perceptual evaluation. The results of the publications found corroborate the objectives proposed by the authors and the different methodologies. The number of studies published is still restricted when compared to the diversity of musical genres and the uniqueness of popular singer.

  5. Violence in schools and the voice of teachers.

    Science.gov (United States)

    Dornelas, Rodrigo; Santos, Thaynara Alves Dos; Oliveira, Daniela Sena de; Irineu, Roxane de Alencar; Brito, Aline; Silva, Kelly

    2017-08-10

    To correlate self-reporting of voice disorders with habits that impact voice production and situations of violence experienced by teachers. The study involved 41 elementary-school teachers of rural and urban areas. Two instruments were used for data collection: The Vocal Production Condition - Teacher (CPV-P) questionnaire and the Screening Index for Voice Disorders - ITDV. The chi-square test was used to verify association among variables with a significance level of 5%. The sample consisted of 8 men and 33 women aged 25-66 years with a median of 39 years. Regarding vocal habits, 33 people (80.5%) mentioned the screaming as usual practice, 40 people (97.5%) declared they talk a lot. As for voice care, 31 people (73.1%) reported drinking water while using their voice. As for the ITDV total score, 30 teachers (73.1%) were above the score threshold set for predisposition to vocal disorders. Statistical analysis revealed a significant association between female participants and complaint of graffiti writings as a type of violence. No significant correlation between the ITDV results with gender and the ITDV with forms of violence evaluated in the study was indicated. Self-reporting of voice disorders showed no significant relationship with acts of violence. However, analysis of the context of violence in schools and vocal problems are issues worthy of attention, particularly the observed naturalization of gender inssues, which is seldom problematized.

  6. Musicians do not benefit from differences in fundamental frequency when listening to speech in competing speech backgrounds

    DEFF Research Database (Denmark)

    Madsen, Sara Miay Kim; Whiteford, Kelly L.; Oxenham, Andrew J.

    2017-01-01

    Recent studies disagree on whether musicians have an advantage over non-musicians in understanding speech in noise. However, it has been suggested that musicians may be able to use diferences in fundamental frequency (F0) to better understand target speech in the presence of interfering talkers....... Here we studied a relatively large (N=60) cohort of young adults, equally divided between nonmusicians and highly trained musicians, to test whether the musicians were better able to understand speech either in noise or in a two-talker competing speech masker. The target speech and competing speech...... were presented with either their natural F0 contours or on a monotone F0, and the F0 diference between the target and masker was systematically varied. As expected, speech intelligibility improved with increasing F0 diference between the target and the two-talker masker for both natural and monotone...

  7. [The prevalence, causes and specific features of voice disturbances in teachers].

    Science.gov (United States)

    Orlova, O S; Vasilenko, Iu S; Zakharova, A F; Samokhvalova, L O; Kozlova, P A

    2000-01-01

    The paper analyzes voice disturbances, their causes and specific features in teachers based on the questionnaires filled by 934 general educational school teachers. The teachers have been found to associate voice disturbances not only with changes in the voice timbre, but with different subjective feelings that make their professional activity difficult. The major factors that cause voice disturbances are the voice overloads that differ in teachers of different specialities, their inability to use the voice, psychoemotional stresses, and frequent colds, as well as a combination of several factors. The incidence of vocal apparatus diseases does not tend to decrease, which makes it necessary to implement combined medical and pedagogical prophylactic measures to prevent dysphonia.

  8. Smartphone-based ecological momentary assessment and intervention in a coping-focused intervention for hearing voices (SAVVy): study protocol for a pilot randomised controlled trial.

    Science.gov (United States)

    Bell, Imogen H; Fielding-Smith, Sarah F; Hayward, Mark; Rossell, Susan L; Lim, Michelle H; Farhall, John; Thomas, Neil

    2018-05-02

    Smartphone-based ecological momentary assessment and intervention (EMA/I) show promise for enhancing psychological treatments for psychosis. EMA has the potential to improve assessment and formulation of experiences which fluctuate day-to-day, and EMI may be used to prompt use of therapeutic strategies in daily life. The current study is an examination of these capabilities in the context of a brief, coping-focused intervention for distressing voice hearing experiences. This is a rater-blinded, pilot randomised controlled trial comparing a four-session intervention in conjunction with use of smartphone EMA/I between sessions, versus treatment-as-usual. The recruitment target is 34 participants with persisting and distressing voice hearing experiences, recruited through a Voices Clinic based in Melbourne, Australia, and via wider advertising. Allocation will be made using minimisation procedure, balancing of the frequency of voices between groups. Assessments are completed at baseline and 8 weeks post-baseline. The primary outcomes of this trial will focus on feasibility and acceptability of the intervention and trial methodology, with secondary outcomes examining preliminary clinical effects related to overall voice severity, the emotional and functional impact of the voices, and emotional distress. This study offers a highly novel examination of specific smartphone capabilities and their integration with traditional psychological treatment for distressing voices. Such technology has potential to enhance psychological interventions and promote adaptation to distressing experiences. Australian New Zealand Clinical Trial Registry, ACTRN12617000348358 . Registered on 7 March 2017.

  9. Performer's attitudes toward seeking health care for voice issues: understanding the barriers.

    Science.gov (United States)

    Gilman, Marina; Merati, Albert L; Klein, Adam M; Hapner, Edie R; Johns, Michael M

    2009-03-01

    Contemporary commercial music (CCM) performers rely heavily on their voice, yet may not be aware of the importance of proactive voice care. This investigation intends to identify perceptions and barriers to seeking voice care among CCM artists. This cross-sectional observational study used a 10-item Likert-based response questionnaire to assess current perceptions regarding voice care in a population of randomly selected participants of professional CCM conference. Subjects (n=78) were queried regarding their likelihood to seek medical care for minor medical problems and specifically problems with their voice. Additional questions investigated anxiety about seeking voice care from a physician specialist, speech language pathologist, or voice coach; apprehension regarding findings of laryngeal examination, laryngeal imaging procedures; and the effect of medical insurance on the likelihood of seeking medical care. Eighty-two percent of subjects reported that their voice was a critical part of their profession; 41% stated that they were not likely to seek medical care for problems with their voice; and only 19% were reluctant to seek care for general medical problems (Peducation about the importance of voice care is needed in this population of vocal performers.

  10. Initial Progress Toward Development of a Voice-Based Computer-Delivered Motivational Intervention for Heavy Drinking College Students: An Experimental Study

    Science.gov (United States)

    Lechner, William J; MacGlashan, James; Wray, Tyler B; Littman, Michael L

    2017-01-01

    Background Computer-delivered interventions have been shown to be effective in reducing alcohol consumption in heavy drinking college students. However, these computer-delivered interventions rely on mouse, keyboard, or touchscreen responses for interactions between the users and the computer-delivered intervention. The principles of motivational interviewing suggest that in-person interventions may be effective, in part, because they encourage individuals to think through and speak aloud their motivations for changing a health behavior, which current computer-delivered interventions do not allow. Objective The objective of this study was to take the initial steps toward development of a voice-based computer-delivered intervention that can ask open-ended questions and respond appropriately to users’ verbal responses, more closely mirroring a human-delivered motivational intervention. Methods We developed (1) a voice-based computer-delivered intervention that was run by a human controller and that allowed participants to speak their responses to scripted prompts delivered by speech generation software and (2) a text-based computer-delivered intervention that relied on the mouse, keyboard, and computer screen for all interactions. We randomized 60 heavy drinking college students to interact with the voice-based computer-delivered intervention and 30 to interact with the text-based computer-delivered intervention and compared their ratings of the systems as well as their motivation to change drinking and their drinking behavior at 1-month follow-up. Results Participants reported that the voice-based computer-delivered intervention engaged positively with them in the session and delivered content in a manner consistent with motivational interviewing principles. At 1-month follow-up, participants in the voice-based computer-delivered intervention condition reported significant decreases in quantity, frequency, and problems associated with drinking, and increased

  11. Compliance and quality of life in patients on prescribed voice rest.

    Science.gov (United States)

    Rousseau, Bernard; Cohen, Seth M; Zeller, Amy S; Scearce, Leda; Tritter, Andrew G; Garrett, C Gaelyn

    2011-01-01

    To determine patient compliance with voice rest and the impact of voice rest on quality of life (QOL). Prospective. University hospital. Demographics, self-reported compliance, QOL impact on a 100-mm visual analog scale (VAS), and communication methods were collected from 84 participants from 2 academic voice centers. Of 84 participants, 36.9% were men, 63.1% were women, and 64.3% were singers. The mean age of participants was 47.2 years. The mean duration of voice rest was 8.8 days (range, 3-28), and the median was 7 days. Overall compliance was 34.5%. Postoperative voice rest patients were more compliant than non-postoperative patients (42.4% vs 16.0%, P = .04, χ(2)). Voice rest had an impact on QOL (mean ± SD, 68.5 ± 27.7). Voice rest also had a greater impact on singers than nonsingers (mean VAS 77.2 vs 63.6, P = .03, t test) and on those age <60 years than those age ≥ 60 years (mean VAS 74.4 vs 46.7, P < .001, t test). More talkative patients and those with longer periods of voice rest had worse QOL scores (Spearman correlation = 0.35, P = .001 and Spearman correlation = 0.24, P = .03, respectively). Restrictions in personal and social life were noted in 36.9% of patients, 46.4% were unable to work, 44.0% felt frustrated, and 38.1% reported feeling handicapped while on voice rest. Given poor patient compliance and the significant impact of voice rest on QOL, further studies are warranted to examine the efficacy of voice rest and factors that may contribute to patient noncompliance with treatment.

  12. Robotic vehicle uses acoustic sensors for voice detection and diagnostics

    Science.gov (United States)

    Young, Stuart H.; Scanlon, Michael V.

    2000-07-01

    An acoustic sensor array that cues an imaging system on a small tele- operated robotic vehicle was used to detect human voice and activity inside a building. The advantage of acoustic sensors is that it is a non-line of sight (NLOS) sensing technology that can augment traditional LOS sensors such as visible and IR cameras. Acoustic energy emitted from a target, such as from a person, weapon, or radio, will travel through walls and smoke, around corners, and down corridors, whereas these obstructions would cripple an imaging detection system. The hardware developed and tested used an array of eight microphones to detect the loudest direction and automatically setter a camera's pan/tilt toward the noise centroid. This type of system has applicability for counter sniper applications, building clearing, and search/rescue. Data presented will be time-frequency representations showing voice detected within rooms and down hallways at various ranges. Another benefit of acoustics is that it provides the tele-operator some situational awareness clues via low-bandwidth transmission of raw audio data for the operator to interpret with either headphones or through time-frequency analysis. This data can be useful to recognize familiar sounds that might indicate the presence of personnel, such as talking, equipment, movement noise, etc. The same array also detects the sounds of the robot it is mounted on, and can be useful for engine diagnostics and trouble shooting, or for self-noise emanations for stealthy travel. Data presented will characterize vehicle self noise over various surfaces such as tiles, carpets, pavement, sidewalk, and grass. Vehicle diagnostic sounds will indicate a slipping clutch and repeated unexpected application of emergency braking mechanism.

  13. CONVERSATIONS -- AND NEGOTIATED INTERACTION -- IN TEXT AND VOICE CHAT ROOMS

    Directory of Open Access Journals (Sweden)

    Kevin Jepson

    2005-09-01

    Full Text Available Despite the expanded use of the Internet for language learning and practice, little attention if any has been given to the quality of interaction among English L2 speakers in conversational text or voice chat rooms. This study explored the patterns of repair moves in synchronous non-native speaker (NNS text chat rooms in comparison to voice chat rooms on the Internet. The following questions were posed: (a Which types of repair moves occur in text and voice chats; and (b what are the differences, if any, between the repair moves in text chats and voice chats when time is held constant? Repair moves made by anonymous NNSs in 10, 5-minute, synchronous chat room sessions (5 text-chat sessions, 5 voice-chat sessions were counted and analyzed using chi-square with alpha set at .05. Significant differences were found between the higher number of total repair moves made in voice chats and the smaller number in text chats. Qualitative data analysis showed that repair work in voice chats was often pronunciation-related. The study includes discussion that may affect teachers' and learners' considerations of the value of NNS chat room interaction for second language development.

  14. Comparing the experience of voices in borderline personality disorder with the experience of voices in a psychotic disorder: A systematic review.

    Science.gov (United States)

    Merrett, Zalie; Rossell, Susan L; Castle, David J

    2016-07-01

    In clinical settings, there is substantial evidence both clinically and empirically to suggest that approximately 50% of individuals with borderline personality disorder experience auditory verbal hallucinations. However, there is limited research investigating the phenomenology of these voices. The aim of this study was to review and compare our current understanding of auditory verbal hallucinations in borderline personality disorder with auditory verbal hallucinations in patients with a psychotic disorder, to critically analyse existing studies investigating auditory verbal hallucinations in borderline personality disorder and to identify gaps in current knowledge, which will help direct future research. The literature was searched using the electronic database Scopus, PubMed and MEDLINE. Relevant studies were included if they were written in English, were empirical studies specifically addressing auditory verbal hallucinations and borderline personality disorder, were peer reviewed, used only adult humans and sample comprising borderline personality disorder as the primary diagnosis, and included a comparison group with a primary psychotic disorder such as schizophrenia. Our search strategy revealed a total of 16 articles investigating the phenomenology of auditory verbal hallucinations in borderline personality disorder. Some studies provided evidence to suggest that the voice experiences in borderline personality disorder are similar to those experienced by people with schizophrenia, for example, occur inside the head, and often involved persecutory voices. Other studies revealed some differences between schizophrenia and borderline personality disorder voice experiences, with the borderline personality disorder voices sounding more derogatory and self-critical in nature and the voice-hearers' response to the voices were more emotionally resistive. Furthermore, in one study, the schizophrenia group's voices resulted in more disruption in daily functioning

  15. Adductor spasmodic dysphonia: Relationships between acoustic indices and perceptual judgments

    Science.gov (United States)

    Cannito, Michael P.; Sapienza, Christine M.; Woodson, Gayle; Murry, Thomas

    2003-04-01

    This study investigated relationships between acoustical indices of spasmodic dysphonia and perceptual scaling judgments of voice attributes made by expert listeners. Audio-recordings of The Rainbow Passage were obtained from thirty one speakers with spasmodic dysphonia before and after a BOTOX injection of the vocal folds. Six temporal acoustic measures were obtained across 15 words excerpted from each reading sample, including both frequency of occurrence and percent time for (1) aperiodic phonation, (2) phonation breaks, and (3) fundamental frequency shifts. Visual analog scaling judgments were also obtained from six voice experts using an interactive computer interface to quantify four voice attributes (i.e., overall quality, roughness, brokenness, breathiness) in a carefully psychoacoustically controlled environment, using the same reading passages as stimuli. Number and percent aperiodicity and phonation breaks correlated significanly with perceived overall voice quality, roughness, and brokenness before and after the BOTOX injection. Breathiness was correlated with aperidocity only prior to injection, while roughness also correlated with frequency shifts following injection. Factor analysis reduced perceived attributes to two principal components: glottal squeezing and breathiness. The acoustic measures demonstrated a strong regression relationship with perceived glottal squeezing, but no regression relationship with breathiness was observed. Implications for an analysis of pathologic voices will be discussed.

  16. The Voice Pump: an Affectively Engaging Interface for Changing Attachments

    DEFF Research Database (Denmark)

    Fritsch, Jonas; Jacobsen, Mogens

    2017-01-01

    In this paper, we present the preliminary results from an ongoing interaction design experiment, the Voice Pump. The Voice Pump is an affectively engaging air-based interface for attuning to the differential qualities of voices in order to change attachments between native Danish speakers and non-native...

  17. ANALYZING THE SPEECH EXPRESSIVENESS USING PROSODIC DYNAMIC CONTROL

    Directory of Open Access Journals (Sweden)

    Valentin Eugen Ghisa

    2018-04-01

    Full Text Available At the level of verbal communication, the prosodic support and emotional space is modelled as a nonlinear system described through some parameters extracted from the spectral model of vocal wave, respectively the outline of the fundamental frequency, the time and energy of sonorous segments, the duration of non-acoustic segments and breaks, the voice timbre etc. Through the discretised addressing of the spectral model, the aim is to optimise the prosodic characteristics extracted from local variations of the fundamental frequency, by a method of dynamic control.

  18. Bodies and Voices

    DEFF Research Database (Denmark)

    A wide-ranging collection of essays centred on readings of the body in contemporary literary and socio-anthropological discourse, from slavery and rape to female genital mutilation, from clothing, ocular pornography, voice, deformation and transmutation to the imprisoned, dismembered, remembered...

  19. 47 CFR 90.233 - Base/mobile non-voice operations.

    Science.gov (United States)

    2010-10-01

    ... 47 Telecommunication 5 2010-10-01 2010-10-01 false Base/mobile non-voice operations. 90.233... SERVICES PRIVATE LAND MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.233 Base/mobile non-voice operations. The use of A1D, A2D, F1D, F2D, G1D, or G2D emission may be authorized to base...

  20. Giving Voice to Emotion: Voice Analysis Technology Uncovering Mental States is Playing a Growing Role in Medicine, Business, and Law Enforcement.

    Science.gov (United States)

    Allen, Summer

    2016-01-01

    It's tough to imagine anything more frustrating than interacting with a call center. Generally, people don't reach out to call centers when they?re happy-they're usually trying to get help with a problem or gearing up to do battle over a billing error. Add in an automatic phone tree, and you have a recipe for annoyance. But what if that robotic voice offering you a smorgasbord of numbered choices could tell that you were frustrated and then funnel you to an actual human being? This type of voice analysis technology exists, and it's just one example of the many ways that computers can use your voice to extract information about your mental and emotional state-including information you may not think of as being accessible through your voice alone.

  1. Speaker-Sex Discrimination for Voiced and Whispered Vowels at Short Durations.

    Science.gov (United States)

    Smith, David R R

    2016-01-01

    Whispered vowels, produced with no vocal fold vibration, lack the periodic temporal fine structure which in voiced vowels underlies the perceptual attribute of pitch (a salient auditory cue to speaker sex). Voiced vowels possess no temporal fine structure at very short durations (below two glottal cycles). The prediction was that speaker-sex discrimination performance for whispered and voiced vowels would be similar for very short durations but, as stimulus duration increases, voiced vowel performance would improve relative to whispered vowel performance as pitch information becomes available. This pattern of results was shown for women's but not for men's voices. A whispered vowel needs to have a duration three times longer than a voiced vowel before listeners can reliably tell whether it's spoken by a man or woman (∼30 ms vs. ∼10 ms). Listeners were half as sensitive to information about speaker-sex when it is carried by whispered compared with voiced vowels.

  2. Phonological experience modulates voice discrimination: Evidence from functional brain networks analysis.

    Science.gov (United States)

    Hu, Xueping; Wang, Xiangpeng; Gu, Yan; Luo, Pei; Yin, Shouhang; Wang, Lijun; Fu, Chao; Qiao, Lei; Du, Yi; Chen, Antao

    2017-10-01

    Numerous behavioral studies have found a modulation effect of phonological experience on voice discrimination. However, the neural substrates underpinning this phenomenon are poorly understood. Here we manipulated language familiarity to test the hypothesis that phonological experience affects voice discrimination via mediating the engagement of multiple perceptual and cognitive resources. The results showed that during voice discrimination, the activation of several prefrontal regions was modulated by language familiarity. More importantly, the same effect was observed concerning the functional connectivity from the fronto-parietal network to the voice-identity network (VIN), and from the default mode network to the VIN. Our findings indicate that phonological experience could bias the recruitment of cognitive control and information retrieval/comparison processes during voice discrimination. Therefore, the study unravels the neural substrates subserving the modulation effect of phonological experience on voice discrimination, and provides new insights into studying voice discrimination from the perspective of network interactions. Copyright © 2017. Published by Elsevier Inc.

  3. Temporal control and compensation for perturbed voicing feedback

    DEFF Research Database (Denmark)

    Mitsuya, Takashi; MacDonald, Ewen; Munhall, Kevin G.

    2014-01-01

    Previous research employing a real-time auditory perturbation paradigm has shown that talkers monitor their own speech attributes such as fundamental frequency, vowel intensity, vowel formants, and fricative noise as part of speech motor control. In the case of vowel formants or fricative noise...

  4. A pilot study of the relations within which hearing voices participates: Towards a functional distinction between voice hearers and controls

    NARCIS (Netherlands)

    McEnteggart, C.; Barnes-Holmes, Y.; Egger, J.I.M.; Barnes-Holmes, D.

    2016-01-01

    The current research used the Implicit Relational Assessment Procedure (IRAP) as a preliminary step toward bringing a broad, functional approach to understanding psychosis, by focusing on the specific phenomenon of auditory hallucinations of voices and sounds (often referred to as hearing voices).

  5. A Robust Multimodal Bio metric Authentication Scheme with Voice and Face Recognition

    International Nuclear Information System (INIS)

    Kasban, H.

    2017-01-01

    This paper proposes a multimodal biometric scheme for human authentication based on fusion of voice and face recognition. For voice recognition, three categories of features (statistical coefficients, cepstral coefficients and voice timbre) are used and compared. The voice identification modality is carried out using Gaussian Mixture Model (GMM). For face recognition, three recognition methods (Eigenface, Linear Discriminate Analysis (LDA), and Gabor filter) are used and compared. The combination of voice and face biometrics systems into a single multimodal biometrics system is performed using features fusion and scores fusion. This study shows that the best results are obtained using all the features (cepstral coefficients, statistical coefficients and voice timbre features) for voice recognition, LDA face recognition method and scores fusion for the multimodal biometrics system

  6. Fundamentals of Coherent Synchrotron Radiation in Storage Rings

    International Nuclear Information System (INIS)

    Sannibale, F.; Byrd, J.M.; Loftsdottir, A.; Martin, M.C.; Venturini, M.

    2004-01-01

    We present the fundamental concepts for producing stable broadband coherent synchrotron radiation (CSR) in the terahertz frequency region in an electron storage ring. The analysis includes distortion of bunch shape from the synchrotron radiation (SR), enhancing higher frequency coherent emission and limits to stable emission due to a microbunching instability excited by the SR. We use these concepts to optimize the performance of a source for CSR emission

  7. Voice-activated intelligent radiologic image display

    International Nuclear Information System (INIS)

    Fisher, P.

    1989-01-01

    The authors present a computer-based expert computer system called Mammo-Icon, which automatically assists the radiologist's case analysis by reviewing the trigger phrase output of a commercially available voice transcription system in he domain of mammography. A commercially available PC-based voice dictation system is coupled to an expert system implemented on a microcomputer. Software employs the LISP and C computer languages. Mammo-Icon responds to the trigger phrase output of a voice dictation system with a textual discussion of the potential significance of the findings that have been described and a display of reference images that may help the radiologist to confirm a suspected diagnosis or consider additional diagnoses. This results in automatic availability of potentially useful computer-based expert advice, making such systems much more likely to be used in routine clinical practice

  8. Frequency control modelling - basics

    DEFF Research Database (Denmark)

    Hansen, Anca Daniela; Sørensen, Poul Ejnar; Zeni, Lorenzo

    2016-01-01

    The purpose of this report is to provide an introduction on how the system balance in an island system can be maintained by controlling the frequency. The power balance differential equation, which is fundamental in understanding the effect on the system frequency of the unbalance between...

  9. Engaging with voices: rethinking the clinical treatment of psychosis.

    Science.gov (United States)

    Jones, Nev; Shattell, Mona

    2013-07-01

    Although the hearing voices movement (HVM) has yet to take root in the US to the extent it has in the UK (and parts of Australia and Europe), recent publications and events, including a keynote presentation by UK hearing voices trainer Ron Coleman at the 2012 Annual NAMI convention and a TED 2013 talk in Los Angeles by British voice hearer and psychologist Eleanor Longden, suggest that the tide is starting to turn (Arenella, 2012; Grantham, 2012; Thomas, 2012). At its core, the HVM emphasizes a few basic, but important, points: that antipsychotic pharmacotherapy and various forms of psychotherapy that aim to suppress psychotic experiences are often--for too many people--ineffective or insufficient; that voices and other extreme experiences and beliefs carry important messages that need to be explored rather than silenced, and that voices themselves are often less of the problem than the difficulties individuals have in coping and negotiating with them (Corstens, Escher, & Romme, 2008; Longden, Corstens, Escher, & Romme, 2012; Place, Foxcroft, & Shaw, 2011).

  10. Perception of a Sung Vowel as a Function of Frequency-Modulation Rate and Excursionin Normal-Hearing and Hearing-Impaired Listeners

    DEFF Research Database (Denmark)

    Vatti, Marianna; Santurette, Sébastien; Pontoppidan, Niels henrik

    2014-01-01

    Purpose: Frequency fluctuations in human voices can usually be described as coherent frequency modulation (FM). As listeners with hearing impairment (HI listeners) are typically less sensitive to FM than listeners with normal hearing (NH listeners), this study investigated whether hearing loss...... affects the perception of a sung vowel based on FM cues. Method: Vibrato maps were obtained in 14 NH and 12 HI listeners with different degrees of musical experience. The FM rate and FM excursion of a synthesized vowel, to which coherent FM was applied, were adjusted until a singing voice emerged. Results......: In NH listeners, adding FM to the steady vowel components produced perception of a singing voice for FM rates between 4.1 and 7.5 Hz and FM excursions between 17 and 83 cents on average. In contrast, HI listeners showed substantially broader vibrato maps. Individual differences in map boundaries were...

  11. Voice preprocessing system incorporating a real-time spectrum analyzer with programmable switched-capacitor filters

    Science.gov (United States)

    Knapp, G.

    1984-01-01

    As part of a speaker verification program for BISS (Base Installation Security System), a test system is being designed with a flexible preprocessing system for the evaluation of voice spectrum/verification algorithm related problems. The main part of this report covers the design, construction, and testing of a voice analyzer with 16 integrating real-time frequency channels ranging from 300 Hz to 3 KHz. The bandpass filter response of each channel is programmable by NMOS switched capacitor quad filter arrays. Presently, the accuracy of these units is limited to a moderate precision by the finite steps of programming. However, repeatability of characteristics between filter units and sections seems to be excellent for the implemented fourth-order Butterworth bandpass responses. We obtained a 0.1 dB linearity error of signal detection and measured a signal-to-noise ratio of approximately 70 dB. The proprocessing system discussed includes preemphasis filter design, gain normalizer design, and data acquisition system design as well as test results.

  12. Keep Your Voice Sound: How to Prevent and Avoid Voice Problems

    Science.gov (United States)

    ... Brain Listen Up! Wise Choices Avoid Voice Problems Drink 6 to 8 glasses of water a day. This helps keep your vocal folds moist and healthy. Limit intake of caffeinated or alcoholic drinks. These can dehydrate your body and make the ...

  13. Original Knowledge, Gender and the Word's Mythology: Voicing the Doctorate

    Science.gov (United States)

    Carter, Susan

    2012-01-01

    Using mythology as a generative matrix, this article investigates the relationship between knowledge, words, embodiment and gender as they play out in academic writing's voice and, in particular, in doctoral voice. The doctoral thesis is defensive, a performance seeking admittance into discipline scholarship. Yet in finding its scholarly voice,…

  14. Educational Technology and Student Voice: Examining Teacher Candidates' Perceptions

    Science.gov (United States)

    Byker, Erik Jon; Putman, S. Michael; Handler, Laura; Polly, Drew

    2017-01-01

    Student Voice is a term that honors the participatory roles that students have when they enter learning spaces like classrooms. Student Voice is the recognition of students' choice, creativity, and freedom. Seminal educationists--like Dewey and Montessori--centered the purposes of education in the flourishing and valuing of Student Voice. This…

  15. Stage Voice Training in the London Schools.

    Science.gov (United States)

    Rubin, Lucille S.

    This report is the result of a six-week study in which the voice training offerings at four schools of drama in London were examined using interviews of teachers and directors, observation of voice classes, and attendance at studio presentations and public performances. The report covers such topics as: textbooks and references being used; courses…

  16. Inspired Leadership: Engaging the Voice and Embodying Advocacy

    OpenAIRE

    Jacobs, Kamra Angelica

    2017-01-01

    The journey of finding my voice has forced me to show up and be seen in my work. I silenced my own voice at a dehumanizing call center, as a faceless target for frustrated customers. l discovered the power of connection by embodying advocacy and engaging my voice and body in my work. Primarily, I listen to my gut and trust my intuition. Secondly, I advocate by speaking up for those who cannot advocate for themselves. During the Streamers production process, when I felt the twinge in my gut,...

  17. A FRAMEWORK FOR INTELLIGENT VOICE-ENABLED E-EDUCATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    Azeta A. A.

    2009-07-01

    Full Text Available Although the Internet has received significant attention in recent years, voice is still the most convenient and natural way of communicating between human to human or human to computer. In voice applications, users may have different needs which will require the ability of the system to reason, make decisions, be flexible and adapt to requests during interaction. These needs have placed new requirements in voice application development such as use of advanced models, techniques and methodologies which take into account the needs of different users and environments. The ability of a system to behave close to human reasoning is often mentioned as one of the major requirements for the development of voice applications. In this paper, we present a framework for an intelligent voice-enabled e-Education application and an adaptation of the framework for the development of a prototype Course Registration and Examination (CourseRegExamOnline module. This study is a preliminary report of an ongoing e-Education project containing the following modules: enrollment, course registration and examination, enquiries/information, messaging/collaboration, e-Learning and library. The CourseRegExamOnline module was developed using VoiceXML for the voice user interface(VUI, PHP for the web user interface (WUI, Apache as the middle-ware and MySQL database as back-end. The system would offer dual access modes using the VUI and WUI. The framework would serve as a reference model for developing voice-based e-Education applications. The e-Education system when fully developed would meet the needs of students who are normal users and those with certain forms of disabilities such as visual impairment, repetitive strain injury (RSI, etc, that make reading and writing difficult.

  18. Lactobacilli : Important in biofilm formation on voice prostheses

    NARCIS (Netherlands)

    Buijssen, Kevin J. D. A.; Harmsen, Hermie J. M.; van der Mei, Henny C.; Busscher, Henk J.; van der Laan, Bernard F. A. M.

    OBJECTIVE: We sought to identify bacterial strains responsible for biofilm formation on silicone rubber voice prostheses. STUDY DESIGN: We conducted an analysis of the bacterial population in biofilms on used silicone rubber voice prostheses by using new microbiological methods. METHODS: Two

  19. Combined Functional Voice Therapy in Singers With Muscle Tension Dysphonia in Singing.

    Science.gov (United States)

    Sielska-Badurek, Ewelina; Osuch-Wójcikiewicz, Ewa; Sobol, Maria; Kazanecka, Ewa; Rzepakowska, Anna; Niemczyk, Kazimierz

    2017-07-01

    The purpose of this study was to evaluate vocal tract function and the voice quality in singers with muscle tension dysphonia (MTD) after undergoing combined functional voice therapy of the singing voice. This is a prospective, randomized study. Forty singers (29 females and 11 males, mean age: 24.6 ± 8.8 years) with MTD were enrolled in the study. The study group consisted of 20 singers who underwent combined functional voice therapy (10-15 individual sessions, 30-40 minutes each). Singers who did not opt for vocal rehabilitation consisted of the control group. Effects of rehabilitation were assessed with videolaryngostroboscopy, palpation of the vocal tract structures, flexible fiberoptic evaluation of the pharynx and the larynx, perceptual speaking and singing voice assessment, acoustic analysis, maximal phonation time, and the Voice Handicap Index. After combined functional voice therapy in the study group, great improvement was noticed in palpation of the vocal tract structures (P singing range obtained from acoustic analysis of glissando (P singing. Development of palpation and perceptual singing voice examination protocols enables one to compare results before and after rehabilitation in clinics. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  20. Using Hierarchical Time Series Clustering Algorithm and Wavelet Classifier for Biometric Voice Classification

    Directory of Open Access Journals (Sweden)

    Simon Fong

    2012-01-01

    Full Text Available Voice biometrics has a long history in biosecurity applications such as verification and identification based on characteristics of the human voice. The other application called voice classification which has its important role in grouping unlabelled voice samples, however, has not been widely studied in research. Lately voice classification is found useful in phone monitoring, classifying speakers’ gender, ethnicity and emotion states, and so forth. In this paper, a collection of computational algorithms are proposed to support voice classification; the algorithms are a combination of hierarchical clustering, dynamic time wrap transform, discrete wavelet transform, and decision tree. The proposed algorithms are relatively more transparent and interpretable than the existing ones, though many techniques such as Artificial Neural Networks, Support Vector Machine, and Hidden Markov Model (which inherently function like a black box have been applied for voice verification and voice identification. Two datasets, one that is generated synthetically and the other one empirically collected from past voice recognition experiment, are used to verify and demonstrate the effectiveness of our proposed voice classification algorithm.

  1. Type 3 Thyroplasty for a Patient with Female-to-Male Gender Identity Disorder

    OpenAIRE

    Yu Saito; Kazuhiro Nakamura; Shigeto Itani; Kiyoaki Tsukahara

    2018-01-01

    Objective. In most cases, about the voice of the patient with female-to-male/gender identity disorder (FTM/GID), hormone therapy makes the voice low-pitched. In success cases, there is no need for phonosurgery. However, hormone therapy is not effective in some cases. We perform type 3 thyroplasty in these cases. Method. Hormone therapy was started in 2008 but did not lower the speaking fundamental frequencies (SFFs). We therefore performed TP3 under local anesthesia. Results. In our case, the...

  2. Voice disorders in teachers and their associations with work-related factors: a systematic review.

    Science.gov (United States)

    Cantor Cutiva, Lady Catherine; Vogel, Ineke; Burdorf, Alex

    2013-01-01

    To provide a quantitative assessment of the occurrence of voice disorders among teachers and to identify associated work-related and individual factors in the teaching profession. A systematic review was conducted using three computerized databases on the occurrence of voice disorders among teachers and their associations with work-related and individual factors. Some of the keywords used were: "teacher", "voice disorder", "voice problem", and "dysphonia". Information regarding the occurrence of voice disorders and associations between work-related and individual factors and voice disorders were extracted from each paper. Occurrence and associations were expressed in prevalence and odds ratios, respectively. In total, 23 publications met the criteria for inclusion. All publications were cross-sectional studies. Prevalence estimates varied widely, reflecting disparity in definitions of "voice problem". Teachers had a significantly increased occurrence of voice disorders compared to other occupations. Several work-related and individual factors were consistently associated with voice disorders, most notably high levels of noise in classrooms, being a physical education instructor, and habitual use of a loud speaking voice. This review shows that teachers report voice disorders more often than non-teachers. Various work-related and individual factors are associated with reported voice disorders. Longitudinal studies are urgently required to get more insight into the development of voice disorders, their work-related determinants, and the consequences of these voice disorders for functioning and work performance among teachers. Describe the occurrence of voice disorders among teachers. Identify some work-related factors of voice disorders among teachers. Interpret the quality of the publications to describe or analyze the relationship between working conditions and voice disorders among teachers. Copyright © 2013 Elsevier Inc. All rights reserved.

  3. Functional outcome of vocal fold medialization thyroplasty with a hydroxyapatite implant.

    Science.gov (United States)

    Storck, Claudio; Brockmann, Meike; Schnellmann, Elvira; Stoeckli, Sandro J; Schmid, Stephan

    2007-06-01

    Unilateral vocal fold paralysis can cause a persistent incomplete glottal closure during phonation, resulting in impaired voice function. The aim of this study was to evaluate functional results of medialization thyroplasty using a hydroxyapatite implant (VoCoM). Prospective observational cohort study. Between 1999 and 2003, a total of 26 patients (19 men, 7 women) undergoing medialization thyroplasty using a hydroxyapatite implant because of unilateral vocal fold paralysis were enrolled in the study. To evaluate voice function, the following parameters were measured preoperatively and postoperatively: mean fundamental frequency, mean sound pressure level, frequency and amplitude range (voice range profile), and maximum phonation time. A perceptual assessment of hoarseness was conducted using the Roughness, Breathiness, Hoarseness scale. Furthermore, the magnitude of voice related impairment of the patient's communication skills was rated on a 7-point scale. A combined parameter called the Voice Dysfunction Index (VDI) was used to rate vocal performance. All patients showed a statistically significant improvement in the VDI, in perceptual voice analysis, in maximum phonation time, and in the dynamic range of voice. One patient experienced a postoperative wound hemorrhage as a minor complication. No further complications or implant extrusions were observed. Medialization thyroplasty using a hydroxyapatite implant is a secure and efficient phonosurgical procedure. Voice quality and patient satisfaction improve significantly after treatment.

  4. Predicting Voice Disorder Status From Smoothed Measures of Cepstral Peak Prominence Using Praat and Analysis of Dysphonia in Speech and Voice (ADSV).

    Science.gov (United States)

    Sauder, Cara; Bretl, Michelle; Eadie, Tanya

    2017-09-01

    The purposes of this study were to (1) determine and compare the diagnostic accuracy of a single acoustic measure, smoothed cepstral peak prominence (CPPS), to predict voice disorder status from connected speech samples using two software systems: Analysis of Dysphonia in Speech and Voice (ADSV) and Praat; and (2) to determine the relationship between measures of CPPS generated from these programs. This is a retrospective cross-sectional study. Measures of CPPS were obtained from connected speech recordings of 100 subjects with voice disorders and 70 nondysphonic subjects without vocal complaints using commercially available ADSV and freely downloadable Praat software programs. Logistic regression and receiver operating characteristic (ROC) analyses were used to evaluate and compare the diagnostic accuracy of CPPS measures. Relationships between CPPS measures from the programs were determined. Results showed acceptable overall accuracy rates (75% accuracy, ADSV; 82% accuracy, Praat) and area under the ROC curves (area under the curve [AUC] = 0.81, ADSV; AUC = 0.91, Praat) for predicting voice disorder status, with slight differences in sensitivity and specificity. CPPS measures derived from Praat were uniquely predictive of disorder status above and beyond CPPS measures from ADSV (χ 2 (1) = 40.71, P disorder status using either program. Clinicians may consider using CPPS to complement clinical voice evaluation and screening protocols. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  5. Sustainable Consumer Voices

    DEFF Research Database (Denmark)

    Klitmøller, Anders; Rask, Morten; Jensen, Nevena

    2011-01-01

    Aiming to explore how user driven innovation can inform high level design strategies, an in-depth empirical study was carried out, based on data from 50 observations of private vehicle users. This paper reports the resulting 5 consumer voices: Technology Enthusiast, Environmentalist, Design Lover...

  6. Voice problems among Slovenian physicians compared to the teachers: Prevalence and risk factors

    Directory of Open Access Journals (Sweden)

    Maja Šereh Bahar

    2012-09-01

    Conclusions: The prevalence of voice disorders among outpatients’ physicians in Slovenia is high and is comparable to the incidence of voice problems in Slovenian teachers. URI is the most common cause of these voice problems. GERD, allergies and an age over 40 years were stated as the risk factors for voice disorders. In order to reduce the extent of voice problems, lessons on vocal hygiene, and additional information about diseases causing voice disorders should be included in their postgraduate education.

  7. Representing Voices from the Life-World in Evidence-Based Practice

    Science.gov (United States)

    Kovarsky, Dana

    2008-01-01

    Background: Current models of evidence-based practice marginalize and even silence the voices of those who are the potential beneficiaries of assessment and intervention. These missing voices can be found in the reflections of clients on their own life-world experiences. Aims: This paper examines how voices from the life-world are silenced in…

  8. Some objective measures indicative of perceived voice robustness in student teachers.

    Science.gov (United States)

    Orr, Rosemary; de Jong, Felix; Cranen, Bert

    2002-01-01

    One of the problems confronted in the teaching profession is the maintenance of a healthy voice. This basic pedagogical tool is subjected to extensive use, and frequently suffers from overload, with some teachers having to give up their profession altogether. In some teacher training schools, it is the current practice to examine the student's voice, and to refer any perceived susceptibility to strain to voice specialists. For this study, a group of vocally healthy students were examined first at the teacher training schools, and then at the ENT clinic at the University Hospital of Nijmegen. The aim was to predict whether the subject's voice might be at risk for occupational dysphonia as a result of the vocal load of the teaching profession. We tried to find objective measures of voice quality in student teachers, used in current clinical practice, which reflect the judgements of the therapists and phoniatricians. We tried to explain such measures physiologically in terms of robustness of, and control over voicing. Objective measures used included video-laryngostroboscopy, phonetography and spectrography. Maximum phonation time, melodic range in conjunction with maximum intensity range, and the production of soft voice are suggested as possible predictive parameters for the risk of occupational voice strain.

  9. Flow Glottogram and Subglottal Pressure Relationship in Singers and Untrained Voices.

    Science.gov (United States)

    Sundberg, Johan

    2018-01-01

    This article combines results from three earlier investigations of the glottal voice source during phonation at varying degrees of vocal loudness (1) in five classically trained baritone singers (Sundberg et al., 1999), (2) in 15 female and 14 male untrained voices (Sundberg et al., 2005), and (3) in voices rated as hyperfunctional by an expert panel (Millgård et al., 2015). Voice source data were obtained by inverse filtering. Associated subglottal pressures were estimated from oral pressure during the occlusion for the consonant /p/. Five flow glottogram parameters, (1) maximum flow declination rate (MFDR), (2) peak-to-peak pulse amplitude, (3) level difference between the first and the second harmonics of the voice source, (4) closed quotient, and (5) normalized amplitude quotient, were averaged across the singer subjects and related to associated MFDR values. Strong, quantitative relations, expressed as equations, are found between subglottal pressure and MFDR and between MFDR and each of the other flow glottogram parameters. The values for the untrained voices, as well as those for the voices rated as hyperfunctional, deviate systematically from the values derived from the equations. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  10. The Supercontinuum Laser Source Fundamentals with Updated References

    CERN Document Server

    Alfano, Robert R

    2006-01-01

    Photonics and nonlinear optics are important areas of science, engineering and technology. One of the most important ultrafast nonlinear optical processes is the supercontinuum (SC) – the production of intense white light pulses covering: uv, visible, NIR, MIR, and IR. It is produced using ultrashort laser pulses (ps/fs) to produce the ultrabroad band of frequencies. This book covers the fundamental principles and surveys research of current thinkers and experts in the field with updated references of the key breakthroughs over the past decade and a half. The application of SC are time-resolved pump-SC probe absorption and excitation spectroscopy for chemistry, biology and physics fundamental processes; optical coherence tomography; ultrashort pulse generation in femtosecond and attosecond regions; frequency clocks; phase stabilization; optical communication; atmospheric science; lightning control; optical medical imaging; biological cell imaging; and metrology standards.

  11. Voice and choice in health care in England: understanding citizen responses to dissatisfaction.

    Science.gov (United States)

    Dowding, Keith; John, Peter

    2011-01-01

    Using data from a five-year online survey the paper examines the effects of relative satisfaction with health services on individuals' voice-and-choice activity in the English public health care system. Voice is considered in three parts – individual voice (complaints), collective voice voting and participation (collective action). Exercising choice is seen in terms of complete exit (not using health care), internal exit (choosing another public service provider) and private exit (using private health care). The interaction of satisfaction and forms of voice and choice are analysed over time. Both voice and choice are correlated with dissatisfaction with those who are unhappy with the NHS more likely to privately voice and to plan to take up private health care. Those unable to choose private provision are likely to use private voice. These factors are not affected by items associated with social capital – indeed, being more trusting leads to lower voice activity.

  12. Challenging stereotyping and bias: a voice simulation study.

    Science.gov (United States)

    Dearing, Karen S; Steadman, Sheryl

    2008-02-01

    Stigma is a barrier to mental health care access for patients with schizophrenia and can interfere with developing therapeutic relationships. This study demonstrates success of a voice simulation experience during orientation in changing the biases of nursing students and the effect on the development of the nurse-patient relationship. Ninety-four individuals participated; 52 received a voice simulation experience during orientation, and 42 received orientation with no voice simulation experience. The Medical Condition Regard Scale was administered before and after orientation. Posttest paired t test results show significant differences in attitudes toward patients with voice hearing experiences between the two groups. The themes of personal growth from the focus groups postorientation include Affective Experience, Physical Experience, and Empathy. Findings demonstrate that the orientation process should include methods to challenge stereotyping and bias to decrease stigma, improve service access, and enhance the ability to develop therapeutic relationships.

  13. Lax Vox as a Voice Training Program for Teachers: A Pilot Study.

    Science.gov (United States)

    Mailänder, Eva; Mühre, Lea; Barsties, Ben

    2017-03-01

    The objective of this study was to explore the effectiveness of a 3-week training program with the voice therapy "Lax Vox" for teachers. Four healthy female teachers participated as volunteers for the study. Several voice measurements of perception, acoustics, aerodynamics, and self-evaluation were investigated. Furthermore, a survey to rate the applicability of Lax Vox was also part of the study. To assess the treatment effects of the Lax Vox training, an effect size analysis (d unb ) was conducted. After 3 weeks of training, medium and large improvements were found in some parameters of perceptual and acoustic voice quality assessments (d unb >0.50 and d unb >0.80, respectively). Furthermore, medium improvements were revealed in some parameters of self-evaluation (ie, physical and total scale of the Voice Handicap Index) and aerodynamic (ie, maximum phonation time) assessments (all d unb >0.50). Additionally, acoustic measures of vocal function showed an expansion in the upper contour of voice range profiles after training. Particularly, the main improvements in the voice range profile was found in the modal and the beginning of the falsetto voice registers. There was an increase of the intensity levels of about 4.6 dB. No changes were revealed in some acoustic measures of the voice range profile, self-evaluation measurements, and the perception of breathy voice quality (all d unb teachers appears to improve select measures of voice quality, maximum phonation time, vocal function, self-evaluation, and perceived applicability. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  14. Speaker-Sex Discrimination for Voiced and Whispered Vowels at Short Durations

    OpenAIRE

    Smith, David R. R.

    2016-01-01

    Whispered vowels, produced with no vocal fold vibration, lack the periodic temporal fine structure which in voiced vowels underlies the perceptual attribute of pitch (a salient auditory cue to speaker sex). Voiced vowels possess no temporal fine structure at very short durations (below two glottal cycles). The prediction was that speaker-sex discrimination performance for whispered and voiced vowels would be similar for very short durations but, as stimulus duration increases, voiced vowel pe...

  15. Intra-oral pressure-based voicing control of electrolaryngeal speech with intra-oral vibrator.

    Science.gov (United States)

    Takahashi, Hirokazu; Nakao, Masayuki; Kikuchi, Yataro; Kaga, Kimitaka

    2008-07-01

    In normal speech, coordinated activities of intrinsic laryngeal muscles suspend a glottal sound at utterance of voiceless consonants, automatically realizing a voicing control. In electrolaryngeal speech, however, the lack of voicing control is one of the causes of unclear voice, voiceless consonants tending to be misheard as the corresponding voiced consonants. In the present work, we developed an intra-oral vibrator with an intra-oral pressure sensor that detected utterance of voiceless phonemes during the intra-oral electrolaryngeal speech, and demonstrated that an intra-oral pressure-based voicing control could improve the intelligibility of the speech. The test voices were obtained from one electrolaryngeal speaker and one normal speaker. We first investigated on the speech analysis software how a voice onset time (VOT) and first formant (F1) transition of the test consonant-vowel syllables contributed to voiceless/voiced contrasts, and developed an adequate voicing control strategy. We then compared the intelligibility of consonant-vowel syllables among the intra-oral electrolaryngeal speech with and without online voicing control. The increase of intra-oral pressure, typically with a peak ranging from 10 to 50 gf/cm2, could reliably identify utterance of voiceless consonants. The speech analysis and intelligibility test then demonstrated that a short VOT caused the misidentification of the voiced consonants due to a clear F1 transition. Finally, taking these results together, the online voicing control, which suspended the prosthetic tone while the intra-oral pressure exceeded 2.5 gf/cm2 and during the 35 milliseconds that followed, proved efficient to improve the voiceless/voiced contrast.

  16. Voice Over Internet Protocol Testbed Design for Non-Intrusive, Objective Voice Quality Assessment

    National Research Council Canada - National Science Library

    Manka, David L

    2007-01-01

    Voice over Internet Protocol (VoIP) is an emerging technology with the potential to assist the United States Marine Corps in solving communication challenges stemming from modern operational concepts...

  17. CAMAC programmable-control frequency synthesizer

    International Nuclear Information System (INIS)

    Yumaguzin, T.Kh.; Vyazovkin, D.E.; Nazirov, Eh.P.; Tuktarov, R.F.

    1989-01-01

    Synthesizer allows to set frequency with 0.015% accuracy and to scan it with variable step. Frequency controlled divider with further summing-up of divided frequency with fundamental one is used in synthesizer, and it has allowed to use digit of the input code and to obtain 3-4 MHz frequency range. Variation of operation flowsheet in the other frequency range is possible. K-155 and K-531 series microcircuits were used during development

  18. Finite element modelling of vocal tract changes after voice therapy

    Czech Academy of Sciences Publication Activity Database

    Vampola, T.; Laukkanen, A. M.; Horáček, Jaromír; Švec, J. G.

    2011-01-01

    Roč. 5, č. 1 (2011), s. 77-88 ISSN 1802-680X R&D Projects: GA ČR GA101/08/1155 Institutional research plan: CEZ:AV0Z20760514 Keywords : biomechanics of human voice * voice production modelling * vocal excersing * voice training Subject RIV: BI - Acoustics http://www.kme.zcu.cz/acm/index.php/acm/article/view/138

  19. Fundamentals of electromagnetics 2 quasistatics and waves

    CERN Document Server

    Voltmer, David

    2007-01-01

    This book is the second of two volumes which have been created to provide an understanding of the basic principles and applications of electromagnetic fields for electrical engineering students. Fundamentals of Electromagnetics Vol 2: Quasistatics and Waves examines how the low-frequency models of lumped elements are modified to include parasitic elements. For even higher frequencies, wave behavior in space and on transmission lines is explained. Finally, the textbook concludes with details of transmission line properties and applications. Upon completion of this book and its companion Fundame

  20. [Comparison of vocal tract discomfort scale results with objective and instrumental phoniatric parameters among teacher rehabilitees from voice disorders].

    Science.gov (United States)

    Woźnicka, Ewelina; Niebudek-Bogusz, Ewa; Wiktorowicz, Justyna; Sliwińska-Kowalska, Mariola

    2013-01-01

    Diagnostic and therapeutic procedures of occupational dysphonia play a major role in voice self-assessment, which is one of the elements of a comprehensive evaluation of voice disorders. The aim of the study was to assess the applicability of the Vocal Tract Discomfort (VTD) scale to monitor the effectiveness of voice rehabilitation and compare the VTD results with objective and instrumental methods of phoniatric diagnosis. The study included 55 teachers (mean age, 47.2) with occupational dysphonia. A comprehensive diagnosis took into account self-assessment by VTD scale, phoniatric examination, including laryngovideostroboscopy (LVSS) and objective measurements of the aerodynamic parameter - the maximum phonation time (MPT). After 4 months of intense rehabilitation, post-therapy examination was performed using the methods specified above. After the treatment, a significant improvement was obtained in the subjective symptoms measured on a VTD scale - assessed both for the frequency (p = 0.000) and the severity (p = 0.000) subscales. Positive effects of the therapy were also observed for the parameters evaluated in the phoniatric study (p dysphonia.