vocal inflection speech: Topics by WorldWideScience.org

Sample records for vocal inflection speech

Common neural substrates support speech and non-speech vocal tract gestures

OpenAIRE

Chang, Soo-Eun; Kenney, Mary Kay; Loucks, Torrey M.J.; Poletto, Christopher J.; Ludlow, Christy L.

2009-01-01

The issue of whether speech is supported by the same neural substrates as non-speech vocal-tract gestures has been contentious. In this fMRI study we tested whether producing non-speech vocal tract gestures in humans shares the same functional neuroanatomy as non-sense speech syllables. Production of non-speech vocal tract gestures, devoid of phonological content but similar to speech in that they had familiar acoustic and somatosensory targets, were compared to the production of speech sylla...
Common neural substrates support speech and non-speech vocal tract gestures.

Science.gov (United States)

Chang, Soo-Eun; Kenney, Mary Kay; Loucks, Torrey M J; Poletto, Christopher J; Ludlow, Christy L

2009-08-01

The issue of whether speech is supported by the same neural substrates as non-speech vocal tract gestures has been contentious. In this fMRI study we tested whether producing non-speech vocal tract gestures in humans shares the same functional neuroanatomy as non-sense speech syllables. Production of non-speech vocal tract gestures, devoid of phonological content but similar to speech in that they had familiar acoustic and somatosensory targets, was compared to the production of speech syllables without meaning. Brain activation related to overt production was captured with BOLD fMRI using a sparse sampling design for both conditions. Speech and non-speech were compared using voxel-wise whole brain analyses, and ROI analyses focused on frontal and temporoparietal structures previously reported to support speech production. Results showed substantial activation overlap between speech and non-speech function in regions. Although non-speech gesture production showed greater extent and amplitude of activation in the regions examined, both speech and non-speech showed comparable left laterality in activation for both target perception and production. These findings posit a more general role of the previously proposed "auditory dorsal stream" in the left hemisphere--to support the production of vocal tract gestures that are not limited to speech processing.
Comment on "Monkey vocal tracts are speech-ready".

Science.gov (United States)

Lieberman, Philip

2017-07-01

Monkey vocal tracts are capable of producing monkey speech, not the full range of articulate human speech. The evolution of human speech entailed both anatomy and brains. Fitch, de Boer, Mathur, and Ghazanfar in Science Advances claim that "monkey vocal tracts are speech-ready," and conclude that "…the evolution of human speech capabilities required neural change rather than modifications of vocal anatomy." Neither premise is consistent either with the data presented and the conclusions reached by de Boer and Fitch themselves in their own published papers on the role of anatomy in the evolution of human speech or with the body of independent studies published since the 1950s.
Primate vocal communication: a useful tool for understanding human speech and language evolution?

Science.gov (United States)

Fedurek, Pawel; Slocombe, Katie E

2011-04-01

Language is a uniquely human trait, and questions of how and why it evolved have been intriguing scientists for years. Nonhuman primates (primates) are our closest living relatives, and their behavior can be used to estimate the capacities of our extinct ancestors. As humans and many primate species rely on vocalizations as their primary mode of communication, the vocal behavior of primates has been an obvious target for studies investigating the evolutionary roots of human speech and language. By studying the similarities and differences between human and primate vocalizations, comparative research has the potential to clarify the evolutionary processes that shaped human speech and language. This review examines some of the seminal and recent studies that contribute to our knowledge regarding the link between primate calls and human language and speech. We focus on three main aspects of primate vocal behavior: functional reference, call combinations, and vocal learning. Studies in these areas indicate that despite important differences, primate vocal communication exhibits some key features characterizing human language. They also indicate, however, that some critical aspects of speech, such as vocal plasticity, are not shared with our primate cousins. We conclude that comparative research on primate vocal behavior is a very promising tool for deepening our understanding of the evolution of human speech and language, but much is still to be done as many aspects of monkey and ape vocalizations remain largely unexplored.
Transfer Effect of Speech-sound Learning on Auditory-motor Processing of Perceived Vocal Pitch Errors.

Science.gov (United States)

Chen, Zhaocong; Wong, Francis C K; Jones, Jeffery A; Li, Weifeng; Liu, Peng; Chen, Xi; Liu, Hanjun

2015-08-17

Speech perception and production are intimately linked. There is evidence that speech motor learning results in changes to auditory processing of speech. Whether speech motor control benefits from perceptual learning in speech, however, remains unclear. This event-related potential study investigated whether speech-sound learning can modulate the processing of feedback errors during vocal pitch regulation. Mandarin speakers were trained to perceive five Thai lexical tones while learning to associate pictures with spoken words over 5 days. Before and after training, participants produced sustained vowel sounds while they heard their vocal pitch feedback unexpectedly perturbed. As compared to the pre-training session, the magnitude of vocal compensation significantly decreased for the control group, but remained consistent for the trained group at the post-training session. However, the trained group had smaller and faster N1 responses to pitch perturbations and exhibited enhanced P2 responses that correlated significantly with their learning performance. These findings indicate that the cortical processing of vocal pitch regulation can be shaped by learning new speech-sound associations, suggesting that perceptual learning in speech can produce transfer effects to facilitating the neural mechanisms underlying the online monitoring of auditory feedback regarding vocal production.
VOCAL DEVELOPMENT AS A MAIN CONDITION IN EARLY SPEECH AND LANGUAGE ACQUISITION

Directory of Open Access Journals (Sweden)

Marianne HOLM

2005-06-01

Full Text Available The objective of this research is the evident positive vocal development in pre-lingual deaf children, who underwent a Cochlea Implantation in early age. The presented research compares the vocal speech expressions of three hearing impaired children and two children with normal hearing from 10 months to 5 years. Comparisons of the spontaneous vocal expressions were conducted by sonagraphic analyses. The awareness of the own voice as well as the voices of others is essential for the child’s continuous vocal development from crying to speech. Supra-segmental factors, such as rhythm, dynamics and melody play a very important role in this development.
Adaptation to Delayed Speech Feedback Induces Temporal Recalibration between Vocal Sensory and Auditory Modalities

Directory of Open Access Journals (Sweden)

Kosuke Yamamoto

2011-10-01

Full Text Available We ordinarily perceive our voice sound as occurring simultaneously with vocal production, but the sense of simultaneity in vocalization can be easily interrupted by delayed auditory feedback (DAF. DAF causes normal people to have difficulty speaking fluently but helps people with stuttering to improve speech fluency. However, the underlying temporal mechanism for integrating the motor production of voice and the auditory perception of vocal sound remains unclear. In this study, we investigated the temporal tuning mechanism integrating vocal sensory and voice sounds under DAF with an adaptation technique. Participants read some sentences with specific delay times of DAF (0, 30, 75, 120 ms during three minutes to induce ‘Lag Adaptation’. After the adaptation, they then judged the simultaneity between motor sensation and vocal sound given feedback in producing simple voice but not speech. We found that speech production with lag adaptation induced a shift in simultaneity responses toward the adapted auditory delays. This indicates that the temporal tuning mechanism in vocalization can be temporally recalibrated after prolonged exposure to delayed vocal sounds. These findings suggest vocalization is finely tuned by the temporal recalibration mechanism, which acutely monitors the integration of temporal delays between motor sensation and vocal sound.
Animal Models of Speech and Vocal Communication Deficits Associated With Psychiatric Disorders.

Science.gov (United States)

Konopka, Genevieve; Roberts, Todd F

2016-01-01

Disruptions in speech, language, and vocal communication are hallmarks of several neuropsychiatric disorders, most notably autism spectrum disorders. Historically, the use of animal models to dissect molecular pathways and connect them to behavioral endophenotypes in cognitive disorders has proven to be an effective approach for developing and testing disease-relevant therapeutics. The unique aspects of human language compared with vocal behaviors in other animals make such an approach potentially more challenging. However, the study of vocal learning in species with analogous brain circuits to humans may provide entry points for understanding this human-specific phenotype and diseases. We review animal models of vocal learning and vocal communication and specifically link phenotypes of psychiatric disorders to relevant model systems. Evolutionary constraints in the organization of neural circuits and synaptic plasticity result in similarities in the brain mechanisms for vocal learning and vocal communication. Comparative approaches and careful consideration of the behavioral limitations among different animal models can provide critical avenues for dissecting the molecular pathways underlying cognitive disorders that disrupt speech, language, and vocal communication. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Language modeling for automatic speech recognition of inflective languages an applications-oriented approach using lexical data

CERN Document Server

Donaj, Gregor

2017-01-01

This book covers language modeling and automatic speech recognition for inflective languages (e.g. Slavic languages), which represent roughly half of the languages spoken in Europe. These languages do not perform as well as English in speech recognition systems and it is therefore harder to develop an application with sufficient quality for the end user. The authors describe the most important language features for the development of a speech recognition system. This is then presented through the analysis of errors in the system and the development of language models and their inclusion in speech recognition systems, which specifically address the errors that are relevant for targeted applications. The error analysis is done with regard to morphological characteristics of the word in the recognized sentences. The book is oriented towards speech recognition with large vocabularies and continuous and even spontaneous speech. Today such applications work with a rather small number of languages compared to the nu...
A Maximum Likelihood Estimation of Vocal-Tract-Related Filter Characteristics for Single Channel Speech Separation

Directory of Open Access Journals (Sweden)

Dansereau Richard M

2007-01-01

Full Text Available We present a new technique for separating two speech signals from a single recording. The proposed method bridges the gap between underdetermined blind source separation techniques and those techniques that model the human auditory system, that is, computational auditory scene analysis (CASA. For this purpose, we decompose the speech signal into the excitation signal and the vocal-tract-related filter and then estimate the components from the mixed speech using a hybrid model. We first express the probability density function (PDF of the mixed speech's log spectral vectors in terms of the PDFs of the underlying speech signal's vocal-tract-related filters. Then, the mean vectors of PDFs of the vocal-tract-related filters are obtained using a maximum likelihood estimator given the mixed signal. Finally, the estimated vocal-tract-related filters along with the extracted fundamental frequencies are used to reconstruct estimates of the individual speech signals. The proposed technique effectively adds vocal-tract-related filter characteristics as a new cue to CASA models using a new grouping technique based on an underdetermined blind source separation. We compare our model with both an underdetermined blind source separation and a CASA method. The experimental results show that our model outperforms both techniques in terms of SNR improvement and the percentage of crosstalk suppression.
A Maximum Likelihood Estimation of Vocal-Tract-Related Filter Characteristics for Single Channel Speech Separation

Directory of Open Access Journals (Sweden)

Mohammad H. Radfar

2006-11-01

Full Text Available We present a new technique for separating two speech signals from a single recording. The proposed method bridges the gap between underdetermined blind source separation techniques and those techniques that model the human auditory system, that is, computational auditory scene analysis (CASA. For this purpose, we decompose the speech signal into the excitation signal and the vocal-tract-related filter and then estimate the components from the mixed speech using a hybrid model. We first express the probability density function (PDF of the mixed speech's log spectral vectors in terms of the PDFs of the underlying speech signal's vocal-tract-related filters. Then, the mean vectors of PDFs of the vocal-tract-related filters are obtained using a maximum likelihood estimator given the mixed signal. Finally, the estimated vocal-tract-related filters along with the extracted fundamental frequencies are used to reconstruct estimates of the individual speech signals. The proposed technique effectively adds vocal-tract-related filter characteristics as a new cue to CASA models using a new grouping technique based on an underdetermined blind source separation. We compare our model with both an underdetermined blind source separation and a CASA method. The experimental results show that our model outperforms both techniques in terms of SNR improvement and the percentage of crosstalk suppression.
Speech intelligibility of laryngectomized patients who use different types of vocal communication

OpenAIRE

Šehović Ivana; Petrović-Lazić Mirjana

2016-01-01

Modern methods of speech rehabilitation after a total laryngectomy have come to a great success by giving the patients a possibility to establish an intelligible and functional speech after an adequate rehabilitation treatment. The aim of this paper was to examine speech intelligibility of laryngectomized patients who use different types of vocal communication: esophageal speech, speech with tracheoesophageal prosthesis and speech with electronic laringeal prosthesis. The research was conduct...
Musician effect on perception of spectro-temporally degraded speech, vocal emotion, and music in young adolescents.

NARCIS (Netherlands)

Başkent, Deniz; Fuller, Christina; Galvin, John; Schepel, Like; Gaudrain, Etienne; Free, Rolien

2018-01-01

In adult normal-hearing musicians, perception of music, vocal emotion, and speech in noise has been previously shown to be better than non-musicians, sometimes even with spectro-temporally degraded stimuli. In this study, melodic contour identification, vocal emotion identification, and speech
Executives' speech expressiveness: analysis of perceptive and acoustic aspects of vocal dynamics.

Science.gov (United States)

Marquezin, Daniela Maria Santos Serrano; Viola, Izabel; Ghirardi, Ana Carolina de Assis Moura; Madureira, Sandra; Ferreira, Léslie Piccolotto

2015-01-01

To analyze speech expressiveness in a group of executives based on perceptive and acoustic aspects of vocal dynamics. Four male subjects participated in the research study (S1, S2, S3, and S4). The assessments included the Kingdomality test to obtain the keywords of communicative attitudes; perceptive-auditory assessment to characterize vocal quality and dynamics, performed by three judges who are speech language pathologists; perceptiveauditory assessment to judge the chosen keywords; speech acoustics to assess prosodic elements (Praat software); and a statistical analysis. According to the perceptive-auditory analysis of vocal dynamics, S1, S2, S3, and S4 did not show vocal alterations and all of them were considered with lowered habitual pitch. S1: pointed out as insecure, nonobjective, nonempathetic, and unconvincing with inappropriate use of pauses that are mainly formed by hesitations; inadequate separation of prosodic groups with breaking of syntagmatic constituents. S2: regular use of pauses for respiratory reload, organization of sentences, and emphasis, which is considered secure, little objective, empathetic, and convincing. S3: pointed out as secure, objective, empathetic, and convincing with regular use of pauses for respiratory reload and organization of sentences and hesitations. S4: the most secure, objective, empathetic, and convincing, with proper use of pauses for respiratory reload, planning, and emphasis; prosodic groups agreed with the statement, without separating the syntagmatic constituents. The speech characteristics and communicative attitudes were highlighted in two subjects in a different manner, in such a way that the slow rate of speech and breaks of the prosodic groups transmitted insecurity, little objectivity, and nonpersuasion.
Speech-like orofacial oscillations in stump-tailed macaque (Macaca arctoides) facial and vocal signals.

Science.gov (United States)

Toyoda, Aru; Maruhashi, Tamaki; Malaivijitnond, Suchinda; Koda, Hiroki

2017-10-01

Speech is unique to humans and characterized by facial actions of ∼5 Hz oscillations of lip, mouth or jaw movements. Lip-smacking, a facial display of primates characterized by oscillatory actions involving the vertical opening and closing of the jaw and lips, exhibits stable 5-Hz oscillation patterns, matching that of speech, suggesting that lip-smacking is a precursor of speech. We tested if facial or vocal actions exhibiting the same rate of oscillation are found in wide forms of facial or vocal displays in various social contexts, exhibiting diversity among species. We observed facial and vocal actions of wild stump-tailed macaques (Macaca arctoides), and selected video clips including facial displays (teeth chattering; TC), panting calls, and feeding. Ten open-to-open mouth durations during TC and feeding and five amplitude peak-to-peak durations in panting were analyzed. Facial display (TC) and vocalization (panting) oscillated within 5.74 ± 1.19 and 6.71 ± 2.91 Hz, respectively, similar to the reported lip-smacking of long-tailed macaques and the speech of humans. These results indicated a common mechanism for the central pattern generator underlying orofacial movements, which would evolve to speech. Similar oscillations in panting, which evolved from different muscular control than the orofacial action, suggested the sensory foundations for perceptual saliency particular to 5-Hz rhythms in macaques. This supports the pre-adaptation hypothesis of speech evolution, which states a central pattern generator for 5-Hz facial oscillation and perceptual background tuned to 5-Hz actions existed in common ancestors of macaques and humans, before the emergence of speech. © 2017 Wiley Periodicals, Inc.
Vocal effort modulates the motor planning of short speech structures

Science.gov (United States)

Taitz, Alan; Shalom, Diego E.; Trevisan, Marcos A.

2018-05-01

Speech requires programming the sequence of vocal gestures that produce the sounds of words. Here we explored the timing of this program by asking our participants to pronounce, as quickly as possible, a sequence of consonant-consonant-vowel (CCV) structures appearing on screen. We measured the delay between visual presentation and voice onset. In the case of plosive consonants, produced by sharp and well defined movements of the vocal tract, we found that delays are positively correlated with the duration of the transition between consonants. We then used a battery of statistical tests and mathematical vocal models to show that delays reflect the motor planning of CCVs and transitions are proxy indicators of the vocal effort needed to produce them. These results support that the effort required to produce the sequence of movements of a vocal gesture modulates the onset of the motor plan.
Vocal effectiveness of speech-language pathology students: Before and after voice use during service delivery

OpenAIRE

Couch, Stephanie; Zieba, Dominique; van der Linde, Jeannie; van der Merwe, Anita

2015-01-01

Background: As a professional voice user, it is imperative that a speech-language pathologist’s(SLP) vocal effectiveness remain consistent throughout the day. Many factors may contribute to reduced vocal effectiveness, including prolonged voice use, vocally abusive behaviours,poor vocal hygiene and environmental factors. Objectives: To determine the effect of service delivery on the perceptual and acoustic features of voice. Method: A quasi-experimental., pre-test–post-test research de...
Vocal Fold Paralysis

Science.gov (United States)

... here Home » Health Info » Voice, Speech, and Language Vocal Fold Paralysis On this page: What is vocal fold ... Where can I get additional information? What is vocal fold paralysis? Structures involved in speech and voice production ...
Vocal Tract Images Reveal Neural Representations of Sensorimotor Transformation During Speech Imitation

Science.gov (United States)

Carey, Daniel; Miquel, Marc E.; Evans, Bronwen G.; Adank, Patti; McGettigan, Carolyn

2017-01-01

Abstract Imitating speech necessitates the transformation from sensory targets to vocal tract motor output, yet little is known about the representational basis of this process in the human brain. Here, we address this question by using real-time MR imaging (rtMRI) of the vocal tract and functional MRI (fMRI) of the brain in a speech imitation paradigm. Participants trained on imitating a native vowel and a similar nonnative vowel that required lip rounding. Later, participants imitated these vowels and an untrained vowel pair during separate fMRI and rtMRI runs. Univariate fMRI analyses revealed that regions including left inferior frontal gyrus were more active during sensorimotor transformation (ST) and production of nonnative vowels, compared with native vowels; further, ST for nonnative vowels activated somatomotor cortex bilaterally, compared with ST of native vowels. Using test representational similarity analysis (RSA) models constructed from participants’ vocal tract images and from stimulus formant distances, we found that RSA searchlight analyses of fMRI data showed either type of model could be represented in somatomotor, temporal, cerebellar, and hippocampal neural activation patterns during ST. We thus provide the first evidence of widespread and robust cortical and subcortical neural representation of vocal tract and/or formant parameters, during prearticulatory ST. PMID:28334401
Gender and vocal production mode discrimination using the high frequencies for speech and singing

Science.gov (United States)

Monson, Brian B.; Lotto, Andrew J.; Story, Brad H.

2014-01-01

Humans routinely produce acoustical energy at frequencies above 6 kHz during vocalization, but this frequency range is often not represented in communication devices and speech perception research. Recent advancements toward high-definition (HD) voice and extended bandwidth hearing aids have increased the interest in the high frequencies. The potential perceptual information provided by high-frequency energy (HFE) is not well characterized. We found that humans can accomplish tasks of gender discrimination and vocal production mode discrimination (speech vs. singing) when presented with acoustic stimuli containing only HFE at both amplified and normal levels. Performance in these tasks was robust in the presence of low-frequency masking noise. No substantial learning effect was observed. Listeners also were able to identify the sung and spoken text (excerpts from “The Star-Spangled Banner”) with very few exposures. These results add to the increasing evidence that the high frequencies provide at least redundant information about the vocal signal, suggesting that its representation in communication devices (e.g., cell phones, hearing aids, and cochlear implants) and speech/voice synthesizers could improve these devices and benefit normal-hearing and hearing-impaired listeners. PMID:25400613

Vocal effectiveness of speech-language pathology students: Before and after voice use during service delivery

Science.gov (United States)

Couch, Stephanie; Zieba, Dominique; van der Merwe, Anita

2015-01-01

Background As a professional voice user, it is imperative that a speech-language pathologist's (SLP) vocal effectiveness remain consistent throughout the day. Many factors may contribute to reduced vocal effectiveness, including prolonged voice use, vocally abusive behaviours, poor vocal hygiene and environmental factors. Objectives To determine the effect of service delivery on the perceptual and acoustic features of voice. Method A quasi-experimental., pre-test–post-test research design was used. Participants included third- and final-year speech-language pathology students at the University of Pretoria (South Africa). Voice parameters were evaluated in a pre-test measurement, after which the participants provided two consecutive hours of therapy. A post-test measurement was then completed. Data analysis consisted of an instrumental analysis in which the multidimensional voice programme (MDVP) and the voice range profile (VRP) were used to measure vocal parameters and then calculate the dysphonia severity index (DSI). The GRBASI scale was used to conduct a perceptual analysis of voice quality. Data were processed using descriptive statistics to determine change in each measured parameter after service delivery. Results A change of clinical significance was observed in the acoustic and perceptual parameters of voice. Conclusion Guidelines for SLPs in order to maintain optimal vocal effectiveness were suggested. PMID:26304213
Improvement of electrolaryngeal speech quality using a supraglottal voice source with compensation of vocal tract characteristics.

Science.gov (United States)

Wu, Liang; Wan, Congying; Wang, Supin; Wan, Mingxi

2013-07-01

Electrolarynx (EL) is a medical speech-recovery device designed for patients who have lost their original voice box due to laryngeal cancer. As a substitute for human larynx, the current commercial EL voice source cannot reconstruct natural EL speech under laryngectomy conditions. To eliminate the abnormal acoustic properties of EL speech, a supraglottal voice source with compensation of vocal tract characteristics was proposed and provided through an experimental EL(SGVS-EL) system. The acoustic analyses of simulated EL speech and reconstructed EL speech produced with different voice sources were performed in the normal subject and laryngectomee. The results indicated that the supraglottal voice source was successful in improving the acoustic properties of EL speech by enhancing low- frequency energy, correcting the shifted formants to normal range, and eliminating the visible spectral zeros. Both normal subject and laryngectomee also produced more natural vowels using SGVS-EL than commercial EL, even if the vocal tract parameter was substituted and the supraglottal voice source was biased to a certain degree. Therefore, supraglottal voice source is a feasible and effective approach to improving the acoustic quality of EL speech.
Analysis of vocal signal in its amplitude - time representation. speech synthesis-by-rules

International Nuclear Information System (INIS)

Rodet, Xavier

1977-01-01

In the first part of this dissertation, the natural speech production and the resulting acoustic waveform are examined under various aspects: communication, phonetics, frequency and temporal analysis. Our own study of direct signal is compared to other researches in these different fields, and fundamental features of vocal signals are described. The second part deals with the numerous methods already used for automatic text-to-speech synthesis. In the last part, we expose the new speech synthesis-by-rule methods that we have worked out, and we present in details the structure of the real-time speech synthesiser that we have implemented on a mini-computer. (author) [fr
Assessment of vocal cord nodules: a case study in speech processing by using Hilbert-Huang Transform

Science.gov (United States)

Civera, M.; Filosi, C. M.; Pugno, N. M.; Silvestrini, M.; Surace, C.; Worden, K.

2017-05-01

Vocal cord nodules represent a pathological condition for which the growth of unnatural masses on vocal folds affects the patients. Among other effects, changes in the vocal cords’ overall mass and stiffness alter their vibratory behaviour, thus changing the vocal emission generated by them. This causes dysphonia, i.e. abnormalities in the patients’ voice, which can be analysed and inspected via audio signals. However, the evaluation of voice condition through speech processing is not a trivial task, as standard methods based on the Fourier Transform, fail to fit the non-stationary nature of vocal signals. In this study, four audio tracks, provided by a volunteer patient, whose vocal fold nodules have been surgically removed, were analysed using a relatively new technique: the Hilbert-Huang Transform (HHT) via Empirical Mode Decomposition (EMD); specifically, by using the CEEMDAN (Complete Ensemble EMD with Adaptive Noise) algorithm. This method has been applied here to speech signals, which were recorded before removal surgery and during convalescence, to investigate specific trends. Possibilities offered by the HHT are exposed, but also some limitations of decomposing the signals into so-called intrinsic mode functions (IMFs) are highlighted. The results of these preliminary studies are intended to be a basis for the development of new viable alternatives to the softwares currently used for the analysis and evaluation of pathological voice.
Assessment of vocal cord nodules: a case study in speech processing by using Hilbert-Huang Transform

International Nuclear Information System (INIS)

Civera, M; Surace, C; Filosi, C M; Silvestrini, M; Pugno, N M; Worden, K

2017-01-01

Vocal cord nodules represent a pathological condition for which the growth of unnatural masses on vocal folds affects the patients. Among other effects, changes in the vocal cords’ overall mass and stiffness alter their vibratory behaviour, thus changing the vocal emission generated by them. This causes dysphonia, i.e. abnormalities in the patients’ voice, which can be analysed and inspected via audio signals. However, the evaluation of voice condition through speech processing is not a trivial task, as standard methods based on the Fourier Transform, fail to fit the non-stationary nature of vocal signals. In this study, four audio tracks, provided by a volunteer patient, whose vocal fold nodules have been surgically removed, were analysed using a relatively new technique: the Hilbert-Huang Transform (HHT) via Empirical Mode Decomposition (EMD); specifically, by using the CEEMDAN (Complete Ensemble EMD with Adaptive Noise) algorithm. This method has been applied here to speech signals, which were recorded before removal surgery and during convalescence, to investigate specific trends. Possibilities offered by the HHT are exposed, but also some limitations of decomposing the signals into so-called intrinsic mode functions (IMFs) are highlighted. The results of these preliminary studies are intended to be a basis for the development of new viable alternatives to the softwares currently used for the analysis and evaluation of pathological voice. (paper)
From Vocal Replication to Shared Combinatorial Speech Codes: A Small Step for Evolution, A Big Step for Language

Science.gov (United States)

Oudeyer, Pierre-Yves

Humans use spoken vocalizations, or their signed equivalent, as a physical support to carry language. This support is highly organized: vocalizations are built with the re-use of a small number of articulatory units, which are themselves discrete elements carved up by each linguistic community in the articulatory continuum. Moreover, the repertoires of these elementary units (the gestures, the phonemes, the morphemes) have a number of structural regularities: for example, while our vocal tract allows physically the production of hundreds of vowels, each language uses most often 5, and never more than 20 of them. Also, certain vowels are very frequent, like /a,e,i,o,u/, and some others are very rare, like /en/. All the speakers of a given linguistic community categorize the speech sounds in the same manner, and share the same repertoire of vocalizations. Speakers of different communities may have very different ways of categorizing sounds (for example, Chinese use tones to distinguish sounds), and repertoires of vocalizations. Such an organized physical support of language is crucial for the existence of language, and thus asking how it may have appeared in the biological and/or cultural history of humans is a fundamental questions. In particular, one can wonder how much the evolution of human speech codes relied on specific evolutionary innovations, and thus how difficult (or not) it was for speech to appear.
Retrospective Parent Report of Early Vocal Behaviours in Children with Suspected Childhood Apraxia of Speech (sCAS)

Science.gov (United States)

Highman, Chantelle; Hennessey, Neville; Sherwood, Mellanie; Leitao, Suze

2008-01-01

Parents of children with suspected Childhood Apraxia of Speech (sCAS, n = 20), Specific Language Impairment (SLI, n = 20), and typically developing speech and language skills (TD, n = 20) participated in this study, which aimed to quantify and compare reports of early vocal development. Via a questionnaire, parents reported on their child's early…
Acoustic correlates of inflectional morphology in the speech of children with specific language impairment and their typically developing peers.

Science.gov (United States)

Owen, Amanda J; Goffman, Lisa

2007-07-01

The development of the use of the third-person singular -s in open syllable verbs in children with specific language impairment (SLI) and their typically developing peers was examined. Verbs that included overt productions of the third-person singular -s morpheme (e.g. Bobby plays ball everyday; Bear laughs when mommy buys popcorn) were contrasted with clearly bare stem contexts (e.g. Mommy, buy popcorn; I saw Bobby play ball) on both global and local measures of acoustic duration. A durational signature for verbs inflected with -s was identified separately from factors related to sentence length. These duration measures were also used to identify acoustic changes related to the omission of the -s morpheme. The omitted productions from the children with SLI were significantly longer than their correct third-person singular and bare stem productions. This result was unexpected given that the omitted productions have fewer phonemes than correctly inflected productions. Typically developing children did not show the same pattern, instead producing omitted productions that patterned most closely with bare stem forms. These results are discussed in relation to current theoretical approaches to SLI, with an emphasis on performance and speech-motor accounts.
A characterization of verb use in Turkish agrammatic narrative speech.

Science.gov (United States)

Arslan, Seçkin; Bamyacı, Elif; Bastiaanse, Roelien

2016-01-01

This study investigates the characteristics of narrative-speech production and the use of verbs in Turkish agrammatic speakers (n = 10) compared to non-brain-damaged controls (n = 10). To elicit narrative-speech samples, personal interviews and storytelling tasks were conducted. Turkish has a large and regular verb inflection paradigm where verbs are inflected for evidentiality (i.e. direct versus indirect evidence available to the speaker). Particularly, we explored the general characteristics of the speech samples (e.g. utterance length) and the uses of lexical, finite and non-finite verbs and direct and indirect evidentials. The results show that speech rate is slow, verbs per utterance are lower than normal and the verb diversity is reduced in the agrammatic speakers. Verb inflection is relatively intact; however, a trade-off pattern between inflection for direct evidentials and verb diversity is found. The implications of the data are discussed in connection with narrative-speech production studies on other languages.
Evidence for cultural dialects in vocal emotion expression: acoustic classification within and across five nations.

Science.gov (United States)

Laukka, Petri; Neiberg, Daniel; Elfenbein, Hillary Anger

2014-06-01

The possibility of cultural differences in the fundamental acoustic patterns used to express emotion through the voice is an unanswered question central to the larger debate about the universality versus cultural specificity of emotion. This study used emotionally inflected standard-content speech segments expressing 11 emotions produced by 100 professional actors from 5 English-speaking cultures. Machine learning simulations were employed to classify expressions based on their acoustic features, using conditions where training and testing were conducted on stimuli coming from either the same or different cultures. A wide range of emotions were classified with above-chance accuracy in cross-cultural conditions, suggesting vocal expressions share important characteristics across cultures. However, classification showed an in-group advantage with higher accuracy in within- versus cross-cultural conditions. This finding demonstrates cultural differences in expressive vocal style, and supports the dialect theory of emotions according to which greater recognition of expressions from in-group members results from greater familiarity with culturally specific expressive styles.
Prevalence of Vocal Problems: Speech-Language Pathologists' Evaluation of Music and Non-Music Teacher Recordings

Science.gov (United States)

Hackworth, Rhonda S.

2013-01-01

The current study, a preliminary examination of whether music teachers are more susceptible to vocal problems than teachers of other subjects, asked for expert evaluation of audio recordings from licensed speech-language pathologists. Participants (N = 41) taught music (n = 23) or another subject (n = 18) in either elementary (n = 21), middle (n =…
Hierarchical temporal structure in music, speech and animal vocalizations: jazz is like a conversation, humpbacks sing like hermit thrushes.

Science.gov (United States)

Kello, Christopher T; Bella, Simone Dalla; Médé, Butovens; Balasubramaniam, Ramesh

2017-10-01

Humans talk, sing and play music. Some species of birds and whales sing long and complex songs. All these behaviours and sounds exhibit hierarchical structure-syllables and notes are positioned within words and musical phrases, words and motives in sentences and musical phrases, and so on. We developed a new method to measure and compare hierarchical temporal structures in speech, song and music. The method identifies temporal events as peaks in the sound amplitude envelope, and quantifies event clustering across a range of timescales using Allan factor (AF) variance. AF variances were analysed and compared for over 200 different recordings from more than 16 different categories of signals, including recordings of speech in different contexts and languages, musical compositions and performances from different genres. Non-human vocalizations from two bird species and two types of marine mammals were also analysed for comparison. The resulting patterns of AF variance across timescales were distinct to each of four natural categories of complex sound: speech, popular music, classical music and complex animal vocalizations. Comparisons within and across categories indicated that nested clustering in longer timescales was more prominent when prosodic variation was greater, and when sounds came from interactions among individuals, including interactions between speakers, musicians, and even killer whales. Nested clustering also was more prominent for music compared with speech, and reflected beat structure for popular music and self-similarity across timescales for classical music. In summary, hierarchical temporal structures reflect the behavioural and social processes underlying complex vocalizations and musical performances. © 2017 The Author(s).
Comparative analysis of perceptual evaluation, acoustic analysis and indirect laryngoscopy for vocal assessment of a population with vocal complaint.

Science.gov (United States)

Nemr, Kátia; Amar, Ali; Abrahão, Marcio; Leite, Grazielle Capatto de Almeida; Köhle, Juliana; Santos, Alexandra de O; Correa, Luiz Artur Costa

2005-01-01

As a result of technology evolution and development, methods of voice evaluation have changed both in medical and speech and language pathology practice. To relate the results of perceptual evaluation, acoustic analysis and medical evaluation in the diagnosis of vocal and/or laryngeal affections of the population with vocal complaint. Clinical prospective. 29 people that attended vocal health protection campaign were evaluated. They were submitted to perceptual evaluation (AFPA), acoustic analysis (AA), indirect laryngoscopy (LI) and telelaryngoscopy (TL). Correlations between medical and speech language pathology evaluation methods were established, verifying possible statistical signification with the application of Fischer Exact Test. There were statistically significant results in the correlation between AFPA and LI, AFPA and TL, LI and TL. This research study conducted in a vocal health protection campaign presented correlations between speech language pathology evaluation and perceptual evaluation and clinical evaluation, as well as between vocal affection and/or laryngeal medical exams.
A Framework for Automated Marmoset Vocalization Detection And Classification

Science.gov (United States)

2016-09-08

for studying the origins and neural basis of human language. Vocalizations belonging to the same species, or Conspecific Vocalizations (CVs), are...applications including automatic speech recognition [17], speech enhancement [18], voice activity detection [19], hyper-nasality detection [20], and emotion ...vocalizations. The feature sets chosen have the desirable property of capturing characteristics of the signals that are useful in both identifying and
Whole-word frequency and inflectional paradigm size facilitate Estonian case-inflected noun processing.

Science.gov (United States)

Lõo, Kaidi; Järvikivi, Juhani; Baayen, R Harald

2018-06-01

Estonian is a morphologically rich Finno-Ugric language with nominal paradigms that have at least 28 different inflected forms but sometimes more than 40. For languages with rich inflection, it has been argued that whole-word frequency, as a diagnostic of whole-word representations, should not be predictive for lexical processing. We report a lexical decision experiment, showing that response latencies decrease both with frequency of the inflected form and its inflectional paradigm size. Inflectional paradigm size was also predictive of semantic categorization, indicating it is a semantic effect, similar to the morphological family size effect. These findings fit well with the evidence for frequency effects of word n-grams in languages with little inflectional morphology, such as English. Apparently, the amount of information on word use in the mental lexicon is substantially larger than was previously thought. Copyright © 2018 Elsevier B.V. All rights reserved.
Improvement of a Vocal Fold Imaging System

Energy Technology Data Exchange (ETDEWEB)

Krauter, K. G. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

2017-02-01

Medical professionals can better serve their patients through continual update of their imaging tools. A wide range of pathologies and disease may afflict human vocal cords or, as they’re also known, vocal folds. These diseases can affect human speech hampering the ability of the patient to communicate. Vocal folds must be opened for breathing and the closed to produce speech. Currently methodologies to image markers of potential pathologies are difficult to use and often fail to detect early signs of disease. These current methodologies rely on a strobe light and slower frame rate camera in an attempt to obtain images as the vocal folds travel over the full extent of their motion.
Speech disorders - children

Science.gov (United States)

... disorder; Voice disorders; Vocal disorders; Disfluency; Communication disorder - speech disorder; Speech disorder - stuttering ... evaluation tools that can help identify and diagnose speech disorders: Denver Articulation Screening Examination Goldman-Fristoe Test of ...
An Investigation of Vocal Tract Characteristics for Acoustic Discrimination of Pathological Voices

Directory of Open Access Journals (Sweden)

Jung-Won Lee

2013-01-01

Full Text Available This paper investigates the effectiveness of measures related to vocal tract characteristics in classifying normal and pathological speech. Unlike conventional approaches that mainly focus on features related to the vocal source, vocal tract characteristics are examined to determine if interaction effects between vocal folds and the vocal tract can be used to detect pathological speech. Especially, this paper examines features related to formant frequencies to see if vocal tract characteristics are affected by the nature of the vocal fold-related pathology. To test this hypothesis, stationary fragments of vowel /aa/ produced by 223 normal subjects, 472 vocal fold polyp subjects, and 195 unilateral vocal cord paralysis subjects are analyzed. Based on the acoustic-articulatory relationships, phonation for pathological subjects is found to be associated with measures correlated with a raised tongue body or an advanced tongue root. Vocal tract-related features are also found to be statistically significant from the Kruskal-Wallis test in distinguishing normal and pathological speech. Classification results demonstrate that combining the formant measurements with vocal fold-related features results in improved performance in differentiating vocal pathologies including vocal polyps and unilateral vocal cord paralysis, which suggests that measures related to vocal tract characteristics may provide additional information in diagnosing vocal disorders.
Voice Outcomes of Adults Diagnosed with Pediatric Vocal Fold Nodules and Impact of Speech Therapy.

Science.gov (United States)

Song, Brian H; Merchant, Maqdooda; Schloegel, Luke

2017-11-01

Objective To evaluate the voice outcomes of adults diagnosed with vocal fold nodules (VFNs) as children and to assess the impact of speech therapy on long-term voice outcomes. Study Design Prospective cohort study. Setting Large health care system. Subjects and Methods Subjects diagnosed with VFNs as children between the years 1996 and 2008 were identified within a medical record database of a large health care system. Included subjects were 3 to 12 years old at the time of diagnosis, had a documented laryngeal examination within 90 days of diagnosis, and were ≥18 years as of December 31, 2014. Qualified subjects were contacted by telephone and administered the Vocal Handicap Index-10 (VHI-10) and a 15-item questionnaire inquiring for confounding factors. Results A total of 155 subjects were included, with a mean age of 21.4 years (range, 18-29). The male:female ratio was 2.3:1. Mean VHI-10 score for the entire cohort was 5.4. Mean VHI-10 scores did not differ between those who received speech therapy (6.1) and those who did not (4.5; P = .08). Both groups were similar with respect to confounding risk factors that can contribute to dysphonia, although the no-therapy group had a disproportionately higher number of subjects who consumed >10 alcoholic drinks per week ( P = .01). Conclusion The majority of adults with VFNs as children will achieve a close-to-normal voice quality when they reach adulthood. In our cohort, speech therapy did not appear to have an impact on the long-term voice outcomes.
TongueToSpeech (TTS): Wearable wireless assistive device for augmented speech.

Science.gov (United States)

Marjanovic, Nicholas; Piccinini, Giacomo; Kerr, Kevin; Esmailbeigi, Hananeh

2017-07-01

Speech is an important aspect of human communication; individuals with speech impairment are unable to communicate vocally in real time. Our team has developed the TongueToSpeech (TTS) device with the goal of augmenting speech communication for the vocally impaired. The proposed device is a wearable wireless assistive device that incorporates a capacitive touch keyboard interface embedded inside a discrete retainer. This device connects to a computer, tablet or a smartphone via Bluetooth connection. The developed TTS application converts text typed by the tongue into audible speech. Our studies have concluded that an 8-contact point configuration between the tongue and the TTS device would yield the best user precision and speed performance. On average using the TTS device inside the oral cavity takes 2.5 times longer than the pointer finger using a T9 (Text on 9 keys) keyboard configuration to type the same phrase. In conclusion, we have developed a discrete noninvasive wearable device that allows the vocally impaired individuals to communicate in real time.

Acoustic vocal tract model of one-year-old children

OpenAIRE

Vojnović, Milan; Bogavac, Ivana; Dobrijević, Ljiljana

2014-01-01

The physical shape of vocal tract and its formant (resonant) frequencies are directly related. The study of this functional connectivity is essential in speech therapy practice with children. Most of the perceived children’s speech anomalies can be explained on a physical level: malfunctioning movement of articulation organs. The current problem is that there is no enough data on the anatomical shape of children’s vocal tract to create its acoustic model. Classical techniques for vocal tract...
Microvascular lesions of the true vocal fold.

Science.gov (United States)

Postma, G N; Courey, M S; Ossoff, R H

1998-06-01

Microvascular lesions, also called varices or capillary ectasias, in contrast to vocal fold polyps with telangiectatic vessels, are relatively small lesions arising from the microcirculation of the vocal fold. Varices are most commonly seen in female professional vocalists and may be secondary to repetitive trauma, hormonal variations, or repeated inflammation. Microvascular lesions may either be asymptomatic or cause frank dysphonia by interrupting the normal vibratory pattern, mass, or closure of the vocal folds. They may also lead to vocal fold hemorrhage, scarring, or polyp formation. Laryngovideostroboscopy is the key in determining the functional significance of vocal fold varices. Management of patients with a varix includes medical therapy, speech therapy, and occasionally surgical vaporization. Indications for surgery are recurrent hemorrhage, enlargement of the varix, development of a mass in conjunction with the varix or hemorrhage, and unacceptable dysphonia after maximal medical and speech therapy due to a functionally significant varix.
Status report on speech research. A report on the status and progress of studies on the nature of speech, instrumentation for its investigation, and practical applications

Science.gov (United States)

Liberman, A. M.

1985-10-01

This interim status report on speech research discusses the following topics: On Vagueness and Fictions as Cornerstones of a Theory of Perceiving and Acting: A Comment on Walter (1983); The Informational Support for Upright Stance; Determining the Extent of Coarticulation-effects of Experimental Design; The Roles of Phoneme Frequency, Similarity, and Availability in the Experimental Elicitation of Speech Errors; On Learning to Speak; The Motor Theory of Speech Perception Revised; Linguistic and Acoustic Correlates of the Perceptual Structure Found in an Individual Differences Scaling Study of Vowels; Perceptual Coherence of Speech: Stability of Silence-cued Stop Consonants; Development of the Speech Perceptuomotor System; Dependence of Reading on Orthography-Investigations in Serbo-Croatian; The Relationship between Knowledge of Derivational Morphology and Spelling Ability in Fourth, Sixth, and Eighth Graders; Relations among Regular and Irregular, Morphologically-Related Words in the Lexicon as Revealed by Repetition Priming; Grammatical Priming of Inflected Nouns by the Gender of Possessive Adjectives; Grammatical Priming of Inflected Nouns by Inflected Adjectives; Deaf Signers and Serial Recall in the Visual Modality-Memory for Signs, Fingerspelling, and Print; Did Orthographies Evolve?; The Development of Children's Sensitivity to Factors Inf luencing Vowel Reading.
Vocal fold injection medialization laryngoplasty.

Science.gov (United States)

Modi, Vikash K

2012-01-01

Unilateral vocal fold paralysis (UVFP) can cause glottic insufficiency that can result in hoarseness, chronic cough, dysphagia, and/or aspiration. In rare circumstances, UVFP can cause airway obstruction necessitating a tracheostomy. The treatment options for UVFP include observation, speech therapy, vocal fold injection medialization laryngoplasty, thyroplasty, and laryngeal reinnervation. In this chapter, the author will discuss the technique of vocal fold injection for medialization of a UVFP. Copyright © 2012 S. Karger AG, Basel.
A social feedback loop for speech development and its reduction in autism.

Science.gov (United States)

Warlaumont, Anne S; Richards, Jeffrey A; Gilkerson, Jill; Oller, D Kimbrough

2014-07-01

We analyzed the microstructure of child-adult interaction during naturalistic, daylong, automatically labeled audio recordings (13,836 hr total) of children (8- to 48-month-olds) with and without autism. We found that an adult was more likely to respond when the child's vocalization was speech related rather than not speech related. In turn, a child's vocalization was more likely to be speech related if the child's previous speech-related vocalization had received an immediate adult response rather than no response. Taken together, these results are consistent with the idea that there is a social feedback loop between child and caregiver that promotes speech development. Although this feedback loop applies in both typical development and autism, children with autism produced proportionally fewer speech-related vocalizations, and the responses they received were less contingent on whether their vocalizations were speech related. We argue that such differences will diminish the strength of the social feedback loop and have cascading effects on speech development over time. Differences related to socioeconomic status are also reported. © The Author(s) 2014.
Acoustic Vocal Tract Model of One-year-old Children

Directory of Open Access Journals (Sweden)

M. Vojnović

2014-11-01

Full Text Available The physical shape of vocal tract and its formant (resonant frequencies are directly related. The study of this functional connectivity is essential in speech therapy practice with children. Most of the perceived children’s speech anomalies can be explained on a physical level: malfunctioning movement of articulation organs. The current problem is that there is no enough data on the anatomical shape of children’s vocal tract to create its acoustic model. Classical techniques for vocal tract shape imaging (X-ray, magnetic resonance, etc. are not appropriate for children. One possibility is to start from the shape of the adult vocal tract and correct it based on anatomical, morphological and articulatory differences between children and adults. This paper presents a method for vocal tract shape estimation of the child aged one year. The initial shapes of the vocal tract refer to the Russian vowels spoken by an adult male. All the relevant anatomical and articulation parameters, that influence the formant frequencies, are analyzed. Finally, the hypothetical configurations of the children’s vocal tract, for the five vowels, are presented.
[On the use of the spectral speech characteristics for the determination of biometric parameters of the vocal tract in forensic medical identification of the speaker's personality].

Science.gov (United States)

Kaganov, A Sh

2014-01-01

The objective of the present study was to elucidate the relationship between the spectral speech characteristics and the biometric parameters of the speaker's vocal tract. The secondary objective was to consider the theoretical basis behind the medico-criminalistic personality identification from the biometric parameters of the speaker's vocal tract. The article is based on the results of real forensic medical investigations and the literature data.
Vocal cord dysfunction in children.

Science.gov (United States)

Noyes, Blakeslee E; Kemp, James S

2007-06-01

Vocal cord dysfunction is characterised by paradoxical vocal cord adduction that occurs during inspiration, resulting in symptoms of dyspnoea, wheeze, chest or throat tightness and cough. Although the condition is well described in children and adults, confusion with asthma often triggers the use of an aggressive treatment regimen directed against asthma. The laryngoscopic demonstration of vocal cord adduction during inspiration has been considered the gold standard for the diagnosis of vocal cord dysfunction, but historical factors and pulmonary function findings may provide adequate clues to the correct diagnosis. Speech therapy, and in some cases psychological counselling, is often beneficial in this disorder. The natural course and prognosis of vocal cord dysfunction are still not well described in adults or children.
Gestures, vocalizations, and memory in language origins.

Science.gov (United States)

Aboitiz, Francisco

2012-01-01

THIS ARTICLE DISCUSSES THE POSSIBLE HOMOLOGIES BETWEEN THE HUMAN LANGUAGE NETWORKS AND COMPARABLE AUDITORY PROJECTION SYSTEMS IN THE MACAQUE BRAIN, IN AN ATTEMPT TO RECONCILE TWO EXISTING VIEWS ON LANGUAGE EVOLUTION: one that emphasizes hand control and gestures, and the other that emphasizes auditory-vocal mechanisms. The capacity for language is based on relatively well defined neural substrates whose rudiments have been traced in the non-human primate brain. At its core, this circuit constitutes an auditory-vocal sensorimotor circuit with two main components, a "ventral pathway" connecting anterior auditory regions with anterior ventrolateral prefrontal areas, and a "dorsal pathway" connecting auditory areas with parietal areas and with posterior ventrolateral prefrontal areas via the arcuate fasciculus and the superior longitudinal fasciculus. In humans, the dorsal circuit is especially important for phonological processing and phonological working memory, capacities that are critical for language acquisition and for complex syntax processing. In the macaque, the homolog of the dorsal circuit overlaps with an inferior parietal-premotor network for hand and gesture selection that is under voluntary control, while vocalizations are largely fixed and involuntary. The recruitment of the dorsal component for vocalization behavior in the human lineage, together with a direct cortical control of the subcortical vocalizing system, are proposed to represent a fundamental innovation in human evolution, generating an inflection point that permitted the explosion of vocal language and human communication. In this context, vocal communication and gesturing have a common history in primate communication.
Multilevel Analysis in Analyzing Speech Data

Science.gov (United States)

Guddattu, Vasudeva; Krishna, Y.

2011-01-01

The speech produced by human vocal tract is a complex acoustic signal, with diverse applications in phonetics, speech synthesis, automatic speech recognition, speaker identification, communication aids, speech pathology, speech perception, machine translation, hearing research, rehabilitation and assessment of communication disorders and many…
The Development and Validation of the Vocalic Sensitivity Test.

Science.gov (United States)

Villaume, William A.; Brown, Mary Helen

1999-01-01

Notes that presbycusis, hearing loss associated with aging, may be marked by a second dimension of hearing loss, a loss in vocalic sensitivity. Reports on the development of the Vocalic Sensitivity Test, which controls for the verbal elements in speech while also allowing for the vocalics to exercise their normal metacommunicative function of…
A model of language inflection graphs

Science.gov (United States)

Fukś, Henryk; Farzad, Babak; Cao, Yi

2014-01-01

Inflection graphs are highly complex networks representing relationships between inflectional forms of words in human languages. For so-called synthetic languages, such as Latin or Polish, they have particularly interesting structure due to the abundance of inflectional forms. We construct the simplest form of inflection graphs, namely a bipartite graph in which one group of vertices corresponds to dictionary headwords and the other group to inflected forms encountered in a given text. We, then, study projection of this graph on the set of headwords. The projection decomposes into a large number of connected components, to be called word groups. Distribution of sizes of word group exhibits some remarkable properties, resembling cluster distribution in a lattice percolation near the critical point. We propose a simple model which produces graphs of this type, reproducing the desired component distribution and other topological features.
Speech versus singing: Infants choose happier sounds

Directory of Open Access Journals (Sweden)

Marieve eCorbeil

2013-06-01

Full Text Available Infants prefer speech to non-vocal sounds and to non-human vocalizations, and they prefer happy-sounding speech to neutral speech. They also exhibit an interest in singing, but there is little knowledge of their relative interest in speech and singing. The present study explored infants’ attention to unfamiliar audio samples of speech and singing. In Experiment 1, infants 4-13 months of age were exposed to happy-sounding infant-directed speech versus hummed lullabies by the same woman. They listened significantly longer to the speech, which had considerably greater acoustic variability and expressiveness, than to the lullabies. In Experiment 2, infants of comparable age who heard the lyrics of a Turkish children’s song spoken versus sung in a joyful/happy manner did not exhibit differential listening. Infants in Experiment 3 heard the happily sung lyrics of the Turkish children’s song versus a version that was spoken in an adult-directed or affectively neutral manner. They listened significantly longer to the sung version. Overall, happy voice quality rather than vocal mode (speech or singing was the principal contributor to infant attention, regardless of age.
A Foxp2 mutation implicated in human speech deficits alters sequencing of ultrasonic vocalizations in adult male mice

Directory of Open Access Journals (Sweden)

Jonathan Chabout

2016-10-01

Full Text Available Development of proficient spoken language skills is disrupted by mutations of the FOXP2 transcription factor. A heterozygous missense mutation in the KE family causes speech apraxia, involving difficulty producing words with complex learned sequences of syllables. Manipulations in songbirds have helped to elucidate the role of this gene in vocal learning, but findings in non-human mammals have been limited or inconclusive. Here we performed a systematic study of ultrasonic vocalizations (USVs of adult male mice carrying the KE family mutation. Using novel statistical tools, we found that Foxp2 heterozygous mice did not have detectable changes in USV syllable acoustic structure, but produced shorter sequences and did not shift to more complex syntax in social contexts where wildtype animals did. Heterozygous mice also displayed a shift in the position of their rudimentary laryngeal motor cortex layer-5 neurons. Our findings indicate that although mouse USVs are mostly innate, the underlying contributions of FoxP2 to sequencing of vocalizations are conserved with humans.
Real-Time Vocal Tract Modelling

Directory of Open Access Journals (Sweden)

K. Benkrid

2008-03-01

Full Text Available To date, most speech synthesis techniques have relied upon the representation of the vocal tract by some form of filter, a typical example being linear predictive coding (LPC. This paper describes the development of a physiologically realistic model of the vocal tract using the well-established technique of transmission line modelling (TLM. This technique is based on the principle of wave scattering at transmission line segment boundaries and may be used in one, two, or three dimensions. This work uses this technique to model the vocal tract using a one-dimensional transmission line. A six-port scattering node is applied in the region separating the pharyngeal, oral, and the nasal parts of the vocal tract.
Attention mechanisms and the mosaic evolution of speech

Directory of Open Access Journals (Sweden)

Pedro Tiago Martins

2014-12-01

Full Text Available There is still no categorical answer for why humans, and no other species, have speech, or why speech is the way it is. Several purely anatomical arguments have been put forward, but they have been shown to be false, biologically implausible, or of limited scope. This perspective paper supports the idea that evolutionary theories of speech could benefit from a focus on the cognitive mechanisms that make speech possible, for which antecedents in evolutionary history and brain correlates can be found. This type of approach is part of a very recent, but rapidly growing tradition, which has provided crucial insights on the nature of human speech by focusing on the biological bases of vocal learning. Here, we call attention to what might be an important ingredient for speech. We contend that a general mechanism of attention, which manifests itself not only in visual but also auditory (and possibly other modalities, might be one of the key pieces of human speech, in addition to the mechanisms underlying vocal learning, and the pairing of facial gestures with vocalic units.
ASSESSING THE SO CALLED MARKED INFLECTIONAL FEATURES OF NIGERIAN ENGLISH: A SECOND LANGUAGE ACQUISITION THEORY ACCOUNT

Directory of Open Access Journals (Sweden)

Boluwaji Oshodi

2014-04-01

Full Text Available There are conflicting claims among scholars on whether the structural outputs of the types of English spoken in countries where English is used as a second language gives such speech forms the status of varieties of English. This study examined those morphological features considered to be marked features of the variety spoken in Nigeria according to Kirkpatrick (2011 and the variety spoken in Malaysia by considering the claims of the Missing Surface Inflection Hypothesis (MSIH a Second Language Acquisition theory which accounts for the cause of the variable use of such inflections among L2 learners. Results from oral and written composition tasks administered on selected undergraduate students of Nigerian and Malaysian universities revealed that what is regarded as morphological features are actually a deviation from the L2 target forms. According to the MSIH the variability in the use of such inflections is due to problems of lexical retrieval a psycholinguistic problem which manifests among L2 learners of English generally which results in wrong surface representations.
Vocal Noise Cancellation From Respiratory Sounds

National Research Council Canada - National Science Library

Moussavi, Zahra

2001-01-01

Although background noise cancellation for speech or electrocardiographic recording is well established, however when the background noise contains vocal noises and the main signal is a breath sound...
Desarrollo de interfaces para la detección del habla sub-vocal

Directory of Open Access Journals (Sweden)

Jenny Alejandra Gutiérrez Calderón

2013-09-01

Full Text Available This paper explores the most important techniques currently used to detect sub-vocal speech in people with cerebral palsy as well as for commercial purposes, (e.g. allow communication in very noisy places. The methodologies presented deal with speech-signal acquisition and processing. Signal detection and analysis methods are described throughout the whole speech process, from signal generation (as neural impulses in the brain to the production sound in the vocal apparatus (located in the throat. Acquisition and processing quality depends on several factors that will be presented in various sections. A brief explanation to the whole voice generation process is provided in the first part of the article. Subsequently, sub-speech signal acquisition and analysis techniques are presented. Finally, a section about the advantages and disadvantages of the various techniques is presented in order to illustrate different implementations in a sub-vocal speech or silent speech detection device. The results from research indicate that Non-audible Murmur Microphone (NAM is one of the choices that offer huge benefits, not only for signal acquisition and processing, but also for future Spanish language phoneme discrimination.
Inflection point of environmental Kuznets curve in Mainland China

International Nuclear Information System (INIS)

Song, Ma-Lin; Zhang, Wei; Wang, Shu-Hong

2013-01-01

As environmental problems in Mainland China are receiving global increasing attentions, environmental Kuznets curve (EKC) is adopted here to validate time route of improvement for its various areas. The results indicate that some areas, such as Shanghai, Tibet, Guizhou, Jilin and Beijing have overstepped their inflection points; Liaoning, Anhui, Fujian, Hainan and Qinghai have no inflection points, and meanwhile it is about seven years for the others areas take to reach their inflection points. Therefore, it is essential to lay down some policies to change or advance the process of reaching inflection point for each area respectively. - Highlights: ► This article focuses on inflection points of EKC in various areas of Mainland China. ► Shanghai, Tibet, Guizhou, Jilin and Beijing have overstepped their inflection points. ► The inflection points for Liaoning, Anhui, Fujian, Hainan and Qinghai do not exist. ► It is about to take 1–7 years for the others to reach their inflection points

Viscous flow features in scaled-up physical models of normal and pathological vocal phonation

Energy Technology Data Exchange (ETDEWEB)

Erath, Byron D., E-mail: berath@purdue.ed [School of Mechanical Engineering, Purdue University, 585 Purdue Mall, West Lafayette, IN 47907 (United States); Plesniak, Michael W., E-mail: plesniak@gwu.ed [Department of Mechanical and Aerospace Engineering, George Washington University, 801 22nd Street NW, Suite 739, Washington, DC 20052 (United States)

2010-06-15

Unilateral vocal fold paralysis results when the recurrent laryngeal nerve, which innervates the muscles of the vocal folds becomes damaged. The loss of muscle and tension control to the damaged vocal fold renders it ineffectual. The mucosal wave disappears during phonation, and the vocal fold becomes largely immobile. The influence of unilateral vocal fold paralysis on the viscous flow development, which impacts speech quality within the glottis during phonation was investigated. Driven, scaled-up vocal fold models were employed to replicate both normal and pathological patterns of vocal fold motion. Spatial and temporal velocity fields were captured using particle image velocimetry, and laser Doppler velocimetry. Flow parameters were scaled to match the physiological values associated with human speech. Loss of motion in one vocal fold resulted in a suppression of typical glottal flow fields, including decreased spatial variability in the location of the flow separation point throughout the phonatory cycle, as well as a decrease in the vorticity magnitude.
Viscous flow features in scaled-up physical models of normal and pathological vocal phonation

International Nuclear Information System (INIS)

Erath, Byron D.; Plesniak, Michael W.

2010-01-01

Unilateral vocal fold paralysis results when the recurrent laryngeal nerve, which innervates the muscles of the vocal folds becomes damaged. The loss of muscle and tension control to the damaged vocal fold renders it ineffectual. The mucosal wave disappears during phonation, and the vocal fold becomes largely immobile. The influence of unilateral vocal fold paralysis on the viscous flow development, which impacts speech quality within the glottis during phonation was investigated. Driven, scaled-up vocal fold models were employed to replicate both normal and pathological patterns of vocal fold motion. Spatial and temporal velocity fields were captured using particle image velocimetry, and laser Doppler velocimetry. Flow parameters were scaled to match the physiological values associated with human speech. Loss of motion in one vocal fold resulted in a suppression of typical glottal flow fields, including decreased spatial variability in the location of the flow separation point throughout the phonatory cycle, as well as a decrease in the vorticity magnitude.
Different Vocal Parameters Predict Perceptions of Dominance and Attractiveness.

Science.gov (United States)

Hodges-Simeon, Carolyn R; Gaulin, Steven J C; Puts, David A

2010-12-01

Low mean fundamental frequency (F(0)) in men's voices has been found to positively influence perceptions of dominance by men and attractiveness by women using standardized speech. Using natural speech obtained during an ecologically valid social interaction, we examined relationships between multiple vocal parameters and dominance and attractiveness judgments. Male voices from an unscripted dating game were judged by men for physical and social dominance and by women in fertile and non-fertile menstrual cycle phases for desirability in short-term and long-term relationships. Five vocal parameters were analyzed: mean F(0) (an acoustic correlate of vocal fold size), F(0) variation, intensity (loudness), utterance duration, and formant dispersion (D(f), an acoustic correlate of vocal tract length). Parallel but separate ratings of speech transcripts served as controls for content. Multiple regression analyses were used to examine the independent contributions of each of the predictors. Physical dominance was predicted by low F(0) variation and physically dominant word content. Social dominance was predicted only by socially dominant word content. Ratings of attractiveness by women were predicted by low mean F(0), low D(f), high intensity, and attractive word content across cycle phase and mating context. Low D(f) was perceived as attractive by fertile-phase women only. We hypothesize that competitors and potential mates may attend more strongly to different components of men's voices because of the different types of information these vocal parameters provide.
Song and speech: examining the link between singing talent and speech imitation ability.

Science.gov (United States)

Christiner, Markus; Reiterer, Susanne M

2013-01-01

In previous research on speech imitation, musicality, and an ability to sing were isolated as the strongest indicators of good pronunciation skills in foreign languages. We, therefore, wanted to take a closer look at the nature of the ability to sing, which shares a common ground with the ability to imitate speech. This study focuses on whether good singing performance predicts good speech imitation. Forty-one singers of different levels of proficiency were selected for the study and their ability to sing, to imitate speech, their musical talent and working memory were tested. Results indicated that singing performance is a better indicator of the ability to imitate speech than the playing of a musical instrument. A multiple regression revealed that 64% of the speech imitation score variance could be explained by working memory together with educational background and singing performance. A second multiple regression showed that 66% of the speech imitation variance of completely unintelligible and unfamiliar language stimuli (Hindi) could be explained by working memory together with a singer's sense of rhythm and quality of voice. This supports the idea that both vocal behaviors have a common grounding in terms of vocal and motor flexibility, ontogenetic and phylogenetic development, neural orchestration and auditory memory with singing fitting better into the category of "speech" on the productive level and "music" on the acoustic level. As a result, good singers benefit from vocal and motor flexibility, productively and cognitively, in three ways. (1) Motor flexibility and the ability to sing improve language and musical function. (2) Good singers retain a certain plasticity and are open to new and unusual sound combinations during adulthood both perceptually and productively. (3) The ability to sing improves the memory span of the auditory working memory.
Song and speech: examining the link between singing talent and speech imitation ability

Directory of Open Access Journals (Sweden)

Markus eChristiner

2013-11-01

Full Text Available In previous research on speech imitation, musicality and an ability to sing were isolated as the strongest indicators of good pronunciation skills in foreign languages. We, therefore, wanted to take a closer look at the nature of the ability to sing, which shares a common ground with the ability to imitate speech. This study focuses on whether good singing performance predicts good speech imitation. Fourty-one singers of different levels of proficiency were selected for the study and their ability to sing, to imitate speech, their musical talent and working memory were tested. Results indicated that singing performance is a better indicator of the ability to imitate speech than the playing of a musical instrument. A multiple regression revealed that 64 % of the speech imitation score variance could be explained by working memory together with educational background and singing performance. A second multiple regression showed that 66 % of the speech imitation variance of completely unintelligible and unfamiliar language stimuli (Hindi could be explained by working memory together with a singer’s sense of rhythm and quality of voice. This supports the idea that both vocal behaviors have a common grounding in terms of vocal and motor flexibility, ontogenetic and phylogenetic development, neural orchestration and sound memory with singing fitting better into the category of "speech" on the productive level and "music" on the acoustic level. As a result, good singers benefit from vocal and motor flexibility, productively and cognitively, in three ways. 1. Motor flexibility and the ability to sing improve language and musical function. 2. Good singers retain a certain plasticity and are open to new and unusual sound combinations during adulthood both perceptually and productively. 3. The ability to sing improves the memory span of the auditory short term memory.
A new measure of child vocal reciprocity in children with autism spectrum disorder.

Science.gov (United States)

Harbison, Amy L; Woynaroski, Tiffany G; Tapp, Jon; Wade, Joshua W; Warlaumont, Anne S; Yoder, Paul J

2018-03-06

Children's vocal development occurs in the context of reciprocal exchanges with a communication partner who models "speechlike" productions. We propose a new measure of child vocal reciprocity, which we define as the degree to which an adult vocal response increases the probability of an immediately following child vocal response. Vocal reciprocity is likely to be associated with the speechlikeness of vocal communication in young children with autism spectrum disorder (ASD). Two studies were conducted to test the utility of the new measure. The first used simulated vocal samples with randomly sequenced child and adult vocalizations to test the accuracy of the proposed index of child vocal reciprocity. The second was an empirical study of 21 children with ASD who were preverbal or in the early stages of language development. Daylong vocal samples collected in the natural environment were computer analyzed to derive the proposed index of child vocal reciprocity, which was highly stable when derived from two daylong vocal samples and was associated with speechlikeness of vocal communication. This association was significant even when controlling for chance probability of child vocalizations to adult vocal responses, probability of adult vocalizations, or probability of child vocalizations. A valid measure of children's vocal reciprocity might eventually improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A link to a free, publicly-available software program to derive the new measure of child vocal reciprocity is provided. Autism Res 2018. © 2018 International Society for Autism Research, Wiley Periodicals, Inc. Children and adults often engage in back-and-forth vocal exchanges. The extent to which they do so is believed to support children's early speech and language development. Two studies tested a new measure of child vocal reciprocity using computer-generated and real
Increased vocal intensity due to the Lombard effect in speakers with Parkinson's disease: simultaneous laryngeal and respiratory strategies.

Science.gov (United States)

Stathopoulos, Elaine T; Huber, Jessica E; Richardson, Kelly; Kamphaus, Jennifer; DeCicco, Devan; Darling, Meghan; Fulcher, Katrina; Sussman, Joan E

2014-01-01

The objective of the present study was to investigate whether speakers with hypophonia, secondary to Parkinson's disease (PD), would increases their vocal intensity when speaking in a noisy environment (Lombard effect). The other objective was to examine the underlying laryngeal and respiratory strategies used to increase vocal intensity. Thirty-three participants with PD were included for study. Each participant was fitted with the SpeechVive™ device that played multi-talker babble noise into one ear during speech. Using acoustic, aerodynamic and respiratory kinematic techniques, the simultaneous laryngeal and respiratory mechanisms used to regulate vocal intensity were examined. Significant group results showed that most speakers with PD (26/33) were successful at increasing their vocal intensity when speaking in the condition of multi-talker babble noise. They were able to support their increased vocal intensity and subglottal pressure with combined strategies from both the laryngeal and respiratory mechanisms. Individual speaker analysis indicated that the particular laryngeal and respiratory interactions differed among speakers. The SpeechVive™ device elicited higher vocal intensities from patients with PD. Speakers used different combinations of laryngeal and respiratory physiologic mechanisms to increase vocal intensity, thus suggesting that disease process does not uniformly affect the speech subsystems. Readers will be able to: (1) identify speech characteristics of people with Parkinson's disease (PD), (2) identify typical respiratory strategies for increasing sound pressure level (SPL), (3) identify typical laryngeal strategies for increasing SPL, (4) define the Lombard effect. Copyright © 2014 Elsevier Inc. All rights reserved.
Song and speech: examining the link between singing talent and speech imitation ability

Science.gov (United States)

Christiner, Markus; Reiterer, Susanne M.

2013-01-01

In previous research on speech imitation, musicality, and an ability to sing were isolated as the strongest indicators of good pronunciation skills in foreign languages. We, therefore, wanted to take a closer look at the nature of the ability to sing, which shares a common ground with the ability to imitate speech. This study focuses on whether good singing performance predicts good speech imitation. Forty-one singers of different levels of proficiency were selected for the study and their ability to sing, to imitate speech, their musical talent and working memory were tested. Results indicated that singing performance is a better indicator of the ability to imitate speech than the playing of a musical instrument. A multiple regression revealed that 64% of the speech imitation score variance could be explained by working memory together with educational background and singing performance. A second multiple regression showed that 66% of the speech imitation variance of completely unintelligible and unfamiliar language stimuli (Hindi) could be explained by working memory together with a singer's sense of rhythm and quality of voice. This supports the idea that both vocal behaviors have a common grounding in terms of vocal and motor flexibility, ontogenetic and phylogenetic development, neural orchestration and auditory memory with singing fitting better into the category of “speech” on the productive level and “music” on the acoustic level. As a result, good singers benefit from vocal and motor flexibility, productively and cognitively, in three ways. (1) Motor flexibility and the ability to sing improve language and musical function. (2) Good singers retain a certain plasticity and are open to new and unusual sound combinations during adulthood both perceptually and productively. (3) The ability to sing improves the memory span of the auditory working memory. PMID:24319438
Laughter as an approach to vocal evolution: The bipedal theory.

Science.gov (United States)

Provine, Robert R

2017-02-01

Laughter is a simple, stereotyped, innate, human play vocalization that is ideal for the study of vocal evolution. The basic approach of describing the act of laughter and when we do it has revealed a variety of phenomena of social, linguistic, and neurological significance. Findings include the acoustic structure of laughter, the minimal voluntary control of laughter, the punctuation effect (which describes the placement of laughter in conversation and indicates the dominance of speech over laughter), and the role of laughter in human matching and mating. Especially notable is the use of laughter to discover why humans can speak and other apes cannot. Quadrupeds, including our primate ancestors, have a 1:1 relation between breathing and stride because their thorax must absorb forelimb impacts during running. The direct link between breathing and locomotion limits vocalizations to short, simple utterances, such as the characteristic panting chimpanzee laugh (one sound per inward or outward breath). The evolution of bipedal locomotion freed the respiration system of its support function during running, permitting greater breath control and the selection for human-type laughter (a parsed exhalation), and subsequently the virtuosic, sustained, expiratory vocalization of speech. This is the basis of the bipedal theory of speech evolution.
Using Complex Event Processing (CEP) and vocal synthesis techniques to improve comprehension of sonified human-centric data

Science.gov (United States)

Rimland, Jeff; Ballora, Mark

2014-05-01

The field of sonification, which uses auditory presentation of data to replace or augment visualization techniques, is gaining popularity and acceptance for analysis of "big data" and for assisting analysts who are unable to utilize traditional visual approaches due to either: 1) visual overload caused by existing displays; 2) concurrent need to perform critical visually intensive tasks (e.g. operating a vehicle or performing a medical procedure); or 3) visual impairment due to either temporary environmental factors (e.g. dense smoke) or biological causes. Sonification tools typically map data values to sound attributes such as pitch, volume, and localization to enable them to be interpreted via human listening. In more complex problems, the challenge is in creating multi-dimensional sonifications that are both compelling and listenable, and that have enough discrete features that can be modulated in ways that allow meaningful discrimination by a listener. We propose a solution to this problem that incorporates Complex Event Processing (CEP) with speech synthesis. Some of the more promising sonifications to date use speech synthesis, which is an "instrument" that is amenable to extended listening, and can also provide a great deal of subtle nuance. These vocal nuances, which can represent a nearly limitless number of expressive meanings (via a combination of pitch, inflection, volume, and other acoustic factors), are the basis of our daily communications, and thus have the potential to engage the innate human understanding of these sounds. Additionally, recent advances in CEP have facilitated the extraction of multi-level hierarchies of information, which is necessary to bridge the gap between raw data and this type of vocal synthesis. We therefore propose that CEP-enabled sonifications based on the sound of human utterances could be considered the next logical step in human-centric "big data" compression and transmission.
Convergent differential regulation of parvalbumin in the brains of vocal learners.

Directory of Open Access Journals (Sweden)

Erina Hara

Full Text Available Spoken language and learned song are complex communication behaviors found in only a few species, including humans and three groups of distantly related birds--songbirds, parrots, and hummingbirds. Despite their large phylogenetic distances, these vocal learners show convergent behaviors and associated brain pathways for vocal communication. However, it is not clear whether this behavioral and anatomical convergence is associated with molecular convergence. Here we used oligo microarrays to screen for genes differentially regulated in brain nuclei necessary for producing learned vocalizations relative to adjacent brain areas that control other behaviors in avian vocal learners versus vocal non-learners. A top candidate gene in our screen was a calcium-binding protein, parvalbumin (PV. In situ hybridization verification revealed that PV was expressed significantly higher throughout the song motor pathway, including brainstem vocal motor neurons relative to the surrounding brain regions of all distantly related avian vocal learners. This differential expression was specific to PV and vocal learners, as it was not found in avian vocal non-learners nor for control genes in learners and non-learners. Similar to the vocal learning birds, higher PV up-regulation was found in the brainstem tongue motor neurons used for speech production in humans relative to a non-human primate, macaques. These results suggest repeated convergent evolution of differential PV up-regulation in the brains of vocal learners separated by more than 65-300 million years from a common ancestor and that the specialized behaviors of learned song and speech may require extra calcium buffering and signaling.
Glottal aerodynamics in compliant, life-sized vocal fold models

Science.gov (United States)

McPhail, Michael; Dowell, Grant; Krane, Michael

2013-11-01

This talk presents high-speed PIV measurements in compliant, life-sized models of the vocal folds. A clearer understanding of the fluid-structure interaction of voiced speech, how it produces sound, and how it varies with pathology is required to improve clinical diagnosis and treatment of vocal disorders. Physical models of the vocal folds can answer questions regarding the fundamental physics of speech, as well as the ability of clinical measures to detect the presence and extent of disorder. Flow fields were recorded in the supraglottal region of the models to estimate terms in the equations of fluid motion, and their relative importance. Experiments were conducted over a range of driving pressures with flow rates, given by a ball flowmeter, and subglottal pressures, given by a micro-manometer, reported for each case. Imaging of vocal fold motion, vector fields showing glottal jet behavior, and terms estimated by control volume analysis will be presented. The use of these results for a comparison with clinical measures, and for the estimation of aeroacoustic source strengths will be discussed. Acknowledge support from NIH R01 DC005642.
Applicability of Cone Beam Computed Tomography to the Assessment of the Vocal Tract before and after Vocal Exercises in Normal Subjects.

Science.gov (United States)

Garcia, Elisângela Zacanti; Yamashita, Hélio Kiitiro; Garcia, Davi Sousa; Padovani, Marina Martins Pereira; Azevedo, Renata Rangel; Chiari, Brasília Maria

2016-01-01

Cone beam computed tomography (CBCT), which represents an alternative to traditional computed tomography and magnetic resonance imaging, may be a useful instrument to study vocal tract physiology related to vocal exercises. This study aims to evaluate the applicability of CBCT to the assessment of variations in the vocal tract of healthy individuals before and after vocal exercises. Voice recordings and CBCT images before and after vocal exercises performed by 3 speech-language pathologists without vocal complaints were collected and compared. Each participant performed 1 type of exercise, i.e., Finnish resonance tube technique, prolonged consonant "b" technique, or chewing technique. The analysis consisted of an acoustic analysis and tomographic imaging. Modifications of the vocal tract settings following vocal exercises were properly detected by CBCT, and changes in the acoustic parameters were, for the most part, compatible with the variations detected in image measurements. CBCT was shown to be capable of properly assessing the changes in vocal tract settings promoted by vocal exercises. © 2017 S. Karger AG, Basel.
ASSESSING THE SO CALLED MARKED INFLECTIONAL FEATURES OF NIGERIAN ENGLISH: A SECOND LANGUAGE ACQUISITION THEORY ACCOUNT

OpenAIRE

Boluwaji Oshodi

2014-01-01

There are conflicting claims among scholars on whether the structural outputs of the types of English spoken in countries where English is used as a second language gives such speech forms the status of varieties of English. This study examined those morphological features considered to be marked features of the variety spoken in Nigeria according to Kirkpatrick (2011) and the variety spoken in Malaysia by considering the claims of the Missing Surface Inflection Hypothesis (MSIH) a Second Lan...
Aerosol emission during human speech

Science.gov (United States)

Asadi, Sima; Wexler, Anthony S.; Cappa, Christopher D.; Bouvier, Nicole M.; Barreda-Castanon, Santiago; Ristenpart, William D.

2017-11-01

We show that the rate of aerosol particle emission during healthy human speech is strongly correlated with the loudness (amplitude) of vocalization. Emission rates range from approximately 1 to 50 particles per second for quiet to loud amplitudes, regardless of language spoken (English, Spanish, Mandarin, or Arabic). Intriguingly, a small fraction of individuals behave as ``super emitters,'' consistently emitting an order of magnitude more aerosol particles than their peers. We interpret the results in terms of the eggressive flowrate during vocalization, which is known to vary significantly for different types of vocalization and for different individuals. The results suggest that individual speech patterns could affect the probability of airborne disease transmission. The results also provide a possible explanation for the existence of ``super spreaders'' who transmit pathogens much more readily than average and who play a key role in the spread of epidemics.
Hearing speech in music.

Science.gov (United States)

Ekström, Seth-Reino; Borg, Erik

2011-01-01

The masking effect of a piano composition, played at different speeds and in different octaves, on speech-perception thresholds was investigated in 15 normal-hearing and 14 moderately-hearing-impaired subjects. Running speech (just follow conversation, JFC) testing and use of hearing aids increased the everyday validity of the findings. A comparison was made with standard audiometric noises [International Collegium of Rehabilitative Audiology (ICRA) noise and speech spectrum-filtered noise (SPN)]. All masking sounds, music or noise, were presented at the same equivalent sound level (50 dBA). The results showed a significant effect of piano performance speed and octave (Ptempo had the largest effect; and high octave and slow tempo, the smallest. Music had a lower masking effect than did ICRA noise with two or six speakers at normal vocal effort (Pmusic offers an interesting opportunity for studying masking under realistic conditions, where spectral and temporal features can be varied independently. The results have implications for composing music with vocal parts, designing acoustic environments and creating a balance between speech perception and privacy in social settings.
Immediate effects of the semi-occluded vocal tract exercise with LaxVox® tube in singers.

Science.gov (United States)

Fadel, Congeta Bruniere Xavier; Dassie-Leite, Ana Paula; Santos, Rosane Sampaio; Santos, Celso Gonçalves Dos; Dias, Cláudio Antônio Sorondo; Sartori, Denise Jussara

The purpose of this study was to analyze the immediate effects of the semi-occluded vocal tract exercise (SOVTE) using the LaxVox® tube in singers. Participants were 23 singers, classical singing students, aged 18 to 47 years (mean age = 27.2 years). First, data was collected through the application of a demographic questionnaire and the recording of sustained emission - vowel /ε/, counting 1-10, and a music section from the participants' current repertoire. After that, the participants were instructed and performed the SOVTE using the LaxVox® tube for three minutes. Finally, the same vocal samples were collected immediately after SOVTE performance and the singers responded to a questionnaire on their perception regarding vocal changes after the exercise. The vocal samples were analyzed by referees (speech-language pathologists and singing teachers) and by means of acoustic analysis. Most of the singers reported improved voice post-exercise in both tasks - speech and singing. Regarding the perceptual assessment (sustained vowel, speech, and singing), the referees found no difference between pre- and post-exercise emissions. The acoustic analysis of the sustained vowel showed increased Fundamental Frequency (F0) and reduction of the Glottal to Noise Excitation (GNE) ratio post-exercise. The semi-occluded vocal tract exercise with LaxVox® tube promotes immediate positive effects on the self-assessment and acoustic analysis of voice in professional singers without vocal complains. No immediate significant changes were observed with respect to auditory-perceptual evaluation of speech and singing.
Hypermetabolism of compensatory laryngeal muscles in unilateral vocal cord palsy: comparison study between speech and silence with normal subjects by co-registered PET-CT fusion images

International Nuclear Information System (INIS)

Pai, Moon Sun; Kim, Hyon Kyong; Kim, Han Su; Chung, Sung Min

2005-01-01

There are a few case reports on asymmetric vocal cord uptake on FDG-PET in patients with unilateral vocal cord paralysis, which could be a potential pitfall in the interpretation of FDG-PET images. We evaluated the metabolic activity of laryngeal muscles of patients with unilateral vocal cord paralysis in comparison to normal controls during both speech and silence. Eleven patients with iatrogenic unilateral vocal cord palsy(thyroidectomy 7, lung cancer = 1, others = 3) and 12 normal controls underwent FDG-PET with usual protocol. They were divided into two groups respectively; one group read books aloud for 20 minutes (phonation group) and the other kept silence (non-phonation groups) after FDG injection. Recent neck CT scan were co-registered with FDG-PET to produce PET-CT fusion images to elaborate small laryngeal muscles. In patients with unilateral vocal cord palsy, contralateral non-paralyzed vocal cord showed increased FDG uptake, more intense with phonation group (SUV =5.88, n =5) than non-phonation group (SUV =2.33, n =6) --mainly on thyroarytenoid muscle. Normal control subjects showed symmetric mildly increased FDG uptake (SUV=1.92, n=6) only in phonation group, which was significantly low against patient groups and was localized in lateral cricoarytenoid muscle. Hypermetabolism of contralateral thyroarytenoid muscle in patients with unilateral vocal cord paralysis could be encountered during FDG-PET imaging even with keeping silence. Phonation during FDG-PET study enhance FDG uptake on different laryngeal muscles between unilateral vocal cord paralysis and normal subjects
Influence of Left-Right Asymmetries on Voice Quality in Simulated Paramedian Vocal Fold Paralysis

Science.gov (United States)

Samlan, Robin A.; Story, Brad H.

2017-01-01

Purpose: The purpose of this study was to determine the vocal fold structural and vibratory symmetries that are important to vocal function and voice quality in a simulated paramedian vocal fold paralysis. Method: A computational kinematic speech production model was used to simulate an exemplar "voice" on the basis of asymmetric…
Recognizing intentions in infant-directed speech: evidence for universals.

Science.gov (United States)

Bryant, Gregory A; Barrett, H Clark

2007-08-01

In all languages studied to date, distinct prosodic contours characterize different intention categories of infant-directed (ID) speech. This vocal behavior likely exists universally as a species-typical trait, but little research has examined whether listeners can accurately recognize intentions in ID speech using only vocal cues, without access to semantic information. We recorded native-English-speaking mothers producing four intention categories of utterances (prohibition, approval, comfort, and attention) as both ID and adult-directed (AD) speech, and we then presented the utterances to Shuar adults (South American hunter-horticulturalists). Shuar subjects were able to reliably distinguish ID from AD speech and were able to reliably recognize the intention categories in both types of speech, although performance was significantly better with ID speech. This is the first demonstration that adult listeners in an indigenous, nonindustrialized, and nonliterate culture can accurately infer intentions from both ID speech and AD speech in a language they do not speak.

Vocal fold paresis - a debilitating and underdiagnosed condition.

Science.gov (United States)

Harris, G; O'Meara, C; Pemberton, C; Rough, J; Darveniza, P; Tisch, S; Cole, I

2017-07-01

To review the clinical signs of vocal fold paresis on laryngeal videostroboscopy, to quantify its impact on patients' quality of life and to confirm the benefit of laryngeal electromyography in its diagnosis. Twenty-nine vocal fold paresis patients were referred for laryngeal electromyography. Voice Handicap Index 10 results were compared to 43 patients diagnosed with vocal fold paralysis. Laryngeal videostroboscopy analysis was conducted to determine side of paresis. Blinded laryngeal electromyography confirmed vocal fold paresis in 92.6 per cent of cases, with vocal fold lag being the most common diagnostic sign. The laryngology team accurately predicted side of paresis in 76 per cent of cases. Total Voice Handicap Index 10 responses were not significantly different between vocal fold paralysis and vocal fold paresis groups (26.08 ± 0.21 and 22.93 ± 0.17, respectively). Vocal fold paresis has a significant impact on quality of life. This study shows that laryngeal electromyography is an important diagnostic tool. Patients with persisting dysphonia and apparently normal vocal fold movement, who fail to respond to appropriate speech therapy, should be investigated for a diagnosis of vocal fold paresis.
[Functional dysphonia and vocal cord nodules in teachers in Navarra, Spain].

Science.gov (United States)

Palomino Moreno, María Prado; Hoyo Rodríguez, Asier; García López, Vega; Losantos Martínez, Juan Tomás

2013-01-01

To describe teachers treated for dysphonia and vocal cord nodules by the public health system in Navarra (Spain), to describe associated factors and to identify the proportion of these cases registered as occupational diseases. Cases of dysphonia occurring in persons between the age of 18 and 65 years, registered between May 2010 and June 2011, and treated in a specific unit (Speech Unit) of Otorhinolaryngology Services were identified. Information on occupation, sex and clinical diagnosis was collected. For teachers, additional information was obtained on smoking habits, teaching level and prior training in speech disorders and their prevention. Cases declared as occupational diseases were identified from the official Register of Occupational Diseases of Navarra. 135 teachers (18% of all dysphonia patients in the sample) were treated for dysphonia in the Speech Unit (87% women). Being female was 3-fold higher among teachers than other occupations (crude prevalence odds ratio = 3.5; 95% confidence interval, 95%CI, 2.1-5.9). Female teachers were also 6.5 years (95%CI, 1.7-11.4) younger than male teachers. No association was found between smoking and risk of vocal cord nodules or dysphonia. Only 20% of teachers treated had received training on speech disorders and their prevention. Nine out of 83 cases of vocal cord nodules diagnosed in professional voice users were officially declared as occupational diseases; in all cases, these were teachers. Dysphonia in teachers is a frequent reason for visiting a specialty clinic. Among these professionals, women showed a higher risk of suffering from vocal cord nodules. Most cases of vocal cord nodules in our sample were not reported as occupational diseases. Copyright belongs to the Societat Catalana de Seguretat i Medicina del Treball.
A hypothesis on improving foreign accents by optimizing variability in vocal learning brain circuits.

Science.gov (United States)

Simmonds, Anna J

2015-01-01

Rapid vocal motor learning is observed when acquiring a language in early childhood, or learning to speak another language later in life. Accurate pronunciation is one of the hardest things for late learners to master and they are almost always left with a non-native accent. Here, I propose a novel hypothesis that this accent could be improved by optimizing variability in vocal learning brain circuits during learning. Much of the neurobiology of human vocal motor learning has been inferred from studies on songbirds. Jarvis (2004) proposed the hypothesis that as in songbirds there are two pathways in humans: one for learning speech (the striatal vocal learning pathway), and one for production of previously learnt speech (the motor pathway). Learning new motor sequences necessary for accurate non-native pronunciation is challenging and I argue that in late learners of a foreign language the vocal learning pathway becomes inactive prematurely. The motor pathway is engaged once again and learners maintain their original native motor patterns for producing speech, resulting in speaking with a foreign accent. Further, I argue that variability in neural activity within vocal motor circuitry generates vocal variability that supports accurate non-native pronunciation. Recent theoretical and experimental work on motor learning suggests that variability in the motor movement is necessary for the development of expertise. I propose that there is little trial-by-trial variability when using the motor pathway. When using the vocal learning pathway variability gradually increases, reflecting an exploratory phase in which learners try out different ways of pronouncing words, before decreasing and stabilizing once the "best" performance has been identified. The hypothesis proposed here could be tested using behavioral interventions that optimize variability and engage the vocal learning pathway for longer, with the prediction that this would allow learners to develop new motor
Functional flexibility in wild bonobo vocal behaviour

Directory of Open Access Journals (Sweden)

Zanna Clay

2015-08-01

Full Text Available A shared principle in the evolution of language and the development of speech is the emergence of functional flexibility, the capacity of vocal signals to express a range of emotional states independently of context and biological function. Functional flexibility has recently been demonstrated in the vocalisations of pre-linguistic human infants, which has been contrasted to the functionally fixed vocal behaviour of non-human primates. Here, we revisited the presumed chasm in functional flexibility between human and non-human primate vocal behaviour, with a study on our closest living primate relatives, the bonobo (Pan paniscus. We found that wild bonobos use a specific call type (the “peep” across a range of contexts that cover the full valence range (positive-neutral-negative in much of their daily activities, including feeding, travel, rest, aggression, alarm, nesting and grooming. Peeps were produced in functionally flexible ways in some contexts, but not others. Crucially, calls did not vary acoustically between neutral and positive contexts, suggesting that recipients take pragmatic information into account to make inferences about call meaning. In comparison, peeps during negative contexts were acoustically distinct. Our data suggest that the capacity for functional flexibility has evolutionary roots that predate the evolution of human speech. We interpret this evidence as an example of an evolutionary early transition away from fixed vocal signalling towards functional flexibility.
IMPAIRED MOBILITY OF VOCAL FOLDS - etiology and symptoms

Directory of Open Access Journals (Sweden)

Karlo Pintarić

2015-06-01

Full Text Available Paresis or paralysis of one or both vocal cords affects some significant aspects of a human life: breathing, swallowing and speech. The major causes for reduced mobility or even immobility are innervation damage, less often fixation of vocal cord or impaired mobility of crycoarytenoid joint. An injury of the superior or/and inferior laryngeal nerve can be a consequence of different medical procedures, tumor growth, trauma, infection, neurological disorders, radiation exposure, toxic damage, impaired circulation of the area or it is idiopathic. The symptoms are different in the case of unilateral and bilateral paresis of the vocal folds. They also depend on the cause for the impaired mobility. In the patients with unilateral vocal fold paresis, hoarseness and aspiration during swallowing are the leading symptoms. In the bilateral vocal fold paralysis, dyspnea prevails.
Common cues to emotion in the dynamic facial expressions of speech and song.

Science.gov (United States)

Livingstone, Steven R; Thompson, William F; Wanderley, Marcelo M; Palmer, Caroline

2015-01-01

Speech and song are universal forms of vocalization that may share aspects of emotional expression. Research has focused on parallels in acoustic features, overlooking facial cues to emotion. In three experiments, we compared moving facial expressions in speech and song. In Experiment 1, vocalists spoke and sang statements each with five emotions. Vocalists exhibited emotion-dependent movements of the eyebrows and lip corners that transcended speech-song differences. Vocalists' jaw movements were coupled to their acoustic intensity, exhibiting differences across emotion and speech-song. Vocalists' emotional movements extended beyond vocal sound to include large sustained expressions, suggesting a communicative function. In Experiment 2, viewers judged silent videos of vocalists' facial expressions prior to, during, and following vocalization. Emotional intentions were identified accurately for movements during and after vocalization, suggesting that these movements support the acoustic message. Experiment 3 compared emotional identification in voice-only, face-only, and face-and-voice recordings. Emotion judgements for voice-only singing were poorly identified, yet were accurate for all other conditions, confirming that facial expressions conveyed emotion more accurately than the voice in song, yet were equivalent in speech. Collectively, these findings highlight broad commonalities in the facial cues to emotion in speech and song, yet highlight differences in perception and acoustic-motor production.
Effects of human fatigue on speech signals

Science.gov (United States)

Stamoulis, Catherine

2004-05-01

Cognitive performance may be significantly affected by fatigue. In the case of critical personnel, such as pilots, monitoring human fatigue is essential to ensure safety and success of a given operation. One of the modalities that may be used for this purpose is speech, which is sensitive to respiratory changes and increased muscle tension of vocal cords, induced by fatigue. Age, gender, vocal tract length, physical and emotional state may significantly alter speech intensity, duration, rhythm, and spectral characteristics. In addition to changes in speech rhythm, fatigue may also affect the quality of speech, such as articulation. In a noisy environment, detecting fatigue-related changes in speech signals, particularly subtle changes at the onset of fatigue, may be difficult. Therefore, in a performance-monitoring system, speech parameters which are significantly affected by fatigue need to be identified and extracted from input signals. For this purpose, a series of experiments was performed under slowly varying cognitive load conditions and at different times of the day. The results of the data analysis are presented here.
Modelling vocal anatomy's significant effect on speech

NARCIS (Netherlands)

de Boer, B.

2010-01-01

This paper investigates the effect of larynx position on the articulatory abilities of a humanlike vocal tract. Previous work has investigated models that were built to resemble the anatomy of existing species or fossil ancestors. This has led to conflicting conclusions about the relation between
Hypermetabolism of compensatory laryngeal muscles in unilateral vocal cord palsy: comparison study between speech and silence with normal subjects by co-registered PET-CT fusion images

International Nuclear Information System (INIS)

Pai, Moon Sun; Kim, Hyon Kyong; Kim, Han Su

2006-01-01

There are a few case report on asymmetric vocal cord uptake on FDG-PET in patients with unilateral vocal cord paralysis, which could be a potential pitfall in the interpretation of FDG-PET images. We evaluated the metabolic activity of laryngeal muscles of patients with unilateral vocal cord paralysis in comparison to normal controls during both speech and silence. Eleven patients with unilateral vocal cord palsy (thyroidectomy=7, lung cancer=1, other=3) and 12 normal controls underwent FDG-PET with usual protocol. They were divided into two groups respectively; one group read books aloud for 20 minutes (phonation group) and the other kept silence (non-phonation groups) after FDG injection. Recent neck CT scan were co-registered with FDG-PET to produce PET-CT fusion images to elaborate small laryngeal muscles. In patients with unilateral vocal cord palsy, contralateral non-paralyzed vocal cord showed hypermetabolism mainly on thyroarytenoid muscle, more intensely with phonation group (SUV=5.88±2.65) than with non-phonation group (SUV=2.30±0.39). Normal control subjects showed hypermetabolism (3.68± 0.96) in interarytenoid muscle and symmetric mild hypermetabolism in both lateral cricoarytenoid muscles in only phonation group. FDG-PET with fusion images using CT scan in patients with unilateral vocal cord paralysis showed hypermetabolism of contralateral non-paralyzed thyroarytedoid muscle, suggesting compensatory action during phonation. Phonation during FDG-PET study enhanced FDG uptake on different laryngeal muscles between patients with unilateral vocal cord paralysis and normal subjects
Lower Vocal Tract Morphologic Adjustments Are Relevant for Voice Timbre in Singing.

Science.gov (United States)

Mainka, Alexander; Poznyakovskiy, Anton; Platzek, Ivan; Fleischer, Mario; Sundberg, Johan; Mürbe, Dirk

2015-01-01

The vocal tract shape is crucial to voice production. Its lower part seems particularly relevant for voice timbre. This study analyzes the detailed morphology of parts of the epilaryngeal tube and the hypopharynx for the sustained German vowels /a/, /e/, /i/, /o/, and /u/ by thirteen male singer subjects who were at the beginning of their academic singing studies. Analysis was based on two different phonatory conditions: a natural, speech-like phonation and a singing phonation, like in classical singing. 3D models of the vocal tract were derived from magnetic resonance imaging and compared with long-term average spectrum analysis of audio recordings from the same subjects. Comparison of singing to the speech-like phonation, which served as reference, showed significant adjustments of the lower vocal tract: an average lowering of the larynx by 8 mm and an increase of the hypopharyngeal cross-sectional area (+ 21:9%) and volume (+ 16:8%). Changes in the analyzed epilaryngeal portion of the vocal tract were not significant. Consequently, lower larynx-to-hypopharynx area and volume ratios were found in singing compared to the speech-like phonation. All evaluated measures of the lower vocal tract varied significantly with vowel quality. Acoustically, an increase of high frequency energy in singing correlated with a wider hypopharyngeal area. The findings offer an explanation how classical male singers might succeed in producing a voice timbre with increased high frequency energy, creating a singer`s formant cluster.
Vocal aging and adductor spasmodic dysphonia: Response to botulinum toxin injection

Directory of Open Access Journals (Sweden)

Michael P Cannito

2008-03-01

Full Text Available Michael P Cannito, Joel C Kahane, Lesya ChornaSchool of Audiology and Speech-Language Pathology, The University of Memphis, Memphis, TN, USAAbstract: Aging of the larynx is characterized by involutional changes which alter its biomechanical and neural properties and create a biological environment that is different from younger counterparts. Illustrative anatomical examples are presented. This natural, non-disease process appears to set conditions which may influence the effectiveness of botulinum toxin injection and our expectations for its success. Adductor spasmodic dysphonia, a type of laryngeal dystonia, is typically treated using botulinum toxin injections of the vocal folds in order to suppress adductory muscle spasms which are disruptive to production of speech and voice. A few studies have suggested diminished response to treatment in older patients with adductor spasmodic dysphonia. This retrospective study provides a reanalysis of existing pre-to-post treatment data as function of age. Perceptual judgments of speech produced by 42 patients with ADSD were made by two panels of professional listeners with expertise in voice or fluency of speech. Results demonstrate a markedly reduced positive response to botulinum toxin treatment in the older patients. Perceptual findings are further elucidated by means of acoustic spectrography. Literature on vocal aging is reviewed to provide a specific set of biological mechanisms that best account for the observed interaction of botulinum toxin treatment with advancing age.Keywords: vocal aging, adductor spasmodic dysphonia, botulinum toxin, voice quality, speech fluency
Comportamento vocal de cantores populares Vocal behavior of popular singers

Directory of Open Access Journals (Sweden)

Valquíria Zimmer

2012-04-01

vocal behavior of popular singers, according to gender and professional and amateur categories. METHOD: interview with 47 singers, 25 men and 22 women. RESULTS: there were statistical significance differences in the following findings: MALE - microphone during rehearsal, absence of diagnosed voice problems, lack of assistance on vocal hygiene, pain or discomfort after singing, but no allergies or respiratory problems; FEMALES - singing lessons and awareness of posture; AMATEUR - no dancing while singing, no imitating voices, lack of otolaryngological evaluation (ENT, no diagnosed vocal problems, lack of speech-language therapy, absence of guidelines on vocal anatomy/physiology and without alcohol consumption during the rehearsals; PROFESSIONAL- hoarseness, knowledge about articulation, alcohol consumption during performance, excess throat clearing, pain after singing. CONCLUSIONS: the comparison between genders showed male singers were using microphone in rehearsals, did not have respiratory or allergic problems, nor voice problems were diagnosed, but they had pain sensation or discomfort after singing and did not have vocal hygiene, and female singers had singing lessons and followed posture guidelines. The comparison between amateurs and professionals showed that amateur singers did not dance while singing, did not imitate voices, did not consume alcohol during rehearsals, and did not have diagnosed voice problems, but they did not have ENT evaluation, nor did they engage in speech-language therapy, and had no awareness of vocal anatomy/physiology; and the professional singers complained of hoarseness, excess throat clearing and pain after singing, and they consumed alcohol during singing, despite having knowledge about articulation.
Hemispheric processing of vocal emblem sounds.

Science.gov (United States)

Neumann-Werth, Yael; Levy, Erika S; Obler, Loraine K

2013-01-01

Vocal emblems, such as shh and brr, are speech sounds that have linguistic and nonlinguistic features; thus, it is unclear how they are processed in the brain. Five adult dextral individuals with left-brain damage and moderate-severe Wernicke's aphasia, five adult dextral individuals with right-brain damage, and five Controls participated in two tasks: (1) matching vocal emblems to photographs ('picture task') and (2) matching vocal emblems to verbal translations ('phrase task'). Cross-group statistical analyses on items on which the Controls performed at ceiling revealed lower accuracy by the group with left-brain damage (than by Controls) on both tasks, and lower accuracy by the group with right-brain damage (than by Controls) on the picture task. Additionally, the group with left-brain damage performed significantly less accurately than the group with right-brain damage on the phrase task only. Findings suggest that comprehension of vocal emblems recruits more left- than right-hemisphere processing.
Stem Complexity and Inflectional Encoding in Language Production

NARCIS (Netherlands)

Janssen, D.P.; Roelofs, A.P.A.; Levelt, W.J.M.

2004-01-01

Three experiments are reported that examined whether stem complexity plays a role in inflecting polymorphemic words in language production. Experiment 1 showed that preparation effects for words with polymorphemic stems are larger when they are produced among words with constant inflectional
Source-system windowing for speech analysis

NARCIS (Netherlands)

Yegnanarayana, B.; Satyanarayana Murthy, P.; Eggen, J.H.

1993-01-01

In this paper we propose a speech-analysis method to bring out characteristics of the vocal tract system in short segments which are much less than a pitch period. The method performs windowing in the source and system components of the speech signal and recombines them to obtain a signal reflecting
The Tuning of Human Neonates' Preference for Speech

Science.gov (United States)

Vouloumanos, Athena; Hauser, Marc D.; Werker, Janet F.; Martin, Alia

2010-01-01

Human neonates prefer listening to speech compared to many nonspeech sounds, suggesting that humans are born with a bias for speech. However, neonates' preference may derive from properties of speech that are not unique but instead are shared with the vocalizations of other species. To test this, thirty neonates and sixteen 3-month-olds were…
Uppföljning av logopedisk behandling för patienter med träningsinducerad vocal cord dysfunction

OpenAIRE

Hamberg, Lena; Karlsson, Sanna

2014-01-01

Abstract. Several treatment options are described for patients with vocal cord dysfunction. At Karolinska University hospital, these patients are offered treatment by speech and language pathologists. The current study describes traits and treatment effects for 148 people diagnosed with exercise-induced vocal cord dysfunction at Karolinska University hospital who received treatment by speech and language pathologists in 2005-2012. Data were collected from medical records and through a questi...
Improving Understanding of Emotional Speech Acoustic Content

Science.gov (United States)

Tinnemore, Anna

Children with cochlear implants show deficits in identifying emotional intent of utterances without facial or body language cues. A known limitation to cochlear implants is the inability to accurately portray the fundamental frequency contour of speech which carries the majority of information needed to identify emotional intent. Without reliable access to the fundamental frequency, other methods of identifying vocal emotion, if identifiable, could be used to guide therapies for training children with cochlear implants to better identify vocal emotion. The current study analyzed recordings of adults speaking neutral sentences with a set array of emotions in a child-directed and adult-directed manner. The goal was to identify acoustic cues that contribute to emotion identification that may be enhanced in child-directed speech, but are also present in adult-directed speech. Results of this study showed that there were significant differences in the variation of the fundamental frequency, the variation of intensity, and the rate of speech among emotions and between intended audiences.
A retrospective study of long-term treatment outcomes for reduced vocal intensity in hypokinetic dysarthria

Directory of Open Access Journals (Sweden)

Christopher R. Watts

2016-02-01

Full Text Available Abstract Background Reduced vocal intensity is a core impairment of hypokinetic dysarthria in Parkinson’s disease (PD. Speech treatments have been developed to rehabilitate the vocal subsystems underlying this impairment. Intensive treatment programs requiring high-intensity voice and speech exercises with clinician-guided prompting and feedback have been established as effective for improving vocal function. Less is known, however, regarding long-term outcomes of clinical benefit in speakers with PD who receive these treatments. Methods A retrospective cohort design was utilized. Data from 78 patient files across a three year period were analyzed. All patients received a structured, intensive program of voice therapy focusing on speaking intent and loudness. The dependent variable for all analyses was vocal intensity in decibels (dBSPL. Vocal intensity during sustained vowel production, reading, and novel conversational speech was compared at pre-treatment, post-treatment, six month follow-up, and twelve month follow-up periods. Results Statistically significant increases in vocal intensity were found at post-treatment, 6 months, and 12 month follow-up periods with intensity gains ranging from 5 to 17 dB depending on speaking condition and measurement period. Significant treatment effects were found in all three speaking conditions. Effect sizes for all outcome measures were large, suggesting a strong degree of practical significance. Conclusions Significant increases in vocal intensity measured at 6 and 12 moth follow-up periods suggested that the sample of patients maintained treatment benefit for up to a year. These findings are supported by outcome studies reporting treatment outcomes within a few months post-treatment, in addition to prior studies that have reported long-term outcome results. The positive treatment outcomes experienced by the PD cohort in this study are consistent with treatment responses subsequent to other treatment
Oral and vocal fold diadochokinesis in dysphonic women.

Science.gov (United States)

Louzada, Talita; Beraldinelle, Roberta; Berretin-Felix, Giédre; Brasolotto, Alcione Ghedini

2011-01-01

The evaluation of oral and vocal fold diadochokinesis (DDK) in individuals with voice disorders may contribute to the understanding of factors that affect the balanced vocal production. Scientific studies that make use of this assessment tool support the knowledge advance of this area, reflecting the development of more appropriate therapeutic planning. To compare the results of oral and vocal fold DDK in dysphonic women and in women without vocal disorders. For this study, 28 voice recordings of women from 19 to 54 years old, diagnosed with dysphonia and submitted to a voice assessment from speech pathologist and otorhinolaryngologist, were used. The control group included 30 nondysphonic women evaluated in prior research from normal adults. The analysis parameters like number and duration of emissions, as well as the regularity of the repetition of syllables "pa", "ta", "ka" and the vowels "a" and "i," were provided by the Advanced Motor Speech Profile program (MSP) Model-5141, version-2.5.2 (KayPentax). The DDK sequence "pataka" was analyzed quantitatively through the Sound Forge 7.0 program, as well as manually with the audio-visual help of sound waves. Average values of oral and vocal fold DDK dysphonic and nondysphonic women were compared using the "t Student" test and were considered significant when pwomen (CvP=10.42%, 12.79%, 12.05%; JittP=2.05%, 6.05%, 3.63%) compared to the control group (CvP=8.86%; 10.95%, 11.20%; JittP=1.82%, 2.98%, 3.15%). Although the results do not indicate any difficulties in oral and laryngeal motor control in the dysphonic group, the largest instability in vocal fold DDK in the experimental group should be considered, and studies of this ability in individuals with communication disorders must be intensified.

The impact of intraglottal vortices on vocal fold dynamics

Science.gov (United States)

Erath, Byron; Pirnia, Alireza; Peterson, Sean

2016-11-01

During voiced speech a critical pressure is produced in the lungs that separates the vocal folds and creates a passage (the glottis) for airflow. As air passes through the vocal folds the resulting aerodynamic loading, coupled with the tissue properties of the vocal folds, produces self-sustained oscillations. Throughout each cycle a complex flow field develops, characterized by a plethora of viscous flow phenomena. Air passing through the glottis creates a jet, with periodically-shed vortices developing due to flow separation and the Kelvin-Helmholtz instability in the shear layer. These vortices have been hypothesized to be a crucial mechanism for producing vocal fold vibrations. In this study the effect of vortices on the vocal fold dynamics is investigated experimentally by passing a vortex ring over a flexible beam with the same non-dimensional mechanical properties as the vocal folds. Synchronized particle image velocimetry data are acquired in tandem with the beam dynamics. The resulting impact of the vortex ring loading on vocal fold dynamics is discussed in detail. This work was supported by the National Science Foundation Grant CBET #1511761.
Voz e posição de prega vocal em homens com paralisia unilateral de prega vocal Voice and vocal fold position in men with unilateral vocal fold paralysis

Directory of Open Access Journals (Sweden)

Karine Schwarz

2011-12-01

Full Text Available O posicionamento da prega vocal paralisada e o grau de disfonia são fatores importantes para decidir as opções de tratamento na paralisia de prega vocal unilateral (PPVU. OBJETIVO: Verificar as características perceptivo-auditivas da voz e a posição da prega vocal paralisada, em homens, com PPVU. MATERIAIS E MÉTODOS: Estudo retrospectivo, coorte histórica, com corte transversal, com dados de 24 homens com PPVU, com média de 60,7 anos, submetidos à avaliação vocal perceptivo-auditiva da voz, por três juízas fonoaudiólogas e perceptivo-visual das imagens laríngeas, com a classificação da posição da prega vocal paralisada, por três juízes otorrinolaringologistas. RESULTADOS: A prega vocal paralisada em posição paramediana ocorreu em 45,83% dos casos; a intermediária, em 25%; a lateral, em 20,83%, e a mediana, em 4,16%; a disfonia resultante da PPVU foi caracterizada pela rouquidão, aspereza e tensão, de grau moderado; soprosidade (maior frequência do grau grave; astenia e instabilidade (maior frequência do grau leve; a posição da prega vocal paralisada influenciou significativamente o grau geral de desvio vocal. CONCLUSÃO: O grau geral de disfonia está relacionado com a posição da prega vocal paralisada; a disfonia é caracterizada pela presença de rouquidão, soprosidade, aspereza e tensão de grau moderado a grave.The paralyzed vocal fold positioning and the degree of dysphonia are important inputs when one is deciding upon treatment options for unilateral vocal fold paralysis (UVFP. OBJECTIVE: To check voice characteristics and paralyzed vocal fold position in men with UVFP. MATERIALS AND METHODS: This is a retrospective historical cross-sectional cohort study, with data from 24 men with UVFP with mean age of 60.7 years, submitted to voice assessment by three speech therapists and three ENT physicians used laryngeal images to classify the position of the paralyzed vocal fold. RESULTS: The paralyzed vocal fold
Inflection point inflation and time dependent potentials in string theory

International Nuclear Information System (INIS)

Itzhaki, Nissan; Kovetz, Ely D.

2007-01-01

We consider models of inflection point inflation. The main drawback of such models is that they suffer from the overshoot problem. Namely the initial condition should be fine tuned to be near the inflection point for the universe to inflate. We show that stringy realizations of inflection point inflation are common and offer a natural resolution to the overshoot problem
Start/End Delays of Voiced and Unvoiced Speech Signals

Energy Technology Data Exchange (ETDEWEB)

Herrnstein, A

1999-09-24

Recent experiments using low power EM-radar like sensors (e.g, GEMs) have demonstrated a new method for measuring vocal fold activity and the onset times of voiced speech, as vocal fold contact begins to take place. Similarly the end time of a voiced speech segment can be measured. Secondly it appears that in most normal uses of American English speech, unvoiced-speech segments directly precede or directly follow voiced-speech segments. For many applications, it is useful to know typical duration times of these unvoiced speech segments. A corpus, assembled earlier of spoken ''Timit'' words, phrases, and sentences and recorded using simultaneously measured acoustic and EM-sensor glottal signals, from 16 male speakers, was used for this study. By inspecting the onset (or end) of unvoiced speech, using the acoustic signal, and the onset (or end) of voiced speech using the EM sensor signal, the average duration times for unvoiced segments preceding onset of vocalization were found to be 300ms, and for following segments, 500ms. An unvoiced speech period is then defined in time, first by using the onset of the EM-sensed glottal signal, as the onset-time marker for the voiced speech segment and end marker for the unvoiced segment. Then, by subtracting 300ms from the onset time mark of voicing, the unvoiced speech segment start time is found. Similarly, the times for a following unvoiced speech segment can be found. While data of this nature have proven to be useful for work in our laboratory, a great deal of additional work remains to validate such data for use with general populations of users. These procedures have been useful for applying optimal processing algorithms over time segments of unvoiced, voiced, and non-speech acoustic signals. For example, these data appear to be of use in speaker validation, in vocoding, and in denoising algorithms.
Different Vocal Parameters Predict Perceptions of Dominance and Attractiveness

OpenAIRE

Hodges-Simeon, Carolyn R.; Gaulin, Steven J. C.; Puts, David A.

2010-01-01

Low mean fundamental frequency (F 0) in men’s voices has been found to positively influence perceptions of dominance by men and attractiveness by women using standardized speech. Using natural speech obtained during an ecologically valid social interaction, we examined relationships between multiple vocal parameters and dominance and attractiveness judgments. Male voices from an unscripted dating game were judged by men for physical and social dominance and by women in fert...
Improved Methods for Pitch Synchronous Linear Prediction Analysis of Speech

OpenAIRE

劉, 麗清

2015-01-01

Linear prediction (LP) analysis has been applied to speech system over the last few decades. LP technique is well-suited for speech analysis due to its ability to model speech production process approximately. Hence LP analysis has been widely used for speech enhancement, low-bit-rate speech coding in cellular telephony, speech recognition, characteristic parameter extraction (vocal tract resonances frequencies, fundamental frequency called pitch) and so on. However, the performance of the co...
Humans mimicking animals: A cortical hierarchy for human vocal communication sounds

Science.gov (United States)

Talkington, William J.; Rapuano, Kristina M.; Hitt, Laura; Frum, Chris A.; Lewis, James W.

2012-01-01

Numerous species possess cortical regions that are most sensitive to vocalizations produced by their own kind (conspecifics). In humans, the superior temporal sulci (STS) putatively represent homologous voice-sensitive areas of cortex. However, STS regions have recently been reported to represent auditory experience or “expertise” in general rather than showing exclusive sensitivity to human vocalizations per se. Using functional magnetic resonance imaging and a unique non-stereotypical category of complex human non-verbal vocalizations – human-mimicked versions of animal vocalizations – we found a cortical hierarchy in humans optimized for processing meaningful conspecific utterances. This left-lateralized hierarchy originated near primary auditory cortices and progressed into traditional speech-sensitive areas. These results suggest that the cortical regions supporting vocalization perception are initially organized by sensitivity to the human vocal tract in stages prior to the STS. Additionally, these findings have implications for the developmental time course of conspecific vocalization processing in humans as well as its evolutionary origins. PMID:22674283
Evaluation of pitch coding alternatives for vibrotactile stimulation in speech training of the deaf

Energy Technology Data Exchange (ETDEWEB)

Barbacena, I L; Barros, A T [CEFET/PB, Joao Pessoa - PB (Brazil); Freire, R C S [DEE, UFCG, Campina Grande-PB (Brazil); Vieira, E C A [CEFET/PB, Joao Pessoa - PB (Brazil)

2007-11-15

Use of vibrotactile feedback stimulation as an aid for speech vocalization by the hearing impaired or deaf is reviewed. Architecture of a vibrotactile based speech therapy system is proposed. Different formulations for encoding the fundamental frequency of the vocalized speech into the pulsed stimulation frequency are proposed and investigated. Simulation results are also presented to obtain a comparative evaluation of the effectiveness of the different formulated transformations. Results of the perception sensitivity to the vibrotactile stimulus frequency to verify effectiveness of the above transformations are included.
Evaluation of pitch coding alternatives for vibrotactile stimulation in speech training of the deaf

International Nuclear Information System (INIS)

Barbacena, I L; Barros, A T; Freire, R C S; Vieira, E C A

2007-01-01

Use of vibrotactile feedback stimulation as an aid for speech vocalization by the hearing impaired or deaf is reviewed. Architecture of a vibrotactile based speech therapy system is proposed. Different formulations for encoding the fundamental frequency of the vocalized speech into the pulsed stimulation frequency are proposed and investigated. Simulation results are also presented to obtain a comparative evaluation of the effectiveness of the different formulated transformations. Results of the perception sensitivity to the vibrotactile stimulus frequency to verify effectiveness of the above transformations are included
Understanding speaker attitudes from prosody by adults with Parkinson's disease.

Science.gov (United States)

Monetta, Laura; Cheang, Henry S; Pell, Marc D

2008-09-01

The ability to interpret vocal (prosodic) cues during social interactions can be disrupted by Parkinson's disease, with notable effects on how emotions are understood from speech. This study investigated whether PD patients who have emotional prosody deficits exhibit further difficulties decoding the attitude of a speaker from prosody. Vocally inflected but semantically nonsensical 'pseudo-utterances' were presented to listener groups with and without PD in two separate rating tasks. Task I required participants to rate how confident a speaker sounded from their voice and Task 2 required listeners to rate how polite the speaker sounded for a comparable set of pseudo-utterances. The results showed that PD patients were significantly less able than HC participants to use prosodic cues to differentiate intended levels of speaker confidence in speech, although the patients could accurately detect the politelimpolite attitude of the speaker from prosody in most cases. Our data suggest that many PD patients fail to use vocal cues to effectively infer a speaker's emotions as well as certain attitudes in speech such as confidence, consistent with the idea that the basal ganglia play a role in the meaningful processing of prosodic sequences in spoken language (Pell & Leonard, 2003).
APPRECIATING SPEECH THROUGH GAMING

Directory of Open Access Journals (Sweden)

Mario T Carreon

2014-06-01

Full Text Available This paper discusses the Speech and Phoneme Recognition as an Educational Aid for the Deaf and Hearing Impaired (SPREAD application and the ongoing research on its deployment as a tool for motivating deaf and hearing impaired students to learn and appreciate speech. This application uses the Sphinx-4 voice recognition system to analyze the vocalization of the student and provide prompt feedback on their pronunciation. The packaging of the application as an interactive game aims to provide additional motivation for the deaf and hearing impaired student through visual motivation for them to learn and appreciate speech.
Speech-like rhythm in a voiced and voiceless orangutan call.

Directory of Open Access Journals (Sweden)

Adriano R Lameira

Full Text Available The evolutionary origins of speech remain obscure. Recently, it was proposed that speech derived from monkey facial signals which exhibit a speech-like rhythm of ∼5 open-close lip cycles per second. In monkeys, these signals may also be vocalized, offering a plausible evolutionary stepping stone towards speech. Three essential predictions remain, however, to be tested to assess this hypothesis' validity; (i Great apes, our closest relatives, should likewise produce 5Hz-rhythm signals, (ii speech-like rhythm should involve calls articulatorily similar to consonants and vowels given that speech rhythm is the direct product of stringing together these two basic elements, and (iii speech-like rhythm should be experience-based. Via cinematic analyses we demonstrate that an ex-entertainment orangutan produces two calls at a speech-like rhythm, coined "clicks" and "faux-speech." Like voiceless consonants, clicks required no vocal fold action, but did involve independent manoeuvring over lips and tongue. In parallel to vowels, faux-speech showed harmonic and formant modulations, implying vocal fold and supralaryngeal action. This rhythm was several times faster than orangutan chewing rates, as observed in monkeys and humans. Critically, this rhythm was seven-fold faster, and contextually distinct, than any other known rhythmic calls described to date in the largest database of the orangutan repertoire ever assembled. The first two predictions advanced by this study are validated and, based on parsimony and exclusion of potential alternative explanations, initial support is given to the third prediction. Irrespectively of the putative origins of these calls and underlying mechanisms, our findings demonstrate irrevocably that great apes are not respiratorily, articulatorilly, or neurologically constrained for the production of consonant- and vowel-like calls at speech rhythm. Orangutan clicks and faux-speech confirm the importance of rhythmic speech
Social learning of vocal structure in a nonhuman primate?

Directory of Open Access Journals (Sweden)

Lemasson Alban

2011-12-01

Full Text Available Abstract Background Non-human primate communication is thought to be fundamentally different from human speech, mainly due to vast differences in vocal control. The lack of these abilities in non-human primates is especially striking if compared to some marine mammals and bird species, which has generated somewhat of an evolutionary conundrum. What are the biological roots and underlying evolutionary pressures of the human ability to voluntarily control sound production and learn the vocal utterances of others? One hypothesis is that this capacity has evolved gradually in humans from an ancestral stage that resembled the vocal behavior of modern primates. Support for this has come from studies that have documented limited vocal flexibility and convergence in different primate species, typically in calls used during social interactions. The mechanisms underlying these patterns, however, are currently unknown. Specifically, it has been difficult to rule out explanations based on genetic relatedness, suggesting that such vocal flexibility may not be the result of social learning. Results To address this point, we compared the degree of acoustic similarity of contact calls in free-ranging Campbell's monkeys as a function of their social bonds and genetic relatedness. We calculated three different indices to compare the similarities between the calls' frequency contours, the duration of grooming interactions and the microsatellite-based genetic relatedness between partners. We found a significantly positive relation between bond strength and acoustic similarity that was independent of genetic relatedness. Conclusion Genetic factors determine the general species-specific call repertoire of a primate species, while social factors can influence the fine structure of some the call types. The finding is in line with the more general hypothesis that human speech has evolved gradually from earlier primate-like vocal communication.
Reduced auditory processing capacity during vocalization in children with Selective Mutism.

Science.gov (United States)

Arie, Miri; Henkin, Yael; Lamy, Dominique; Tetin-Schneider, Simona; Apter, Alan; Sadeh, Avi; Bar-Haim, Yair

2007-02-01

Because abnormal Auditory Efferent Activity (AEA) is associated with auditory distortions during vocalization, we tested whether auditory processing is impaired during vocalization in children with Selective Mutism (SM). Participants were children with SM and abnormal AEA, children with SM and normal AEA, and normally speaking controls, who had to detect aurally presented target words embedded within word lists under two conditions: silence (single task), and while vocalizing (dual task). To ascertain specificity of auditory-vocal deficit, effects of concurrent vocalizing were also examined during a visual task. Children with SM and abnormal AEA showed impaired auditory processing during vocalization relative to children with SM and normal AEA, and relative to control children. This impairment is specific to the auditory modality and does not reflect difficulties in dual task per se. The data extends previous findings suggesting that deficient auditory processing is involved in speech selectivity in SM.
Case inflection of construct state constructions in Dinka

DEFF Research Database (Denmark)

Andersen, Torben

2016-01-01

Dinka, a Nilo-Saharan language, is largely monosyllabic, but nevertheless it has a fairly rich morphology. Thus, most of its morphology is expressed by alternations in phonological material of the root. The inflectional categories of nouns manifested in this way include state and case in addition...... to number. The state category consists of an absolute state and two construct states. The case category includes a nominative, a genitive, an allative, and an essive/ablative. The present article shows how case inflection is manifested in complex noun phrases consisting of a noun in a construct state...... simultaneously carry state information and case information. Thus, the case inflection of construct state constructions in Dinka adds yet another layer of nonlinear morphology to nouns in this language....
Processing of vocalizations in humans and monkeys: A comparative fMRI study

International Nuclear Information System (INIS)

Joly, Olivier; Orban, Guy A.; Pallier, Christophe; Ramus, Franck; Pressnitzer, Daniel; Vanduffel, Wim

2012-01-01

Humans and many other animals use acoustical signals to mediate social interactions with con-specifics. The evolution of sound-based communication is still poorly understood and its neural correlates have only recently begun to be investigated. In the present study, we applied functional MRI to humans and macaque monkeys listening to identical stimuli in order to compare the cortical networks involved in the processing of vocalizations. At the first stages of auditory processing, both species showed similar fMRI activity maps within and around the lateral sulcus (the Sylvian fissure in humans). Monkeys showed remarkably similar responses to monkey calls and to human vocal sounds (speech or otherwise), mainly in the lateral sulcus and the adjacent superior temporal gyrus (STG). In contrast, a preference for human vocalizations and especially for speech was observed in the human STG and superior temporal sulcus (STS). The STS and Broca's region were especially responsive to intelligible utterances. The evolution of the language faculty in humans appears to have recruited most of the STS. It may be that in monkeys, a much simpler repertoire of vocalizations requires less involvement of this temporal territory. (authors)
Vocal effort with changing talker-to-listener distance in different acoustic environments

DEFF Research Database (Denmark)

Pelegrin Garcia, David; Smits, Bertrand; Brunskog, Jonas

2011-01-01

Talkers adjust their vocal effort to communicate at different distances, aiming to compensate for the sound propagation losses. The present paper studies the influence of four acoustically different rooms on the speech produced by 13 male talkers addressing a listener at four distances. Talkers...... raised their vocal intensity by between 1.3 and 2.2 dB per double distance to the listener and lowered it as a linear function of the quantity “room gain” at a rate of 3.6 dB/dB. There were also significant variations in the mean fundamental frequency, both across distance (3.8 Hz per double distance......) and among environments (4.3 Hz), and in the long-term standard deviation of the fundamental frequency among rooms (4 Hz). In the most uncomfortable rooms to speak in, talkers prolonged the voiced segments of the speech they produced, either as a side-effect of increased vocal intensity or in order...
[Varices of the vocal cord: report of 21 cases].

Science.gov (United States)

Li, Jin-rang; Sun, Jian-jun

2006-04-01

To study the diagnosis and treatment of varices of the vocal cord. The clinical data of 21 cases with varix of vocal cord were analyzed. All the patients presented hoarseness. There were 15 female and 6 male cases with their ages ranged from 23 to 68 years (median 44 years old). The varix was found on the right vocal cord in 12 cases, on the left vocal cord in 9 cases. Isolated varix existed on the vocal cord in 10 cases, varix with vocal cord polyps or nodules in 10 cases, varix with vocal cord paralysis in 1 case. All the patients were diagnosed under the laryngovideoscopy. The lesions appeared on the superior surface of the vocal cord. Varices manifested as abnormally dilated capillary running in the anterior to posterior direction in 6 cases, as clusters of capillary in 3 cases, as a dot or small sheet or short line of capillary in 12 cases. The varices were disappeared in 2 of 8 cases with vocal cord varices and polyps after removed the polyps. The varices of others patients had no change after following up for more than 6 months, but one patient happened hemorrhage of the contralateral vocal cord. Varices are most commonly seen in female. Laryngovideoscopy is the key in determining the vocal fold varices. Management of patients with a varix includes medical therapy, speech therapy, and occasionally surgical vaporization.
Hearing speech in music

Directory of Open Access Journals (Sweden)

Seth-Reino Ekström

2011-01-01

Full Text Available The masking effect of a piano composition, played at different speeds and in different octaves, on speech-perception thresholds was investigated in 15 normal-hearing and 14 moderately-hearing-impaired subjects. Running speech (just follow conversation, JFC testing and use of hearing aids increased the everyday validity of the findings. A comparison was made with standard audiometric noises [International Collegium of Rehabilitative Audiology (ICRA noise and speech spectrum-filtered noise (SPN]. All masking sounds, music or noise, were presented at the same equivalent sound level (50 dBA. The results showed a significant effect of piano performance speed and octave (P<.01. Low octave and fast tempo had the largest effect; and high octave and slow tempo, the smallest. Music had a lower masking effect than did ICRA noise with two or six speakers at normal vocal effort (P<.01 and SPN (P<.05. Subjects with hearing loss had higher masked thresholds than the normal-hearing subjects (P<.01, but there were smaller differences between masking conditions (P<.01. It is pointed out that music offers an interesting opportunity for studying masking under realistic conditions, where spectral and temporal features can be varied independently. The results have implications for composing music with vocal parts, designing acoustic environments and creating a balance between speech perception and privacy in social settings.
Speech masking and cancelling and voice obscuration

Science.gov (United States)

Holzrichter, John F.

2013-09-10

A non-acoustic sensor is used to measure a user's speech and then broadcasts an obscuring acoustic signal diminishing the user's vocal acoustic output intensity and/or distorting the voice sounds making them unintelligible to persons nearby. The non-acoustic sensor is positioned proximate or contacting a user's neck or head skin tissue for sensing speech production information.

Voice parameters and videonasolaryngoscopy in children with vocal nodules: a longitudinal study, before and after voice therapy.

Science.gov (United States)

Valadez, Victor; Ysunza, Antonio; Ocharan-Hernandez, Esther; Garrido-Bustamante, Norma; Sanchez-Valerio, Araceli; Pamplona, Ma C

2012-09-01

Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. Before the onset of voice therapy, there was a significant difference (ptherapy period, a significant improvement (pvocal nodules were no longer discernible on the vocal folds in any of the cases. SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Oral and vocal fold diadochokinesis in dysphonic women

Directory of Open Access Journals (Sweden)

Talita Louzada

2011-12-01

Full Text Available The evaluation of oral and vocal fold diadochokinesis (DDK in individuals with voice disorders may contribute to the understanding of factors that affect the balanced vocal production. Scientific studies that make use of this assessment tool support the knowledge advance of this area, reflecting the development of more appropriate therapeutic planning. Objective: To compare the results of oral and vocal fold DDK in dysphonic women and in women without vocal disorders. Material and methods: For this study, 28 voice recordings of women from 19 to 54 years old, diagnosed with dysphonia and submitted to a voice assessment from speech pathologist and otorhinolaryngologist, were used. The control group included 30 nondysphonic women evaluated in prior research from normal adults. The analysis parameters like number and duration of emissions, as well as the regularity of the repetition of syllables "pa", "ta", "ka" and the vowels "a" and "i," were provided by the Advanced Motor Speech Profile program (MSP Model-5141, version-2.5.2 (KayPentax. The DDK sequence "pataka" was analyzed quantitatively through the Sound Forge 7.0 program, as well as manually with the audio-visual help of sound waves. Average values of oral and vocal fold DDK dysphonic and nondysphonic women were compared using the "t Student" test and were considered significant when p<0.05. Results: The findings showed no significant differences between populations; however, the coefficient of variation of period (CvP and jitter of period (JittP average of the "ka," "a" and "i" emissions, respectively, were higher in dysphonic women (CvP=10.42%, 12.79%, 12.05%; JittP=2.05%, 6.05%, 3.63% compared to the control group (CvP=8.86%; 10.95%, 11.20%; JittP=1.82%, 2.98%, 3.15%. Conclusion: Although the results do not indicate any difficulties in oral and laryngeal motor control in the dysphonic group, the largest instability in vocal fold DDK in the experimental group should be considered, and
Single injection of basic fibroblast growth factor to treat severe vocal fold lesions and vocal fold paralysis.

Science.gov (United States)

Kanazawa, Takeharu; Komazawa, Daigo; Indo, Kanako; Akagi, Yusuke; Lee, Yogaku; Nakamura, Kazuhiro; Matsushima, Koji; Kunieda, Chikako; Misawa, Kiyoshi; Nishino, Hiroshi; Watanabe, Yusuke

2015-10-01

Severe vocal fold lesions such as vocal fold sulcus, scars, and atrophy induce a communication disorder due to severe hoarseness, but a treatment has not been established. Basic fibroblast growth factor (bFGF) therapies by either four-time repeated local injections or regenerative surgery for vocal fold scar and sulcus have previously been reported, and favorable outcomes have been observed. In this study, we modified bFGF therapy using a single of bFGF injection, which may potentially be used in office procedures. Retrospective chart review. Five cases of vocal fold sulcus, six cases of scars, seven cases of paralysis, and 17 cases of atrophy were treated by a local injection of bFGF. The injection regimen involved injecting 50 µg of bFGF dissolved in 0.5 mL saline only once into the superficial lamina propria using a 23-gauge injection needle. Two months to 3 months after the injection, phonological outcomes were evaluated. The maximum phonation time (MPT), mean airflow rate, pitch range, speech fundamental frequency, jitter, and voice handicap index improved significantly after the bFGF injection. Furthermore, improvement in the MPT was significantly greater in patients with (in increasing order) vocal fold atrophy, scar, and paralysis. The improvement in the MPT among all patients was significantly correlated with age; the MPT improved more greatly in younger patients. Regenerative treatments by bFGF injection—even a single injection—effectively improve vocal function in vocal fold lesions. 4 © 2015 The American Laryngological, Rhinological and Otological Society, Inc.
Innovative Speech Reconstructive Surgery

OpenAIRE

Hashem Shemshadi

2003-01-01

Proper speech functioning in human being, depends on the precise coordination and timing balances in a series of complex neuro nuscular movements and actions. Starting from the prime organ of energy source of expelled air from respirato y system; deliver such air to trigger vocal cords; swift changes of this phonatory episode to a comprehensible sound in RESONACE and final coordination of all head and neck structures to elicit final speech in ...
Vocal plasticity in a reptile.

Science.gov (United States)

Brumm, Henrik; Zollinger, Sue Anne

2017-05-31

Sophisticated vocal communication systems of birds and mammals, including human speech, are characterized by a high degree of plasticity in which signals are individually adjusted in response to changes in the environment. Here, we present, to our knowledge, the first evidence for vocal plasticity in a reptile. Like birds and mammals, tokay geckos ( Gekko gecko ) increased the duration of brief call notes in the presence of broadcast noise compared to quiet conditions, a behaviour that facilitates signal detection by receivers. By contrast, they did not adjust the amplitudes of their call syllables in noise (the Lombard effect), which is in line with the hypothesis that the Lombard effect has evolved independently in birds and mammals. However, the geckos used a different strategy to increase signal-to-noise ratios: instead of increasing the amplitude of a given call type when exposed to noise, the subjects produced more high-amplitude syllable types from their repertoire. Our findings demonstrate that reptile vocalizations are much more flexible than previously thought, including elaborate vocal plasticity that is also important for the complex signalling systems of birds and mammals. We suggest that signal detection constraints are one of the major forces driving the evolution of animal communication systems across different taxa. © 2017 The Author(s).
Plasticity in the Human Speech Motor System Drives Changes in Speech Perception

Science.gov (United States)

Lametti, Daniel R.; Rochet-Capellan, Amélie; Neufeld, Emily; Shiller, Douglas M.

2014-01-01

Recent studies of human speech motor learning suggest that learning is accompanied by changes in auditory perception. But what drives the perceptual change? Is it a consequence of changes in the motor system? Or is it a result of sensory inflow during learning? Here, subjects participated in a speech motor-learning task involving adaptation to altered auditory feedback and they were subsequently tested for perceptual change. In two separate experiments, involving two different auditory perceptual continua, we show that changes in the speech motor system that accompany learning drive changes in auditory speech perception. Specifically, we obtained changes in speech perception when adaptation to altered auditory feedback led to speech production that fell into the phonetic range of the speech perceptual tests. However, a similar change in perception was not observed when the auditory feedback that subjects' received during learning fell into the phonetic range of the perceptual tests. This indicates that the central motor outflow associated with vocal sensorimotor adaptation drives changes to the perceptual classification of speech sounds. PMID:25080594
Temporal recalibration in vocalization induced by adaptation of delayed auditory feedback.

Directory of Open Access Journals (Sweden)

Kosuke Yamamoto

Full Text Available BACKGROUND: We ordinarily perceive our voice sound as occurring simultaneously with vocal production, but the sense of simultaneity in vocalization can be easily interrupted by delayed auditory feedback (DAF. DAF causes normal people to have difficulty speaking fluently but helps people with stuttering to improve speech fluency. However, the underlying temporal mechanism for integrating the motor production of voice and the auditory perception of vocal sound remains unclear. In this study, we investigated the temporal tuning mechanism integrating vocal sensory and voice sounds under DAF with an adaptation technique. METHODS AND FINDINGS: Participants produced a single voice sound repeatedly with specific delay times of DAF (0, 66, 133 ms during three minutes to induce 'Lag Adaptation'. They then judged the simultaneity between motor sensation and vocal sound given feedback. We found that lag adaptation induced a shift in simultaneity responses toward the adapted auditory delays. This indicates that the temporal tuning mechanism in vocalization can be temporally recalibrated after prolonged exposure to delayed vocal sounds. Furthermore, we found that the temporal recalibration in vocalization can be affected by averaging delay times in the adaptation phase. CONCLUSIONS: These findings suggest vocalization is finely tuned by the temporal recalibration mechanism, which acutely monitors the integration of temporal delays between motor sensation and vocal sound.
Establishing a basic speech repertoire without using NSOME: means, motive, and opportunity.

Science.gov (United States)

Davis, Barbara; Velleman, Shelley

2008-11-01

Children who are performing at a prelinguistic level of vocal communication present unique issues related to successful intervention relative to the general population of children with speech disorders. These children do not consistently use meaning-based vocalizations to communicate with those around them. General goals for this group of children include stimulating more mature vocalization types and connecting these vocalizations to meanings that can be used to communicate consistently with persons in their environment. We propose a means, motive, and opportunity conceptual framework for assessing and intervening with these children. This framework is centered on stimulation of meaningful vocalizations for functional communication. It is based on a broad body of literature describing the nature of early language development. In contrast, nonspeech oral motor exercise (NSOME) protocols require decontextualized practice of repetitive nonspeech movements that are not related to functional communication with respect to means, motive, or opportunity for communicating. Successful intervention with NSOME activities requires adoption of the concept that the child, operating at a prelinguistic communication level, will generalize from repetitive nonspeech movements that are not intended to communicate with anyone to speech-based movements that will be intelligible enough to allow responsiveness to the child's wants and needs from people in the environment. No evidence from the research literature on the course of speech and language acquisition suggests that this conceptualization is valid.
Audio-vocal interaction in single neurons of the monkey ventrolateral prefrontal cortex.

Science.gov (United States)

Hage, Steffen R; Nieder, Andreas

2015-05-06

Complex audio-vocal integration systems depend on a strong interconnection between the auditory and the vocal motor system. To gain cognitive control over audio-vocal interaction during vocal motor control, the PFC needs to be involved. Neurons in the ventrolateral PFC (VLPFC) have been shown to separately encode the sensory perceptions and motor production of vocalizations. It is unknown, however, whether single neurons in the PFC reflect audio-vocal interactions. We therefore recorded single-unit activity in the VLPFC of rhesus monkeys (Macaca mulatta) while they produced vocalizations on command or passively listened to monkey calls. We found that 12% of randomly selected neurons in VLPFC modulated their discharge rate in response to acoustic stimulation with species-specific calls. Almost three-fourths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of vocalization. Based on these audio-vocal interactions, the VLPFC might be well positioned to combine higher order auditory processing with cognitive control of the vocal motor output. Such audio-vocal integration processes in the VLPFC might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2015 the authors 0270-6474/15/357030-11$15.00/0.
"Non-Vocalization": A Phonological Error Process in the Speech of Severely and Profoundly Hearing Impaired Adults, from the Point of View of the Theory of Phonology as Human Behaviour

Science.gov (United States)

Halpern, Orly; Tobin, Yishai

2008-01-01

"Non-vocalization" (N-V) is a newly described phonological error process in hearing impaired speakers. In N-V the hearing impaired person actually articulates the phoneme but without producing a voice. The result is an error process looking as if it is produced but sounding as if it is omitted. N-V was discovered by video recording the speech of…
The Neural Basis of Vocal Pitch Imitation in Humans.

Science.gov (United States)

Belyk, Michel; Pfordresher, Peter Q; Liotti, Mario; Brown, Steven

2016-04-01

Vocal imitation is a phenotype that is unique to humans among all primate species, and so an understanding of its neural basis is critical in explaining the emergence of both speech and song in human evolution. Two principal neural models of vocal imitation have emerged from a consideration of nonhuman animals. One hypothesis suggests that putative mirror neurons in the inferior frontal gyrus pars opercularis of Broca's area may be important for imitation. An alternative hypothesis derived from the study of songbirds suggests that the corticostriate motor pathway performs sensorimotor processes that are specific to vocal imitation. Using fMRI with a sparse event-related sampling design, we investigated the neural basis of vocal imitation in humans by comparing imitative vocal production of pitch sequences with both nonimitative vocal production and pitch discrimination. The strongest difference between these tasks was found in the putamen bilaterally, providing a striking parallel to the role of the analogous region in songbirds. Other areas preferentially activated during imitation included the orofacial motor cortex, Rolandic operculum, and SMA, which together outline the corticostriate motor loop. No differences were seen in the inferior frontal gyrus. The corticostriate system thus appears to be the central pathway for vocal imitation in humans, as predicted from an analogy with songbirds.
Efeito imediato de técnicas vocais em mulheres sem queixa vocal Immediate effect of vocal techniques in women without vocal complaint

Directory of Open Access Journals (Sweden)

Eliane Cristina Pereira

2011-10-01

Full Text Available OBJETIVO: verificar o efeito imediato das técnicas vocais vibração, som nasal e sobrearticulação na voz e na laringe de mulheres sem queixas vocais. MÉTODO: participaram da pesquisa 32 sujeitos do sexo feminino, com idades entre 20 e 45 anos, sem queixas vocais, com qualidade vocal avaliada entre normal e alteração de grau leve Os sujeitos foram submetidos à análise perceptivo-auditiva pela escala visual analógica da vogal /ε/ e fala espontânea, análise acústica e laringoestroboscopia antes e após a realização das técnicas. RESULTADOS: a análise perceptivo-auditiva revelou melhora significante dos parâmetros impressão global da voz, rouquidão e estabilidade na vogal /ε/ e articulação na fala espontânea. A análise acústica evidenciou melhora significante do jitter e shimmer. A laringoestroboscopia evidenciou significante melhora no fechamento glótico e melhora na movimentação muco-ondulatória das pregas vocais. CONCLUSÃO: as técnicas vocais estudadas são capazes de proporcionar melhora imediata significante da qualidade vocal e da configuração laríngea.PURPOSE: to check the immediate effect of vocal techniques: vibration, nasal sound and overarticulation. METHOD: 32 female subjects with normal to mild dysphonia took part in the study, with ages from 20 to 45 years. Subjects were submitted to perceptual analysis and laryngostroboscopic exams before and after the use of vocal techniques. RESULTS: subjects' vocal classification in perceptual analysis after accomplishing the vocal techniques showed significant improvement on parameters voice global impression, hoarseness and stability; and, in spontaneous speech, one showed a significant improvement on the parameter articulation. The acoustic analysis evidenced significant improvement of the jitter and shimmer. Laryngostroboscopic examination evidenced a significant increase in the glottic closing and an increase in the mucondulatory movement of the vocal folds
Child vocalization composition as discriminant information for automatic autism detection.

Science.gov (United States)

Xu, Dongxin; Gilkerson, Jill; Richards, Jeffrey; Yapanel, Umit; Gray, Sharmi

2009-01-01

Early identification is crucial for young children with autism to access early intervention. The existing screens require either a parent-report questionnaire and/or direct observation by a trained practitioner. Although an automatic tool would benefit parents, clinicians and children, there is no automatic screening tool in clinical use. This study reports a fully automatic mechanism for autism detection/screening for young children. This is a direct extension of the LENA (Language ENvironment Analysis) system, which utilizes speech signal processing technology to analyze and monitor a child's natural language environment and the vocalizations/speech of the child. It is discovered that child vocalization composition contains rich discriminant information for autism detection. By applying pattern recognition and machine learning approaches to child vocalization composition data, accuracy rates of 85% to 90% in cross-validation tests for autism detection have been achieved at the equal-error-rate (EER) point on a data set with 34 children with autism, 30 language delayed children and 76 typically developing children. Due to its easy and automatic procedure, it is believed that this new tool can serve a significant role in childhood autism screening, especially in regards to population-based or universal screening.
Descrição da qualidade vocal de personagens idosos dos filmes de Hollywood Vocal quality description of senile characters from Hollywood movies

Directory of Open Access Journals (Sweden)

Gisele Oliveira

2010-06-01

Full Text Available OBJETIVO: descrever a qualidade vocal de personagens idosos dos filmes de Hollywood. MÉTODOS: foram colhidas 50 amostras de fala de personagens idosos, 11 do sexo feminino e 39 do masculino, de 38 filmes hollywoodianos dos anos de 1993 a 2001. Através da análise perceptivo-auditiva das amostras de fala, 20 fonoaudiólogos treinados classificaram cada personagem em idoso e não idoso, além de avaliarem as vozes quanto aos seguintes parâmetros citados pela literatura como mais alterados: rouquidão, crepitação, soprosidade, tensão, aspereza, astenia, nasalidade, tremor, modulação, pitch e estabilidade da frequência fundamental. RESULTADOS: após a análise perceptivo-auditiva, foi observado que a grande maioria dos atores (82% utilizou voz de idoso para representar seus papéis. O marcador mais evidente nas vozes foi alteração na qualidade vocal (92%, demonstrada por crepitação (80%, soprosidade (54%, tensão (38%, rouquidão (30% e astenia (28%. O segundo marcador mais utilizado pelos atores nas suas representações foi a modulação vocal ampla e variada (44%. Também foram observadas alterações no controle da voz (36% e instabilidade da frequência fundamental (38%. CONCLUSÃO: a partir dos resultados obtidos pode-se concluir que os filmes de Hollywood caracterizam o idoso através de desvios evidentes na qualidade e modulação da voz, utilizando tipos de vozes alteradas e modulação vocal ampla e instável.PURPOSE: to describe the vocal quality of Hollywood movies characters playing elderly people roles. METHODS: a total of 50 aged character voice samples were used, 11 female and 39 male, from 38 Hollywood movies from the period between 1993 and 2001. Twenty speech therapists performed a perceptual auditory analysis. The listener's task required classifying each character either as elderly or as adult by their speech features, and also assessing their voices following the parameters that are most frequently addressed in the
Acoustic Characteristics of Simulated Respiratory-Induced Vocal Tremor

Science.gov (United States)

Lester, Rosemary A.; Story, Brad H.

2013-01-01

Purpose: The purpose of this study was to investigate the relation of respiratory forced oscillation to the acoustic characteristics of vocal tremor. Method: Acoustical analyses were performed to determine the characteristics of the intensity and fundamental frequency (F[subscript 0]) for speech samples obtained by Farinella, Hixon, Hoit, Story,…
Language and Preliteracy Skills in Bilinguals and Monolinguals at Preschool Age: Effects of Exposure to Richly Inflected Speech from Birth

Science.gov (United States)

Silven, Maarit; Rubinov, Evgenia

2010-01-01

Language proficiency before school entry has proven to be a powerful predictor of literacy development. This longitudinal study examined how simultaneous exposure to two richly inflected languages from birth contributes to the development of language-related literacy precursors at preschool age compared to peers exposed to one language. The…
Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis

DEFF Research Database (Denmark)

Shi, Liming; Jensen, Jesper Rindom; Christensen, Mads Græsbøll

2017-01-01

In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape...
Coping strategies in teachers with vocal complaint.

Science.gov (United States)

Zambon, Fabiana; Moreti, Felipe; Behlau, Mara

2014-05-01

To understand the coping strategies used by teachers with vocal complaints, compare the differences between those who seek and those who do not seek voice therapy, and investigate the relationships among coping and voice perceptual analysis, coping and signs and symptoms of voice, and coping and participation restrictions and limitations in vocal activities. Cross-sectional nonrandomized prospective study with control group. Ninety female teachers participated in the study, of similar ages, divided into three groups: group 1 (G1) comprised 30 teachers with vocal complaints who sought voice therapy, group 2 (G2) comprised 30 teachers with vocal complaints who never sought voice therapy, and group 3 (G3) comprised 30 teachers without vocal complaints. The following analysis were conducted: identification and characterization questionnaire, addressing personal and occupational description, recording speech material for voice perceptual analysis, Voice Signs and Symptoms Questionnaire, Voice Activity and Participation Profile (VAPP), and Voice Disability Coping Questionnaire (VDCQ)-Brazilian Version. In relation to the voice perceptual analysis, there was statistically significant difference between the groups with vocal complaint (G1+G2), which had showed voices with mild-to-moderate deviation, and the group without vocal complaint (G1), which showed voices within the normal variability of voice quality (mean for G1 = 49.9, G2 = 43.7, and G3 = 32.3, P Teachers with vocal complaints who looked for voice therapy use more coping strategies. Moreover, they present a tendency to use more problem-focused coping strategies. Voice symptoms prompt the teachers into seeking treatment; however, they are not correlated with the coping itself. In general, the higher the perception of limitation and restriction of participating in vocal activities, the greater the use of coping strategies. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Análise comparativa entre avaliação fonoaudiológica perceptivo-auditiva, análise acústica e laringoscopias indiretas para avaliação vocal em população com queixa vocal Comparative analysis of perceptual evaluation, acoustic analysis and indirect laryngoscopy for vocal assessment of a population with vocal complaint

Directory of Open Access Journals (Sweden)

Kátia Nemr

2005-02-01

Full Text Available Com a evolução e o desenvolvimento tecnológico, houve mudanças nos métodos de avaliação da voz, tanto na prática médica como fonoaudiológica. OBJETIVO: Relacionar os resultados da avaliação perceptivo-auditiva vocal, análise acústica e avaliações médicas no diagnóstico de alterações vocais e/ou laríngeas em indivíduos com queixa vocal. FORMA DE ESTUDO: Clínico prospectivo. MATERIAL E MÉTODO: Foram avaliados 29 indivíduos que participaram de uma ação de proteção de saúde. Os sujeitos foram submetidos à avaliação fonoaudiológica peceptivo-auditiva (AFPA, análise acústica (AA, laringoscopia indireta (LI e telelaringoscopia (TL. RESULTADOS: Foram estabelecidas as relações entre os métodos de avaliação médica e fonoaudiológica, verificando possíveis significâncias estatísticas a partir da aplicação do Teste Exato de Fischer. Houve significância estatística na relação entre AFPA e LI, AFPA e TL, LI e TL. CONCLUSÃO: Esta pesquisa realizada numa ação de proteção de saúde vocal mostrou concordância entre a avaliação fonoaudiológica perceptivo-auditiva e as avaliações médicas, bem como os exames médicos entre si no diagnóstico de alterações vocais e/ou laríngeas.As a result of technology evolution and development, methods of voice evaluation have changed both in medical and speech and language pathology practice. AIM: To relate the results of perceptual evaluation, acoustic analysis and medical evaluation in the diagnosis of vocal and/or laryngeal affections of the population with vocal complaint. STUDY DESIGN: Clinical prospective. MATERIAL AND METHOD: 29 people that attended vocal health protection campaign were evaluated. They were submitted to perceptual evaluation (AFPA, acoustic analysis (AA, indirect laryngoscopy (LI and telelaryngoscopy (TL. RESULTS: Correlations between medical and speech language pathology evaluation methods were established, verifying possible statistical
Supersymmetric seesaw inflection

International Nuclear Information System (INIS)

Aulakh, Charanjit S.; Garg, Ila

2013-01-01

We showed that Supersymmetric Unified theories which explain small neutrino masses via renormalizable Type-I-see-saw mechanism can also support slow roll inflection point inflation. In such a scenario inflation occurs along a MSSM D-flat direction associated with gauge invariant combination of Higgs, slepton and right handed sneutrino. The scale of inflation is set by right handed neutrino mass M υc ∼10 6 10 12 GeV and inflation parameters are determined in terms of Dirac and Majorana couplings responsible for neutrino masses. The fine tuning conditions to have effective slow roll inflation are determined in terms of superpotential parameters (Dirac and Majorana couplings). This is in contrast to MSSM or Dirac neutrino inflection scenarios where fine tuning conditions are on soft Susy breaking parameters. In our case M υc ≫ M Susy , so soft Susy breaking parameters have hardly any role to play in fine tuning. The fine tuning conditions are thus radiatively stable due to nonrenormalization theorems. Reheating occurs via instant preheating which dumps all the inflation energy into MSSM degrees of freedom giving a high reheat temperature T rh ≅ M υc 10 6 GeV ∼ 10 1l 10 15 GeV. We also examined how this scenario can be embedded in realistic New Minimal Supersymmetric SO(10) Grand Unified Theory. (author)

Vocal health fitness to different music styles

Directory of Open Access Journals (Sweden)

Maria Cláudia Mendes Caminha Muniz

2010-09-01

Full Text Available Objective: To present genres and styles currently running on western music scene, focusing on the practice of singing voice. Methods: An observational and documental study for which were selected sound sources presenting musical genres and styles that are part of the experience of the researchers, which were analyzed considering origins, formative elements and vocal features. Alongside we carried out a review of literature grounded in databases research and free review of websites and classical books of the area. Results: The selected styles (Rock and Roll, Heavy Metal, Trash Metal, Grunge, Gothic Metal, Rap, Funk, Blues, R&B – Rhythm and Blues, Soul, Gospel, MPB, Samba, Forro, Sertanejo, Bossa Nova, Opera and Chamber Music were described, pointing the reasons for the speech therapist to be informed about them and about singing voice aspects. His guidance may minimize possible vocal damage caused by each style, since each of them carries its own patterns to which the interpreter must submit. Conclusions: We conclude that the singer will use a specific vocal pattern that resembles the musical style he intends to sing, regardless of any harm it may or may not cause to vocal health. When choosing a musical style, it is important that the singer has the knowledge and understanding of how the use of his vocal apparatus will cause or not cause injury to his voice. Also be aware that the technique in singing is necessary for vocal longevity.
Vocal Pitch Shift in Congenital Amusia (Pitch Deafness)

Science.gov (United States)

Hutchins, Sean; Peretz, Isabelle

2013-01-01

We tested whether congenital amusics, who exhibit pitch perception deficits, nevertheless adjust the pitch of their voice in response to a sudden pitch shift applied to vocal feedback. Nine amusics and matched controls imitated their own previously-recorded speech or singing, while the online feedback they received was shifted mid-utterance by 25…
Current Understanding and Future Directions for Vocal Fold Mechanobiology

Science.gov (United States)

Li, Nicole Y.K.; Heris, Hossein K.; Mongeau, Luc

2013-01-01

The vocal folds, which are located in the larynx, are the main organ of voice production for human communication. The vocal folds are under continuous biomechanical stress similar to other mechanically active organs, such as the heart, lungs, tendons and muscles. During speech and singing, the vocal folds oscillate at frequencies ranging from 20 Hz to 3 kHz with amplitudes of a few millimeters. The biomechanical stress associated with accumulated phonation is believed to alter vocal fold cell activity and tissue structure in many ways. Excessive phonatory stress can damage tissue structure and induce a cell-mediated inflammatory response, resulting in a pathological vocal fold lesion. On the other hand, phonatory stress is one major factor in the maturation of the vocal folds into a specialized tri-layer structure. One specific form of vocal fold oscillation, which involves low impact and large amplitude excursion, is prescribed therapeutically for patients with mild vocal fold injuries. Although biomechanical forces affect vocal fold physiology and pathology, there is little understanding of how mechanical forces regulate these processes at the cellular and molecular level. Research into vocal fold mechanobiology has burgeoned over the past several years. Vocal fold bioreactors are being developed in several laboratories to provide a biomimic environment that allows the systematic manipulation of physical and biological factors on the cells of interest in vitro. Computer models have been used to simulate the integrated response of cells and proteins as a function of phonation stress. The purpose of this paper is to review current research on the mechanobiology of the vocal folds as it relates to growth, pathogenesis and treatment as well as to propose specific research directions that will advance our understanding of this subject. PMID:24812638
The attention-getting capacity of whines and child-directed speech.

Science.gov (United States)

Chang, Rosemarie Sokol; Thompson, Nicholas S

2010-06-03

The current study tested the ability of whines and child-directed speech to attract the attention of listeners involved in a story repetition task. Twenty non-parents and 17 parents were presented with two dull stories, each playing to a separate ear, and asked to repeat one of the stories verbatim. The story that participants were instructed to ignore was interrupted occasionally with the reader whining and using child-directed speech. While repeating the passage, participants were monitored for Galvanic skin response, heart rate, and blood pressure. Based on 4 measures, participants tuned in more to whining, and to a lesser extent child-directed speech, than neutral speech segments that served as a control. Participants, regardless of gender or parental status, made more mistakes when presented with the whine or child-directed speech, they recalled hearing those vocalizations, they recognized more words from the whining segment than the neutral control segment, and they exhibited higher Galvanic skin response during the presence of whines and child- directed speech than neutral speech segments. Whines and child-directed speech appear to be integral members of a suite of vocalizations designed to get the attention of attachment partners by playing to an auditory sensitivity among humans. Whines in particular may serve the function of eliciting care at a time when caregivers switch from primarily mothers to greater care from other caregivers.
The Attention-Getting Capacity of Whines and Child-Directed Speech

Directory of Open Access Journals (Sweden)

Rosemarie Sokol Chang

2010-04-01

Full Text Available The current study tested the ability of whines and child-directed speech to attract the attention of listeners involved in a story repetition task. Twenty non-parents and 17 parents were presented with two dull stories, each playing to a separate ear, and asked to repeat one of the stories verbatim. The story that participants were instructed to ignore was interrupted occasionally with the reader whining and using child-directed speech. While repeating the passage, participants were monitored for Galvanic skin response, heart rate, and blood pressure. Based on 4 measures, participants tuned in more to whining, and to a lesser extent child-directed speech, than neutral speech segments that served as a control. Participants, regardless of gender or parental status, made more mistakes when presented with the whine or child-directed speech, they recalled hearing those vocalizations, they recognized more words from the whining segment than the neutral control segment, and they exhibited higher Galvanic skin response during the presence of whines and child-directed speech than neutral speech segments. Whines and child-directed speech appear to be integral members of a suite of vocalizations designed to get the attention of attachment partners by playing to an auditory sensitivity among humans. Whines in particular may serve the function of eliciting care at a time when caregivers switch from primarily mothers to greater care from other caregivers.
Inflectional spelling deficits in developmental dyslexia.

Science.gov (United States)

Egan, Joanne; Tainturier, Marie-Josèphe

2011-01-01

The goal of this study was to examine past-tense spelling deficits in developmental dyslexia and their relationship to phonological abilities, spoken morphological awareness and word specific orthographic memory. Three groups of children (28 9-year-old dyslexic, 28 chronological age-matched and 28 reading/spelling age-matched children) completed a battery of tests including spelling regularly inflected words (e.g., kissed) and matched one-morpheme words (e.g., wrist). They were also assessed on a range of tests of reading and spelling abilities and associated linguistic measures. Dyslexic children were impaired in relation to chronological age-matched controls on all measures. Furthermore, they were significantly poorer than younger reading and spelling age-matched controls at spelling inflected verbs, supporting the existence of a specific deficit in past-tense spelling in dyslexia. In addition to under-using the -ed spelling on inflected verbs, the dyslexic children were less likely to erroneously apply this spelling to one-morpheme words than younger controls. Dyslexics were also poorer than younger controls at using a consistent spelling for stems presented in isolation versus as part of an inflected word, indicating that they make less use of the morphological relations between words to support their spelling. In line with this interpretation, regression analyses revealed another qualitative difference between the spelling and reading age-matched group and the dyslexic group: while both spoken morphological awareness and orthographic word specific memory were significant predictors of the accuracy of past-tense spelling in the former group, only orthographic memory (irregular word reading and spelling) was a significant factor in the dyslexic group. Finally, we identified a subgroup of seven dyslexic children who were severely deficient in past-tense spelling. This subgroup was also significantly worse than other dyslexics and than younger controls on scores
Efficient Encoding of Inflection Rules in NLP Systems

Directory of Open Access Journals (Sweden)

Péter BARABÁSS

2012-12-01

Full Text Available The grammatical parsing unit is a core module in natural language processing engines. This unit determines the grammatical roles of the incoming words and it converts the sentences into semantic models. A special grammar rule in agglutinative languages is the inflection rule. The traditional, automata-based parsers are usually not very effective in the parsing of inflection transformations. The paper presents implementation alternatives and compares them from the viewpoint of time efficiency and accuracy. The prototype system was tested with examples from Hungarian.
The Effects of the Literal Meaning of Emotional Phrases on the Identification of Vocal Emotions.

Science.gov (United States)

Shigeno, Sumi

2018-02-01

This study investigates the discrepancy between the literal emotional content of speech and emotional tone in the identification of speakers' vocal emotions in both the listeners' native language (Japanese), and in an unfamiliar language (random-spliced Japanese). Both experiments involve a "congruent condition," in which the emotion contained in the literal meaning of speech (words and phrases) was compatible with vocal emotion, and an "incongruent condition," in which these forms of emotional information were discordant. Results for Japanese indicated that performance in identifying emotions did not differ significantly between the congruent and incongruent conditions. However, the results for random-spliced Japanese indicated that vocal emotion was correctly identified more often in the congruent than in the incongruent condition. The different results for Japanese and random-spliced Japanese suggested that the literal meaning of emotional phrases influences the listener's perception of the speaker's emotion, and that Japanese participants could infer speakers' intended emotions in the incongruent condition.
A new feature constituting approach to detection of vocal fold pathology

Science.gov (United States)

Hariharan, M.; Polat, Kemal; Yaacob, Sazali

2014-08-01

In the last two decades, non-invasive methods through acoustic analysis of voice signal have been proved to be excellent and reliable tool to diagnose vocal fold pathologies. This paper proposes a new feature vector based on the wavelet packet transform and singular value decomposition for the detection of vocal fold pathology. k-means clustering based feature weighting is proposed to increase the distinguishing performance of the proposed features. In this work, two databases Massachusetts Eye and Ear Infirmary (MEEI) voice disorders database and MAPACI speech pathology database are used. Four different supervised classifiers such as k-nearest neighbour (k-NN), least-square support vector machine, probabilistic neural network and general regression neural network are employed for testing the proposed features. The experimental results uncover that the proposed features give very promising classification accuracy of 100% for both MEEI database and MAPACI speech pathology database.
Swallowing dysfunction in patients with unilateral vocal fold paralysis: aetiology and outcomes.

Science.gov (United States)

Ollivere, B; Duce, K; Rowlands, G; Harrison, P; O'Reilly, B J

2006-01-01

Although unilateral vocal fold palsy (UVFP) is a common problem, data relating to swallowing dysfunction are sparse. We reviewed the clinical findings (method of presentation, underlying diagnosis and position of the vocal folds) of 30 patients and conducted a follow-up telephone survey. Outcome measures used were direct visualization of fold function, position and compensation. In addition, standardized speech and language assessments for swallowing dysfunction and dysphonia were noted and compared to presentation. Our study indicates that 56 per cent of patients with UVFP have associated dysphagia. Outcome with speech therapy is significant, with 73 per cent showing improvement. These data indicate a significant link between UVFP and swallowing dysfunction. There is a marked therapeutic benefit from voice therapy. Further work is required to evaluate the long-term outcomes and establish the mechanism of swallowing dysfunction in these patients.
Abnormal laughter-like vocalisations replacing speech in primary progressive aphasia

Science.gov (United States)

Rohrer, Jonathan D.; Warren, Jason D.; Rossor, Martin N.

2009-01-01

We describe ten patients with a clinical diagnosis of primary progressive aphasia (PPA) (pathologically confirmed in three cases) who developed abnormal laughter-like vocalisations in the context of progressive speech output impairment leading to mutism. Failure of speech output was accompanied by increasing frequency of the abnormal vocalisations until ultimately they constituted the patient's only extended utterance. The laughter-like vocalisations did not show contextual sensitivity but occurred as an automatic vocal output that replaced speech. Acoustic analysis of the vocalisations in two patients revealed abnormal motor features including variable note duration and inter-note interval, loss of temporal symmetry of laugh notes and loss of the normal decrescendo. Abnormal laughter-like vocalisations may be a hallmark of a subgroup in the PPA spectrum with impaired control and production of nonverbal vocal behaviour due to disruption of fronto-temporal networks mediating vocalisation. PMID:19435636
Speech task effects on acoustic measure of fundamental frequency in Cantonese-speaking children.

Science.gov (United States)

Ma, Estella P-M; Lam, Nina L-N

2015-12-01

Speaking fundamental frequency (F0) is a voice measure frequently used to document changes in vocal performance over time. Knowing the intra-subject variability of speaking F0 has implications on its clinical usefulness. The present study examined the speaking F0 elicited from three speech tasks in Cantonese-speaking children. The study also compared the variability of speaking F0 elicited from different speech tasks. Fifty-six vocally healthy Cantonese-speaking children (31 boys and 25 girls) aged between 7.0 and 10.11 years participated. For each child, speaking F0 was elicited using speech tasks at three linguistic levels (sustained vowel /a/ prolongation, reading aloud a sentence and passage). Two types of variability, within-session (trial-to-trial) and across-session (test-retest) variability, were compared across speech tasks. Significant differences in mean speaking F0 values were found between speech tasks. Mean speaking F0 value elicited from sustained vowel phonations was significantly higher than those elicited from the connected speech tasks. The variability of speaking F0 was higher in sustained vowel prolongation than that in connected speech. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Higher-order semantic structures in an African Grey parrot's vocalizations: evidence from the hyperspace analog to language (HAL) model.

Science.gov (United States)

Kaufman, Allison B; Colbert-White, Erin N; Burgess, Curt

2013-09-01

Previous research has described the significant role that social interaction plays in both the acquisition and use of speech by parrots. The current study analyzed the speech of one home-raised African Grey parrot (Psittacus erithacus erithacus) across three different social contexts: owner interacting with parrot in the same room, owner and parrot interacting out of view in adjacent rooms, and parrot home alone. The purpose was to determine the extent to which the subject's speech reflected an understanding of the contextual substitutability (e.g., the word street can be substituted in context for the word road) of the vocalizations that comprised the units in her repertoire (i.e., global co-occurrence of repertoire units; Burgess in Behav Res Methods Instrum Comput 30:188-198, 1998; Lund and Burgess in Behav Res Methods Instrum Comput 28:203-208, 1996). This was accomplished via the human language model hyperspace analog to language (HAL). HAL is contextually driven and bootstraps language "rules" from input without human intervention. Because HAL does not require human tutelage, it provided an objective measure to empirically examine the parrot's vocalizations. Results indicated that the subject's vocalization patterns did contain global co-occurrence. The presence of this quality in this nonhuman's speech may be strongly indicative of higher-order cognitive skills.
Mechanisms underlying the social enhancement of vocal learning in songbirds.

Science.gov (United States)

Chen, Yining; Matheson, Laura E; Sakata, Jon T

2016-06-14

Social processes profoundly influence speech and language acquisition. Despite the importance of social influences, little is known about how social interactions modulate vocal learning. Like humans, songbirds learn their vocalizations during development, and they provide an excellent opportunity to reveal mechanisms of social influences on vocal learning. Using yoked experimental designs, we demonstrate that social interactions with adult tutors for as little as 1 d significantly enhanced vocal learning. Social influences on attention to song seemed central to the social enhancement of learning because socially tutored birds were more attentive to the tutor's songs than passively tutored birds, and because variation in attentiveness and in the social modulation of attention significantly predicted variation in vocal learning. Attention to song was influenced by both the nature and amount of tutor song: Pupils paid more attention to songs that tutors directed at them and to tutors that produced fewer songs. Tutors altered their song structure when directing songs at pupils in a manner that resembled how humans alter their vocalizations when speaking to infants, that was distinct from how tutors changed their songs when singing to females, and that could influence attention and learning. Furthermore, social interactions that rapidly enhanced learning increased the activity of noradrenergic and dopaminergic midbrain neurons. These data highlight striking parallels between humans and songbirds in the social modulation of vocal learning and suggest that social influences on attention and midbrain circuitry could represent shared mechanisms underlying the social modulation of vocal learning.
Sintomas vocais e perfil de professores em um programa de saúde vocal Vocal symptoms and profile of teachers in a vocal health program

Directory of Open Access Journals (Sweden)

Karin Choi-Cardim

2010-10-01

, and in G2 most of them used to work 6 to 10 hours a day. G1 had 51% of individuals who did not search for a laryngologist's or speech pathologist's help when needed while in G2 a higher percentage of individuals (68.38% had already looked for a specialist due to voice disorders. Both groups had a large number of voice symptoms (> 4, in G1 the mean number of symptoms was 3.5 while in G2 it was 5.8; demonstrating a statistically higher percentage of symptoms in G2 (98.05% than in G1 (57% - p<0.001. CONCLUSION: although both groups had similar profiles, a higher mean of vocal symptoms was found in G2, meaning that this group looked for the speech pathology assistance already with a higher risk of voice disorders, possibly due to the usage of a different type of intervention (by offering a vocal rehabilitation, which attracted teachers with more disorders. Thus, it is very important to offer vocal health programs focusing both prevention as well as vocal treatment, because these will contribute not only to the subjects' work but also to their quality of life.
From Gesture to Speech

Directory of Open Access Journals (Sweden)

Maurizio Gentilucci

2012-11-01

Full Text Available One of the major problems concerning the evolution of human language is to understand how sounds became associated to meaningful gestures. It has been proposed that the circuit controlling gestures and speech evolved from a circuit involved in the control of arm and mouth movements related to ingestion. This circuit contributed to the evolution of spoken language, moving from a system of communication based on arm gestures. The discovery of the mirror neurons has provided strong support for the gestural theory of speech origin because they offer a natural substrate for the embodiment of language and create a direct link between sender and receiver of a message. Behavioural studies indicate that manual gestures are linked to mouth movements used for syllable emission. Grasping with the hand selectively affected movement of inner or outer parts of the mouth according to syllable pronunciation and hand postures, in addition to hand actions, influenced the control of mouth grasp and vocalization. Gestures and words are also related to each other. It was found that when producing communicative gestures (emblems the intention to interact directly with a conspecific was transferred from gestures to words, inducing modification in voice parameters. Transfer effects of the meaning of representational gestures were found on both vocalizations and meaningful words. It has been concluded that the results of our studies suggest the existence of a system relating gesture to vocalization which was precursor of a more general system reciprocally relating gesture to word.
Rodent ultrasonic vocalizations are bound to active sniffing behavior

Directory of Open Access Journals (Sweden)

Yevgeniy B Sirotin

2014-11-01

Full Text Available During rodent active behavior, multiple orofacial sensorimotor behaviors, including sniffing and whisking, display rhythmicity in the theta range (~5-10 Hz. During specific behaviors, these rhythmic patterns interlock, such that execution of individual motor programs becomes dependent on the state of the others. Here we performed simultaneous recordings of the respiratory cycle and ultrasonic vocalization emission by adult rats and mice in social settings. We used automated analysis to examine the relationship between breathing patterns and vocalization over long time periods. Rat ultrasonic vocalizations (USVs, ’50 kHz’ were emitted within stretches of active sniffing (5−10 Hz and were largely absent during periods of passive breathing (1-4 Hz. Because ultrasound was tightly linked to the exhalation phase, the sniffing cycle segmented vocal production into discrete calls and imposed its theta rhythmicity on their timing. In turn, calls briefly prolonged exhalations, causing an immediate drop in sniffing rate. Similar results were obtained in mice. Our results show that ultrasonic vocalizations are an integral part of the rhythmic orofacial behavioral ensemble. This complex behavioral program is thus involved not only in active sensing but also in the temporal structuring of social communication signals. Many other social signals of mammals, including monkey calls and human speech, show structure in the theta range. Our work points to a mechanism for such structuring in rodent ultrasonic vocalizations.
Catecholaminergic contributions to vocal communication signals.

Science.gov (United States)

Matheson, Laura E; Sakata, Jon T

2015-05-01

Social context affects behavioral displays across a variety of species. For example, social context acutely influences the acoustic and temporal structure of vocal communication signals such as speech and birdsong. Despite the prevalence and importance of such social influences, little is known about the neural mechanisms underlying the social modulation of communication. Catecholamines are implicated in the regulation of social behavior and motor control, but the degree to which catecholamines influence vocal communication signals remains largely unknown. Using a songbird, the Bengalese finch, we examined the extent to which the social context in which song is produced affected immediate early gene expression (EGR-1) in catecholamine-synthesising neurons in the midbrain. Further, we assessed the degree to which administration of amphetamine, which increases catecholamine concentrations in the brain, mimicked the effect of social context on vocal signals. We found that significantly more catecholaminergic neurons in the ventral tegmental area and substantia nigra (but not the central grey, locus coeruleus or subcoeruleus) expressed EGR-1 in birds that were exposed to females and produced courtship song than in birds that produced non-courtship song in isolation. Furthermore, we found that amphetamine administration mimicked the effects of social context and caused many aspects of non-courtship song to resemble courtship song. Specifically, amphetamine increased the stereotypy of syllable structure and sequencing, the repetition of vocal elements and the degree of sequence completions. Taken together, these data highlight the conserved role of catecholamines in vocal communication across species, including songbirds and humans. © 2015 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
Dopamine Regulation of Human Speech and Bird Song: A Critical Review

Science.gov (United States)

Simonyan, Kristina; Horwitz, Barry; Jarvis, Erich D.

2012-01-01

To understand the neural basis of human speech control, extensive research has been done using a variety of methodologies in a range of experimental models. Nevertheless, several critical questions about learned vocal motor control still remain open. One of them is the mechanism(s) by which neurotransmitters, such as dopamine, modulate speech and…
Speech rhythm in Kannada speaking adults who stutter.

Science.gov (United States)

Maruthy, Santosh; Venugopal, Sahana; Parakh, Priyanka

2017-10-01

A longstanding hypothesis about the underlying mechanisms of stuttering suggests that speech disfluencies may be associated with problems in timing and temporal patterning of speech events. Fifteen adults who do and do not stutter read five sentences, and from these, the vocalic and consonantal durations were measured. Using these, pairwise variability index (raw PVI for consonantal intervals and normalised PVI for vocalic intervals) and interval based rhythm metrics (PercV, DeltaC, DeltaV, VarcoC and VarcoV) were calculated for all the participants. Findings suggested higher mean values in adults who stutter when compared to adults who do not stutter for all the rhythm metrics except for VarcoV. Further, statistically significant difference between the two groups was found for all the rhythm metrics except for VarcoV. Combining the present results with consistent prior findings based on rhythm deficits in children and adults who stutter, there appears to be strong empirical support for the hypothesis that individuals who stutter may have deficits in generation of rhythmic speech patterns.

The management of vocal fold nodules in children: a national survey of speech-language pathologists.

Science.gov (United States)

Signorelli, Monique E; Madill, Catherine J; McCabe, Patricia

2011-06-01

The purpose of this study was to determine the management options and voice therapy techniques currently being used by practicing speech-language pathologists (SLPs) to treat vocal fold nodules (VFNs) in children. The sources used by SLPs to inform and guide their clinical decisions when managing VFNs in children were also explored. Sixty-two SLPs completed a 23-item web-based survey. Data was analysed using frequency counts, content analyses, and supplementary analyses. SLPs reported using a range of management options and voice therapy techniques to treat VFNs in children. Voice therapy was reportedly the most frequently used management option across all respondents, with the majority of SLPs using a combination of indirect and direct voice therapy techniques. When selecting voice therapy techniques, the majority of SLPs reported that they did not use the limited external evidence available to guide their clinical decisions. Additionally, the majority of SLPs reported that they frequently relied on lower levels of evidence or non-evidence-based sources to guide clinical practice both in the presence and absence of higher quality evidence. Further research needs to investigate strategies to remove the barriers that impede SLPs use of external evidence when managing VFNs in children.
Alteração de mobilidade de prega vocal unilateral: avaliação subjetiva e objetiva da voz nos momentos pré e pós-fonoterapia Unilateral vocal fold mobility alteration: objective and subjective evaluation of voice quality on prior and post speech therapy

Directory of Open Access Journals (Sweden)

Ana Cristina Cortes Gama

2011-08-01

Full Text Available OBJETIVO: avaliar de forma subjetiva e objetiva a voz de pacientes com paralisia unilateral de prega vocal nos momentos pré-tratamento e pós-tratamento. MÉTODOS: trata-se de um estudo retrospectivo por meio de revisão de prontuário, que analisou as gravações de vozes de 12 indivíduos com diagnóstico otorrinolaringológico de paralisia unilateral de prega vocal. O material de voz colhido foi a emissão sustentada da vogal /a/, seguida de fala encadeada. As vozes pré e pós-terapia foram analisadas por meio da escala GRBASI, análise espectrográfica e medida do tempo máximo de fonação (TMF. Os parâmetros para análise espectrográfica foram: forma do traçado, grau de escurecimento dos harmônicos, continuidade do traçado, presença de ruídos, presença de sub-harmônicos e harmônicos definidos. A medida do TMF da vogal /a/ representou a maior de três emissões. Os dados obtidos foram submetidos a análise descritiva de tendência central e dispersão, e ao Teste Wilcoxon. RESULTADOS: na análise perceptivo-auditiva, o parâmetro que mais se modificou no momento pós- tratamento foi o de soprosidade (B (p=0,003, seguido do grau da disfonia (G (p=0,004 e astenia (A (p=0,01, sendo que estes resultados foram estatisticamente significantes. Com relação ao espectrograma, houve melhora do traçado em 91% dos pacientes, e os parâmetros que mais se modificaram foram: aumento do número de harmônicos (32% e diminuição do ruído (24%. A medida do TMF da vogal /a/ apresentou-se significantemente maior no momento pós-fonoterapia (p=0,003%. CONCLUSÃO: pacientes com paralisia de prega vocal que foram submetidos ao tratamento fonoaudiológico apresentaram melhora dos dados perceptivo-auditivos, espectrográfico e do TMF.PURPOSE: this study aims to analyze the objective and subjective evaluation of voice quality in a unilateral vocal fold mobility alteration on prior and post speech therapy. METHODS: this is a retrospective study
Speech recognition for the anaesthesia record during crisis scenarios

DEFF Research Database (Denmark)

Alapetite, Alexandre

2008-01-01

Introduction: This article describes the evaluation of a prototype speech-input interface to an anaesthesia patient record, conducted in a full-scale anaesthesia simulator involving six doctor-nurse anaesthetist teams. Objective: The aims of the experiment were, first, to assess the potential...... and observations almost simultaneously when they are given or made. The tested speech input strategies were successful, even with the ambient noise. Speaking to the system while working appeared feasible, although improvements in speech recognition rates are needed. Conclusion: A vocal interface leads to shorter...
Deviant vocal fold vibration as observed during videokymography : the effect on voice quality

NARCIS (Netherlands)

Verdonck-de Leeuw, I M; Festen, J.M.; Mahieu, H.F.

Videokymographic images of deviant or irregular vocal fold vibration, including diplophonia, the transition from falsetto to modal voice, irregular vibration onset and offset, and phonation following partial laryngectomy were compared with the synchronously recorded acoustic speech signals. A clear
Vocal aging and adductor spasmodic dysphonia: Response to botulinum toxin injection

Science.gov (United States)

Cannito, Michael P; Kahane, Joel C; Chorna, Lesya

2008-01-01

Aging of the larynx is characterized by involutional changes which alter its biomechanical and neural properties and create a biological environment that is different from younger counterparts. Illustrative anatomical examples are presented. This natural, non-disease process appears to set conditions which may influence the effectiveness of botulinum toxin injection and our expectations for its success. Adductor spasmodic dysphonia, a type of laryngeal dystonia, is typically treated using botulinum toxin injections of the vocal folds in order to suppress adductory muscle spasms which are disruptive to production of speech and voice. A few studies have suggested diminished response to treatment in older patients with adductor spasmodic dysphonia. This retrospective study provides a reanalysis of existing pre-to-post treatment data as function of age. Perceptual judgments of speech produced by 42 patients with ADSD were made by two panels of professional listeners with expertise in voice or fluency of speech. Results demonstrate a markedly reduced positive response to botulinum toxin treatment in the older patients. Perceptual findings are further elucidated by means of acoustic spectrography. Literature on vocal aging is reviewed to provide a specific set of biological mechanisms that best account for the observed interaction of botulinum toxin treatment with advancing age. PMID:18488884
Vocal aging and adductor spasmodic dysphonia: response to botulinum toxin injection.

Science.gov (United States)

Cannito, Michael P; Kahane, Joel C; Chorna, Lesya

2008-01-01

Aging of the larynx is characterized by involutional changes which alter its biomechanical and neural properties and create a biological environment that is different from younger counterparts. Illustrative anatomical examples are presented. This natural, non-disease process appears to set conditions which may influence the effectiveness of botulinum toxin injection and our expectations for its success. Adductor spasmodic dysphonia, a type of laryngeal dystonia, is typically treated using botulinum toxin injections of the vocal folds in order to suppress adductory muscle spasms which are disruptive to production of speech and voice. A few studies have suggested diminished response to treatment in older patients with adductor spasmodic dysphonia. This retrospective study provides a reanalysis of existing pre-to-post treatment data as function of age. Perceptual judgments of speech produced by 42 patients with ADSD were made by two panels of professional listeners with expertise in voice or fluency of speech. Results demonstrate a markedly reduced positive response to botulinum toxin treatment in the older patients. Perceptual findings are further elucidated by means of acoustic spectrography. Literature on vocal aging is reviewed to provide a specific set of biological mechanisms that best account for the observed interaction of botulinum toxin treatment with advancing age.
Systematic Studies of Modified Vocalization: The Effect of Speech Rate on Speech Production Measures during Metronome-Paced Speech in Persons Who Stutter

Science.gov (United States)

Davidow, Jason H.

2014-01-01

Background: Metronome-paced speech results in the elimination, or substantial reduction, of stuttering moments. The cause of fluency during this fluency-inducing condition is unknown. Several investigations have reported changes in speech pattern characteristics from a control condition to a metronome-paced speech condition, but failure to control…
Perceptual fluency and judgments of vocal aesthetics and stereotypicality.

Science.gov (United States)

Babel, Molly; McGuire, Grant

2015-05-01

Research has shown that processing dynamics on the perceiver's end determine aesthetic pleasure. Specifically, typical objects, which are processed more fluently, are perceived as more attractive. We extend this notion of perceptual fluency to judgments of vocal aesthetics. Vocal attractiveness has traditionally been examined with respect to sexual dimorphism and the apparent size of a talker, as reconstructed from the acoustic signal, despite evidence that gender-specific speech patterns are learned social behaviors. In this study, we report on a series of three experiments using 60 voices (30 females) to compare the relationship between judgments of vocal attractiveness, stereotypicality, and gender categorization fluency. Our results indicate that attractiveness and stereotypicality are highly correlated for female and male voices. Stereotypicality and categorization fluency were also correlated for male voices, but not female voices. Crucially, stereotypicality and categorization fluency interacted to predict attractiveness, suggesting the role of perceptual fluency is present, but nuanced, in judgments of human voices. © 2014 Cognitive Science Society, Inc.
Assessment of breathing patterns and respiratory muscle recruitment during singing and speech in quadriplegia.

Science.gov (United States)

Tamplin, Jeanette; Brazzale, Danny J; Pretto, Jeffrey J; Ruehland, Warren R; Buttifant, Mary; Brown, Douglas J; Berlowitz, David J

2011-02-01

To explore how respiratory impairment after cervical spinal cord injury affects vocal function, and to explore muscle recruitment strategies used during vocal tasks after quadriplegia. It was hypothesized that to achieve the increased respiratory support required for singing and loud speech, people with quadriplegia use different patterns of muscle recruitment and control strategies compared with control subjects without spinal cord injury. Matched, parallel-group design. Large university-affiliated public hospital. Consenting participants with motor-complete C5-7 quadriplegia (n=6) and able-bodied age-matched controls (n=6) were assessed on physiologic and voice measures during vocal tasks. Not applicable. Standard respiratory function testing, surface electromyographic activity from accessory respiratory muscles, sound pressure levels during vocal tasks, the Voice Handicap Index, and the Perceptual Voice Profile. The group with quadriplegia had a reduced lung capacity (vital capacity, 71% vs 102% of predicted; P=.028), more perceived voice problems (Voice Handicap Index score, 22.5 vs 6.5; P=.046), and greater recruitment of accessory respiratory muscles during both loud and soft volumes (P=.028) than the able-bodied controls. The group with quadriplegia also demonstrated higher accessory muscle activation in changing from soft to loud speech (P=.028). People with quadriplegia have impaired vocal ability and use different muscle recruitment strategies during speech than the able-bodied. These findings will enable us to target specific measurements of respiratory physiology for assessing functional improvements in response to formal therapeutic singing training. Copyright © 2011 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Coral amador: efeitos de uma proposta de intervenção fonoaudiológica Amateur choir: the effect of speech therapy intervention

Directory of Open Access Journals (Sweden)

Camila Miranda Loiola

2010-10-01

Full Text Available OBJETIVO: verificar os efeitos de uma proposta de intervenção fonoaudiológica com base na prática educativa, por meio de avaliação de fonoaudiólogos, professores de canto e dos próprios coristas amadores participantes, analisando, em momento pré e pós-intervenção fonoaudiológica, os parâmetros de respiração, projeção e tessitura vocal na voz cantada. MÉTODOS: o programa teve o referencial teórico de ZABALA (1998 sobre a prática educativa. Dez cantores de coral amador responderam a um questionário de caracterização e realizaram gravações da extensão vocal e canto, pré e pós-intervenção. Durante seis encontros, foi abordado o aquecimento vocal, anatomia e fisiologia da voz cantada, bem-estar vocal, respiração e propriocepção da voz. As gravações foram analisadas por juízes fonoaudiólogos e professores de canto, que avaliaram a respiração, projeção e tessitura vocal. Os coristas, sem acesso às gravações, realizaram auto-avaliação dos mesmos parâmetros. RESULTADOS: avaliação dos juízes: tessitura vocal teve mais alterações positivas, seguida da respiração e projeção vocal. Todos os parâmetros tiveram mudanças significantes (p PURPOSE: to check the effects of a speech therapy intervention, based on the educational practice, by the assessment of speech and language pathologists, singing teachers and amateur choral singers themselves, analyzing the parameters of breathing, projection and vocal range profile in singing voice, pre and post speech therapy intervention. METHODS: the program was the theoretical framework of Zabala (1998 on educational practice. Ten amateur choral singers responded to a characterization questionnaire and conducted recordings of vocal range and singing, pre and post intervention. During six meetings, we approached warm up, anatomy and physiology as for the singing voice, vocal health, breathing and voice perception. The recordings were analyzed by judges (speech
The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English

Science.gov (United States)

Russo, Frank A.

2018-01-01

The RAVDESS is a validated multimodal database of emotional speech and song. The database is gender balanced consisting of 24 professional actors, vocalizing lexically-matched statements in a neutral North American accent. Speech includes calm, happy, sad, angry, fearful, surprise, and disgust expressions, and song contains calm, happy, sad, angry, and fearful emotions. Each expression is produced at two levels of emotional intensity, with an additional neutral expression. All conditions are available in face-and-voice, face-only, and voice-only formats. The set of 7356 recordings were each rated 10 times on emotional validity, intensity, and genuineness. Ratings were provided by 247 individuals who were characteristic of untrained research participants from North America. A further set of 72 participants provided test-retest data. High levels of emotional validity and test-retest intrarater reliability were reported. Corrected accuracy and composite "goodness" measures are presented to assist researchers in the selection of stimuli. All recordings are made freely available under a Creative Commons license and can be downloaded at https://doi.org/10.5281/zenodo.1188976. PMID:29768426
Altered vocal fold kinematics in synthetic self-oscillating models that employ adipose tissue as a lateral boundary condition.

Science.gov (United States)

Saidi, Hiba; Erath, Byron D.

2015-11-01

The vocal folds play a major role in human communication by initiating voiced sound production. During voiced speech, the vocal folds are set into sustained vibrations. Synthetic self-oscillating vocal fold models are regularly employed to gain insight into flow-structure interactions governing the phonation process. Commonly, a fixed boundary condition is applied to the lateral, anterior, and posterior sides of the synthetic vocal fold models. However, physiological observations reveal the presence of adipose tissue on the lateral surface between the thyroid cartilage and the vocal folds. The goal of this study is to investigate the influence of including this substrate layer of adipose tissue on the dynamics of phonation. For a more realistic representation of the human vocal folds, synthetic multi-layer vocal fold models have been fabricated and tested while including a soft lateral layer representative of adipose tissue. Phonation parameters have been collected and are compared to those of the standard vocal fold models. Results show that vocal fold kinematics are affected by adding the adipose tissue layer as a new boundary condition.
Head movements encode emotions during speech and song.

Science.gov (United States)

Livingstone, Steven R; Palmer, Caroline

2016-04-01

When speaking or singing, vocalists often move their heads in an expressive fashion, yet the influence of emotion on vocalists' head motion is unknown. Using a comparative speech/song task, we examined whether vocalists' intended emotions influence head movements and whether those movements influence the perceived emotion. In Experiment 1, vocalists were recorded with motion capture while speaking and singing each statement with different emotional intentions (very happy, happy, neutral, sad, very sad). Functional data analyses showed that head movements differed in translational and rotational displacement across emotional intentions, yet were similar across speech and song, transcending differences in F0 (varied freely in speech, fixed in song) and lexical variability. Head motion specific to emotional state occurred before and after vocalizations, as well as during sound production, confirming that some aspects of movement were not simply a by-product of sound production. In Experiment 2, observers accurately identified vocalists' intended emotion on the basis of silent, face-occluded videos of head movements during speech and song. These results provide the first evidence that head movements encode a vocalist's emotional intent and that observers decode emotional information from these movements. We discuss implications for models of head motion during vocalizations and applied outcomes in social robotics and automated emotion recognition. (c) 2016 APA, all rights reserved).
Speech recovery device

Energy Technology Data Exchange (ETDEWEB)

Frankle, Christen M.

2004-04-20

There is provided an apparatus and method for assisting speech recovery in people with inability to speak due to aphasia, apraxia or another condition with similar effect. A hollow, rigid, thin-walled tube with semi-circular or semi-elliptical cut out shapes at each open end is positioned such that one end mates with the throat/voice box area of the neck of the assistor and the other end mates with the throat/voice box area of the assisted. The speaking person (assistor) makes sounds that produce standing wave vibrations at the same frequency in the vocal cords of the assisted person. Driving the assisted person's vocal cords with the assisted person being able to hear the correct tone enables the assisted person to speak by simply amplifying the vibration of membranes in their throat.
Speech recovery device

Energy Technology Data Exchange (ETDEWEB)

Frankle, Christen M.

2000-10-19

There is provided an apparatus and method for assisting speech recovery in people with inability to speak due to aphasia, apraxia or another condition with similar effect. A hollow, rigid, thin-walled tube with semi-circular or semi-elliptical cut out shapes at each open end is positioned such that one end mates with the throat/voice box area of the neck of the assistor and the other end mates with the throat/voice box area of the assisted. The speaking person (assistor) makes sounds that produce standing wave vibrations at the same frequency in the vocal cords of the assisted person. Driving the assisted person's vocal cords with the assisted person being able to hear the correct tone enables the assisted person to speak by simply amplifying the vibration of membranes in their throat.
Domestic Dogs (Canis lupus familiaris are Sensitive to the “Human” Qualities of Vocal Commands

Directory of Open Access Journals (Sweden)

Jennifer M. Gibson

2014-08-01

Full Text Available In recent years, domestic dogs have been recognized for their ability to utilize human communicative gestures in choice tasks, as well as communicate with humans through visual and auditory means. A few dogs have even demonstrated the capacity to learn hundreds to thousands of human words and object labels with extensive training. However less is known about dogs‟ understanding or perception of human vocalizations in the absence of explicit training. This study was conducted to determine what aspects of human scolding vocalizations dogs would be most responsive to when presented with a choice to consume or avoid available food items. Variables included the gender, authenticity, word clarity and the human quality of the vocal commands. Our results suggest that dogs are generally cautious about novel sounds produced in the proximity of food. However they are most likely to avoid consumption when hearing a vocalization originally produced by a scolding human, suggesting awareness of vocal qualities common to human speech.
Three-Dimensional Flow Separation Induced by a Model Vocal Fold Polyp

Science.gov (United States)

Stewart, Kelley C.; Erath, Byron D.; Plesniak, Michael W.

2012-11-01

The fluid-structure energy exchange process for normal speech has been studied extensively, but it is not well understood for pathological conditions. Polyps and nodules, which are geometric abnormalities that form on the medial surface of the vocal folds, can disrupt vocal fold dynamics and thus can have devastating consequences on a patient's ability to communicate. A recent in-vitro investigation of a model polyp in a driven vocal fold apparatus demonstrated that such a geometric abnormality considerably disrupts the glottal jet behavior and that this flow field adjustment was a likely reason for the severe degradation of the vocal quality in patients. Understanding of the formation and propagation of vortical structures from a geometric protuberance, and their subsequent impact on the aerodynamic loadings that drive vocal fold dynamic, is a critical component in advancing the treatment of this pathological condition. The present investigation concerns the three-dimensional flow separation induced by a wall-mounted prolate hemispheroid with a 2:1 aspect ratio in cross flow, i.e. a model vocal fold polyp. Unsteady three-dimensional flow separation and its impact of the wall pressure loading are examined using skin friction line visualization and wall pressure measurements. Supported by the National Science Foundation, Grant No. CBET-1236351 and GW Center for Biomimetics and Bioinspired Engineering (COBRE).
Memory for speech and speech for memory.

Science.gov (United States)

Locke, J L; Kutz, K J

1975-03-01

Thirty kindergarteners, 15 who substituted /w/ for /r/ and 15 with correct articulation, received two perception tests and a memory test that included /w/ and /r/ in minimally contrastive syllables. Although both groups had nearly perfect perception of the experimenter's productions of /w/ and /r/, misarticulating subjects perceived their own tape-recorded w/r productions as /w/. In the memory task these same misarticulating subjects committed significantly more /w/-/r/ confusions in unspoken recall. The discussion considers why people subvocally rehearse; a developmental period in which children do not rehearse; ways subvocalization may aid recall, including motor and acoustic encoding; an echoic store that provides additional recall support if subjects rehearse vocally, and perception of self- and other- produced phonemes by misarticulating children-including its relevance to a motor theory of perception. Evidence is presented that speech for memory can be sufficiently impaired to cause memory disorder. Conceptions that restrict speech disorder to an impairment of communication are challenged.
Interactive Speech-Defect Diagnostic/Therapeutic/Prosthetic Aid

Science.gov (United States)

Bates, R. H. T.; Brieseman, N. P.; Clark, T. M.; Elder, A. G.; Fright, W. R.; Garden, K. L.; Kennedy, W. K.; Squires, P. L.; Thorpe, C. W.; Jelinek, H. J.; Turner, S. G.

1987-11-01

We have designed and built a portable real-time speech processing system, which incorporates a TMS 32010 (i.e. a co-processor) within an IBM personal computer. The system design is discussed as is the speech therapy software that has been implemented. Displays of loudness, pitch and vocal tract cross-section as computed by the system are illustrated. Preliminary results show that an estimate of the glottal excitation, as extracted using shift-and-add, vary between individuals. We indicate why the estimate of the glottal excitation may be useful in the diagnosis of glottal disorders.
Analyzing the effectiveness of vocal features in early telediagnosis of Parkinson's disease.

Directory of Open Access Journals (Sweden)

Betul Erdogdu Sakar

Full Text Available The recently proposed Parkinson's Disease (PD telediagnosis systems based on detecting dysphonia achieve very high classification rates in discriminating healthy subjects from PD patients. However, in these studies the data used to construct the classification model contain the speech recordings of both early and late PD patients with different severities of speech impairments resulting in unrealistic results. In a more realistic scenario, an early telediagnosis system is expected to be used in suspicious cases by healthy subjects or early PD patients with mild speech impairment. In this paper, considering the critical importance of early diagnosis in the treatment of the disease, we evaluate the ability of vocal features in early telediagnosis of Parkinson's Disease (PD using machine learning techniques with a two-step approach. In the first step, using only patient data, we aim to determine the patient group with relatively greater severity of speech impairments using Unified Parkinson's Disease Rating Scale (UPDRS score as an index of disease progression. For this purpose, we use three supervised and two unsupervised learning techniques. In the second step, we exclude the samples of this group of patients from the dataset, create a new dataset consisting of the samples of PD patients having less severity of speech impairments and healthy subjects, and use three classifiers with various settings to address this binary classification problem. In this classification problem, the highest accuracy of 96.4% and Matthew's Correlation Coefficient of 0.77 is obtained using support vector machines with third-degree polynomial kernel showing that vocal features can be used to build a decision support system for early telediagnosis of PD.

Vocal Imitations of Non-Vocal Sounds

Science.gov (United States)

Houix, Olivier; Voisin, Frédéric; Misdariis, Nicolas; Susini, Patrick

2016-01-01

Imitative behaviors are widespread in humans, in particular whenever two persons communicate and interact. Several tokens of spoken languages (onomatopoeias, ideophones, and phonesthemes) also display different degrees of iconicity between the sound of a word and what it refers to. Thus, it probably comes at no surprise that human speakers use a lot of imitative vocalizations and gestures when they communicate about sounds, as sounds are notably difficult to describe. What is more surprising is that vocal imitations of non-vocal everyday sounds (e.g. the sound of a car passing by) are in practice very effective: listeners identify sounds better with vocal imitations than with verbal descriptions, despite the fact that vocal imitations are inaccurate reproductions of a sound created by a particular mechanical system (e.g. a car driving by) through a different system (the voice apparatus). The present study investigated the semantic representations evoked by vocal imitations of sounds by experimentally quantifying how well listeners could match sounds to category labels. The experiment used three different types of sounds: recordings of easily identifiable sounds (sounds of human actions and manufactured products), human vocal imitations, and computational “auditory sketches” (created by algorithmic computations). The results show that performance with the best vocal imitations was similar to the best auditory sketches for most categories of sounds, and even to the referent sounds themselves in some cases. More detailed analyses showed that the acoustic distance between a vocal imitation and a referent sound is not sufficient to account for such performance. Analyses suggested that instead of trying to reproduce the referent sound as accurately as vocally possible, vocal imitations focus on a few important features, which depend on each particular sound category. These results offer perspectives for understanding how human listeners store and access long
Vocal symptoms in Parkinson disease treated with levodopa. A case report.

Science.gov (United States)

Schley, W S; Fenton, E; Niimi, S

1982-01-01

This is a report of a patient with unusually severe hoarseness in the absence of vocal fold pathology demonstrating Parkinson disease as one of the neurological diseases in which vocal symptoms occur. Although it is classifiably a severe, progressive, degenerative disorder, the popularity of pharmacotherapy for Parkinson disease during the past decade has resulted in improved functionality for an undetermined course of time in most patients. The classically described deterioration of speech ad voice may develop in a variant manner difficult to distinguish as disease-related, as this case report illustrates. An explanation of the hoarseness based on dyssynchronous vocal fold motion related to the disease is suggested by the acoustic methods (spectrography, waveform analysis) used in this study, and supported by strobe light laryngoscopy. This conclusion is important because of the extremely high incidences of varying degrees of hoarseness reported in recent studies of Parkinson disease.
Audiovisual integration of speech falters under high attention demands.

Science.gov (United States)

Alsius, Agnès; Navarra, Jordi; Campbell, Ruth; Soto-Faraco, Salvador

2005-05-10

One of the most commonly cited examples of human multisensory integration occurs during exposure to natural speech, when the vocal and the visual aspects of the signal are integrated in a unitary percept. Audiovisual association of facial gestures and vocal sounds has been demonstrated in nonhuman primates and in prelinguistic children, arguing for a general basis for this capacity. One critical question, however, concerns the role of attention in such multisensory integration. Although both behavioral and neurophysiological studies have converged on a preattentive conceptualization of audiovisual speech integration, this mechanism has rarely been measured under conditions of high attentional load, when the observers' attention resources are depleted. We tested the extent to which audiovisual integration was modulated by the amount of available attentional resources by measuring the observers' susceptibility to the classic McGurk illusion in a dual-task paradigm. The proportion of visually influenced responses was severely, and selectively, reduced if participants were concurrently performing an unrelated visual or auditory task. In contrast with the assumption that crossmodal speech integration is automatic, our results suggest that these multisensory binding processes are subject to attentional demands.
Top-Down Modulation of Auditory-Motor Integration during Speech Production: The Role of Working Memory.

Science.gov (United States)

Guo, Zhiqiang; Wu, Xiuqin; Li, Weifeng; Jones, Jeffery A; Yan, Nan; Sheft, Stanley; Liu, Peng; Liu, Hanjun

2017-10-25

Although working memory (WM) is considered as an emergent property of the speech perception and production systems, the role of WM in sensorimotor integration during speech processing is largely unknown. We conducted two event-related potential experiments with female and male young adults to investigate the contribution of WM to the neurobehavioural processing of altered auditory feedback during vocal production. A delayed match-to-sample task that required participants to indicate whether the pitch feedback perturbations they heard during vocalizations in test and sample sequences matched, elicited significantly larger vocal compensations, larger N1 responses in the left middle and superior temporal gyrus, and smaller P2 responses in the left middle and superior temporal gyrus, inferior parietal lobule, somatosensory cortex, right inferior frontal gyrus, and insula compared with a control task that did not require memory retention of the sequence of pitch perturbations. On the other hand, participants who underwent extensive auditory WM training produced suppressed vocal compensations that were correlated with improved auditory WM capacity, and enhanced P2 responses in the left middle frontal gyrus, inferior parietal lobule, right inferior frontal gyrus, and insula that were predicted by pretraining auditory WM capacity. These findings indicate that WM can enhance the perception of voice auditory feedback errors while inhibiting compensatory vocal behavior to prevent voice control from being excessively influenced by auditory feedback. This study provides the first evidence that auditory-motor integration for voice control can be modulated by top-down influences arising from WM, rather than modulated exclusively by bottom-up and automatic processes. SIGNIFICANCE STATEMENT One outstanding question that remains unsolved in speech motor control is how the mismatch between predicted and actual voice auditory feedback is detected and corrected. The present study
Examiner Practices and Culturally Inflected Doctoral Theses

Science.gov (United States)

Wisker, Gina; Robinson, Gillian

2014-01-01

Increase in numbers of postgraduate students worldwide represent an opportunity and necessity for nurturing and recognising the diversity of culturally inflected research topics, methodologies and expression. However, there are tensions in the definitions, encouragement and recognition of diversity in theses, and in balances of power in…
Use of Speech Analyses within a Mobile Application for the Assessment of Cognitive Impairment in Elderly People.

Science.gov (United States)

Konig, Alexandra; Satt, Aharon; Sorin, Alex; Hoory, Ran; Derreumaux, Alexandre; David, Renaud; Robert, Phillippe H

2018-01-01

Various types of dementia and Mild Cognitive Impairment (MCI) are manifested as irregularities in human speech and language, which have proven to be strong predictors for the disease presence and progress ion. Therefore, automatic speech analytics provided by a mobile application may be a useful tool in providing additional indicators for assessment and detection of early stage dementia and MCI. 165 participants (subjects with subjective cognitive impairment (SCI), MCI patients, Alzheimer's disease (AD) and mixed dementia (MD) patients) were recorded with a mobile application while performing several short vocal cognitive tasks during a regular consultation. These tasks included verbal fluency, picture description, counting down and a free speech task. The voice recordings were processed in two steps: in the first step, vocal markers were extracted using speech signal processing techniques; in the second, the vocal markers were tested to assess their 'power' to distinguish between SCI, MCI, AD and MD. The second step included training automatic classifiers for detecting MCI and AD, based on machine learning methods, and testing the detection accuracy. The fluency and free speech tasks obtain the highest accuracy rates of classifying AD vs. MD vs. MCI vs. SCI. Using the data, we demonstrated classification accuracy as follows: SCI vs. AD = 92% accuracy; SCI vs. MD = 92% accuracy; SCI vs. MCI = 86% accuracy and MCI vs. AD = 86%. Our results indicate the potential value of vocal analytics and the use of a mobile application for accurate automatic differentiation between SCI, MCI and AD. This tool can provide the clinician with meaningful information for assessment and monitoring of people with MCI and AD based on a non-invasive, simple and low-cost method. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization

Directory of Open Access Journals (Sweden)

Buddhamas eKriengwatana

2015-01-01

Full Text Available The extent to which human speech perception evolved by taking advantage of predispositions and pre-existing features of vertebrate auditory and cognitive systems remains a central question in the evolution of speech. This paper reviews asymmetries in vowel perception, speaker voice recognition, and speaker normalization in non-human animals – topics that have not been thoroughly discussed in relation to the abilities of non-human animals, but are nonetheless important aspects of vocal perception. Throughout this paper we demonstrate that addressing these issues in non-human animals is relevant and worthwhile because many non-human animals must deal with similar issues in their natural environment. That is, they must also discriminate between similar-sounding vocalizations, determine signaler identity from vocalizations, and resolve signaler-dependent variation in vocalizations from conspecifics. Overall, we find that, although plausible, the current evidence is insufficiently strong to conclude that directional asymmetries in vowel perception are specific to humans, or that non-human animals can use voice characteristics to recognize human individuals. However, we do find some indication that non-human animals can normalize speaker differences. Accordingly, we identify avenues for future research that would greatly improve and advance our understanding of these topics.
Speech–Language Pathology Evaluation and Management of Hyperkinetic Disorders Affecting Speech and Swallowing Function

Science.gov (United States)

Barkmeier-Kraemer, Julie M.; Clark, Heather M.

2017-01-01

Background Hyperkinetic dysarthria is characterized by abnormal involuntary movements affecting respiratory, phonatory, and articulatory structures impacting speech and deglutition. Speech–language pathologists (SLPs) play an important role in the evaluation and management of dysarthria and dysphagia. This review describes the standard clinical evaluation and treatment approaches by SLPs for addressing impaired speech and deglutition in specific hyperkinetic dysarthria populations. Methods A literature review was conducted using the data sources of PubMed, Cochrane Library, and Google Scholar. Search terms included 1) hyperkinetic dysarthria, essential voice tremor, voice tremor, vocal tremor, spasmodic dysphonia, spastic dysphonia, oromandibular dystonia, Meige syndrome, orofacial, cervical dystonia, dystonia, dyskinesia, chorea, Huntington’s Disease, myoclonus; and evaluation/treatment terms: 2) Speech–Language Pathology, Speech Pathology, Evaluation, Assessment, Dysphagia, Swallowing, Treatment, Management, and diagnosis. Results The standard SLP clinical speech and swallowing evaluation of chorea/Huntington’s disease, myoclonus, focal and segmental dystonia, and essential vocal tremor typically includes 1) case history; 2) examination of the tone, symmetry, and sensorimotor function of the speech structures during non-speech, speech and swallowing relevant activities (i.e., cranial nerve assessment); 3) evaluation of speech characteristics; and 4) patient self-report of the impact of their disorder on activities of daily living. SLP management of individuals with hyperkinetic dysarthria includes behavioral and compensatory strategies for addressing compromised speech and intelligibility. Swallowing disorders are managed based on individual symptoms and the underlying pathophysiology determined during evaluation. Discussion SLPs play an important role in contributing to the differential diagnosis and management of impaired speech and deglutition
The retrieval and inflection of verbs in the spontaneous speech of fluent aphasic speakers

NARCIS (Netherlands)

Bastiaanse, Y.R.M.

Fluent aphasia of the anomic and Wernicke's type is characterized by word retrieval difficulties. However, in fluent aphasic speech, grammatical deviations have been observed as well. There is debate as to whether these grammatical problems are caused by the word retrieval deficit, by an additional
Emotionally conditioning the target-speech voice enhances recognition of the target speech under "cocktail-party" listening conditions.

Science.gov (United States)

Lu, Lingxi; Bao, Xiaohan; Chen, Jing; Qu, Tianshu; Wu, Xihong; Li, Liang

2018-05-01

Under a noisy "cocktail-party" listening condition with multiple people talking, listeners can use various perceptual/cognitive unmasking cues to improve recognition of the target speech against informational speech-on-speech masking. One potential unmasking cue is the emotion expressed in a speech voice, by means of certain acoustical features. However, it was unclear whether emotionally conditioning a target-speech voice that has none of the typical acoustical features of emotions (i.e., an emotionally neutral voice) can be used by listeners for enhancing target-speech recognition under speech-on-speech masking conditions. In this study we examined the recognition of target speech against a two-talker speech masker both before and after the emotionally neutral target voice was paired with a loud female screaming sound that has a marked negative emotional valence. The results showed that recognition of the target speech (especially the first keyword in a target sentence) was significantly improved by emotionally conditioning the target speaker's voice. Moreover, the emotional unmasking effect was independent of the unmasking effect of the perceived spatial separation between the target speech and the masker. Also, (skin conductance) electrodermal responses became stronger after emotional learning when the target speech and masker were perceptually co-located, suggesting an increase of listening efforts when the target speech was informationally masked. These results indicate that emotionally conditioning the target speaker's voice does not change the acoustical parameters of the target-speech stimuli, but the emotionally conditioned vocal features can be used as cues for unmasking target speech.
Hidden Markov models in automatic speech recognition

Science.gov (United States)

Wrzoskowicz, Adam

1993-11-01

This article describes a method for constructing an automatic speech recognition system based on hidden Markov models (HMMs). The author discusses the basic concepts of HMM theory and the application of these models to the analysis and recognition of speech signals. The author provides algorithms which make it possible to train the ASR system and recognize signals on the basis of distinct stochastic models of selected speech sound classes. The author describes the specific components of the system and the procedures used to model and recognize speech. The author discusses problems associated with the choice of optimal signal detection and parameterization characteristics and their effect on the performance of the system. The author presents different options for the choice of speech signal segments and their consequences for the ASR process. The author gives special attention to the use of lexical, syntactic, and semantic information for the purpose of improving the quality and efficiency of the system. The author also describes an ASR system developed by the Speech Acoustics Laboratory of the IBPT PAS. The author discusses the results of experiments on the effect of noise on the performance of the ASR system and describes methods of constructing HMM's designed to operate in a noisy environment. The author also describes a language for human-robot communications which was defined as a complex multilevel network from an HMM model of speech sounds geared towards Polish inflections. The author also added mandatory lexical and syntactic rules to the system for its communications vocabulary.
A hypothesis on a role of oxytocin in the social mechanisms of speech and vocal learning.

Science.gov (United States)

Theofanopoulou, Constantina; Boeckx, Cedric; Jarvis, Erich D

2017-08-30

Language acquisition in humans and song learning in songbirds naturally happen as a social learning experience, providing an excellent opportunity to reveal social motivation and reward mechanisms that boost sensorimotor learning. Our knowledge about the molecules and circuits that control these social mechanisms for vocal learning and language is limited. Here we propose a hypothesis of a role for oxytocin (OT) in the social motivation and evolution of vocal learning and language. Building upon existing evidence, we suggest specific neural pathways and mechanisms through which OT might modulate vocal learning circuits in specific developmental stages. © 2017 The Authors.
Hemispheric dominance underlying the neural substrate for learned vocalizations develops with experience.

Science.gov (United States)

Chirathivat, Napim; Raja, Sahitya C; Gobes, Sharon M H

2015-06-22

Many aspects of song learning in songbirds resemble characteristics of speech acquisition in humans. Genetic, anatomical and behavioural parallels have most recently been extended with demonstrated similarities in hemispheric dominance between humans and songbirds: the avian higher order auditory cortex is left-lateralized for processing song memories in juvenile zebra finches that already have formed a memory of their fathers' song, just like Wernicke's area in the left hemisphere of the human brain is dominant for speech perception. However, it is unclear if hemispheric specialization is due to pre-existing functional asymmetry or the result of learning itself. Here we show that in juvenile male and female zebra finches that had never heard an adult song before, neuronal activation after initial exposure to a conspecific song is bilateral. Thus, like in humans, hemispheric dominance develops with vocal proficiency. A left-lateralized functional system that develops through auditory-vocal learning may be an evolutionary adaptation that could increase the efficiency of transferring information within one hemisphere, benefiting the production and perception of learned communication signals.
A multimodal spectral approach to characterize rhythm in natural speech.

Science.gov (United States)

Alexandrou, Anna Maria; Saarinen, Timo; Kujala, Jan; Salmelin, Riitta

2016-01-01

Human utterances demonstrate temporal patterning, also referred to as rhythm. While simple oromotor behaviors (e.g., chewing) feature a salient periodical structure, conversational speech displays a time-varying quasi-rhythmic pattern. Quantification of periodicity in speech is challenging. Unimodal spectral approaches have highlighted rhythmic aspects of speech. However, speech is a complex multimodal phenomenon that arises from the interplay of articulatory, respiratory, and vocal systems. The present study addressed the question of whether a multimodal spectral approach, in the form of coherence analysis between electromyographic (EMG) and acoustic signals, would allow one to characterize rhythm in natural speech more efficiently than a unimodal analysis. The main experimental task consisted of speech production at three speaking rates; a simple oromotor task served as control. The EMG-acoustic coherence emerged as a sensitive means of tracking speech rhythm, whereas spectral analysis of either EMG or acoustic amplitude envelope alone was less informative. Coherence metrics seem to distinguish and highlight rhythmic structure in natural speech.
Automated analysis of connected speech reveals early biomarkers of Parkinson's disease in patients with rapid eye movement sleep behaviour disorder.

Science.gov (United States)

Hlavnička, Jan; Čmejla, Roman; Tykalová, Tereza; Šonka, Karel; Růžička, Evžen; Rusz, Jan

2017-02-02

For generations, the evaluation of speech abnormalities in neurodegenerative disorders such as Parkinson's disease (PD) has been limited to perceptual tests or user-controlled laboratory analysis based upon rather small samples of human vocalizations. Our study introduces a fully automated method that yields significant features related to respiratory deficits, dysphonia, imprecise articulation and dysrhythmia from acoustic microphone data of natural connected speech for predicting early and distinctive patterns of neurodegeneration. We compared speech recordings of 50 subjects with rapid eye movement sleep behaviour disorder (RBD), 30 newly diagnosed, untreated PD patients and 50 healthy controls, and showed that subliminal parkinsonian speech deficits can be reliably captured even in RBD patients, which are at high risk of developing PD or other synucleinopathies. Thus, automated vocal analysis should soon be able to contribute to screening and diagnostic procedures for prodromal parkinsonian neurodegeneration in natural environments.
Paradoxical vocal changes in a trained singer by focally cooling the right superior temporal gyrus.

Science.gov (United States)

Katlowitz, Kalman A; Oya, Hiroyuki; Howard, Matthew A; Greenlee, Jeremy D W; Long, Michael A

2017-04-01

The production and perception of music is preferentially mediated by cortical areas within the right hemisphere, but little is known about how these brain regions individually contribute to this process. In an experienced singer undergoing awake craniotomy, we demonstrated that direct electrical stimulation to a portion of the right posterior superior temporal gyrus (pSTG) selectively interrupted singing but not speaking. We then focally cooled this region to modulate its activity during vocalization. In contrast to similar manipulations in left hemisphere speech production regions, pSTG cooling did not elicit any changes in vocal timing or quality. However, this manipulation led to an increase in the pitch of speaking with no such change in singing. Further analysis revealed that all vocalizations exhibited a cooling-induced increase in the frequency of the first formant, raising the possibility that potential pitch offsets may have been actively avoided during singing. Our results suggest that the right pSTG plays a key role in vocal sensorimotor processing whose impact is dependent on the type of vocalization produced. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reinforcement of Infant Vocalizations through Contingent Vocal Imitation

Science.gov (United States)

Pelaez, Martha; Virues-Ortega, Javier; Gewirtz, Jacob L.

2011-01-01

Maternal vocal imitation of infant vocalizations is highly prevalent during face-to-face interactions of infants and their caregivers. Although maternal vocal imitation has been associated with later verbal development, its potentially reinforcing effect on infant vocalizations has not been explored experimentally. This study examined the…
Fonoterapia vocal e fisioterapia respiratória com idosos saudáveis: revisão de literatura

OpenAIRE

Carla Aparecida Cielo; Fernanda dos Santos Pascotini; Vanessa Veis Ribeiro; Ariane de Macedo Gomes; Léris Salete Bonfanti Haeffner

2016-01-01

RESUMO Este estudo tem como tema a fonoterapia vocal e a fisioterapia respiratória no idoso saudável. O objetivo do presente estudo foi revisar a literatura sobre fonoterapia vocal e sobre fisioterapia respiratória com idosos saudáveis. Foi realizado um levantamento bibliográfico de artigos publicados entre 2004 e 2014 nas bases de dados Lilacs, Bireme, MedLine, PubMed e Scielo. Descritores utilizados: physical therapy specialty; breathing; speech therapy; aged; therapeutics e voice. A litera...
Measurement of vocal doses in virtual classrooms

DEFF Research Database (Denmark)

Bottalico, Pasquale; Pelegrin Garcia, David

2010-01-01

This work shows the results of a preliminary study about the determination of the optimal acoustical conditions for speakers in small classrooms. An experiment was carried out in a laboratory facility with 22 untrained talkers, who read a text passage from “Goldilocks” during two minutes under 13...... different acoustical conditions, that combined different kind of background noise and virtual classroom acoustics. Readings from the vocal fold vibrations were registered with an Ambulatory Phonation Monitor device. The speech signal from the talker in the center of the facility was picked up with a head...
Speechlessness in Gilles de la Tourette Syndrome: Cannabis-Based Medicines Improve Severe Vocal Blocking Tics in Two Patients.

Science.gov (United States)

Jakubovski, Ewgeni; Müller-Vahl, Kirsten

2017-08-10

We report the cases of two young German male patients with treatment-resistant Tourette syndrome (TS), who suffer from incapacitating stuttering-like speech disfluencies caused by vocal blocking tics and palilalia. Case 1: a 19-year old patient received medical cannabis at a dose of 1 × 0.1 g cannabis daily. Case 2: a 16-year old patient initially received dronabinol at a maximum dose of 22.4-33.6 mg daily. Both treatments provided significant symptom improvement of vocal blocking tics as well as of comorbid conditions and were well tolerated. Thus, cannabis-based medicine appears to be effective in treatment-resistant TS patients with vocal blocking tics.

Voice quality in relation to voice complaints and vocal fold condition during the screening of female student teachers.

Science.gov (United States)

Meulenbroek, Leo F P; de Jong, Felix I C R S

2011-07-01

The purpose of this study was to compare the perceptual examination of voice quality with the condition of the vocal folds and voice complaints during voice screening in female student teachers. This research was a cross-sectional study in 214 starting student teachers using the four-point grade scale of the GRBAS and laryngostroboscopic assessment of the vocal folds. The voice quality was assessed by speech pathologists using the ordinal 4-point G-scale (overall dysphonia) of the GRBAS method in a running speech sample. Glottal closure and vocal fold lesions were recorded. A questionnaire was used for assessing voice complaints. More students with an insufficient glottal closure (89%) were rated dysphonic compared with students with sufficient glottal closure (80%). Students with sufficient glottal closure had a significantly lower mean G-score (1.21) compared with the group with insufficient glottal closure (1.52) (P = 0.038). This study showed a larger percentage of students with vocal fold lesions (96%) labeled a dysphonic voice compared to students with no vocal fold problems (81%). Students with no vocal fold lesions had a significantly lower mean G-score (1.20) compared with the group with vocal fold lesions (2.05) (P=0.002). A dysphonic voice (G≥1) was rated in 76% of the students without voice complaints compared with 86% of the students with voice complaints. Students with no voice complaints had a lower mean G-score (1.07) compared with the group with voice complaints (1.41) (P=0.090). The present study showed that perceptual assessment of the voice and voice complaints is not sufficient to check if the future professional is at risk. Therefore, preventive measures are needed to detect students at risk early in their education and this depends on broader assessment: on the one hand, assessing voice quality and voice complaints and on the other hand, examination of the vocal folds of all starting students. Copyright © 2011 The Voice Foundation
Speed-Accuracy Tradeoffs in Speech Production

Science.gov (United States)

2017-06-01

capacity of discrete motor responses under different cognitive sets. Journal of Experimental Psychology , 71 (4), 475. SPEED-ACCURACY TRADEOFFS IN HUMAN...space defined by vocal tract constriction degree and location, as in Articulatory Phonology Browman & Goldstein (1992). These high-level spaces are...relationship between speech gestures varies as a function of their positions within the syllable Browman & Goldstein (1995); Krakow (1999); Byrd et al
Speech–Language Pathology Evaluation and Management of Hyperkinetic Disorders Affecting Speech and Swallowing Function

Directory of Open Access Journals (Sweden)

Julie M. Barkmeier-Kraemer

2017-09-01

Full Text Available Background: Hyperkinetic dysarthria is characterized by abnormal involuntary movements affecting respiratory, phonatory, and articulatory structures impacting speech and deglutition. Speech–language pathologists (SLPs play an important role in the evaluation and management of dysarthria and dysphagia. This review describes the standard clinical evaluation and treatment approaches by SLPs for addressing impaired speech and deglutition in specific hyperkinetic dysarthria populations.Methods: A literature review was conducted using the data sources of PubMed, Cochrane Library, and Google Scholar. Search terms included 1 hyperkinetic dysarthria, essential voice tremor, voice tremor, vocal tremor, spasmodic dysphonia, spastic dysphonia, oromandibular dystonia, Meige syndrome, orofacial, cervical dystonia, dystonia, dyskinesia, chorea, Huntington’s Disease, myoclonus; and evaluation/treatment terms: 2 Speech–Language Pathology, Speech Pathology, Evaluation, Assessment, Dysphagia, Swallowing, Treatment, Management, and diagnosis.Results: The standard SLP clinical speech and swallowing evaluation of chorea/Huntington’s disease, myoclonus, focal and segmental dystonia, and essential vocal tremor typically includes 1 case history; 2 examination of the tone, symmetry, and sensorimotor function of the speech structures during non-speech, speech and swallowing relevant activities (i.e., cranial nerve assessment; 3 evaluation of speech characteristics; and 4 patient self-report of the impact of their disorder on activities of daily living. SLP management of individuals with hyperkinetic dysarthria includes behavioral and compensatory strategies for addressing compromised speech and intelligibility. Swallowing disorders are managed based on individual symptoms and the underlying pathophysiology determined during evaluation.Discussion: SLPs play an important role in contributing to the differential diagnosis and management of impaired speech and
Two organizing principles of vocal production: Implications for nonhuman and human primates.

Science.gov (United States)

Owren, Michael J; Amoss, R Toby; Rendall, Drew

2011-06-01

Vocal communication in nonhuman primates receives considerable research attention, with many investigators arguing for similarities between this calling and speech in humans. Data from development and neural organization show a central role of affect in monkey and ape sounds, however, suggesting that their calls are homologous to spontaneous human emotional vocalizations while having little relation to spoken language. Based on this evidence, we propose two principles that can be useful in evaluating the many and disparate empirical findings that bear on the nature of vocal production in nonhuman and human primates. One principle distinguishes production-first from reception-first vocal development, referring to the markedly different role of auditory-motor experience in each case. The second highlights a phenomenon dubbed dual neural pathways, specifically that when a species with an existing vocal system evolves a new functionally distinct vocalization capability, it occurs through emergence of a second parallel neural pathway rather than through expansion of the extant circuitry. With these principles as a backdrop, we review evidence of acoustic modification of calling associated with background noise, conditioning effects, audience composition, and vocal convergence and divergence in nonhuman primates. Although each kind of evidence has been interpreted to show flexible cognitively mediated control over vocal production, we suggest that most are more consistent with affectively grounded mechanisms. The lone exception is production of simple, novel sounds in great apes, which is argued to reveal at least some degree of volitional vocal control. If also present in early hominins, the cortically based circuitry surmised to be associated with these rudimentary capabilities likely also provided the substrate for later emergence of the neural pathway allowing volitional production in modern humans. © 2010 Wiley-Liss, Inc.
Voice Disorders in Occupations with Vocal Load in Slovenia.

Science.gov (United States)

Boltežar, Lučka; Šereg Bahar, Maja

2014-12-01

The aim of this paper is to compare the prevalence of voice disorders and the risk factors for them in different occupations with a vocal load in Slovenia. A meta-analysis of six different Slovenian studies involving teachers, physicians, salespeople, catholic priests, nurses and speech-and-language therapists (SLTs) was performed. In all six studies, similar questions about the prevalence of voice disorders and the causes for them were included. The comparison of the six studies showed that more than 82% of the 2347 included subjects had voice problems at some time during their career. The teachers were the most affected by voice problems. The prevalent cause of voice problems was the vocal load in teachers and salespeople and respiratory-tract infections in all the other occupational groups. When the occupational groups were compared, it was stated that the teachers had more voice problems and showed less care for their voices than the priests. The physicians had more voice problems and showed better consideration of vocal hygiene rules than the SLTs. The majority of all the included subjects did not receive instructions about voice care during education. In order to decrease the prevalence of voice disorders in vocal professionals, a screening program is recommended before the beginning of their studies. Regular courses on voice care and proper vocal technique should be obligatory for all professional voice users during their career. The inclusion of dysphonia in the list of occupational diseases should be considered in Slovenia as it is in some European countries.
The successful treatment of vocal cord dysfunction with low-dose amitriptyline – including literature review

Directory of Open Access Journals (Sweden)

VA Varney

2009-11-01

Full Text Available VA Varney1, H Parnell1, J Evans1, NT Cooke1, J Lloyd2, J Bolton31Department of Respiratory Medicine, 2Department of Speech and Language Therapy, 3Department of Liaison Psychiatry, St Helier Hospital, Carshalton, Surrey, UKAbstract: Vocal cord dysfunction is an asthma mimic. Diagnosis of this condition requires a high index of suspicion if unnecessary treatments are to be avoided. We describe the findings from our case series of 62 patients (age range 18 to 90 years in whom the diagnosis was confirmed. Our findings show low-dose amitriptyline to be very effective in 90% of cases, with rapid benefit for those patients whose symptoms had been present for less than 12 months. This treatment, in conjunction with psycho-therapeutic and behavioral therapies may reduce unnecessary hospital admissions. Future studies may show whether this treatment regimen may reduce demands on the speech and language therapists.Keywords: vocal cord dysfunction, asthma, amitriptyline, wheeze, anxiety
Artificially lengthened and constricted vocal tract in vocal training methods.

Science.gov (United States)

Bele, Irene Velsvik

2005-01-01

It is common practice in vocal training to make use of vocal exercise techniques that involve partial occlusion of the vocal tract. Various techniques are used; some of them form an occlusion within the front part of the oral cavity or at the lips. Another vocal exercise technique involves lengthening the vocal tract; for example, the method of phonation into small tubes. This essay presents some studies made on the effects of various vocal training methods that involve an artificially lengthened and constricted vocal tract. The influence of sufficient acoustic impedance on vocal fold vibration and economical voice production is presented.
FoxP2 isoforms delineate spatiotemporal transcriptional networks for vocal learning in the zebra finch

Science.gov (United States)

Day, Nancy F; Kimball, Todd Haswell; Aamodt, Caitlin M; Heston, Jonathan B; Hilliard, Austin T; Xiao, Xinshu; White, Stephanie A

2018-01-01

Human speech is one of the few examples of vocal learning among mammals yet ~half of avian species exhibit this ability. Its neurogenetic basis is largely unknown beyond a shared requirement for FoxP2 in both humans and zebra finches. We manipulated FoxP2 isoforms in Area X, a song-specific region of the avian striatopallidum analogous to human anterior striatum, during a critical period for song development. We delineate, for the first time, unique contributions of each isoform to vocal learning. Weighted gene coexpression network analysis of RNA-seq data revealed gene modules correlated to singing, learning, or vocal variability. Coexpression related to singing was found in juvenile and adult Area X whereas coexpression correlated to learning was unique to juveniles. The confluence of learning and singing coexpression in juvenile Area X may underscore molecular processes that drive vocal learning in young zebra finches and, by analogy, humans. PMID:29360038
Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding

Energy Technology Data Exchange (ETDEWEB)

Hogden, J.

1996-11-05

The goal of the proposed research is to test a statistical model of speech recognition that incorporates the knowledge that speech is produced by relatively slow motions of the tongue, lips, and other speech articulators. This model is called Maximum Likelihood Continuity Mapping (Malcom). Many speech researchers believe that by using constraints imposed by articulator motions, we can improve or replace the current hidden Markov model based speech recognition algorithms. Unfortunately, previous efforts to incorporate information about articulation into speech recognition algorithms have suffered because (1) slight inaccuracies in our knowledge or the formulation of our knowledge about articulation may decrease recognition performance, (2) small changes in the assumptions underlying models of speech production can lead to large changes in the speech derived from the models, and (3) collecting measurements of human articulator positions in sufficient quantity for training a speech recognition algorithm is still impractical. The most interesting (and in fact, unique) quality of Malcom is that, even though Malcom makes use of a mapping between acoustics and articulation, Malcom can be trained to recognize speech using only acoustic data. By learning the mapping between acoustics and articulation using only acoustic data, Malcom avoids the difficulties involved in collecting articulator position measurements and does not require an articulatory synthesizer model to estimate the mapping between vocal tract shapes and speech acoustics. Preliminary experiments that demonstrate that Malcom can learn the mapping between acoustics and articulation are discussed. Potential applications of Malcom aside from speech recognition are also discussed. Finally, specific deliverables resulting from the proposed research are described.
Vocal acoustic analysis as a biometric indicator of information processing: implications for neurological and psychiatric disorders.

Science.gov (United States)

Cohen, Alex S; Dinzeo, Thomas J; Donovan, Neila J; Brown, Caitlin E; Morrison, Sean C

2015-03-30

Vocal expression reflects an integral component of communication that varies considerably within individuals across contexts and is disrupted in a range of neurological and psychiatric disorders. There is reason to suspect that variability in vocal expression reflects, in part, the availability of "on-line" resources (e.g., working memory, attention). Thus, understanding vocal expression is a potentially important biometric index of information processing, not only across but within individuals over time. A first step in this line of research involves establishing a link between vocal expression and information processing systems in healthy adults. The present study employed a dual attention experimental task where participants provided natural speech while simultaneously engaged in a baseline, medium or high nonverbal processing-load task. Objective, automated, and computerized analysis was employed to measure vocal expression in 226 adults. Increased processing load resulted in longer pauses, fewer utterances, greater silence overall and less variability in frequency and intensity levels. These results provide compelling evidence of a link between information processing resources and vocal expression, and provide important information for the development of an automated, inexpensive and uninvasive biometric measure of information processing. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Avaliação vocal de crianças disfônicas pré e pós intervenção fonoaudiológica em grupo: estudo de caso Evaluating the dynamic vocal dysphonic children in a pre and post intervention speech therapy group: a case study

Directory of Open Access Journals (Sweden)

Vanessa Veis Ribeiro

2012-01-01

Full Text Available Este estudo tem como objetivo caracterizar a dinâmica vocal de crianças disfônicas pré e pós-terapia fonoaudiológica em grupo por meio de avaliação perceptivo-auditiva e acústica da voz. Participaram seis crianças, dois meninos e quatro meninas, com idades entre sete e dez anos, com diagnóstico de disfonia funcional ou organofuncional. As crianças foram submetidas à anamnese, análise perceptivo-auditiva e acústica da voz, antes e após processo terapêutico grupal semanal, num total de doze sessões de quarenta minutos cada. Como estratégias terapêuticas foram propostas atividades de dramatizações, desenhos, brincadeiras, elaboração de painéis e realizados exercícios vocais (de forma coletiva e lúdica. Buscou-se a troca de experiências entre os membros do grupo, a construção conjunta de conhecimentos sobre a produção da voz e saúde vocal, e, a atuação direta por meio de técnicas e exercícios. Os dados foram analisados usando um nível de significância de 0,10. Quanto aos parâmetros de rugosidade e grau global da disfonia da avaliação perceptivo-auditiva, houve diferença entre as avaliações realizadas antes e depois do processo terapêutico grupal (p=0,024 e p=0,074, respectivamente. Em relação à análise acústica da voz pré e pós-terapia, não houve diferença para frequência fundamental e intensidade vocal média (p=0,288 e p=0,906, respectivamente. Já para as medidas de ruído, jitter e shimmer, houve diferença entre as avaliações iniciais e finais (p=0,079 e p=0,046, respectivamente. A terapia fonoaudiológica em grupo promove modificações na dinâmica vocal de crianças disfônicas, no que se refere aos parâmetros perceptivo-auditivos e acústicos.This study aims at featuring the dynamic vocals of dysphonic children before and after speech therapy group by assessing perceptual and acoustic voice. Participants were six children, two boys and four girls, aged between seven and ten year old
Avaliação vocal de crianças disfônicas pré e pós intervenção fonoaudiológica em grupo: estudo de caso Evaluating the dynamic vocal dysphonic children in a pre and post intervention speech therapy group: a case study

Directory of Open Access Journals (Sweden)

Vanessa Veis Ribeiro

2013-04-01

Full Text Available Este estudo tem como objetivo caracterizar a dinâmica vocal de crianças disfônicas pré e pós-terapia fonoaudiológica em grupo por meio de avaliação perceptivo-auditiva e acústica da voz. Participaram seis crianças, dois meninos e quatro meninas, com idades entre sete e dez anos, com diagnóstico de disfonia funcional ou organofuncional. As crianças foram submetidas à anamnese, análise perceptivo-auditiva e acústica da voz, antes e após processo terapêutico grupal semanal, num total de doze sessões de quarenta minutos cada. Como estratégias terapêuticas foram propostas atividades de dramatizações, desenhos, brincadeiras, elaboração de painéis e realizados exercícios vocais (de forma coletiva e lúdica. Buscou-se a troca de experiências entre os membros do grupo, a construção conjunta de conhecimentos sobre a produção da voz e saúde vocal, e, a atuação direta por meio de técnicas e exercícios. Os dados foram analisados usando um nível de significância de 0,10. Quanto aos parâmetros de rugosidade e grau global da disfonia da avaliação perceptivo-auditiva, houve diferença entre as avaliações realizadas antes e depois do processo terapêutico grupal (p=0,024 e p=0,074, respectivamente. Em relação à análise acústica da voz pré e pós-terapia, não houve diferença para frequência fundamental e intensidade vocal média (p=0,288 e p=0,906, respectivamente. Já para as medidas de ruído, jitter e shimmer, houve diferença entre as avaliações iniciais e finais (p=0,079 e p=0,046, respectivamente. A terapia fonoaudiológica em grupo promove modificações na dinâmica vocal de crianças disfônicas, no que se refere aos parâmetros perceptivo-auditivos e acústicos.This study aims at featuring the dynamic vocals of dysphonic children before and after speech therapy group by assessing perceptual and acoustic voice. Participants were six children, two boys and four girls, aged between seven and ten year old
The Role of the Listener's State in Speech Perception

Science.gov (United States)

Viswanathan, Navin

2009-01-01

Accounts of speech perception disagree on whether listeners perceive the acoustic signal (Diehl, Lotto, & Holt, 2004) or the vocal tract gestures that produce the signal (e.g., Fowler, 1986). In this dissertation, I outline a research program using a phenomenon called "perceptual compensation for coarticulation" (Mann, 1980) to examine this…
Inpatient injection laryngoplasty for vocal fold immobility: When is it really necessary?

Science.gov (United States)

Zuniga, Steven; Ebersole, Barbara; Jamal, Nausheen

To compare pulmonary and swallow outcomes of injection laryngoplasty when performed in the acute versus subacute setting in head & neck and thoracic cancer patients presenting with new onset unilateral vocal fold immobility. Case series with chart review at an academic cancer center over a 2year period. Based on swallow evaluation, patients diagnosed with vocal fold immobility were grouped into an unsafe swallow group, injected as inpatients, and a safe swallow group, for whom injection laryngoplasty was delayed to the outpatient setting or not performed. Rates of pneumonia, diet recommendations, and swallow outcomes were compared between groups. 24 patients with new-onset vocal fold immobility were evaluated. 7 underwent injection in the inpatient setting, 12 in the outpatient setting, and 5 did not undergo injection. There was no perceived difference in speech and swallow outcomes between the inpatient and outpatient injection groups. Injection laryngoplasty shows promise as an effective intervention for reducing aspiration risk and improving diet normalcy in patients with dysphagia as a result of unilateral vocal fold immobility. In patients determined to have a safe swallow, delay of injection laryngoplasty is not detrimental to swallow outcomes. Copyright © 2017 Elsevier Inc. All rights reserved.
On primordial black holes from an inflection point

NARCIS (Netherlands)

Germani, Cristiano; Prokopec, Tom

2017-01-01

Recently, it has been claimed that inflationary models with an inflection point in the scalar potential can produce a large resonance in the power spectrum of curvature perturbation. In this paper however we show that the previous analyses are incorrect. The reason is twofold: firstly, the inflaton
Imaging for understanding speech communication: Advances and challenges

Science.gov (United States)

Narayanan, Shrikanth

2005-04-01

Research in speech communication has relied on a variety of instrumentation methods to illuminate details of speech production and perception. One longstanding challenge has been the ability to examine real-time changes in the shaping of the vocal tract; a goal that has been furthered by imaging techniques such as ultrasound, movement tracking, and magnetic resonance imaging. The spatial and temporal resolution afforded by these techniques, however, has limited the scope of the investigations that could be carried out. In this talk, we focus on some recent advances in magnetic resonance imaging that allow us to perform near real-time investigations on the dynamics of vocal tract shaping during speech. Examples include Demolin et al. (2000) (4-5 images/second, ultra-fast turbo spin echo) and Mady et al. (2001,2002) (8 images/second, T1 fast gradient echo). A recent study by Narayanan et al. (2004) that used a spiral readout scheme to accelerate image acquisition has allowed for image reconstruction rates of 24 images/second. While these developments offer exciting prospects, a number of challenges lie ahead, including: (1) improving image acquisition protocols, hardware for enhancing signal-to-noise ratio, and optimizing spatial sampling; (2) acquiring quality synchronized audio; and (3) analyzing and modeling image data including cross-modality registration. [Work supported by NIH and NSF.
Performance of a reduced-order FSI model for flow-induced vocal fold vibration

Science.gov (United States)

Luo, Haoxiang; Chang, Siyuan; Chen, Ye; Rousseau, Bernard; PhonoSim Team

2017-11-01

Vocal fold vibration during speech production involves a three-dimensional unsteady glottal jet flow and three-dimensional nonlinear tissue mechanics. A full 3D fluid-structure interaction (FSI) model is computationally expensive even though it provides most accurate information about the system. On the other hand, an efficient reduced-order FSI model is useful for fast simulation and analysis of the vocal fold dynamics, which can be applied in procedures such as optimization and parameter estimation. In this work, we study performance of a reduced-order model as compared with the corresponding full 3D model in terms of its accuracy in predicting the vibration frequency and deformation mode. In the reduced-order model, we use a 1D flow model coupled with a 3D tissue model that is the same as in the full 3D model. Two different hyperelastic tissue behaviors are assumed. In addition, the vocal fold thickness and subglottal pressure are varied for systematic comparison. The result shows that the reduced-order model provides consistent predictions as the full 3D model across different tissue material assumptions and subglottal pressures. However, the vocal fold thickness has most effect on the model accuracy, especially when the vocal fold is thin.
Evidence-Based Occupational Hearing Screening I: Modeling the Effects of Real-World Noise Environments on the Likelihood of Effective Speech Communication.

Science.gov (United States)

Soli, Sigfrid D; Giguère, Christian; Laroche, Chantal; Vaillancourt, Véronique; Dreschler, Wouter A; Rhebergen, Koenraad S; Harkins, Kevin; Ruckstuhl, Mark; Ramulu, Pradeep; Meyers, Lawrence S

The objectives of this study were to (1) identify essential hearing-critical job tasks for public safety and law enforcement personnel; (2) determine the locations and real-world noise environments where these tasks are performed; (3) characterize each noise environment in terms of its impact on the likelihood of effective speech communication, considering the effects of different levels of vocal effort, communication distances, and repetition; and (4) use this characterization to define an objective normative reference for evaluating the ability of individuals to perform essential hearing-critical job tasks in noisy real-world environments. Data from five occupational hearing studies performed over a 17-year period for various public safety agencies were analyzed. In each study, job task analyses by job content experts identified essential hearing-critical tasks and the real-world noise environments where these tasks are performed. These environments were visited, and calibrated recordings of each noise environment were made. The extended speech intelligibility index (ESII) was calculated for each 4-sec interval in each recording. These data, together with the estimated ESII value required for effective speech communication by individuals with normal hearing, allowed the likelihood of effective speech communication in each noise environment for different levels of vocal effort and communication distances to be determined. These likelihoods provide an objective norm-referenced and standardized means of characterizing the predicted impact of real-world noise on the ability to perform essential hearing-critical tasks. A total of 16 noise environments for law enforcement personnel and eight noise environments for corrections personnel were analyzed. Effective speech communication was essential to hearing-critical tasks performed in these environments. Average noise levels, ranged from approximately 70 to 87 dBA in law enforcement environments and 64 to 80 dBA in
Assessing Linguistic Competence: Verbal Inflection in Child Tamil

Science.gov (United States)

Lakshmanan, Usha

2006-01-01

Within child language acquisition research, there has been a fair amount of controversy regarding children's knowledge of the grammatical properties associated with verbal inflection (e.g., tense, agreement, and aspect). Some researchers have proposed that the child's early grammar is fundamentally different from the adult grammar, whereas others…
Software for objective comparison of vocal acoustic features over weeks of audio recording: KLFromRecordingDays

Science.gov (United States)

Soderstrom, Ken; Alalawi, Ali

KLFromRecordingDays allows measurement of Kullback-Leibler (KL) distances between 2D probability distributions of vocal acoustic features. Greater KL distance measures reflect increased phonological divergence across the vocalizations compared. The software has been used to compare *.wav file recordings made by Sound Analysis Recorder 2011 of songbird vocalizations pre- and post-drug and surgical manipulations. Recordings from individual animals in *.wav format are first organized into subdirectories by recording day and then segmented into individual syllables uttered and acoustic features of these syllables using Sound Analysis Pro 2011 (SAP). KLFromRecordingDays uses syllable acoustic feature data output by SAP to a MySQL table to generate and compare "template" (typically pre-treatment) and "target" (typically post-treatment) probability distributions. These distributions are a series of virtual 2D plots of the duration of each syllable (as x-axis) to each of 13 other acoustic features measured by SAP for that syllable (as y-axes). Differences between "template" and "target" probability distributions for each acoustic feature are determined by calculating KL distance, a measure of divergence of the target 2D distribution pattern from that of the template. KL distances and the mean KL distance across all acoustic features are calculated for each recording day and output to an Excel spreadsheet. Resulting data for individual subjects may then be pooled across treatment groups and graphically summarized and used for statistical comparisons. Because SAP-generated MySQL files are accessed directly, data limits associated with spreadsheet output are avoided, and the totality of vocal output over weeks may be objectively analyzed all at once. The software has been useful for measuring drug effects on songbird vocalizations and assessing recovery from damage to regions of vocal motor cortex. It may be useful in studies employing other species, and as part of speech

Software for objective comparison of vocal acoustic features over weeks of audio recording: KLFromRecordingDays

Directory of Open Access Journals (Sweden)

Ken Soderstrom

2017-01-01

Full Text Available KLFromRecordingDays allows measurement of Kullback–Leibler (KL distances between 2D probability distributions of vocal acoustic features. Greater KL distance measures reflect increased phonological divergence across the vocalizations compared. The software has been used to compare *.wav file recordings made by Sound Analysis Recorder 2011 of songbird vocalizations pre- and post-drug and surgical manipulations. Recordings from individual animals in *.wav format are first organized into subdirectories by recording day and then segmented into individual syllables uttered and acoustic features of these syllables using Sound Analysis Pro 2011 (SAP. KLFromRecordingDays uses syllable acoustic feature data output by SAP to a MySQL table to generate and compare “template” (typically pre-treatment and “target” (typically post-treatment probability distributions. These distributions are a series of virtual 2D plots of the duration of each syllable (as x-axis to each of 13 other acoustic features measured by SAP for that syllable (as y-axes. Differences between “template” and “target” probability distributions for each acoustic feature are determined by calculating KL distance, a measure of divergence of the target 2D distribution pattern from that of the template. KL distances and the mean KL distance across all acoustic features are calculated for each recording day and output to an Excel spreadsheet. Resulting data for individual subjects may then be pooled across treatment groups and graphically summarized and used for statistical comparisons. Because SAP-generated MySQL files are accessed directly, data limits associated with spreadsheet output are avoided, and the totality of vocal output over weeks may be objectively analyzed all at once. The software has been useful for measuring drug effects on songbird vocalizations and assessing recovery from damage to regions of vocal motor cortex. It may be useful in studies employing other
Subcortical Contributions to Motor Speech: Phylogenetic, Developmental, Clinical.

Science.gov (United States)

Ziegler, W; Ackermann, H

2017-08-01

Vocal learning is an exclusively human trait among primates. However, songbirds demonstrate behavioral features resembling human speech learning. Two circuits have a preeminent role in this human behavior; namely, the corticostriatal and the cerebrocerebellar motor loops. While the striatal contribution can be traced back to the avian anterior forebrain pathway (AFP), the sensorimotor adaptation functions of the cerebellum appear to be human specific in acoustic communication. This review contributes to an ongoing discussion on how birdsong translates into human speech. While earlier approaches were focused on higher linguistic functions, we place the motor aspects of speaking at center stage. Genetic data are brought together with clinical and developmental evidence to outline the role of cerebrocerebellar and corticostriatal interactions in human speech. Copyright © 2017 Elsevier Ltd. All rights reserved.
The Traditional/Acoustic Music Project: a study of vocal demands and vocal health.

Science.gov (United States)

Erickson, Molly L

2012-09-01

The Traditional/Acoustic Music Project seeks to identify the musical and performance characteristics of traditional/acoustic musicians and determine the vocal demands they face with the goals of (1) providing information and outreach to this important group of singers and (2) providing information to physicians, speech-language pathologists, and singing teachers who will enable them to provide appropriate services. Descriptive cross-sectional study. Data have been collected through administration of a 53-item questionnaire. The questionnaire was administered to artists performing at local venues in Knoxville, Tennessee and also to musicians attending the 2008 Folk Alliance Festival in Memphis, Tennessee. Approximately 41% of the respondents have had no vocal training, whereas approximately 34% of the respondents have had some form of formal vocal training (private lessons or group instruction). About 41% of the participants had experienced a tired voice, whereas about 30% of the participants had experienced either a loss of the top range of the voice or a total loss of voice at least once in their careers. Approximately 31% of the respondents had no health insurance. Approximately 69% of the respondents reported that they get their information about healthy singing practices solely from fellow musicians or that they do not get any information at all. Traditional/acoustic musicians are a poorly studied population at risk for the development of voice disorders. Continued research is necessary with the goal of a large sample that can be analyzed for associations, identification of subpopulations, and formulation of specific hypotheses that lend themselves to experimental research. Appropriate models of information and service delivery tailored for the singer-instrumentalist are needed. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Measurement of speech levels in the presence of time varying background noise

Science.gov (United States)

Pearsons, K. S.; Horonjeff, R.

1982-01-01

Short-term speech level measurements which could be used to note changes in vocal effort in a time varying noise environment were studied. Knowing the changes in speech level would in turn allow prediction of intelligibility in the presence of aircraft flyover noise. Tests indicated that it is possible to use two second samples of speech to estimate long term root mean square speech levels. Other tests were also performed in which people read out loud during aircraft flyover noise. Results of these tests indicate that people do indeed raise their voice during flyovers at a rate of about 3-1/2 dB for each 10 dB increase in background level. This finding is in agreement with other tests of speech levels in the presence of steady state background noise.
Double Fourier analysis for Emotion Identification in Voiced Speech

International Nuclear Information System (INIS)

Sierra-Sosa, D.; Bastidas, M.; Ortiz P, D.; Quintero, O.L.

2016-01-01

We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech. Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions. A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds. Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions. Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it. Finally features related with emotions in voiced speech are extracted and presented. (paper)
Caracterização vocal de pacientes com hipertireoidismo e hipotireoidismo Vocal characterization of patients with hyperthyroidism and hypothyroidism

Directory of Open Access Journals (Sweden)

Roberta Werlang Isolan-Cury

2007-06-01

: Twenty non-smoking women with ages between 18 and 55 years from the Endocrinology Ambulatory of the institution were evaluated after clinical and lab diagnosis for hyperthyroidism or hypothyroidism. The parameters investigated were: period bearing the disease, vocal complaint, maximum phonation time /a/, /s/, and /z/, fundamental frequency (F0, glottal noise (GNE. The aspects evaluated in the auditory-perceptive analysis were: pneumo-phono-articulatory coordination (coordinated or uncoordinated, pitch, loudness, vocal attack, resonance, speech speed and vocal quality, that could be classified as one or two of the following: neutral, hoarse, whispered, coarse, or tense, and degree: light, moderate or severe. Data were statistically analyzed through the EPI-INFO 6.04b software, Fisher qualitative method, considering a significance level of 0.05. RESULTS: The auditory-perceptive analysis showed that seven patients with hypothyroidism and nine with hyperthyroidism presented changes in vocal quality. Eight subjects from both groups presented pneumo-phono-articulatory incoordination. Eight subjects from group A and six from group B referred vocal complaints, such as hoarseness and thick voice, respectively. In the acoustic analysis, nine subjects presented change in glottal noise. CONCLUSION: The results showed great incidence of vocal changes on the studied groups (both hyper and hypothyroidism groups, which evidences the relation between dysphonia and thyroidal dysfunctions.
L2 Processing of Plural Inflection in English

Science.gov (United States)

Song, Yoonsang

2015-01-01

This study investigates (1) whether late second language (L2) learners can attain native-like knowledge of English plural inflection even when their first language (L1) lacks an equivalent and (2) whether they construct hierarchically structured representations during online sentence processing like native speakers. In a self-paced reading task,…
Co dokáže náš hlas? Fonetický pohled na variabilitu řečové produkce // What are our voices capable of ? Phonetic perspective on the variability of speech production

Directory of Open Access Journals (Sweden)

Radek Skarnitzl

2016-12-01

Full Text Available The paper surveys the plasticity of the speech production mechanism. At the level of phonatory behaviour, a distinction is made between the frequency of vocal fold vibration, which is reflected in the pitch of the voice, and the manner in which the vocal folds vibrate, which lends our voice different qualities. The main types of phonatory modifications are described and some of their uses in everyday communication, as well as their perceptual effects, are documented from literature. Modifications of the primary makeup of speech sounds in the supraglottal vocal tract, such as rounding or spreading of the lips, hyper- or hyponasality, and palatalization, are discussed in the following section. The two levels of description — phonatory and articulatory — are formally anchored in Nolan’s model of the sources of variability in speech. The final part of the paper examines speech variability from the perspective of the listener, regarding one’s speech as their auditory face which signals biologically, psychologically, and socially conditioned information about the speaker.
Predictive value of ventilatory inflection points determined under field conditions.

Science.gov (United States)

Heyde, Christian; Mahler, Hubert; Roecker, Kai; Gollhofer, Albert

2016-01-01

The aim of this study was to evaluate the predictive potential provided by two ventilatory inflection points (VIP1 and VIP2) examined in field without using gas analysis systems and uncomfortable facemasks. A calibrated respiratory inductance plethysmograph (RIP) and a computerised routine were utilised, respectively, to derive ventilation and to detect VIP1 and VIP2 during a standardised field ramp test on a 400 m running track on 81 participants. In addition, average running speed of a competitive 1000 m run (S1k) was observed as criterion. The predictive value of running speed at VIP1 (SVIP1) and the speed range between VIP1 and VIP2 in relation to VIP2 (VIPSPAN) was analysed via regression analysis. VIPSPAN rather than running speed at VIP2 (SVIP2) was operationalised as a predictor to consider the covariance between SVIP1 and SVIP2. SVIP1 and VIPSPAN, respectively, provided 58.9% and 22.9% of explained variance in regard to S1k. Considering covariance, the timing of two ventilatory inflection points provides predictive value in regard to a competitive 1000 m run. This is the first study to apply computerised detection of ventilatory inflection points in a field setting independent on measurements of the respiratory gas exchange and without using any facemasks.
Acoustic analysis after radiotherapy in T1 vocal cord carcinoma: a new approach to the analysis of voice quality

International Nuclear Information System (INIS)

Rovirosa, Angeles; Martinez-Celdran, Eugenio; Ortega, Alicia; Ascaso, Carlos; Abellana, Rosa; Velasco, Mercedes; Bonet, Montserrat; Herrera, Carmen; Casas, Francesc; Francisco, Rosa Maria; Arenas, Meritxell; Hernandez, Victor; Sanchez-Reyes, Alberto; Leon, Concha; Traserra, Jordi; Biete, Albert

2000-01-01

Purpose: The study of acoustic voice parameters (fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio) in extended vowel production, oral reading of a standard paragraph, spontaneous speech and a song in irradiated patients for Tis-T1 vocal cord carcinoma. Methods and Materials: Eighteen male patients irradiated for Tis-T1 vocal cord carcinoma and a control group of 31 nonirradiated subjects of the same age were included in a study of acoustic voice analysis. The control group had been rigorously selected for voice quality and the irradiated group had previous history of smoking in two-thirds of the cases and a vocal cord biopsy. Radiotherapy patients were treated with a 6MV Linac receiving a total dose of 66 Gy, 2 Gy/day, with median treatment areas of 28 cm 2 . Acoustic voice analysis was performed 1 year after radiotherapy, the voice of patients in extended vowel production, oral reading of a standard paragraph, spontaneous speech, and in a song was tape registered and analyzed by a Kay Elemetric's Computerized Speech Lab (model CSL no. 4300). Fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio were obtained in each case. Mann Whitney analysis was used for statistical tests. Results: The irradiated group presented higher values of fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio. Mann-Whitney analysis showed significant differences for fundamental frequency and jitter in vowel production, oral reading, spontaneous speech, and song. Shimmer only showed differences in vowel production and harmonics-to-noise ratio in oral reading and song. Conclusions: In our study only fundamental frequency and jitter showed significant increased values to the control group in all the acoustic situations. Sustained vowel production showed the worst values of the acoustic parameters in comparison with the other acoustic situations. This study seems to suggest that more work should be done in this field
Out of the Mouths of Babes: Vocal Production in Infant Siblings of Children with ASD

Science.gov (United States)

Paul, Rhea; Fuerst, Yael; Ramsay, Gordon; Chawarska, Kasia; Klin, Ami

2011-01-01

Background: Younger siblings of children with autism spectrum disorders (ASD) are at higher risk for acquiring these disorders than the general population. Language development is usually delayed in children with ASD. The present study examines the development of pre-speech vocal behavior in infants at risk for ASD due to the presence of an older…
Inflectional and derivational morphological spelling abilities of children with Specific Language Impairment.

Science.gov (United States)

Critten, Sarah; Connelly, Vincent; Dockrell, Julie E; Walter, Kirsty

2014-01-01

Children with Specific Language Impairment (SLI) are known to have difficulties with spelling but the factors that underpin these difficulties, are a matter of debate. The present study investigated the impact of oral language and literacy on the bound morpheme spelling abilities of children with SLI. Thirty-three children with SLI (9-10 years) and two control groups, one matched for chronological age (CA) and one for language and spelling age (LA) (aged 6-8 years) were given dictated spelling tasks of 24 words containing inflectional morphemes and 18 words containing derivational morphemes. There were no significant differences between the SLI group and their LA matches in accuracy or error patterns for inflectional morphemes. By contrast when spelling derivational morphemes the SLI group was less accurate and made proportionately more omissions and phonologically implausible errors than both control groups. Spelling accuracy was associated with phonological awareness and reading; reading performance significantly predicted the ability to spell both inflectional and derivational morphemes. The particular difficulties experienced by the children with SLI for derivational morphemes are considered in relation to reading and oral language.
The Therapeutic Effect of Speechvive on Prosody in Parkinson's Disease

Science.gov (United States)

Kiefer, Brianna Rose

It is well known that physiological impairments secondary to Parkinson's Disease (PD) negatively impact speech production. Individuals with PD display vocal, prosodic, resonant, and articulatory abnormalities which reduce communicative effectiveness. Prosody is a broad term which refers to the alterations in pitch, duration, and loudness used by speakers to convey important linguistic and paralinguistic information during speech. Little is known about the prosodic abnormalities associated with PD relative to healthy older adults; however, it is well known that individuals with PD display impairments in their ability to modulate the acoustic cues (pitch, duration, intensity) associated with prosodic inflection in speech. Literature presently lacks sufficient evidence to support treatment paradigms commonly used to address dysprosody in PD. Thus, there is a significant need to develop and investigate potential evidence-based treatment paradigms for dysprosody associated with PD. The present study aimed to examine the potential treatment effects the SpeechVive device has on treating dysprosody in PD. Acoustic recordings were obtained from 15 individuals with PD during a reading task. Participants read the passage at the start of the study and 12 weeks later, after wearing the SpeechVive device for the intervening weeks. Main outcome measures examined productions of contrastive stress, intonation contours, rate, and patterns of pausing. The results revealed that participants increased vocal intensity levels during the production of stressed words and improved standard deviation of pitch during the productions of intonation contours. Lastly, the device was found to improve participants' abilities to pause relative to syntactic boundaries.
Vocal Fold Vibration Following Surgical Intervention in Three Vocal Pathologies: A Preliminary Study.

Science.gov (United States)

Chen, Wenli; Woo, Peak; Murry, Thomas

2017-09-01

High-speed videoendoscopy captures the cycle-to-cycle vibratory motion of each individual vocal fold in normal and severely disordered phonation. Therefore, it provides a direct method to examine the specific vibratory changes following vocal fold surgery. The purpose of this study was to examine the vocal fold vibratory pattern changes in the surgically treated pathologic vocal fold and the contralateral vocal fold in three vocal pathologies: vocal polyp (n = 3), paresis or paralysis (n = 3), and scar (n = 3). Digital kymography was used to extract high-speed kymographic vocal fold images at the mid-membranous region of the vocal fold. Spectral analysis was subsequently applied to the digital kymography to quantify the cycle-to-cycle movements of each vocal fold, expressed as a spectrum. Surgical modification resulted in significantly improved spectral power of the treated pathologic vocal fold. Furthermore, the contralateral vocal fold also presented with improved spectral power irrespective of vocal pathology. In comparison with normal vocal fold spectrum, postsurgical vocal fold vibrations continued to demonstrate decreased vibratory amplitude in both vocal folds. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Three-dimensional laryngeal flow fields induced by a model vocal fold polyp

Energy Technology Data Exchange (ETDEWEB)

Erath, Byron D., E-mail: erath@gwu.edu [Department of Mechanical and Aerospace Engineering, George Washington University, 801 22nd Street NW, 739 Phillips Hall, Washington, DC 20052 (United States); Plesniak, Michael W., E-mail: plesniak@gwu.edu [Department of Mechanical and Aerospace Engineering, George Washington University, 801 22nd Street NW, 739 Phillips Hall, Washington, DC 20052 (United States)

2012-06-15

Highlights: Black-Right-Pointing-Pointer Pathological speech with a unilateral polyp is modeled in a scaled-up flow facility. Black-Right-Pointing-Pointer Vortex shedding from the polyp disrupts normal flow behavior. Black-Right-Pointing-Pointer Hairpin vortices create spatial velocity asymmetries in the glottal flow. - Abstract: Pathological laryngeal flow fields are investigated in a dynamically-driven, scaled-up model of the vocal folds. Disruption of the flow field due to the presence of a geometric protuberance, representative of a sessile unilateral polyp, is investigated in both the streamwise and transverse flow directions using phase-averaged particle image velocimetry. It is shown that the protuberance disrupts the normal flow behavior of the glottal jet throughout the phonatory cycle. During the divergent portions of the glottal cycle, the flow is characterized by the formation of hairpin vortices downstream of the protuberance. The protuberance also introduces significant velocity gradients in the anterior-posterior direction, which cover {approx}30 - 40% of the vocal fold length. It is proposed that the disruption of the normal velocity behavior owing to the presence of a polyp will adversely impact the aerodynamic loadings that drive vocal fold motion, contributing to the temporal and spatial vocal fold asymmetries that are clinically-observed in patients with unilateral polyps.
Three-dimensional laryngeal flow fields induced by a model vocal fold polyp

International Nuclear Information System (INIS)

Erath, Byron D.; Plesniak, Michael W.

2012-01-01

Highlights: ► Pathological speech with a unilateral polyp is modeled in a scaled-up flow facility. ► Vortex shedding from the polyp disrupts normal flow behavior. ► Hairpin vortices create spatial velocity asymmetries in the glottal flow. - Abstract: Pathological laryngeal flow fields are investigated in a dynamically-driven, scaled-up model of the vocal folds. Disruption of the flow field due to the presence of a geometric protuberance, representative of a sessile unilateral polyp, is investigated in both the streamwise and transverse flow directions using phase-averaged particle image velocimetry. It is shown that the protuberance disrupts the normal flow behavior of the glottal jet throughout the phonatory cycle. During the divergent portions of the glottal cycle, the flow is characterized by the formation of hairpin vortices downstream of the protuberance. The protuberance also introduces significant velocity gradients in the anterior-posterior direction, which cover ∼30 − 40% of the vocal fold length. It is proposed that the disruption of the normal velocity behavior owing to the presence of a polyp will adversely impact the aerodynamic loadings that drive vocal fold motion, contributing to the temporal and spatial vocal fold asymmetries that are clinically-observed in patients with unilateral polyps.
Verb inflection in Monolingual Dutch and Sequential Bilingual Turkish-Dutch Children with and without SLI

Science.gov (United States)

Blom, Elma; De Jong, Jan; Orgassa, Antje; Baker, Anne; Weerman, Fred

2013-01-01

Both children with specific language impairment (SLI) and children who acquire a second language (L2) make errors with verb inflection. This overlap between SLI and L2 raises the question if verb inflection can discriminate between L2 children with and without SLI. In this study we addressed this question for Dutch. The secondary goal of the study…
Strain modulations as a mechanism to reduce stress relaxation in laryngeal tissues.

Science.gov (United States)

Hunter, Eric J; Siegmund, Thomas; Chan, Roger W

2014-01-01

Vocal fold tissues in animal and human species undergo deformation processes at several types of loading rates: a slow strain involved in vocal fold posturing (on the order of 1 Hz or so), cyclic and faster posturing often found in speech tasks or vocal embellishment (1-10 Hz), and shear strain associated with vocal fold vibration during phonation (100 Hz and higher). Relevant to these deformation patterns are the viscous properties of laryngeal tissues, which exhibit non-linear stress relaxation and recovery. In the current study, a large strain time-dependent constitutive model of human vocal fold tissue is used to investigate effects of phonatory posturing cyclic strain in the range of 1 Hz to 10 Hz. Tissue data for two subjects are considered and used to contrast the potential effects of age. Results suggest that modulation frequency and extent (amplitude), as well as the amount of vocal fold overall strain, all affect the change in stress relaxation with modulation added. Generally, the vocal fold cover reduces the rate of relaxation while the opposite is true for the vocal ligament. Further, higher modulation frequencies appear to reduce the rate of relaxation, primarily affecting the ligament. The potential benefits of cyclic strain, often found in vibrato (around 5 Hz modulation) and intonational inflection, are discussed in terms of vocal effort and vocal pitch maintenance. Additionally, elderly tissue appears to not exhibit these benefits to modulation. The exacerbating effect such modulations may have on certain voice disorders, such as muscle tension dysphonia, are explored.
Strain modulations as a mechanism to reduce stress relaxation in laryngeal tissues.

Directory of Open Access Journals (Sweden)

Eric J Hunter

Full Text Available Vocal fold tissues in animal and human species undergo deformation processes at several types of loading rates: a slow strain involved in vocal fold posturing (on the order of 1 Hz or so, cyclic and faster posturing often found in speech tasks or vocal embellishment (1-10 Hz, and shear strain associated with vocal fold vibration during phonation (100 Hz and higher. Relevant to these deformation patterns are the viscous properties of laryngeal tissues, which exhibit non-linear stress relaxation and recovery. In the current study, a large strain time-dependent constitutive model of human vocal fold tissue is used to investigate effects of phonatory posturing cyclic strain in the range of 1 Hz to 10 Hz. Tissue data for two subjects are considered and used to contrast the potential effects of age. Results suggest that modulation frequency and extent (amplitude, as well as the amount of vocal fold overall strain, all affect the change in stress relaxation with modulation added. Generally, the vocal fold cover reduces the rate of relaxation while the opposite is true for the vocal ligament. Further, higher modulation frequencies appear to reduce the rate of relaxation, primarily affecting the ligament. The potential benefits of cyclic strain, often found in vibrato (around 5 Hz modulation and intonational inflection, are discussed in terms of vocal effort and vocal pitch maintenance. Additionally, elderly tissue appears to not exhibit these benefits to modulation. The exacerbating effect such modulations may have on certain voice disorders, such as muscle tension dysphonia, are explored.
On the Time Course of Vocal Emotion Recognition

Science.gov (United States)

Pell, Marc D.; Kotz, Sonja A.

2011-01-01

How quickly do listeners recognize emotions from a speaker's voice, and does the time course for recognition vary by emotion type? To address these questions, we adapted the auditory gating paradigm to estimate how much vocal information is needed for listeners to categorize five basic emotions (anger, disgust, fear, sadness, happiness) and neutral utterances produced by male and female speakers of English. Semantically-anomalous pseudo-utterances (e.g., The rivix jolled the silling) conveying each emotion were divided into seven gate intervals according to the number of syllables that listeners heard from sentence onset. Participants (n = 48) judged the emotional meaning of stimuli presented at each gate duration interval, in a successive, blocked presentation format. Analyses looked at how recognition of each emotion evolves as an utterance unfolds and estimated the “identification point” for each emotion. Results showed that anger, sadness, fear, and neutral expressions are recognized more accurately at short gate intervals than happiness, and particularly disgust; however, as speech unfolds, recognition of happiness improves significantly towards the end of the utterance (and fear is recognized more accurately than other emotions). When the gate associated with the emotion identification point of each stimulus was calculated, data indicated that fear (M = 517 ms), sadness (M = 576 ms), and neutral (M = 510 ms) expressions were identified from shorter acoustic events than the other emotions. These data reveal differences in the underlying time course for conscious recognition of basic emotions from vocal expressions, which should be accounted for in studies of emotional speech processing. PMID:22087275

Why the Left Hemisphere Is Dominant for Speech Production: Connecting the Dots

Directory of Open Access Journals (Sweden)

Harvey Martin Sussman

2015-12-01

Full Text Available Evidence from seemingly disparate areas of speech/language research is reviewed to form a unified theoretical account for why the left hemisphere is specialized for speech production. Research findings from studies investigating hemispheric lateralization of infant babbling, the primacy of the syllable in phonological structure, rhyming performance in split-brain patients, rhyming ability and phonetic categorization in children diagnosed with developmental apraxia of speech, rules governing exchange errors in spoonerisms, organizational principles of neocortical control of learned motor behaviors, and multi-electrode recordings of human neuronal responses to speech sounds are described and common threads highlighted. It is suggested that the emergence, in developmental neurogenesis, of a hard-wired, syllabically-organized, neural substrate representing the phonemic sound elements of one’s language, particularly the vocalic nucleus, is the crucial factor underlying the left hemisphere’s dominance for speech production.
Vocal health fitness to different music styles - doi:10.5020/18061230.2010.p278

Directory of Open Access Journals (Sweden)

Maria Claudia Mendes Caminha Muniz

2012-01-01

Full Text Available Objective: To present genres and styles currently running on western music scene, focusing on the practice of singing voice. Methods: An observational and documental study for which were selected sound sources presenting musical genres and styles that are part of the experience of the researchers, which were analyzed considering origins, formative elements and vocal features. Alongside we carried out a review of literature grounded in databases research and free review of websites and classical books of the area. Results: The selected styles (Rock and Roll, Heavy Metal, Trash Metal, Grunge, Gothic Metal, Rap, Funk, Blues, R&B – Rhythm and Blues, Soul, Gospel, MPB, Samba, Forro, Sertanejo, Bossa Nova, Opera and Chamber Music were described, pointing the reasons for the speech therapist to be informed about them and about singing voice aspects. His guidance may minimize possible vocal damage caused by each style, since each of them carries its own patterns to which the interpreter must submit. Conclusions: We conclude that the singer will use a specific vocal pattern that resembles the musical style he intends to sing, regardless of any harm it may or may not cause to vocal health. When choosing a musical style, it is important that the singer has the knowledge and understanding of how the use of his vocal apparatus will cause or not cause injury to his voice. Also be aware that the technique in singing is necessary for vocal longevity.
Convergence of laughter in conversational speech: effects of quantity, temporal alignment and imitation

NARCIS (Netherlands)

Trouvain, Jürgen; Truong, Khiet Phuong

A crucial feature of spoken interaction is joint activity at various linguistic and phonetic levels that requires fine-tuned coordination. This study gives a brief overview on how laughing in conversational speech can be phonetically analysed as partner-specific adaptation and joint vocal action.
Speech processing system demonstrated by positron emission tomography (PET). A review of the literature

International Nuclear Information System (INIS)

Hirano, Shigeru; Naito, Yasushi; Kojima, Hisayoshi

1996-01-01

We review the literature on speech processing in the central nervous system as demonstrated by positron emission tomography (PET). Activation study using PET has been proved to be a useful and non-invasive method of investigating the speech processing system in normal subjects. In speech recognition, the auditory association areas and lexico-semantic areas called Wernicke's area play important roles. Broca's area, motor areas, supplementary motor cortices and the prefrontal area have been proved to be related to speech output. Visual speech stimulation activates not only the visual association areas but also the temporal region and prefrontal area, especially in lexico-semantic processing. Higher level speech processing, such as conversation which includes auditory processing, vocalization and thinking, activates broad areas in both hemispheres. This paper also discusses problems to be resolved in the future. (author) 42 refs
Some Behavioral and Neurobiological Constraints on Theories of Audiovisual Speech Integration: A Review and Suggestions for New Directions

Science.gov (United States)

Altieri, Nicholas; Pisoni, David B.; Townsend, James T.

2012-01-01

Summerfield (1987) proposed several accounts of audiovisual speech perception, a field of research that has burgeoned in recent years. The proposed accounts included the integration of discrete phonetic features, vectors describing the values of independent acoustical and optical parameters, the filter function of the vocal tract, and articulatory dynamics of the vocal tract. The latter two accounts assume that the representations of audiovisual speech perception are based on abstract gestures, while the former two assume that the representations consist of symbolic or featural information obtained from visual and auditory modalities. Recent converging evidence from several different disciplines reveals that the general framework of Summerfield’s feature-based theories should be expanded. An updated framework building upon the feature-based theories is presented. We propose a processing model arguing that auditory and visual brain circuits provide facilitatory information when the inputs are correctly timed, and that auditory and visual speech representations do not necessarily undergo translation into a common code during information processing. Future research on multisensory processing in speech perception should investigate the connections between auditory and visual brain regions, and utilize dynamic modeling tools to further understand the timing and information processing mechanisms involved in audiovisual speech integration. PMID:21968081
Effect of speech therapy and pharmacological treatment in prosody of parkinsonians

Directory of Open Access Journals (Sweden)

Luciana Lemos de Azevedo

2015-01-01

Full Text Available Objective Parkinsonian patients usually present speech impairment. The aim of this study was to verify the influence of levodopa and of the adapted Lee Silverman Vocal Treatment® method on prosodic parameters employed by parkinsonian patients. Method Ten patients with idiopathic Parkinson's disease using levodopa underwent recording of utterances produced in four stages: expressing attitudes of certainty and doubt and declarative and interrogative modalities. The sentences were recorded under the effect of levodopa (on, without the effect of levodopa (off; before and after speech therapy during the on and off periods. Results The speech therapy and its association with drug treatment promoted the improvement of prosodic parameters: increase of fundamental frequency measures, reduction of measures of duration and greater intensity. Conclusion The association of speech therapy to medication treatment is of great value in improving the communication of parkinsonian patients.
Weight-bearing MR imaging as an option in the study of gravitational effects on the vocal tract of untrained subjects in singing phonation.

Science.gov (United States)

Traser, Louisa; Burdumy, Michael; Richter, Bernhard; Vicari, Marco; Echternach, Matthias

2014-01-01

Magnetic Resonance Imaging (MRI) of subjects in a supine position can be used to evaluate the configuration of the vocal tract during phonation. However, studies of speech phonation have shown that gravity can affect vocal tract shape and bias measurements. This is one of the reasons that MRI studies of singing phonation have used professionally trained singers as subjects, because they are generally considered to be less affected by the supine body position and environmental distractions. A study of untrained singers might not only contribute to the understanding of intuitive singing function and aid the evaluation of potential hazards for vocal health, but also provide insights into the effect of the supine position on singers in general. In the present study, an open configuration 0.25 T MRI system with a rotatable examination bed was used to study the effect of body position in 20 vocally untrained subjects. The subjects were asked to sing sustained tones in both supine and upright body positions on different pitches and in different register conditions. Morphometric measurements were taken from the acquired images of a sagittal slice depicting the vocal tract. The analysis concerning the vocal tract configuration in the two body positions revealed differences in 5 out of 10 measured articulatory parameters. In the upright position the jaw was less protruded, the uvula was elongated, the larynx more tilted and the tongue was positioned more to the front of the mouth than in the supine position. The findings presented are in agreement with several studies on gravitational effects in speech phonation, but contrast with the results of a previous study on professional singers of our group where only minor differences between upright and supine body posture were observed. The present study demonstrates that imaging of the vocal tract using weight-bearing MR imaging is a feasible tool for the study of sustained phonation in singing for vocally untrained subjects.
The hypoglossal canal and the origin of human vocal behavior

Science.gov (United States)

Kay, Richard F.; Cartmill, Matt; Balow, Michelle

1998-01-01

The mammalian hypoglossal canal transmits the nerve that supplies the muscles of the tongue. This canal is absolutely and relatively larger in modern humans than it is in the African apes (Pan and Gorilla). We hypothesize that the human tongue is supplied more richly with motor nerves than are those of living apes and propose that canal size in fossil hominids may provide an indication about the motor coordination of the tongue and reflect the evolution of speech and language. Canals of gracile Australopithecus, and possibly Homo habilis, fall within the range of extant Pan and are significantly smaller than those of modern Homo. The canals of Neanderthals and an early “modern” Homo sapiens (Skhul 5), as well as of African and European middle Pleistocene Homo (Kabwe and Swanscombe), fall within the range of extant Homo and are significantly larger than those of Pan troglodytes. These anatomical findings suggest that the vocal capabilities of Neanderthals were the same as those of humans today. Furthermore, the vocal abilities of Australopithecus were not advanced significantly over those of chimpanzees whereas those of Homo may have been essentially modern by at least 400,000 years ago. Thus, human vocal abilities may have appeared much earlier in time than the first archaeological evidence for symbolic behavior. PMID:9560291
Comparison of Perceptual Signs of Voice before and after Vocal Hygiene Program in Adults with Dysphonia

Directory of Open Access Journals (Sweden)

Seyyedeh Maryam khoddami

2011-12-01

Full Text Available Background and Aim: Vocal abuse and misuse are the most frequent causes of voice disorders. Consequently some therapy is needed to stop or modify such behaviors. This research was performed to study the effectiveness of vocal hygiene program on perceptual signs of voice in people with dysphonia.Methods: A Vocal hygiene program was performed to 8 adults with dysphonia for 6 weeks. At first, Consensus Auditory- Perceptual Evaluation of Voice was used to assess perceptual signs. Then the program was delivered, Individuals were followed in second and forth weeks visits. In the last session, perceptual assessment was performed and individuals’ opinions were collected. Perceptual findings were compared before and after the therapy.Results: After the program, mean score of perceptual assessment decreased. Mean score of every perceptual sign revealed significant difference before and after the therapy (p≤0.0001. «Loudness» had maximum score and coordination between speech and respiration indicated minimum score. All participants confirmed efficiency of the therapy.Conclusion: The vocal hygiene program improves all perceptual signs of voice although not equally. This deduction is confirmed by both clinician-based and patient-based assessments. As a result, vocal hygiene program is necessary for a comprehensive voice therapy but is not solely effective to resolve all voice problems.
[Modeling developmental aspects of sensorimotor control of speech production].

Science.gov (United States)

Kröger, B J; Birkholz, P; Neuschaefer-Rube, C

2007-05-01

Detailed knowledge of the neurophysiology of speech acquisition is important for understanding the developmental aspects of speech perception and production and for understanding developmental disorders of speech perception and production. A computer implemented neural model of sensorimotor control of speech production was developed. The model is capable of demonstrating the neural functions of different cortical areas during speech production in detail. (i) Two sensory and two motor maps or neural representations and the appertaining neural mappings or projections establish the sensorimotor feedback control system. These maps and mappings are already formed and trained during the prelinguistic phase of speech acquisition. (ii) The feedforward sensorimotor control system comprises the lexical map (representations of sounds, syllables, and words of the first language) and the mappings from lexical to sensory and to motor maps. The training of the appertaining mappings form the linguistic phase of speech acquisition. (iii) Three prelinguistic learning phases--i. e. silent mouthing, quasi stationary vocalic articulation, and realisation of articulatory protogestures--can be defined on the basis of our simulation studies using the computational neural model. These learning phases can be associated with temporal phases of prelinguistic speech acquisition obtained from natural data. The neural model illuminates the detailed function of specific cortical areas during speech production. In particular it can be shown that developmental disorders of speech production may result from a delayed or incorrect process within one of the prelinguistic learning phases defined by the neural model.
Pediatric paradoxical vocal-fold motion: presentation and natural history.

Science.gov (United States)

Maturo, Stephen; Hill, Courtney; Bunting, Glenn; Baliff, Cathy; Ramakrishna, Jyoti; Scirica, Christina; Fracchia, Shannon; Donovan, Abigail; Hartnick, Christopher

2011-12-01

To describe (1) a cohort of children with paradoxical vocal-fold motion (PVFM) who were referred to a multidisciplinary airway center and (2) the outcomes of various treatment modalities including speech therapy, gastroesophageal reflux disease treatment, and psychiatric treatment. This was a case series with chart review of children younger than 18 years with PVFM evaluated at a tertiary care pediatric airway center over a 36-month period. Fifty-nine children with PVFM were evaluated. The cohort had a mean age of 13.64 years (range: 8-18 years) and a female-to-male ratio of 3:1. Speech therapy as an initial treatment resulted in a 63% (24 of 38) success rate after an average of 3.7 treatment sessions. Speech therapy was a more successful treatment than antireflux therapy (P = .001). Ten percent (6 of 59) of the children presented with a known psychiatric diagnosis, and 30% (18 of 59) of children in the cohort were ultimately diagnosed with a psychiatric condition. Children with inspiratory stridor at rest had a lower initial success rate with speech therapy (56%), a higher rate of underlying psychiatric disorders (75%), and a high rate of success after psychiatric treatment (100%) that required, on average, 3 sessions over a 2-month period. To our knowledge, this is the largest study to date on pediatric PVFM. The majority of children with PVFM improve with speech therapy. Children with PVFM at rest may be better treated with psychiatric therapy than speech therapy. Furthermore, children who present with symptoms at rest may have a higher likelihood of underlying psychiatric disease.
When Infants Talk, Infants Listen: Pre-Babbling Infants Prefer Listening to Speech with Infant Vocal Properties

Science.gov (United States)

Masapollo, Matthew; Polka, Linda; Ménard, Lucie

2016-01-01

To learn to produce speech, infants must effectively monitor and assess their own speech output. Yet very little is known about how infants perceive speech produced by an infant, which has higher voice pitch and formant frequencies compared to adult or child speech. Here, we tested whether pre-babbling infants (at 4-6 months) prefer listening to…
A case of bilateral vocal fold mucosal bridges, bilateral trans-vocal fold type III sulci vocales, and an intracordal polyp.

Science.gov (United States)

Tan, Melin; Pitman, Michael J

2011-07-01

We present a patient with a novel finding of bilateral mucosal bridges, bilateral type III trans-vocal fold sulci vocales, and a vocal fold polyp. Although sulci and mucosal bridges occur in the vocal folds, it is rare to find multiples of these lesions in a single patient, and it is even more uncommon when they occur in conjunction with a vocal fold polyp. To our knowledge, this is the first description of a vocal fold polyp in combination with multiple vocal fold bridges and multiple type III sulci vocales in a single patient. To describe and visually present the diagnosis and treatment of a patient with an intracordal polyp, bilateral mucosal bridges, as well as bilateral type III trans-vocal fold sulci vocales. Presentation of a set of high definition intraoperative photos displaying the extent of the vocal fold lesions and the resection of the intracordal polyp. This patient presented with only 6 months of significant dysphonia. It was felt that the recent change in voice was because of the polyp and not the bridges or sulci vocales. Considering the patient's presentation and the possible morbidity of resection of mucosal bridges and sulci, only the polyp was excised. Postoperatively, the patient's voice returned to his acceptable mild baseline dysphonia, and the benefit has persisted 6 months postoperatively. The combination of bilateral mucosal bridges, bilateral type III sulcus vocalis, and an intracordal polyp in one patient is rare if not novel. Treatment of the polyp alone returned the patient's voice to his lifelong baseline of mild dysphonia. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Gender-related aspects of transmasculine people's vocal situations: insights from a qualitative content analysis of interview transcripts.

Science.gov (United States)

Azul, David

2016-11-01

Transmasculine people assigned female gender at birth but who do not identify with this classification have traditionally received little consideration in the voice literature. Existing analyses tend to be focused on evaluating speaker voice characteristics, whereas other factors that contribute to the production of vocal gender have remained underexplored. Most studies rely on researcher-centred perspectives, whereas very little is known about how transmasculine people themselves experience and make sense of their vocal situations. To explore how participants described their subjective gender positionings; which gender attributions they wished to receive from others; which gender they self-attributed to their voices; which gender attributions they had received from others; and how far participants were satisfied with the gender-related aspects of their vocal situations. Transcripts of semi-structured interviews with 14 German-speaking transmasculine people served as the original data corpus. Sections in which participants described the gender-related aspects of their vocal situations and that were relevant to the current research objectives were selected and explored using qualitative content analysis. The analysis revealed diverse accounts pertaining to the factors that contribute to the production of vocal gender for individual participants and variable levels of satisfaction with vocal gender presentation and attribution. Transmasculine people need to be regarded as a heterogeneous population and clinical practice needs to follow a client-centred, individualized approach. © 2016 Royal College of Speech and Language Therapists.
Classroom acoustics design for speakers’ comfort and speech intelligibility: a European perspective

DEFF Research Database (Denmark)

Garcia, David Pelegrin; Rasmussen, Birgit; Brunskog, Jonas

2014-01-01

. The recommended values of reverberation time in fully occupied classrooms for exible teaching methods are between 0.45 s and 0.6 s (between 0.6 and 0.7 s in an unoccupied but furnished condition) for classrooms with less than 40 students and volumes below 210 m 3 . When designing larger classrooms, a dedicated......Current European regulatory requirements or guidelines for reverberation time in classrooms have the goal of enhancing speech intelligibility for students and reducing noise levels in classrooms. At the same time, school teachers suffer frequently from voice problems due to high vocal load...... intelligibility for students. Two room acoustic parameters are shown relevant for a speaker: the voice support, linked to vocal effort, and the decay time derived from an oral-binaural impulse response, linked to vocal comfort. Theoretical prediction models for room-averaged values of these parameters...
Exploring the zebra finch Taeniopygia guttata as a novel animal model for the speech-language deficit of fragile X syndrome.

Science.gov (United States)

Winograd, Claudia; Ceman, Stephanie

2012-01-01

Fragile X syndrome (FXS) is the most common cause of inherited intellectual disability and presents with markedly atypical speech-language, likely due to impaired vocal learning. Although current models have been useful for studies of some aspects of FXS, zebra finch is the only tractable lab model for vocal learning. The neural circuits for vocal learning in the zebra finch have clear relationships to the pathways in the human brain that may be affected in FXS. Further, finch vocal learning may be quantified using software designed specifically for this purpose. Knockdown of the zebra finch FMR1 gene may ultimately enable novel tests of therapies that are modality-specific, using drugs or even social strategies, to ameliorate deficits in vocal development and function. In this chapter, we describe the utility of the zebra finch model and present a hypothesis for the role of FMRP in the developing neural circuitry for vocalization.
Acoustic analyses of speech sounds and rhythms in Japanese- and English-learning infants

Directory of Open Access Journals (Sweden)

Yuko eYamashita

2013-02-01

Full Text Available The purpose of this study was to explore developmental changes, in terms of spectral fluctuations and temporal periodicity with Japanese- and English-learning infants. Three age groups (15, 20, and 24 months were selected, because infants diversify phonetic inventories with age. Natural speech of the infants was recorded. We utilized a critical-band-filter bank, which simulated the frequency resolution in adults’ auditory periphery. First, the correlations between the critical-band outputs represented by factor analysis were observed in order to see how the critical bands should be connected to each other, if a listener is to differentiate sounds in infants’ speech. In the following analysis, we analyzed the temporal fluctuations of factor scores by calculating autocorrelations. The present analysis identified three factors observed in adult speech at 24 months of age in both linguistic environments. These three factors were shifted to a higher frequency range corresponding to the smaller vocal tract size of the infants. The results suggest that the vocal tract structures of the infants had developed to become adult-like configuration by 24 months of age in both language environments. The amount of utterances with periodic nature of shorter time increased with age in both environments. This trend was clearer in the Japanese environment.
Orienting India : interwar internationalism in an Asian inflection, 1917-1937

NARCIS (Netherlands)

Stolte, Carolina Margaretha

2013-01-01

‘Orienting India: Interwar Internationalism in an Asian Inflection, 1917-1937’ is an intellectual history of (Pan-) Asianist individuals, initiatives and movements in South Asia in the years between the two world wars. The First World War, the Bolshevik Revolution and the establishment of the League
Recognizing vocal emotions in Mandarin Chinese: a validated database of Chinese vocal emotional stimuli.

Science.gov (United States)

Liu, Pan; Pell, Marc D

2012-12-01

To establish a valid database of vocal emotional stimuli in Mandarin Chinese, a set of Chinese pseudosentences (i.e., semantically meaningless sentences that resembled real Chinese) were produced by four native Mandarin speakers to express seven emotional meanings: anger, disgust, fear, sadness, happiness, pleasant surprise, and neutrality. These expressions were identified by a group of native Mandarin listeners in a seven-alternative forced choice task, and items reaching a recognition rate of at least three times chance performance in the seven-choice task were selected as a valid database and then subjected to acoustic analysis. The results demonstrated expected variations in both perceptual and acoustic patterns of the seven vocal emotions in Mandarin. For instance, fear, anger, sadness, and neutrality were associated with relatively high recognition, whereas happiness, disgust, and pleasant surprise were recognized less accurately. Acoustically, anger and pleasant surprise exhibited relatively high mean f0 values and large variation in f0 and amplitude; in contrast, sadness, disgust, fear, and neutrality exhibited relatively low mean f0 values and small amplitude variations, and happiness exhibited a moderate mean f0 value and f0 variation. Emotional expressions varied systematically in speech rate and harmonics-to-noise ratio values as well. This validated database is available to the research community and will contribute to future studies of emotional prosody for a number of purposes. To access the database, please contact pan.liu@mail.mcgill.ca.
Expression of emotion in Eastern and Western music mirrors vocalization.

Science.gov (United States)

Bowling, Daniel Liu; Sundararajan, Janani; Han, Shui'er; Purves, Dale

2012-01-01

In Western music, the major mode is typically used to convey excited, happy, bright or martial emotions, whereas the minor mode typically conveys subdued, sad or dark emotions. Recent studies indicate that the differences between these modes parallel differences between the prosodic and spectral characteristics of voiced speech sounds uttered in corresponding emotional states. Here we ask whether tonality and emotion are similarly linked in an Eastern musical tradition. The results show that the tonal relationships used to express positive/excited and negative/subdued emotions in classical South Indian music are much the same as those used in Western music. Moreover, tonal variations in the prosody of English and Tamil speech uttered in different emotional states are parallel to the tonal trends in music. These results are consistent with the hypothesis that the association between musical tonality and emotion is based on universal vocal characteristics of different affective states.

University Vocal Training and Vocal Health of Music Educators and Music Therapists

Science.gov (United States)

Baker, Vicki D.; Cohen, Nicki

2017-01-01

The purpose of this study was to describe the university vocal training and vocal health of music educators and music therapists. The participants (N = 426), music educators (n = 351) and music therapists (n = 75), completed a survey addressing demographics, vocal training, voice usage, and vocal health. Both groups reported singing at least 50%…
Improved Noise Minimum Statistics Estimation Algorithm for Using in a Speech-Passing Noise-Rejecting Headset

Directory of Open Access Journals (Sweden)

Seyedtabaee Saeed

2010-01-01

Full Text Available This paper deals with configuration of an algorithm to be used in a speech-passing angle grinder noise-canceling headset. Angle grinder noise is annoying and interrupts ordinary oral communication. Meaning that, low SNR noisy condition is ahead. Since variation in angle grinder working condition changes noise statistics, the noise will be nonstationary with possible jumps in its power. Studies are conducted for picking an appropriate algorithm. A modified version of the well-known spectral subtraction shows superior performance against alternate methods. Noise estimation is calculated through a multi-band fast adapting scheme. The algorithm is adapted very quickly to the non-stationary noise environment while inflecting minimum musical noise and speech distortion on the processed signal. Objective and subjective measures illustrating the performance of the proposed method are introduced.
Exploring the determinants of the graded structure of vocal emotion expressions.

Science.gov (United States)

Laukka, Petri; Audibert, Nicolas; Aubergé, Véronique

2012-01-01

We examined what determines the typicality, or graded structure, of vocal emotion expressions. Separate groups of judges rated acted and spontaneous expressions of anger, fear, and joy with regard to their typicality and three main determinants of the graded structure of categories: category members' similarity to the central tendency of their category (CT); category members' frequency of instantiation, i.e., how often they are encountered as category members (FI); and category members' similarity to ideals associated with the goals served by its category, i.e., suitability to express particular emotions. Partial correlations and multiple regression analysis revealed that similarity to ideals, rather than CT or FI, explained most variance in judged typicality. Results thus suggest that vocal emotion expressions constitute ideal-based goal-derived categories, rather than taxonomic categories based on CT and FI. This could explain how prototypical expressions can be acoustically distinct and highly recognisable but occur relatively rarely in everyday speech.
An Experimental Determination of the Intelligibility of Two Different Speech Synthesizers in Noise.

Science.gov (United States)

1987-12-01

slid around the into the bay. 22. The two met while on the sand. 23. The ink stain dried on the finished 24. The town was seized without a fight. 25...backward coarticulation, napkin ) Formants--resonant frequencies of the speech wave which reflect how the vocal tract is modified to produce sounds
Comparison of Two Music Training Approaches on Music and Speech Perception in Cochlear Implant Users.

Science.gov (United States)

Fuller, Christina D; Galvin, John J; Maat, Bert; Başkent, Deniz; Free, Rolien H

2018-01-01

In normal-hearing (NH) adults, long-term music training may benefit music and speech perception, even when listening to spectro-temporally degraded signals as experienced by cochlear implant (CI) users. In this study, we compared two different music training approaches in CI users and their effects on speech and music perception, as it remains unclear which approach to music training might be best. The approaches differed in terms of music exercises and social interaction. For the pitch/timbre group, melodic contour identification (MCI) training was performed using computer software. For the music therapy group, training involved face-to-face group exercises (rhythm perception, musical speech perception, music perception, singing, vocal emotion identification, and music improvisation). For the control group, training involved group nonmusic activities (e.g., writing, cooking, and woodworking). Training consisted of weekly 2-hr sessions over a 6-week period. Speech intelligibility in quiet and noise, vocal emotion identification, MCI, and quality of life (QoL) were measured before and after training. The different training approaches appeared to offer different benefits for music and speech perception. Training effects were observed within-domain (better MCI performance for the pitch/timbre group), with little cross-domain transfer of music training (emotion identification significantly improved for the music therapy group). While training had no significant effect on QoL, the music therapy group reported better perceptual skills across training sessions. These results suggest that more extensive and intensive training approaches that combine pitch training with the social aspects of music therapy may further benefit CI users.
Speech Recognition: Acoustic-Phonetic Knowledge Acquisition and Representation.

Science.gov (United States)

1987-09-25

Society of "" America , Anaheim, CA, Dec. 1986. # Randolph, M. A., and V. W. Zue, "The Role of Syllable Structure in the Acoustic Realizations of Stops...input speech signal is first transformed into a represen- ences in sociolinguistic background, dialect, and vocal tract tation that takes into account...Perceptual Evidence,’ Journal of the Acovuticai Society of America , vol. 59, * no. 5, pp. 1208-1221, May 1976. � G. E. Kupec and M. A. Bush, ’Network
Instant messages vs. speech: hormones and why we still need to hear each other.

Science.gov (United States)

Seltzer, Leslie J; Prososki, Ashley R; Ziegler, Toni E; Pollak, Seth D

2012-01-01

Human speech evidently conveys an adaptive advantage, given its apparently rapid dissemination through the ancient world and global use today. As such, speech must be capable of altering human biology in a positive way, possibly through those neuroendocrine mechanisms responsible for strengthening the social bonds between individuals. Indeed, speech between trusted individuals is capable of reducing levels of salivary cortisol, often considered a biomarker of stress, and increasing levels of urinary oxytocin, a hormone involved in the formation and maintenance of positive relationships. It is not clear, however, whether it is the uniquely human grammar, syntax, content and/or choice of words that causes these physiological changes, or whether the prosodic elements of speech, which are present in the vocal cues of many other species, are responsible. In order to tease apart these elements of human communication, we examined the hormonal responses of female children who instant messaged their mothers after undergoing a stressor. We discovered that unlike children interacting with their mothers in person or over the phone, girls who instant messaged did not release oxytocin; instead, these participants showed levels of salivary cortisol as high as control subjects who did not interact with their parents at all. We conclude that the comforting sound of a familiar voice is responsible for the hormonal differences observed and, hence, that similar differences may be seen in other species using vocal cues to communicate.
Intelligibility of emotional speech in younger and older adults.

Science.gov (United States)

Dupuis, Kate; Pichora-Fuller, M Kathleen

2014-01-01

Little is known about the influence of vocal emotions on speech understanding. Word recognition accuracy for stimuli spoken to portray seven emotions (anger, disgust, fear, sadness, neutral, happiness, and pleasant surprise) was tested in younger and older listeners. Emotions were presented in either mixed (heterogeneous emotions mixed in a list) or blocked (homogeneous emotion blocked in a list) conditions. Three main hypotheses were tested. First, vocal emotion affects word recognition accuracy; specifically, portrayals of fear enhance word recognition accuracy because listeners orient to threatening information and/or distinctive acoustical cues such as high pitch mean and variation. Second, older listeners recognize words less accurately than younger listeners, but the effects of different emotions on intelligibility are similar across age groups. Third, blocking emotions in list results in better word recognition accuracy, especially for older listeners, and reduces the effect of emotion on intelligibility because as listeners develop expectations about vocal emotion, the allocation of processing resources can shift from emotional to lexical processing. Emotion was the within-subjects variable: all participants heard speech stimuli consisting of a carrier phrase followed by a target word spoken by either a younger or an older talker, with an equal number of stimuli portraying each of seven vocal emotions. The speech was presented in multi-talker babble at signal to noise ratios adjusted for each talker and each listener age group. Listener age (younger, older), condition (mixed, blocked), and talker (younger, older) were the main between-subjects variables. Fifty-six students (Mage= 18.3 years) were recruited from an undergraduate psychology course; 56 older adults (Mage= 72.3 years) were recruited from a volunteer pool. All participants had clinically normal pure-tone audiometric thresholds at frequencies ≤3000 Hz. There were significant main effects of
Models for predicting the inflectional paradigm of Croatian words

Directory of Open Access Journals (Sweden)

Jan Šnajder

2013-12-01

Full Text Available Morphological analysis is a prerequisite for many natural language processing tasks. For inflectionally rich languages such as Croatian, morphological analysis typically relies on a morphological lexicon, which lists the lemmas and their paradigms. However, a real-life morphological analyzer must also be able to handle properly the out-of-vocabulary words. We address the task of predicting the correct inflectional paradigm of unknown Croatian words. We frame this as a supervised machine learning problem: we train a classifier to predict whether a candidate lemma-paradigm pair is correct based on a number of string- and corpus-based features. The candidate lemma-paradigm pairs are generated using a handcrafted morphology grammar. Our aim is to examine the machine learning aspect of the problem: we test a comprehensive set of features and evaluate the classification accuracy using different feature subsets. We show that satisfactory classification accuracy (92% can be achieved with SVM using a combination of string- and corpus-based features. On a per word basis, the F1-score is 53% and accuracy is 70%, which outperforms a frequency-based baseline by a wide margin. We discuss a number of possible directions for future research.
Exploring Attitudes of Indian Classical Singers Toward Seeking Vocal Health Care.

Science.gov (United States)

Gunjawate, Dhanshree R; Aithal, Venkataraja U; Guddattu, Vasudeva; Kishore, Amrutha; Bellur, Rajashekhar

2016-11-01

The attitude of Indian classical singers toward seeking vocal health care is a dimension yet to be explored. The current study was aimed to determine the attitudes of these singers toward seeking vocal health care and further understand the influence of age and gender. Cross-sectional. A 10-item self-report questionnaire adapted from a study on contemporary commercial music singers was used. An additional question was added to ask if the singer was aware about the profession and role of speech-language pathologists (SLPs). The questionnaire was administered on 55 randomly selected self-identified trained Indian classical singers who rated the items using a five-point Likert scale. Demographic variables were summarized using descriptive statistics and t test was used to compare the mean scores between genders and age groups. Of the singers, 78.2% were likely to see a doctor for heath-related problems, whereas 81.8% were unlikely to seek medical care for voice-related problems; the difference was statistically significant (P attitudes toward findings from medical examination by a specialist revealed a statistically significant difference (P = 0.02) between the genders. Age did not have a significant influence on the responses. Only 23.6% of the respondents were aware about the profession and the role of SLPs. The findings are in tune with western literature reporting hesitation of singers toward seeking vocal health care and draws attention of SLPs to promote their role in vocal health awareness and management. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Crossmodal integration of conspecific vocalizations in rhesus macaques.

Directory of Open Access Journals (Sweden)

Christa Payne

Full Text Available Crossmodal integration of audio/visual information is vital for recognition, interpretation and appropriate reaction to social signals. Here we examined how rhesus macaques process bimodal species-specific vocalizations by eye tracking, using an unconstrained preferential looking paradigm. Six adult rhesus monkeys (3M, 3F were presented two side-by-side videos of unknown male conspecifics emitting different vocalizations, accompanied by the audio signal corresponding to one of the videos. The percentage of time animals looked to each video was used to assess crossmodal integration ability and the percentages of time spent looking at each of the six a priori ROIs (eyes, mouth, and rest of each video were used to characterize scanning patterns. Animals looked more to the congruent video, confirming reports that rhesus monkeys spontaneously integrate conspecific vocalizations. Scanning patterns showed that monkeys preferentially attended to the eyes and mouth of the stimuli, with subtle differences between males and females such that females showed a tendency to differentiate the eye and mouth regions more than males. These results were similar to studies in humans indicating that when asked to assess emotion-related aspects of visual speech, people preferentially attend to the eyes. Thus, the tendency for female monkeys to show a greater differentiation between the eye and mouth regions than males may indicate that female monkeys were slightly more sensitive to the socio-emotional content of complex signals than male monkeys. The current results emphasize the importance of considering both the sex of the observer and individual variability in passive viewing behavior in nonhuman primate research.
Vocalization Subsystem Responses to a Temporarily Induced Unilateral Vocal Fold Paralysis

Science.gov (United States)

Croake, Daniel J.; Andreatta, Richard D.; Stemple, Joseph C.

2018-01-01

Purpose: The purpose of this study is to quantify the interactions of the 3 vocalization subsystems of respiration, phonation, and resonance before, during, and after a perturbation to the larynx (temporarily induced unilateral vocal fold paralysis) in 10 vocally healthy participants. Using dynamic systems theory as a guide, we hypothesized that…
Expression of emotion in Eastern and Western music mirrors vocalization.

Directory of Open Access Journals (Sweden)

Daniel Liu Bowling

Full Text Available In Western music, the major mode is typically used to convey excited, happy, bright or martial emotions, whereas the minor mode typically conveys subdued, sad or dark emotions. Recent studies indicate that the differences between these modes parallel differences between the prosodic and spectral characteristics of voiced speech sounds uttered in corresponding emotional states. Here we ask whether tonality and emotion are similarly linked in an Eastern musical tradition. The results show that the tonal relationships used to express positive/excited and negative/subdued emotions in classical South Indian music are much the same as those used in Western music. Moreover, tonal variations in the prosody of English and Tamil speech uttered in different emotional states are parallel to the tonal trends in music. These results are consistent with the hypothesis that the association between musical tonality and emotion is based on universal vocal characteristics of different affective states.
How Can We Create the Conditions for Students' Freedom of Speech within Studies in Art?

Science.gov (United States)

Matthews, Miranda

2008-01-01

This study investigates how the dynamics of students' voice can be productively brought into teaching situations. I have researched the conditions required for constructive freedom of speech, within art education. I explored the potential for vocal peer assessment and for students' ownership of their educational experiences, for the…
From mouth to hand: gesture, speech, and the evolution of right-handedness.

Science.gov (United States)

Corballis, Michael C

2003-04-01

The strong predominance of right-handedness appears to be a uniquely human characteristic, whereas the left-cerebral dominance for vocalization occurs in many species, including frogs, birds, and mammals. Right-handedness may have arisen because of an association between manual gestures and vocalization in the evolution of language. I argue that language evolved from manual gestures, gradually incorporating vocal elements. The transition may be traced through changes in the function of Broca's area. Its homologue in monkeys has nothing to do with vocal control, but contains the so-called "mirror neurons," the code for both the production of manual reaching movements and the perception of the same movements performed by others. This system is bilateral in monkeys, but predominantly left-hemispheric in humans, and in humans is involved with vocalization as well as manual actions. There is evidence that Broca's area is enlarged on the left side in Homo habilis, suggesting that a link between gesture and vocalization may go back at least two million years, although other evidence suggests that speech may not have become fully autonomous until Homo sapiens appeared some 170,000 years ago, or perhaps even later. The removal of manual gesture as a necessary component of language may explain the rapid advance of technology, allowing late migrations of Homo sapiens from Africa to replace all other hominids in other parts of the world, including the Neanderthals in Europe and Homo erectus in Asia. Nevertheless, the long association of vocalization with manual gesture left us a legacy of right-handedness.
Vocal Performance and Speech Intonation: Bob Dylan’s “Like a Rolling Stone”

Directory of Open Access Journals (Sweden)

Michael Daley

2007-03-01

Full Text Available This article proposes a linguistic analysis of a recorded performance of a single verse of one of Dylan’s most popular songs—the originally released studio recording of “Like A Rolling Stone”—and describes more specifically the ways in which intonation relates to lyrics and performance. This analysis is used as source material for a close reading of the semantic, affective, and “playful” meanings of the performance, and is compared with some published accounts of the song’s reception. The author has drawn on the linguistic methodology formulated by Michael Halliday, who has found speech intonation (which includes pitch movement, timbre, syllabic rhythm, and loudness to be an integral part of English grammar and crucial to the transmission of certain kinds of meaning. Speech intonation is a deeply-rooted and powerfully meaningful aspect of human communication. This article argues that is plausible that a system so powerful in speech might have some bearing on the communication of meaning in sung performance.
The effect of vocal tract impedance on the vocal folds

DEFF Research Database (Denmark)

Agerkvist, Finn T.; Selamtzis, Andreas

2011-01-01

frontend is used to measure the electroglottograph signal which reflects the opening and closing pattern of the vocal folds. The measurements were carried out for all four modes (Neutral, Curbing, Overdrive and Edge) for the vowel [a] in three different pitches: C3(131 Hz), G3 (196 Hz) and C4 (262Hz......The importance of the interaction between the acoustic impedance of the vocal tract with the flow across the vocal cords is well established. In this paper we are investigating the changes in vocal tract impedance when using the different modes of phonation according to Sadolin [1], going from...... the soft levels of the Neutral mode to the high levels of the fully ‘metallic’ Edge mode. The acoustic impedance of vocal tract as seen from the mouth opening is measured via a microphone placed close to the mouth when exciting the system with a volume velocity source [2]. At the same time a Laryngograph...
Digitised evaluation of speech intelligibility using vowels in maxillectomy patients.

Science.gov (United States)

Sumita, Y I; Hattori, M; Murase, M; Elbashti, M E; Taniguchi, H

2018-03-01

Among the functional disabilities that patients face following maxillectomy, speech impairment is a major factor influencing quality of life. Proper rehabilitation of speech, which may include prosthodontic and surgical treatments and speech therapy, requires accurate evaluation of speech intelligibility (SI). A simple, less time-consuming yet accurate evaluation is desirable both for maxillectomy patients and the various clinicians providing maxillofacial treatment. This study sought to determine the utility of digital acoustic analysis of vowels for the prediction of SI in maxillectomy patients, based on a comprehensive understanding of speech production in the vocal tract of maxillectomy patients and its perception. Speech samples were collected from 33 male maxillectomy patients (mean age 57.4 years) in two conditions, without and with a maxillofacial prosthesis, and formant data for the vowels /a/,/e/,/i/,/o/, and /u/ were calculated based on linear predictive coding. The frequency range of formant 2 (F2) was determined by differences between the minimum and maximum frequency. An SI test was also conducted to reveal the relationship between SI score and F2 range. Statistical analyses were applied. F2 range and SI score were significantly different between the two conditions without and with a prosthesis (both P maxillectomy. © 2017 John Wiley & Sons Ltd.
Acoustical conditions for speech communication in active elementary school classrooms

Science.gov (United States)

Sato, Hiroshi; Bradley, John

2005-04-01

Detailed acoustical measurements were made in 34 active elementary school classrooms with typical rectangular room shape in schools near Ottawa, Canada. There was an average of 21 students in classrooms. The measurements were made to obtain accurate indications of the acoustical quality of conditions for speech communication during actual teaching activities. Mean speech and noise levels were determined from the distribution of recorded sound levels and the average speech-to-noise ratio was 11 dBA. Measured mid-frequency reverberation times (RT) during the same occupied conditions varied from 0.3 to 0.6 s, and were a little less than for the unoccupied rooms. RT values were not related to noise levels. Octave band speech and noise levels, useful-to-detrimental ratios, and Speech Transmission Index values were also determined. Key results included: (1) The average vocal effort of teachers corresponded to louder than Pearsons Raised voice level; (2) teachers increase their voice level to overcome ambient noise; (3) effective speech levels can be enhanced by up to 5 dB by early reflection energy; and (4) student activity is seen to be the dominant noise source, increasing average noise levels by up to 10 dBA during teaching activities. [Work supported by CLLRnet.
A Computerized Tomography Study of Vocal Tract Setting in Hyperfunctional Dysphonia and in Belting.

Science.gov (United States)

Saldias, Marcelo; Guzman, Marco; Miranda, Gonzalo; Laukkanen, Anne-Maria

2018-04-03

Vocal tract setting in hyperfunctional patients is characterized by a high larynx and narrowing of the epilaryngeal and pharyngeal region. Similar observations have been made for various singing styles, eg, belting. The voice quality in belting has been described to be loud, speech like, and high pitched. It is also often described as sounding "pressed" or "tense". The above mentioned has led to the hypothesis that belting may be strenuous to the vocal folds. However, singers and teachers of belting do not regard belting as particularly strenuous. This study investigates possible similarities and differences between hyperfunctional voice production and belting. This study concerns vocal tract setting. Four male patients with hyperfunctional dysphonia and one male contemporary commercial music singer were registered with computerized tomography while phonating on [a:] in their habitual speaking pitch. Additionally, the singer used the pitch G4 in belting. The scannings were studied in sagittal and transversal dimensions by measuring lengths, widths, and areas. Various similarities were found between belting and hyperfunction: high vertical larynx position, small hypopharyngeal width, and epilaryngeal outlet. On the other hand, belting differed from dysphonia (in addition to higher pitch) by a wider lip and jaw opening, and larger volumes of the oral cavity. Belting takes advantage of "megaphone shape" of the vocal tract. Future studies should focus on modeling and simulation to address sound energy transfer. Also, they should consider aerodynamic variables and vocal fold vibration to evaluate the "price of decibels" in these phonation types. Copyright © 2018. Published by Elsevier Inc.

The Risk of Vocal Fold Atrophy after Serial Corticosteroid Injections of the Vocal Fold.

Science.gov (United States)

Shi, Lucy L; Giraldez-Rodriguez, Laureano A; Johns, Michael M

2016-11-01

The aim of this study was to illustrate the risk of vocal fold atrophy in patients who receive serial subepithelial steroid injections for vocal fold scar. This study is a retrospective case report of two patients who underwent a series of weekly subepithelial infusions of 10 mg/mL dexamethasone for benign vocal fold lesion. Shortly after the procedures, both patients developed a weak and breathy voice. The first patient was a 53-year-old man with radiation-induced vocal fold stiffness. Six injections were performed unilaterally, and 1 week later, he developed unilateral vocal fold atrophy with new glottal insufficiency. The second patient was a 67-year-old woman with severe vocal fold inflammation related to laryngitis and calcinosis, Raynaud's phenomenon, esophagean dysmotility, sclerodactyly, and telangiectasia (CREST) syndrome. Five injections were performed bilaterally, and 1 week later, she developed bilateral vocal fold atrophy with a large midline glottal gap during phonation. In both cases, the steroid-induced vocal atrophy resolved spontaneously after 4 months. Serial subepithelial steroid infusions of the vocal folds, although safe in the majority of patients, carry the risk of causing temporary vocal fold atrophy when given at short intervals. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The Singer's and the Clinician's Perspective on Vitamin B12 Treatment for Vocal Benefits.

Science.gov (United States)

Shoffel-Havakuk, Hagit; Lava, Christian X; Hapner, Edie R; O'Dell, Karla; Reder, Lindsay; Johns, Michael M

2018-01-03

There is a belief among vocalists that there are voice benefits from vitamin B 12 treatment. Yet there are no previous reports regarding vitamin B 12 effects on voice. To assess the prevalence of vitamin B 12 use among singers and their beliefs regarding vitamin B 12 therapy. Anonymous online survey administered to singers, singing-teachers, speech-language pathologists, and laryngologists. A total of 192 participants completed the surveys; 128 singers (68 singing-teachers, 30 speech-language pathologists) and 64 laryngologists. Among singers, 12% have perceived voice benefits from vitamin B 12 treatment taken for any reason. Four percent used vitamin B 12 for voice benefits; all perceived voice benefits as a result. The leading voice benefits were improved stamina, reduced effort, confidence, and control. Nineteen percent of the singers would recommend vitamin B 12 treatment to a friend; 15% of the singing-teachers would recommend it to a student. Among laryngologists, 33% been asked by a singer to prescribe vitamin B 12 for voice benefits; 9% have prescribed it in the past. Yet only 3% would you recommend it to a patient. When asked "Do you believe vitamin B 12 therapy improves vocal performance?" 31% of the singers responded "Yes," compared with none in the laryngologists. When asked "Do you think the singing community believes vitamin B 12 therapy improves vocal performance?" 26% of the singers responded "Yes," compared with 53% of the laryngologists (P = 0.0002). There is a discrepancy between the singers' and the laryngologists' beliefs regarding vocal benefits perceived by vitamin B 12 . Blinded randomized trials are required to verify or refute this belief. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Speech rhythm analysis with decomposition of the amplitude envelope: characterizing rhythmic patterns within and across languages.

Science.gov (United States)

Tilsen, Sam; Arvaniti, Amalia

2013-07-01

This study presents a method for analyzing speech rhythm using empirical mode decomposition of the speech amplitude envelope, which allows for extraction and quantification of syllabic- and supra-syllabic time-scale components of the envelope. The method of empirical mode decomposition of a vocalic energy amplitude envelope is illustrated in detail, and several types of rhythm metrics derived from this method are presented. Spontaneous speech extracted from the Buckeye Corpus is used to assess the effect of utterance length on metrics, and it is shown how metrics representing variability in the supra-syllabic time-scale components of the envelope can be used to identify stretches of speech with targeted rhythmic characteristics. Furthermore, the envelope-based metrics are used to characterize cross-linguistic differences in speech rhythm in the UC San Diego Speech Lab corpus of English, German, Greek, Italian, Korean, and Spanish speech elicited in read sentences, read passages, and spontaneous speech. The envelope-based metrics exhibit significant effects of language and elicitation method that argue for a nuanced view of cross-linguistic rhythm patterns.
On the single-mass model of the vocal folds

International Nuclear Information System (INIS)

Howe, M S; McGowan, R S

2010-01-01

An analysis is made of the fluid-structure interactions necessary to support self-sustained oscillations of a single-mass mechanical model of the vocal folds subject to a nominally steady subglottal overpressure. The single-mass model of Fant and Flanagan is re-examined and an analytical representation of vortex shedding during 'voiced speech' is proposed that promotes cooperative, periodic excitation of the folds by the glottal flow. Positive feedback that sustains glottal oscillations is shown to occur during glottal contraction, when the flow separates from the 'trailing edge' of the glottis producing a low-pressure 'suction' force that tends to pull the folds together. Details are worked out for flow that can be regarded as locally two-dimensional in the glottal region. Predictions of free-streamline theory are used to model the effects of quasi-static variations in the separation point on the glottal wall. Numerical predictions are presented to illustrate the waveform of the sound radiated towards the mouth from the glottis. The theory is easily modified to include feedback on the glottal flow of standing acoustic waves, both in the vocal tract beyond the glottis and in the subglottal region. (invited paper)
Impacto vocal de professores Teachers' vocal impact

Directory of Open Access Journals (Sweden)

Adriana Ricarte

2011-08-01

Full Text Available OBJETIVO: analisar o impacto vocal nas atividades diárias em professores do ensino médio. Correlacionar os achado da auto-percepção do problema vocal com os aspectos: efeitos no trabalho, na comunicação diária, na comunicação social e na sua emoção. MÉTODOS: a amostra foi constituída por 107 professores, sendo 86 com queixa e 21 sem queixa, selecionados em escolas da rede particular de ensino de Maceió-AL. Cada professor respondeu individualmente o protocolo Perfil Participação em Atividades Vocais na presença da pesquisadora, assinalando suas respostas em uma escala visual que varia de 0 a 10. O protocolo é composto por 28 questões com a presença integrada em cinco aspectos englobados para avaliar a qualidade de vida e o resultado de tratamentos vocais. O protocolo oferece, ainda, dois escores adicionais: pontuação de limitação nas atividades (PLA e de restrição de participação (PRP. RESULTADOS: na comparação dos grupos com e sem queixa vocal foram verificados que todos os resultados foram estatisticamente significantes (pPURPOSE: to analyze the vocal impact in the daily activities on high-school teachers. Correlate the finding of the auto-perception on the vocal problem with the following aspects: effects in the work, daily communication, social communication and, its emotion METHODS: the sample consisted of 107 teachers, 86 with and 21 with no complaint, selected from private teaching schools in Maceió-AL. Each teacher answered individually the Protocol for Voice Activity Participation Profile in the presence of the researcher, noting their responses on a visual scale ranging from 0 to 10. The protocol is composed of 28 questions with the presence integrated in five aspects to evaluate the quality of life and the result of vocal treatments. The protocol offers, still, two additional scores: punctuation of limitation in the activities (PLA and restriction of participation (PRP. RESULTS: comparing the groups with
CDS is not what you think - Hypoarticulation in Danish Child Directed Speech

DEFF Research Database (Denmark)

Dideriksen, Christina Rejkjær; Fusaroli, Riccardo

et al. 2008). A previous study relying on lab-elicited stimuli indicated that Danish CDS might be peculiar, with a surprising lack of increased articulation (Bohn 2013). In the current study, we focused on longer naturalistic recordings in an environment known and safe for both child and mother...... common CDS acoustic traits: increased pitch and pitch variability and lower speech rate. However, we also find a significantly reduced vowel space when compared to adult-directed speech, which is especially surprising given the wide range of Danish vocalic sounds. We are currently extending the analysis...... and cultural affordances and the many complex routes to learn a language....
Vocal fold elasticity of the Rocky Mountain elk (Cervus elaphus nelsoni) – producing high fundamental frequency vocalization with a very long vocal fold

OpenAIRE

Riede, Tobias; Titze, Ingo R.

2008-01-01

The vocal folds of male Rocky Mountain elk (Cervus elaphus nelsoni) are about 3 cm long. If fundamental frequency were to be predicted by a simple vibrating string formula, as is often done for the human larynx, such long vocal folds would bear enormous stress to produce the species-specific mating call with an average fundamental frequency of 1 kHz. Predictions would be closer to 50 Hz. Vocal fold histology revealed the presence of a large vocal ligament between the vocal fold epithelium and...
Vocal Hygiene Habits and Vocal Handicap Among Conservatory Students of Classical Singing.

Science.gov (United States)

Achey, Meredith A; He, Mike Z; Akst, Lee M

2016-03-01

This study sought to assess classical singing students' compliance with vocal hygiene practices identified in the literature and to explore the relationship between self-reported vocal hygiene practice and self-reported singing voice handicap in this population. The primary hypothesis was that increased attention to commonly recommended vocal hygiene practices would correlate with reduced singing voice handicap. This is a cross-sectional, survey-based study. An anonymous survey assessing demographics, attention to 11 common vocal hygiene recommendations in both performance and nonperformance periods, and the Singing Voice Handicap Index 10 (SVHI-10) was distributed to classical singing teachers to be administered to their students at two major schools of music. Of the 215 surveys distributed, 108 were returned (50.2%), of which 4 were incomplete and discarded from analysis. Conservatory students of classical singing reported a moderate degree of vocal handicap (mean SVHI-10, 12; range, 0-29). Singers reported considering all 11 vocal hygiene factors more frequently when preparing for performances than when not preparing for performances. Of these, significant correlations with increased handicap were identified for consideration of stress reduction in nonperformance (P = 0.01) and performance periods (P = 0.02) and with decreased handicap for consideration of singing voice use in performance periods alone (P = 0.02). Conservatory students of classical singing report more assiduous attention to vocal hygiene practices when preparing for performances and report moderate degrees of vocal handicap overall. These students may have elevated risk for dysphonia and voice disorders which is not effectively addressed through common vocal hygiene recommendations alone. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal Health Education and Medical Resources for Graduate-Level Vocal Performance Students.

Science.gov (United States)

Latham, Katherine; Messing, Barbara; Bidlack, Melissa; Merritt, Samantha; Zhou, Xian; Akst, Lee M

2017-03-01

Most agree that education about vocal health and physiology can help singers avoid the development of vocal disorders. However, little is known about how this kind of education is provided to singers as part of their formal training. This study describes the amount of instruction in these topics provided through graduate-level curricula, who provides this instruction, and the kinds of affiliations such graduate singing programs have with medical professionals. This is an online survey of music schools with graduate singing programs. Survey questions addressed demographics of the programs, general attitudes about vocal health instruction for singers, the amount of vocal health instruction provided and by whom it was taught, perceived barriers to including more vocal health instruction, and any affiliations the voice program might have with medical personnel. Eighty-one survey responses were received. Instruction on vocal health was provided in 95% of the schools. In 55% of the schools, none of this instruction was given by a medical professional. Limited time in the curriculum, lack of financial support, and lack of availability of medical professional were the most frequently reported barriers to providing more instruction. When programs offered more hours of instruction, they were more likely to have some of that instruction given by a medical professional (P = 0.008) and to assess the amount of instruction provided positively (P = 0.001). There are several perceived barriers to incorporating vocal health education into graduate singing programs. Opportunity exists for more collaboration between vocal pedagogues and medical professionals in the education of singers about vocal health. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Modeling Speech Level as a Function of Background Noise Level and Talker-to-Listener Distance for Talkers Wearing Hearing Protection Devices

DEFF Research Database (Denmark)

Bouserhal, Rachel E.; Bockstael, Annelies; MacDonald, Ewen

2017-01-01

Purpose: Studying the variations in speech levels with changing background noise level and talker-to-listener distance for talkers wearing hearing protection devices (HPDs) can aid in understanding communication in background noise. Method: Speech was recorded using an intra-aural HPD from 12...... complements the existing model presented by Pelegrín-García, Smits, Brunskog, and Jeong (2011) and expands on it by taking into account the effects of occlusion and background noise level on changes in speech sound level. Conclusions: Three models of the relationship between vocal effort, background noise...
Speech Analysis and Synthesis and Man-Machine Speech Communications for Air Operations. (Synthese et Analyse de la Parole et Liaisons Vocales Homme- Machine dans les Operations Aeriennes)

Science.gov (United States)

1990-05-01

speech processing area are faced . He presents speech communication as an interactive process, in which the listener actively reconstructs the message...speech produced by these systems. Finally, perhaps the greatest recent impetus in advancing digital Finally, in the area of speech and speaker recognitio
Analysis of possible factors of vocal interference during the teaching activity.

Science.gov (United States)

Silva, Bárbara Gabriela; Chammas, Tiago Visacre; Zenari, Marcia Simões; Moreira, Renata Rodrigues; Samelli, Alessandra Giannella; Nemr, Kátia

2017-12-11

To measure the risk of dysphonia in teachers, as well as investigate whether the perceptual-auditory and acoustic aspects of the voice of teachers in situations of silence and noise, the signal-to-noise ratio, and the noise levels in the classroom are associated with the presence of dysphonia. This is an observational cross-sectional research with 23 primary and secondary school teachers from a private school in the municipality of São Paulo, Brazil, divided into the groups without dysphonia and with dysphonia. We performed the following procedures: general Dysphonia Risk Screening Protocol (General-DRSP) and complementary to speaking voice - teacher (Specific-DRSP), voice recording during class and in an individual situation in a silent room, and measurement of the signal-to-noise ratio and noise levels of classrooms. We have found differences between groups regarding physical activity (General-DRSP) and particularities of the profession (Specific-DRSP), as well as in all aspects of the perceptual-auditory vocal analysis. We have found signs of voice wear in the group without dysphonia. Regarding the vocal resources in the situations of noise and silence, we have identified a difference for the production of abrupt vocal attack and the tendency of a more precise speech in the situation of noise. Both the signal-to-noise ratio and the room noise levels during class were high in both groups. Teachers in both groups are at high risk for developing dysphonia and have negative vocal signals to a greater or lesser extent. Signal-to-noise ratio was inadequate in most classrooms, considering the standards for both children with normal hearing and with hearing loss, as well as equivalent noise levels.
Speech Rehabilitation For 10 Alaryngeal Patients Using Tracheoesophageal Puncture And Prosthesis Insertion In Amir Alam And Imam Khomeini Hospitals 2002-2003

Directory of Open Access Journals (Sweden)

M.T. Khorsi Ashtiani

2006-05-01

Full Text Available Background and Aim: Total laryngectomy following laryngeal cancer has many sequelae , that loss of voice is the most important of them. Tracheoesophageal puncture (TEP and prosthesis insertion has evolved into the most widely used and accepted technique for vocal rehabilitation. Materials and Methods: 10 patients that underwent TEP in Amir Alam and Imam Khomeini hospitals from Feb. 2002 through Nov. 2003; were included in this study. Prosthesis insertion in 4 patients is primary and in 6 patients is secondary; and all patients are men. Results: The age of patients was between 50 to 70. 90% of patients had history of cigarette smoking and 10% of them had history of drinking alcohol. Salivary leakage was seen in 30% of patients that was improved with conservative management. Fluency of speech in 30% of patients and intelligibility of speech & voice quality in 40% of patients is good. Conclusion: We could conclude that TEP has less complication & better speech results of other vocal rehabilitation methods. Carefully selection of patients & size of prosthesis has important role in results of TEP.
Effects of Vocal Function Exercises: A Systematic Review.

Science.gov (United States)

Angadi, Vrushali; Croake, Daniel; Stemple, Joseph

2017-11-03

The purpose of the present review was to systematically analyze the evidence for the effectiveness of vocal function exercises (VFEs) in improving voice production. A systematic literature search was performed by two independent reviewers using PubMed and EBSCOHost to access relevant databases and to locate outcome studies that used VFEs as an intervention. Articles that met inclusion criteria were appraised based on the American Speech-Language and Hearing Association's levels of evidence. Effect sizes for outcomes were calculated using Hedge's g. Voice outcomes were categorized according to the five domains of voice assessment: visual perceptual analysis, acoustic analysis, aerodynamic analysis, auditory-perceptual analysis, and patient self-report measures. Twenty-one articles were included for the final appraisal. All studies demonstrated positive effects of VFEs as demonstrated by effect sizes across selected voice parameters. Effect sizes across parameters ranged from -0.59 to 1.55. None of the included studies reported adverse voice outcomes as a result of VFEs. Outcome studies demonstrate that VFEs are efficacious in enhancing vocal function in individuals with normal and disordered voices, presbylaryngeus, and professional voice users. The available research suggests moderate to strong evidence to support the use of VFEs for a variety of voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Modeling vocalization with ECoG cortical activity recorded during vocal production in the macaque monkey.

Science.gov (United States)

Fukushima, Makoto; Saunders, Richard C; Fujii, Naotaka; Averbeck, Bruno B; Mishkin, Mortimer

2014-01-01

Vocal production is an example of controlled motor behavior with high temporal precision. Previous studies have decoded auditory evoked cortical activity while monkeys listened to vocalization sounds. On the other hand, there have been few attempts at decoding motor cortical activity during vocal production. Here we recorded cortical activity during vocal production in the macaque with a chronically implanted electrocorticographic (ECoG) electrode array. The array detected robust activity in motor cortex during vocal production. We used a nonlinear dynamical model of the vocal organ to reduce the dimensionality of `Coo' calls produced by the monkey. We then used linear regression to evaluate the information in motor cortical activity for this reduced representation of calls. This simple linear model accounted for circa 65% of the variance in the reduced sound representations, supporting the feasibility of using the dynamical model of the vocal organ for decoding motor cortical activity during vocal production.
Perfil vocal do guia de turismo Vocal profile of tourism guide

Directory of Open Access Journals (Sweden)

Elisângela Barros Soares

2006-12-01

Full Text Available OBJETIVO: caracterizar o perfil vocal dos guias de turismo, bem como gênero e idade. MÉTODOS: participaram desse estudo 23 guias de turismo, de ambos os gêneros, com idade entre 25 a 64 anos, participantes do Sindicato de Guias de Turismo do Estado de Pernambuco, que compareceram às reuniões trimestrais no período da coleta. Trata-se de um estudo de caráter descritivo, observacional e transversal. Para coleta foi realizada avaliação perceptivo-auditiva GRBAS. RESULTADOS: observou-se que a maioria dos guias apresentou loudness adequada, pitch normal e voz alterada. Além disso, as médias dos tempos máximos de fonação das vogais e das fricativas encontravam-se reduzidas e ataque vocal isocrônico. A ressonância, na maioria dos guias, estava equilibrada, mas houve uma incidência de ressonância laringo-faringea. A articulação foi precisa, com tipo e modo respiratório misto e nasal, respectivamente. Quanto à escala GRBAS as alterações apareceram de forma leve no G (grau de alteração vocal em 68%. CONCLUSÃO: na amostra estudada, a maioria era do gênero feminino com média de idade de 46 anos, e perfil vocal caracterizado por tempo máximo de fonação reduzidos, relação s/z adequado, ataque vocal isocrônico, pitch normal, loudness adequado, qualidade vocal alterada, com presença de rouquidão, soprosidade, tensão. A ressonância da maioria estava equilibrada e a articulação precisa, com tipo e modo respiratório misto e nasal, respectivamente. Quanto à escala GRBAS, as alterações apareceram de forma leve no grau de alteração vocal (G em 68% e tensão (S em 78% dos sujeitos.PURPOSE: to characterize the vocal profile of tourism guides, as well as gender and age. METHODS: 23 guides took part in this study, of both genders, with age between 25 to 64 years, partakers of the Union of Tourism Guides of the State of Pernambuco, who appeared to the quarterly meetings in the period of the collection. It is a descriptive
The Siren song of vocal fundamental frequency for romantic relationships

Directory of Open Access Journals (Sweden)

Sarah eWeusthoff

2013-07-01

Full Text Available A multitude of factors contribute to why and how romantic relationships are formed as well as whether they ultimately succeed or fail. Drawing on evolutionary models of attraction and speech production as well as integrative models of relationship functioning, this review argues that paralinguistic cues (more specifically the fundamental frequency of the voice that are initially a strong source of attraction also increase couples’ risk for relationship failure. Conceptual similarities and differences between the multiple operationalizations and interpretations of vocal fundamental frequency are discussed and guidelines are presented for understanding both convergent and non-convergent findings. Implications for clinical practice and future research are discussed.
The siren song of vocal fundamental frequency for romantic relationships.

Science.gov (United States)

Weusthoff, Sarah; Baucom, Brian R; Hahlweg, Kurt

2013-01-01

A multitude of factors contribute to why and how romantic relationships are formed as well as whether they ultimately succeed or fail. Drawing on evolutionary models of attraction and speech production as well as integrative models of relationship functioning, this review argues that paralinguistic cues (more specifically the fundamental frequency of the voice) that are initially a strong source of attraction also increase couples' risk for relationship failure. Conceptual similarities and differences between the multiple operationalizations and interpretations of vocal fundamental frequency are discussed and guidelines are presented for understanding both convergent and non-convergent findings. Implications for clinical practice and future research are discussed.
Multidimensional effects of voice therapy in patients affected by unilateral vocal fold paralysis due to cancer.

Science.gov (United States)

Barcelos, Camila Barbosa; Silveira, Paula Angélica Lorenzon; Guedes, Renata Lígia Vieira; Gonçalves, Aline Nogueira; Slobodticov, Luciana Dall'Agnol Siqueira; Angelis, Elisabete Carrara-de

2017-08-24

Patients with unilateral vocal fold paralysis may demonstrate different degrees of voice perturbation depending on the position of the paralyzed vocal fold. Understanding the effectiveness of voice therapy in this population may be an important coefficient to define the therapeutic approach. To evaluate the voice therapy effectiveness in the short, medium and long-term in patients with unilateral vocal fold paralysis and determine the risk factors for voice rehabilitation failure. Prospective study with 61 patients affected by unilateral vocal fold paralysis enrolled. Each subject had voice therapy with an experienced speech pathologist twice a week. A multidimensional assessment protocol was used pre-treatment and in three different times after voice treatment initiation: short-term (1-3 months), medium-term (4-6 months) and long-term (12 months); it included videoendoscopy, maximum phonation time, GRBASI scale, acoustic voice analysis and the portuguese version of the voice handicap index. Multiple comparisons for GRBASI scale and VHI revealed statistically significant differences, except between medium and long term (pvocal improvement over time with stabilization results after 6 months (medium term). From the 28 patients with permanent unilateral vocal fold paralysis, 18 (69.2%) reached complete glottal closure following vocal therapy (p=0.001). The logistic regression method indicated that the Jitter entered the final model as a risk factor for partial improvement. For every unit of increased jitter, there was an increase of 0.1% (1.001) of the chance for partial improvement, which means an increase on no full improvement chance during rehabilitation. Vocal rehabilitation improves perceptual and acoustic voice parameters and voice handicap index, besides favor glottal closure in patients with unilateral vocal fold paralysis. The results were also permanent during the period of 1 year. The Jitter value, when elevated, is a risk factor for the voice therapy
INTERPRETATION OF INFLECTIVE FORMS IN MARULIĆ'S "JUDITA"

Directory of Open Access Journals (Sweden)

Iva Lukežić

2002-01-01

Full Text Available The paper deals with the readings in modern Croatian standard language of those inflective words in Judita that bare additional historical linguistic and dialectological information. After excerpting and interpretative reading of the examples, the data in this work have been classified in two ways: into four ranks according to belonging to the Croatian language (general language, Old Čakavian, Štokavian-Čakavian, and the rank of personal author's morphological creations conditioned by versification sake that M. Marulić might have used and within seven grammatical categories of declinable words.

Impacto na qualidade vocal da miectomia parcial e neurectomia endoscópica do músculo tireoaritenóideo em paciente com disfonia espasmódica de adução Impact in vocal quality in partial myectomy and neurectomy endoscopic of thyroarytenoid muscle in patients with adductor spasmodic dysphonia

Directory of Open Access Journals (Sweden)

Domingos Hiroshi Tsuji

2006-04-01

Full Text Available A disfonia espasmódica de adução é um distúrbio vocal grave, caracterizado por espasmos dos músculos laríngeos durante a fonação, produzindo voz quebrada, tensa, forçada e estrangulada. Seus sintomas decorrem da contração intermitente e involuntária dos músculos tireoaritenóideos durante a fonação, o que resulta em pregas vocais tensas, pressionadas uma contra a outra, e no aumento da resistência glótica. OBJETIVO: Apresentar os resultados preliminares do impacto na qualidade vocal da cirurgia de Neurectomia do ramo tireoaritenóideo do laríngeo inferior, via endoscópica, associada à miectomia parcial do músculo tireoaritenóideo com laser de CO2. MATERIAL E MÉTODO: A cirurgia foi realizada em 7 pacientes (6 mulheres e 1 homem, com idades variando entre 22 e 75 anos, com diagnóstico de disfonia espasmódica de adução. Os pacientes foram submetidos ao VHI (Voice Handicap Index no pré e pós-operatório. RESULTEDOS E CONCLUSÃO: A melhora vocal foi conseguida em todos os pacientes estudados não ocorrendo deterioração da qualidade vocal ao longo do período pós-operatório. Houve uma diferença evidente no VHI antes e após a cirurgia. Essa técnica cirúrgica mostrou-se eficaz e inovadora no tratamento da disfonia espasmódica de adução.Impact in vocal quality in partial myectomy and neurectomy endoscopic of thyroarytenoid muscle in patients with adductor spasmodic dysphonia the adductor spasmodic dysphonia is a severe vocal disorder characterized by muscle laryngeal spasms during speech, producing phonatory breaks, forced, strained and strangled voice. Its symptoms come from involuntary and intermittent contractions of thyroarytenoid muscle during speech, which causes vocal fold strain, pressed one against another and increased glottic resistance. AIM: report the results in the impact in vocal quality in neurectomy of the thyroarytenoid branch of the inferior laryngeal nerve by endoscopic route associated with
Acoustic and temporal analysis of speech: A potential biomarker for schizophrenia.

LENUS (Irish Health Repository)

Rapcan, Viliam

2010-11-01

Currently, there are no established objective biomarkers for the diagnosis or monitoring of schizophrenia. It has been previously reported that there are notable qualitative differences in the speech of schizophrenics. The objective of this study was to determine whether a quantitative acoustic and temporal analysis of speech may be a potential biomarker for schizophrenia. In this study, 39 schizophrenic patients and 18 controls were digitally recorded reading aloud an emotionally neutral text passage from a children\\'s story. Temporal, energy and vocal pitch features were automatically extracted from the recordings. A classifier based on linear discriminant analysis was employed to differentiate between controls and schizophrenic subjects. Processing the recordings with the algorithm developed demonstrated that it is possible to differentiate schizophrenic patients and controls with a classification accuracy of 79.4% (specificity=83.6%, sensitivity=75.2%) based on speech pause related parameters extracted from recordings carried out in standard office (non-studio) environments. Acoustic and temporal analysis of speech may represent a potential tool for the objective analysis in schizophrenia.
Modelling analogical change : A history of Swedish and Frisian verb inflection

NARCIS (Netherlands)

Strik, Oscar

2015-01-01

Why did Shakespeare write 'shak’d' when we say 'shook'? Why do some people say 'dived' and others 'dove'? These questions have to do with how we inflect verbs for past tense, and how those strategies vary across time and space. This dissertation sheds light on the issue through case studies of verbs
Age effects on the acquisition of nominal and verbal inflections in an instructed setting

Directory of Open Access Journals (Sweden)

Simone E. Pfenninger

2011-09-01

Full Text Available This study examines evidence for the hypothesis (e.g., Muñoz, 2006 that an early starting age is not necessarily more beneficial to the successful learning of L2 inflectional morphology in strictly formal instructional settings. The present author investigated the quantitative and qualitative differences in the production and reception of 5 selected inflectional morphemes in English written performance and competence tasks by 100 early classroom learners and 100 late classroom learners of the same age. While an earlier age of first exposure and a longer instructional period was not associated with higher accuracy scores, the findings suggest distinct patterns in the productive and receptive knowledge abilities of inflectional morphology; the late classroom learners’ superiority seems to be rooted in their greater reliance upon memory-based item-by-item associative learning, as they are significantly stronger on tasks that might cause semantic difficulties, whereas the early classroom learners are marginally better on pattern-based processes for certain morphemes. This finding possibly supports Ullman’s (2005 proposal that, as procedural memory declines with age, older starters have difficulty in discovering regularities in the input and thus over-rely on the declarative memory system in L2 learning.
Principle for possible memory structures with extra high density by using the electron sharing mechanisms of atoms in an inflective orbit

Science.gov (United States)

Sengor, T.

2014-10-01

Both of the qualitative and quantitative knowledge of electromagnetic fields in the inter-atomic scale bring useful applications. From this point of view, bringing some possible new sights and solutions to atom-electron-photon-atom and/or molecule interactions is aimed in the near-field at inter atomic scale and their potential applications. The electron sharing processes between neighbor atoms are considered as an inflective surface system and an inflective guiding processes. The critical pass and transition structures are derived. The structures involving trigging that transition mechanisms may be suitable to design extra high density and fast data storage processes. The electron sharing processes between two near atomic system are modelled with gate mechanisms involving two distinct passages: continuous pass and discontinuous pass. Even if the stochastic processes are applicable at these cases theoretical approach putting an influence like inner and external dipole mechanisms fits best to the situation and provides almost deterministic scheme, which has potential to estimate some processes being able to design new electronics structures and devices. We call orbitron all of such structures and/or devices. The boundary value problem of atomic system sharing an electron in the way of electron passage model is formulated in inflective spherical coordinate system. The wave phenomenon is studied near spherically inflection points. The analytical essentials are derived for the solution of Helmholtz's equation when inflective boundaries are included. The evaluation is obtained by the extracted separation method. The results are given by using the spherically inflective wave series. The method is reshaped for the solution of Schrödinger equation.
Medialization thyroplasty in glottis insufficiency due to unilateral vocal fold paralysis and after laser cordectomies - preliminary report.

Science.gov (United States)

Rzepakowska, Anna; Osuch-Wójcikiewicz, Ewa; Sielska-Badurek, Ewelina; Niemczyk, Kazimierz

2017-02-28

Medialization thyroplasty (type I) is surgical procedure performed on the thyroid cartilage. The major indication for this surgery is significant glottis insufficiency due to unilateral vocal fold paresis. However the proce¬dure is also performed after vocal fold resections during cordectomy. The evaluation of voice results in patients after medialisation throplasty. In Otolaryngology Department of Medical University of Warsaw there were performed so far 8 thyroplasty procedures under local anaesthesia with implantation of medical silicon protesis. 6 patients had unilat¬eral vocal fold paresis and the rest two underwent in the past laser cordectomy due to T1a vocal carcinoma. There were no complications during and post the surgery. The follow up examination in 1st , 3rd, 6th i 12th months postoperatively revealed for all patients significant improvement of glottal closure in laryngeal videostrobos¬copy. The voice quality improved both in perceptual evaluation (GRBAS scale) and acoustic analysis (F0, jitter, shim¬mer, NHR) in both patients groups. However the rate of improvement was much more significant in group with uni¬lateral vocal fold paresis. In all patients the maximum phonation time (MPT) increased. The self-evaluation of voice quality with Voice Handicap Index questionnaire confirmed also individual improvement. The speech rehabilitations is not successful in each patient with glottis insufficiency. The medialisation thyroplasty remains the standard procedure for permanent improvement of voice quality in those cases.
The natural statistics of audiovisual speech.

Directory of Open Access Journals (Sweden)

Chandramouli Chandrasekaran

2009-07-01

Full Text Available Humans, like other animals, are exposed to a continuous stream of signals, which are dynamic, multimodal, extended, and time varying in nature. This complex input space must be transduced and sampled by our sensory systems and transmitted to the brain where it can guide the selection of appropriate actions. To simplify this process, it's been suggested that the brain exploits statistical regularities in the stimulus space. Tests of this idea have largely been confined to unimodal signals and natural scenes. One important class of multisensory signals for which a quantitative input space characterization is unavailable is human speech. We do not understand what signals our brain has to actively piece together from an audiovisual speech stream to arrive at a percept versus what is already embedded in the signal structure of the stream itself. In essence, we do not have a clear understanding of the natural statistics of audiovisual speech. In the present study, we identified the following major statistical features of audiovisual speech. First, we observed robust correlations and close temporal correspondence between the area of the mouth opening and the acoustic envelope. Second, we found the strongest correlation between the area of the mouth opening and vocal tract resonances. Third, we observed that both area of the mouth opening and the voice envelope are temporally modulated in the 2-7 Hz frequency range. Finally, we show that the timing of mouth movements relative to the onset of the voice is consistently between 100 and 300 ms. We interpret these data in the context of recent neural theories of speech which suggest that speech communication is a reciprocally coupled, multisensory event, whereby the outputs of the signaler are matched to the neural processes of the receiver.
[Speech-related tremor of lips: a focal task-specific tremor].

Science.gov (United States)

Morita, Shuhei; Takagi, Rieko; Miwa, Hideto; Kondo, Tomoyoshi

2002-04-01

We report a 66-year-old Japanese woman in whom tremor of lips appeared during speech. Her past and family histories were unremarkable. On neurological examination, there was no abnormal finding except the lip tremor. Results of laboratory findings were all within normal levels. Her MRI and EEG were normal. Surface EMG studies revealed that regular grouped discharges at a frequency of about 4-5 Hz appeared in the orbicularis oris muscle only during voluntary speaking. The tremor was not observed under conditions of a purposeless phonation or a vocalization of a simple word, suggesting that the tremor was not a vocal tremor but a task-specific tremor related to speaking. Administration of a beta-blocker and consumption of small amount of alcohol could effectively improve the tremor, possibly suggesting that this type of tremor might be a clinical variant of essential tremor.
At the interface of the auditory and vocal motor systems: NIf and its role in vocal processing, production and learning.

Science.gov (United States)

Lewandowski, Brian; Vyssotski, Alexei; Hahnloser, Richard H R; Schmidt, Marc

2013-06-01

Communication between auditory and vocal motor nuclei is essential for vocal learning. In songbirds, the nucleus interfacialis of the nidopallium (NIf) is part of a sensorimotor loop, along with auditory nucleus avalanche (Av) and song system nucleus HVC, that links the auditory and song systems. Most of the auditory information comes through this sensorimotor loop, with the projection from NIf to HVC representing the largest single source of auditory information to the song system. In addition to providing the majority of HVC's auditory input, NIf is also the primary driver of spontaneous activity and premotor-like bursting during sleep in HVC. Like HVC and RA, two nuclei critical for song learning and production, NIf exhibits behavioral-state dependent auditory responses and strong motor bursts that precede song output. NIf also exhibits extended periods of fast gamma oscillations following vocal production. Based on the converging evidence from studies of physiology and functional connectivity it would be reasonable to expect NIf to play an important role in the learning, maintenance, and production of song. Surprisingly, however, lesions of NIf in adult zebra finches have no effect on song production or maintenance. Only the plastic song produced by juvenile zebra finches during the sensorimotor phase of song learning is affected by NIf lesions. In this review, we carefully examine what is known about NIf at the anatomical, physiological, and behavioral levels. We reexamine conclusions drawn from previous studies in the light of our current understanding of the song system, and establish what can be said with certainty about NIf's involvement in song learning, maintenance, and production. Finally, we review recent theories of song learning integrating possible roles for NIf within these frameworks and suggest possible parallels between NIf and sensorimotor areas that form part of the neural circuitry for speech processing in humans. Copyright © 2013 Elsevier
Vocal outcome after endoscopic thyroarytenoid myoneurectomy in patients with adductor spasmodic dysphonia.

Science.gov (United States)

Gandhi, Sachin; Remacle, Marc; Mishra, Prasun; Desai, Vrushali

2014-12-01

Spasmodic dysphonia (SD) remains one of the most difficult of laryngeal pathologies to treat. With limited role for speech therapy, various surgical modalities have been tried with various success rates. The objective of the study is to report the results of vocal outcome after thyroarytenoid myoneurectomy in patients of adductor spasmodic dysphonia (ASD). 15 patients of ASD were selected. GRBAS, and voice handicap index (VHI) were used for perceptual evaluation of voice. Thyroarytenoid myoneurectomy was performed by vaporizing the muscular layer of the vocal fold with CO2 laser, at an intensity of 6 W with 1.2 mm diameter in scanner mode. Voice analysis was repeated at 12, 24 and 48 months follow-up. Preoperative GRBAS scores and VHI score of all the patients were poor. At 12 months 12/15 (80 %) patients having strain score of 0. There was marked improvement in VHI scores at 6 months. 10/15 (67 %) patients have been followed up for 24 months. 5/10 (50 %) patients have strain (S) value of 0. VHI scoring of 5/10 (50 %) patients was <30. Two of the four patients completed 48 months follow-up had a strain (S) value of 0, one patient has strain value of 1 and one patient had strain value of 2. 2/4 patients had VHI score of <30; one patient had that of 40. Trans-oral CO2 laser thyroarytenoid myoneurectomy shows significant long-term improvement in voice quality in terms of reduced speech brakes, effort and strain in voice.
Dosimetric complication probability and acoustic analysis of vocal cord region in oropharyngeal carcinoma treated with voice-sparing intensity modulated radiotherapy

International Nuclear Information System (INIS)

Jain, S.; Gupta, T.; Agarwal, J.P.; Baccher, G.; Shrivastava, S.K.; Reenadevi; Master, J.

2008-01-01

Radiation to larynx has long been associated with speech and voice dysfunction. The objective is to study dosimetric parameters and complication probability of vocal cord region (VCR) and the effect of voice-sparing (VS) in the patients treated with intensity modulated radiotherapy (IMRT). The secondary objective is to describe the post-radiation acoustic voice characteristics and correlate them with the dosimetric parameters. (author)
Endo-extralaryngeal Laterofixation of the Vocal Folds in Patients with Bilateral Vocal Fold Immobility.

Science.gov (United States)

Wiegand, Susanne; Teymoortash, Afshin; Hanschmann, Holger

2017-01-01

Bilateral vocal fold paralysis can result in shortness of breath and severe dyspnea which can be life-threatening. Thirty-five patients with bilateral vocal fold paralysis who underwent endo-extralaryngeal laterofixation according to Lichtenberger were retrospectively analyzed regarding etiology, symptoms, treatment and complications. In 27 patients, laterofixation of the vocal cord alone was performed. Eight patients underwent laterofixation and additional posterior chordectomy of the opposite vocal cord according to Dennis and Kashima. The time of intervention ranged from 1 day to 38 years after the onset of bilateral vocal cord immobility. The intraoperative course was uneventful in all patients. None of the patients had postoperative aspiration. Postoperative voice function was acceptable in all patients. Complications of suture laterofixation were laryngeal edema, formation of fibrin, and malposition of the suture. Laterofixation of the vocal cords according to Lichtenberger is a safe and easy method that can be used as a first-stage treatment of vocal cord paralysis. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Desvantagem vocal em cantores de igreja Vocal handicap of church singers

Directory of Open Access Journals (Sweden)

Tatiane Prestes

2012-10-01

Full Text Available OBJETIVO: avaliar a desvantagem vocal de cantores amadores de coros de igreja. MÉTODO: participaram 42 cantores de coros amadores de igrejas, sendo 20 homens e 22 mulheres, com idades entre 18 e 59 anos. Todos responderam a um questionário contendo perguntas sobre autopercepção vocal e práticas de canto, e ao protocolo Índice de Desvantagem para o Canto Moderno (IDCM, composto por 30 questões referentes às subescalas incapacidade, desvantagem e defeito. Foi realizada triagem perceptivo-auditiva para classificação das vozes em adaptadas ou alteradas e mensuração dos graus De alteração. RESULTADOS: a pontuação total média obtida no IDCM foi 23 pontos. Os maiores escores foram obtidos na subescala "defeito" (10,9, seguido por "incapacidade" (7,6 e "desvantagem" (4,5, com diferença entre elas (p= 0,001. Cantores que nunca realizaram aula de canto apresentaram maiores escores no domínio "desvantagem" (p=0,003. À medida que o escore total do IDCM aumentou, a nota atribuída pelo cantor em relação à própria voz diminuiu (p= 0,046. Participantes com qualidade vocal alterada apresentaram maiores escores nas subescalas incapacidade e desvantagem e no domínio total do IDCM quando comparados aos que apresentavam qualidade vocal adaptada (p=0,012, p=0,049 e p=0,015, respectivamente. Além disso, quanto maior o grau de alteração vocal, maiores foram os escores referentes à subescala incapacidade (p=0,022. CONCLUSÃO: cantores de igreja apresentam desvantagem vocal importante. Quando apresentam alterações vocais, esta desvantagem é ainda maior. Quanto maior o grau de alteração vocal, maiores as limitações referentes à voz cantada. Aulas de canto parecem minimizar a desvantagem vocal nessa população.PURPOSE: to evaluate the vocal handicap of amateur singers of church choirs. METHOD: we interviewed 42 amateur singers from church choirs, 20 men, and 22 women, between 18 and 59 year old. Everybody answered a questionnaire
Analysis of possible factors of vocal interference during the teaching activity

Directory of Open Access Journals (Sweden)

Bárbara Gabriela Silva

2017-12-01

Full Text Available ABSTRACT OBJECTIVE To measure the risk of dysphonia in teachers, as well as investigate whether the perceptual-auditory and acoustic aspects of the voice of teachers in situations of silence and noise, the signal-to-noise ratio, and the noise levels in the classroom are associated with the presence of dysphonia. METHODS This is an observational cross-sectional research with 23 primary and secondary school teachers from a private school in the municipality of São Paulo, Brazil, divided into the groups without dysphonia and with dysphonia. We performed the following procedures: general Dysphonia Risk Screening Protocol (General-DRSP and complementary to speaking voice - teacher (Specific-DRSP, voice recording during class and in an individual situation in a silent room, and measurement of the signal-to-noise ratio and noise levels of classrooms. RESULTS We have found differences between groups regarding physical activity (General-DRSP and particularities of the profession (Specific-DRSP, as well as in all aspects of the perceptual-auditory vocal analysis. We have found signs of voice wear in the group without dysphonia. Regarding the vocal resources in the situations of noise and silence, we have identified a difference for the production of abrupt vocal attack and the tendency of a more precise speech in the situation of noise. Both the signal-to-noise ratio and the room noise levels during class were high in both groups. CONCLUSIONS Teachers in both groups are at high risk for developing dysphonia and have negative vocal signals to a greater or lesser extent. Signal-to-noise ratio was inadequate in most classrooms, considering the standards for both children with normal hearing and with hearing loss, as well as equivalent noise levels.
Transcriptome Analysis of Liangshan Pig Muscle Development at the Growth Curve Inflection Point and Asymptotic Stages Using Digital Gene Expression Profiling

Science.gov (United States)

Du, Jingjing; Liu, Chendong; Wu, Xiaoqian; Pu, Qiang; Fu, Yuhua; Tang, Qianzi; Liu, Yuanrui; Li, Qiang; Yang, Runlin; Li, Xuewei; Tang, Guoqing; Jiang, Yanzhi; Li, Mingzhou; Zhang, Shunhua; Zhu, Li

2015-01-01

Animal growth curves can provide essential information for animal breeders to optimize feeding and management strategies. However, the genetic mechanism underlying the phenotypic differentiation between the inflection point and asymptotic stages of the growth curve is not well characterized. Here, we employed Liangshan pigs in stages of growth at the inflection point (under inflection point: UIP) and the two asymptotic stages (before the inflection point: BIP, after the inflection point: AIP) as models to survey global gene expression in the longissimus dorsi muscle using digital gene expression (DGE) tag profiling. We found Liangshan pigs reached maximum growth rate (UIP) at 163.6 days of age and a weight of 134.6 kg. The DGE libraries generated 117 million reads of 5.89 gigabases in length. 21,331, 20,996 and 20,139 expressed transcripts were identified BIP, UIP and AIP, respectively. Among them, we identified 757 differentially expressed genes (DEGs) between BIP and UIP, and 271 DEGs between AIP and UIP. An enrichment analysis of DEGs proved the immune system was strengthened in the AIP stage. Energy metabolism rate, global transcriptional activity and bone development intensity were highest UIP. Meat from Liangshan pigs had the highest intramuscular fat content and most favorable fatty acid composition in the AIP. Three hundred eighty (27.70%) specific expression genes were highly enriched in QTL regions for growth and meat quality traits. This study completed a comprehensive analysis of diverse genetic mechanisms underlying the inflection point and asymptotic stages of growth. Our findings will serve as an important resource in the understanding of animal growth and development in indigenous pig breeds. PMID:26292092
Acoustic analysis assessment in speech pathology detection

Directory of Open Access Journals (Sweden)

Panek Daria

2015-09-01

Full Text Available Automatic detection of voice pathologies enables non-invasive, low cost and objective assessments of the presence of disorders, as well as accelerating and improving the process of diagnosis and clinical treatment given to patients. In this work, a vector made up of 28 acoustic parameters is evaluated using principal component analysis (PCA, kernel principal component analysis (kPCA and an auto-associative neural network (NLPCA in four kinds of pathology detection (hyperfunctional dysphonia, functional dysphonia, laryngitis, vocal cord paralysis using the a, i and u vowels, spoken at a high, low and normal pitch. The results indicate that the kPCA and NLPCA methods can be considered a step towards pathology detection of the vocal folds. The results show that such an approach provides acceptable results for this purpose, with the best efficiency levels of around 100%. The study brings the most commonly used approaches to speech signal processing together and leads to a comparison of the machine learning methods determining the health status of the patient
The minor third communicates sadness in speech, mirroring its use in music.

Science.gov (United States)

Curtis, Meagan E; Bharucha, Jamshed J

2010-06-01

There is a long history of attempts to explain why music is perceived as expressing emotion. The relationship between pitches serves as an important cue for conveying emotion in music. The musical interval referred to as the minor third is generally thought to convey sadness. We reveal that the minor third also occurs in the pitch contour of speech conveying sadness. Bisyllabic speech samples conveying four emotions were recorded by 9 actresses. Acoustic analyses revealed that the relationship between the 2 salient pitches of the sad speech samples tended to approximate a minor third. Participants rated the speech samples for perceived emotion, and the use of numerous acoustic parameters as cues for emotional identification was modeled using regression analysis. The minor third was the most reliable cue for identifying sadness. Additional participants rated musical intervals for emotion, and their ratings verified the historical association between the musical minor third and sadness. These findings support the theory that human vocal expressions and music share an acoustic code for communicating sadness.
Morphometric Differences of Vocal Tract Articulators in Different Loudness Conditions in Singing.

Science.gov (United States)

Echternach, Matthias; Burk, Fabian; Burdumy, Michael; Traser, Louisa; Richter, Bernhard

2016-01-01

Dynamic MRI analysis of phonation has gathered interest in voice and speech physiology. However, there are limited data addressing the extent to which articulation is dependent on loudness. 12 professional singer subjects of different voice classifications were analysed concerning the vocal tract profiles recorded with dynamic real-time MRI with 25fps in different pitch and loudness conditions. The subjects were asked to sing ascending scales on the vowel /a/ in three loudness conditions (comfortable=mf, very soft=pp, very loud=ff, respectively). Furthermore, fundamental frequency and sound pressure level were analysed from the simultaneously recorded optical audio signal after noise cancellation. The data show articulatory differences with respect to changes of both pitch and loudness. Here, lip opening and pharynx width were increased. While the vertical larynx position was rising with pitch it was lower for greater loudness. Especially, the lip opening and pharynx width were more strongly correlated with the sound pressure level than with pitch. For the vowel /a/ loudness has an effect on articulation during singing which should be considered when articulatory vocal tract data are interpreted.
Inducible laryngeal obstruction during exercise: moving beyond vocal cords with new insights.

Science.gov (United States)

Olin, James Tod; Clary, Matthew S; Deardorff, Emily H; Johnston, Kristina; Morris, Michael J; Sokoya, Mofiyinfolu; Staudenmayer, Herman; Christopher, Kent L

2015-02-01

Exercise as an important part of life for the health and wellness of children and adults. Inducible laryngeal obstruction (ILO) is a consensus term used to describe a group of disorders previously called vocal cord dysfunction, paradoxical vocal fold motion, and numerous other terms. Exercise-ILO can impair one's ability to exercise, can be confused with asthma, leading to unnecessary prescription of asthma controller and rescue medication, and results in increased healthcare resource utilization including (rarely) emergency care. It is characterized by episodic shortness of breath and noisy breathing that generally occurs at high work rates. The present diagnostic gold standard for all types of ILO is laryngoscopic visualization of inappropriate glottic or supraglottic movement resulting in airway narrowing during a spontaneous event or provocation challenge. A number of different behavioral techniques, including speech therapy, biofeedback, and cognitive-behavioral psychotherapy, may be appropriate to treat individual patients. A consensus nomenclature, which will allow for better characterization of patients, coupled with new diagnostic techniques, may further define the epidemiology and etiology of ILO as well as enable objective evaluation of therapeutic modalities.
Patterns in Early Interaction between Young Preschool Children with Severe Speech and Physical Impairments and Their Parents

Science.gov (United States)

Sandberg, Annika Dahlgren; Liliedahl, Marie

2008-01-01

The aim of this study is to examine whether the asymmetrical pattern of communication usually found between people who use augmentative and alternative communication and their partners using natural speech was also found in the interaction between non-vocal young preschool children with cerebral palsy and their parents. Three parent-child dyads…

An examination of variations in the cepstral spectral index of dysphonia across a single breath group in connected speech.

Science.gov (United States)

Watts, Christopher R; Awan, Shaheen N

2015-01-01

The purpose of this study was to use spectral and cepstral analyses of speech to investigate whether underlying physiological changes in voice result in changes in acoustic estimates of dysphonia severity in continuous speech contexts within a single breath group. The effect of dysphonia on acoustic estimates of dysphonia severity, frequency, relative intensity, and vocalization time across initial and terminal segments of a single breath group using a common clinical stimulus was investigated. Prospective quasi-experimental controlled design. Digitized recordings of the Consensus Auditory-Perceptual Evaluation of Voice sentence "We were away a year ago" were obtained from 20 treatment-seeking dysphonic individuals (females, mean age = 39 years) and 20 normal controls (females, mean age = 39 years). Each recorded sample was separated into the first four syllables ("We were away … ") and second four syllables ("…a year ago.") of the breath group. Cepstral and spectral measures, intensity measures, and temporal analyses were obtained and used in calculations of the Cepstral Spectral Index of Dysphonia (CSID, an acoustic estimate of dysphonia severity), fundamental frequency (F0), vocalization time, and relative vocal intensity (dB SLP). Statistical analyses were applied to calculations of change (delta [Δ]) in these measures from one breath group segment to the next. Results revealed a significant effect of group on measures of CSID and F0, but not relative intensity or vocalization time. Dysphonic speakers exhibited a significant increase in the CSID from the first to second breath group segment and limited variation in F0 compared with controls. These results may support the hypothesis that voice impairment increases in severity toward the termination of a breath group even within a short temporal frame (i.e., 2 seconds or less of connected speech), and that this portion of the breath group may be an important determinant of perceptual impressions. Further
Two Methods of Automatic Evaluation of Speech Signal Enhancement Recorded in the Open-Air MRI Environment

Science.gov (United States)

Přibil, Jiří; Přibilová, Anna; Frollo, Ivan

2017-12-01

The paper focuses on two methods of evaluation of successfulness of speech signal enhancement recorded in the open-air magnetic resonance imager during phonation for the 3D human vocal tract modeling. The first approach enables to obtain a comparison based on statistical analysis by ANOVA and hypothesis tests. The second method is based on classification by Gaussian mixture models (GMM). The performed experiments have confirmed that the proposed ANOVA and GMM classifiers for automatic evaluation of the speech quality are functional and produce fully comparable results with the standard evaluation based on the listening test method.
The physiological basis of Glottal electromagnetic micropower sensors (GEMS) and their use in defining an excitation function for the human vocal tract

Science.gov (United States)

Burnett, Gregory Clell

1999-10-01

The definition, use, and physiological basis of Glottal Electromagnetic Micropower Sensors (GEMS) is presented. These sensors are a new type of low power (excitation function for the human vocal tract. For the first time, an excitation function may be calculated in near real time using a noninvasive procedure. Several experiments and models are presented to demonstrate that the GEMS signal is representative of the motion of the subglottal posterior wall of the trachea as it vibrates in response to the pressure changes caused by the folds as they modulate the airflow supplied by the lungs. The vibrational properties of the tracheal wall are modeled using a lumped-element circuit model. Taking the output of the vocal tract to be the audio pressure captured by a microphone and the input to be the subglottal pressure, the transfer function of the vocal tract (including the nasal cavities) can be approximated every 10-30 milliseconds using an autoregressive moving-average model. Unlike the currently utilized method of transfer function approximation, this new method only involves noninvasive GEMS measurements and digital signal processing and does not demand the difficult task of obtaining precise physical measurements of the tract and subsequent estimation of the transfer function using its cross-sectional area. The ability to measure the physical motion of the trachea enables a significant number of potential applications, ranging from very accurate pitch detection to speech synthesis, speaker verification, and speech recognition.
A Morphological Analyzer for Vocalized or Not Vocalized Arabic Language

Science.gov (United States)

El Amine Abderrahim, Med; Breksi Reguig, Fethi

This research has been to show the realization of a morphological analyzer of the Arabic language (vocalized or not vocalized). This analyzer is based upon our object model for the Arabic Natural Language Processing (NLP) and can be exploited by NLP applications such as translation machine, orthographical correction and the search for information.
Automatic speech signal segmentation based on the innovation adaptive filter

Directory of Open Access Journals (Sweden)

Makowski Ryszard

2014-06-01

Full Text Available Speech segmentation is an essential stage in designing automatic speech recognition systems and one can ﬁnd several algorithms proposed in the literature. It is a difﬁcult problem, as speech is immensely variable. The aim of the authors’ studies was to design an algorithm that could be employed at the stage of automatic speech recognition. This would make it possible to avoid some problems related to speech signal parametrization. Posing the problem in such a way requires the algorithm to be capable of working in real time. The only such algorithm was proposed by Tyagi et al., (2006, and it is a modiﬁed version of Brandt’s algorithm. The article presents a new algorithm for unsupervised automatic speech signal segmentation. It performs segmentation without access to information about the phonetic content of the utterances, relying exclusively on second-order statistics of a speech signal. The starting point for the proposed method is time-varying Schur coefﬁcients of an innovation adaptive ﬁlter. The Schur algorithm is known to be fast, precise, stable and capable of rapidly tracking changes in second order signal statistics. A transfer from one phoneme to another in the speech signal always indicates a change in signal statistics caused by vocal track changes. In order to allow for the properties of human hearing, detection of inter-phoneme boundaries is performed based on statistics deﬁned on the mel spectrum determined from the reﬂection coefﬁcients. The paper presents the structure of the algorithm, deﬁnes its properties, lists parameter values, describes detection efﬁciency results, and compares them with those for another algorithm. The obtained segmentation results, are satisfactory.
Precise auditory-vocal mirroring in neurons for learned vocal communication.

Science.gov (United States)

Prather, J F; Peters, S; Nowicki, S; Mooney, R

2008-01-17

Brain mechanisms for communication must establish a correspondence between sensory and motor codes used to represent the signal. One idea is that this correspondence is established at the level of single neurons that are active when the individual performs a particular gesture or observes a similar gesture performed by another individual. Although neurons that display a precise auditory-vocal correspondence could facilitate vocal communication, they have yet to be identified. Here we report that a certain class of neurons in the swamp sparrow forebrain displays a precise auditory-vocal correspondence. We show that these neurons respond in a temporally precise fashion to auditory presentation of certain note sequences in this songbird's repertoire and to similar note sequences in other birds' songs. These neurons display nearly identical patterns of activity when the bird sings the same sequence, and disrupting auditory feedback does not alter this singing-related activity, indicating it is motor in nature. Furthermore, these neurons innervate striatal structures important for song learning, raising the possibility that singing-related activity in these cells is compared to auditory feedback to guide vocal learning.
Multivoxel Patterns Reveal Functionally Differentiated Networks Underlying Auditory Feedback Processing of Speech

DEFF Research Database (Denmark)

Zheng, Zane Z.; Vicente-Grabovetsky, Alejandro; MacDonald, Ewen N.

2013-01-01

The everyday act of speaking involves the complex processes of speech motor control. An important component of control is monitoring, detection, and processing of errors when auditory feedback does not correspond to the intended motor gesture. Here we show, using fMRI and converging operations...... within a multivoxel pattern analysis framework, that this sensorimotor process is supported by functionally differentiated brain networks. During scanning, a real-time speech-tracking system was used to deliver two acoustically different types of distorted auditory feedback or unaltered feedback while...... human participants were vocalizing monosyllabic words, and to present the same auditory stimuli while participants were passively listening. Whole-brain analysis of neural-pattern similarity revealed three functional networks that were differentially sensitive to distorted auditory feedback during...
Speech-rhythm characteristics of client-centered, Gestalt, and rational-emotive therapy interviews.

Science.gov (United States)

Chen, C L

1981-07-01

The aim of this study was to discover whether client-centered, Gestalt, and rational-emotive psychotherapy interviews could be described and differentiated on the basis of quantitative measurement of their speech rhythms. These measures were taken from the sound portion of a film showing interviews by Carl Rogers, Frederick Perls, and Albert Ellis. The variables used were total session and percentage of speaking times, speaking turns, vocalizations, interruptions, inside and switching pauses, and speaking rates. The three types of interview had very distinctive patterns of speech-rhythm variables. These patterns suggested that Rogers's Client-centered therapy interview was patient dominated, that Ellis's rational-emotive therapy interview was therapist dominated, and that Perls's Gestalt therapy interview was neither therapist nor patient dominated.
Auditory evoked fields to vocalization during passive listening and active generation in adults who stutter.

Science.gov (United States)

Beal, Deryk S; Cheyne, Douglas O; Gracco, Vincent L; Quraan, Maher A; Taylor, Margot J; De Nil, Luc F

2010-10-01

We used magnetoencephalography to investigate auditory evoked responses to speech vocalizations and non-speech tones in adults who do and do not stutter. Neuromagnetic field patterns were recorded as participants listened to a 1 kHz tone, playback of their own productions of the vowel /i/ and vowel-initial words, and actively generated the vowel /i/ and vowel-initial words. Activation of the auditory cortex at approximately 50 and 100 ms was observed during all tasks. A reduction in the peak amplitudes of the M50 and M100 components was observed during the active generation versus passive listening tasks dependent on the stimuli. Adults who stutter did not differ in the amount of speech-induced auditory suppression relative to fluent speakers. Adults who stutter had shorter M100 latencies for the actively generated speaking tasks in the right hemisphere relative to the left hemisphere but the fluent speakers showed similar latencies across hemispheres. During passive listening tasks, adults who stutter had longer M50 and M100 latencies than fluent speakers. The results suggest that there are timing, rather than amplitude, differences in auditory processing during speech in adults who stutter and are discussed in relation to hypotheses of auditory-motor integration breakdown in stuttering. Copyright 2010 Elsevier Inc. All rights reserved.
Classification of Phase Transitions by Microcanonical Inflection-Point Analysis

Science.gov (United States)

Qi, Kai; Bachmann, Michael

2018-05-01

By means of the principle of minimal sensitivity we generalize the microcanonical inflection-point analysis method by probing derivatives of the microcanonical entropy for signals of transitions in complex systems. A strategy of systematically identifying and locating independent and dependent phase transitions of any order is proposed. The power of the generalized method is demonstrated in applications to the ferromagnetic Ising model and a coarse-grained model for polymer adsorption onto a substrate. The results shed new light on the intrinsic phase structure of systems with cooperative behavior.
Vocal cord paralysis in children.

Science.gov (United States)

King, Ericka F; Blumin, Joel H

2009-12-01

Vocal fold paralysis (VFP) is an increasingly commonly identified problem in the pediatric patient. Diagnostic and management techniques honed in adult laryngologic practice have been successfully applied to children. Iatrogenic causes, including cardiothoracic procedures, remain a common cause of unilateral VFP. Neurologic disorders predominate in the cause of bilateral VFP. Diagnosis with electromyography is currently being evaluated in children. Treatment of VFP is centered around symptomology, which is commonly divided between voice and airway concerns. Speech therapy shows promise in older children. Surgical management for unilateral VFP with injection laryngoplasty is commonly performed and well tolerated. Laryngeal reinnervation is currently being applied to the pediatric population as a permanent treatment and offers several advantages over laryngeal framework procedures. For bilateral VFP, tracheotomy is still commonly performed. Glottic dilation procedures are performed both openly and endoscopically with a high degree of success. VFP is a well recognized problem in pediatric patients with disordered voice and breathing. Some patients will spontaneously recover their laryngeal function. For those who do not, a variety of reliable techniques are available for rehabilitative treatment.
Glass ionomer application for vocal fold augmentation: Histopathological analysis on rabbit vocal fold.

Science.gov (United States)

Demirci, Sule; Tuzuner, Arzu; Callıoglu, Elif Ersoy; Yumusak, Nihat; Arslan, Necmi; Baltacı, Bülent

2016-04-01

The aim of this study was to investigate the use of glass ionomer cement (GIC) as an injection material for vocal fold augmentation and to evaluate the biocompatibility of the material. Ten adult New Zealand rabbits were used. Under general anesthesia, 0.1-cc GIC was injected to one vocal fold and the augmentation of vocal fold was observed. No injection was applied to the opposite side, which was accepted as the control group. The animals were sacrificed after 3 months and the laryngeal specimens were histopathologically evaluated. The injected and the noninjected control vocal folds were analyzed. The GIC particles were observed in histological sections on the injected side, and no foreign body giant cells, granulomatous inflammation, necrosis, or marked chronic inflammation were detected around the glass ionomer particles. Mild inflammatory reactions were noticed in only two specimens. The noninjected sides of vocal folds were completely normal. The findings of this study suggest that GIC is biocompatible and may be further investigated as an alternative injection material for augmentation of the vocal fold. Further studies are required to examine the viscoelastic properties of GIC and the long-term effects in experimental studies. NA. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.
Análise vocal em pacientes com disfonia espasmódica nos momentos pré e pós tratamento com toxina Botulínica A Vocal analysis in patients with spasmodic dysphonia before and after treatment with Botulinum toxin A

Directory of Open Access Journals (Sweden)

Ana Cristina Côrtes Gama

2012-10-01

Full Text Available OBJETIVO: avaliar de forma objetiva e subjetiva a voz de pacientes com disfonia espasmódica nos momentos pré e pós aplicação de toxina botulínica A. MÉTODO: as emissões vocais de onze pacientes do sexo feminino foram registradas antes e após (15 dias o tratamento. As amostras vocais foram analisadas por duas fonoaudiólogas com experiência em voz por meio da análise perceptivo-auditiva (escala GRBASI e da análise espectrográfica. RESULTADOS: na análise perceptivo-auditiva com vogal sustentada os parâmetros que alteraram após o tratamento foram o grau de severidade, tensão e instabilidade, enquanto na fala encadeada foram o grau de severidade e a tensão. Na análise espectrográfica ocorreu melhora do traçado após o tratamento sem significância estatística entre os parâmetros. CONCLUSÃO: ocorreu melhora significante dos aspectos perceptivo-auditivos após o tratamento e, portanto, as injeções de toxina botulínica A mostraram-se eficazes no tratamento da disfonia espasmódica no grupo estudado.PURPOSE: to analyze in an objective and subjective manner the voice of patients with spasmodic dysphonia in the moments before and after botulinum toxin A. METHOD: the vocal emissions of eleven women patients were recorded before and after (15 days treatment. The vocal samples were analyzed by two experienced speech therapists through the perceptual analysis (GRBASI scale and spectrographic analysis. RESULTS: in the perceptual analysis with subtended vowel, the altered parameters were degree of severity, strain and instability, while in connected speech only degree of severity and strain changed after treatment. In the perceptual analysis with sustained vowel, the parameters that have changed, were the degree of severity, strain and instability, while in connected speech only degree of severity and strain changed after treatment. The spectrographic analysis was improved after treatment with no statistical significance found among
Vocal dose in teachers: correlation with dysphonia.

Science.gov (United States)

Gama, Ana Cristina Côrtes; Santos, Juliana Nunes; Pedra, Elisângela de Fátima Pereira; Rabelo, Alessandra Terra Vasconcelos; Magalhães, Max de Castro; Casas, Estevam Barbosa de Las

2016-04-01

Teachers are professionals with high prevalence of dysphonia, whose main risk factors are the large work hours in classrooms with the presence of background noise. The purpose of the study was to calculate the phonation time and the cycle dose of teachers with dysphonia and teachers without voice disorders during the class. There were two groups analyzed: five teachers with functional dysphonia were the first group and five teachers without voice disorders were the second group. For the data was used the VoxLog® dosimeter and the parameters were: intensity; fundamental frequency; phonation time and cycle dose. The statistical analysis used ANOVA, Student's T-test, and Kruskal-Wallis test. Dysphonic teachers showed major values of phonation time and cycle dose compared with teachers without voice disorders. The dysphonia is related to extended period of speech time and greater exposure of the tissue of the vocal fold to phonotrauma.
The effect of filtered speech feedback on the frequency of stuttering

Science.gov (United States)

Rami, Manish Krishnakant

2000-10-01

whispered speech conditions all decreased the frequency of stuttering while the approximate glottal source did not. It is suggested that articulatory events, chiefly the encoded speech output of the vocal tract origin, afford effective cues and induces fluent speech in people who stutter.
Methods for eliciting, annotating, and analyzing databases for child speech development.

Science.gov (United States)

Beckman, Mary E; Plummer, Andrew R; Munson, Benjamin; Reidy, Patrick F

2017-09-01

Methods from automatic speech recognition (ASR), such as segmentation and forced alignment, have facilitated the rapid annotation and analysis of very large adult speech databases and databases of caregiver-infant interaction, enabling advances in speech science that were unimaginable just a few decades ago. This paper centers on two main problems that must be addressed in order to have analogous resources for developing and exploiting databases of young children's speech. The first problem is to understand and appreciate the differences between adult and child speech that cause ASR models developed for adult speech to fail when applied to child speech. These differences include the fact that children's vocal tracts are smaller than those of adult males and also changing rapidly in size and shape over the course of development, leading to between-talker variability across age groups that dwarfs the between-talker differences between adult men and women. Moreover, children do not achieve fully adult-like speech motor control until they are young adults, and their vocabularies and phonological proficiency are developing as well, leading to considerably more within-talker variability as well as more between-talker variability. The second problem then is to determine what annotation schemas and analysis techniques can most usefully capture relevant aspects of this variability. Indeed, standard acoustic characterizations applied to child speech reveal that adult-centered annotation schemas fail to capture phenomena such as the emergence of covert contrasts in children's developing phonological systems, while also revealing children's nonuniform progression toward community speech norms as they acquire the phonological systems of their native languages. Both problems point to the need for more basic research into the growth and development of the articulatory system (as well as of the lexicon and phonological system) that is oriented explicitly toward the construction of
Gender Differences in the Reporting of Vocal Fatigue in Teachers as Quantified by the Vocal Fatigue Index.

Science.gov (United States)

Hunter, Eric J; Banks, Russell E

2017-12-01

Occupational voice users report higher instances of vocal health problems. Women, who are more likely than men to report voice problems, are the largest members of some occupational voice users, such as teachers. While a common complaint among this population is vocal fatigue, it has been difficult to quantify. Therefore, the goal of this study is to quantify vocal fatigue generally in school teachers and investigate any related gender differences. Six hundred forty (518 female, 122 male) teachers were surveyed using an online questionnaire consisting in part of the Vocal Fatigue Index (VFI), an index specifically designed to quantify vocal fatigue. Compared to vocally healthy adults, the teachers surveyed were 3 times as likely to report vocal tiredness or vocal avoidance and over 3 times as likely to report physical voice discomfort. Additionally, female teachers were more likely to have scores approaching those with dysphonia. The VFI quantified elevated levels of vocal fatigue in teachers, with a significant prevalence of symptoms reported among females compared to males. Further, because the VFI indicated elevated complaints (between normal and dysphonic) in a population likely to be elevated, the VFI might be used to identify early indications of voice problems and/or track recovery.
Auditory-vocal mirroring in songbirds.

Science.gov (United States)

Mooney, Richard

2014-01-01

Mirror neurons are theorized to serve as a neural substrate for spoken language in humans, but the existence and functions of auditory-vocal mirror neurons in the human brain remain largely matters of speculation. Songbirds resemble humans in their capacity for vocal learning and depend on their learned songs to facilitate courtship and individual recognition. Recent neurophysiological studies have detected putative auditory-vocal mirror neurons in a sensorimotor region of the songbird's brain that plays an important role in expressive and receptive aspects of vocal communication. This review discusses the auditory and motor-related properties of these cells, considers their potential role on song learning and communication in relation to classical studies of birdsong, and points to the circuit and developmental mechanisms that may give rise to auditory-vocal mirroring in the songbird's brain.
Pitch (F0) and formant profiles of human vowels and vowel-like baboon grunts: The role of vocalizer body size and voice-acoustic allometry

Science.gov (United States)

Rendall, Drew; Kollias, Sophie; Ney, Christina; Lloyd, Peter

2005-02-01

Key voice features-fundamental frequency (F0) and formant frequencies-can vary extensively between individuals. Much of the variation can be traced to differences in the size of the larynx and vocal-tract cavities, but whether these differences in turn simply reflect differences in speaker body size (i.e., neutral vocal allometry) remains unclear. Quantitative analyses were therefore undertaken to test the relationship between speaker body size and voice F0 and formant frequencies for human vowels. To test the taxonomic generality of the relationships, the same analyses were conducted on the vowel-like grunts of baboons, whose phylogenetic proximity to humans and similar vocal production biology and voice acoustic patterns recommend them for such comparative research. For adults of both species, males were larger than females and had lower mean voice F0 and formant frequencies. However, beyond this, F0 variation did not track body-size variation between the sexes in either species, nor within sexes in humans. In humans, formant variation correlated significantly with speaker height but only in males and not in females. Implications for general vocal allometry are discussed as are implications for speech origins theories, and challenges to them, related to laryngeal position and vocal tract length. .
Vocal therapy of hyperkinetic dysphonia

Directory of Open Access Journals (Sweden)

Mumović Gordana

2014-01-01

Full Text Available Introduction. Hyperkinetic (hyperfunctional dysphonia is a common pathology. The disorder is often found in vocal professionals faced with high vocal requirements. Objective. The objective of this study was to evaluate the effects of vocal therapy on voice condition characterized by hyperkinetic dysphonia with prenodular lesions and soft nodules. Methods. The study included 100 adult patients and 27 children aged 4-16 years with prenodular lesions and soft nodules. A subjective acoustic analysis using the GIRBAS scale was performed prior to and after vocal therapy. Twenty adult patients and 10 children underwent objective acoustic analysis including several acoustic parameters. Pathological vocal qualities (hoarse, harsh and breathy voice were also obtained by computer analysis. Results. The subjective acoustic analysis revealed a significant (p<0.01 reduction in all dysphonia parameters after vocal treatment in adults and children. After treatment, all levels of dysphonia were lowered in 85% (85/100 of adult patients and 29% (29/100 had a normal voice. Before vocal therapy 9 children had severe, 13 had moderate and 8 slight dysphonia. After vocal therapy only 1 child had severe dysphonia, 7 had moderate, 10 had slight levels of dysphonia and 9 were without voice disorder. The objective acoustic analysis in adults revealed a significant improvement (p≤0.025 in all dysphonia parameters except SD F0 and jitter %. In children, the acoustic parameters SD F0, jitter % and NNE (normal noise energy were significantly improved (p=0.003-0.03. Pathological voice qualities were also improved in adults and children (p<0.05. Conclusion. Vocal therapy effectively improves the voice in hyperkinetic dysphonia with prenodular lesions and soft nodules in both adults and children, affecting diverse acoustic parameters.

From Root Infinitive to Finite Sentence : The acquisition of verbal inflections and auxiliaries

NARCIS (Netherlands)

Blom, W.B.T.

2003-01-01

Across languages, children in the earliest stages of syntactic development tend to omit overt markings of finiteness, such as verbal inflections and auxiliaries: when children use a verb, they use an infinitival form (e.g. Dutch) or a bare stem (e.g. English). From Root Infinitive to Finite Sentence
Spasmodic dysphonia: a laryngeal control disorder specific to speech.

Science.gov (United States)

Ludlow, Christy L

2011-01-19

Spasmodic dysphonia (SD) is a rare neurological disorder that emerges in middle age, is usually sporadic, and affects intrinsic laryngeal muscle control only during speech. Spasmodic bursts in particular laryngeal muscles disrupt voluntary control during vowel sounds in adductor SD and interfere with voice onset after voiceless consonants in abductor SD. Little is known about its origins; it is classified as a focal dystonia secondary to an unknown neurobiological mechanism that produces a chronic abnormality of laryngeal motor neuron regulation during speech. It develops primarily in females and does not interfere with breathing, crying, laughter, and shouting. Recent postmortem studies have implicated the accumulation of clusters in the parenchyma and perivascular regions with inflammatory changes in the brainstem in one to two cases. A few cases with single mutations in THAP1, a gene involved in transcription regulation, suggest that a weak genetic predisposition may contribute to mechanisms causing a nonprogressive abnormality in laryngeal motor neuron control for speech but not for vocal emotional expression. Research is needed to address the basic cellular and proteomic mechanisms that produce this disorder to provide intervention that could target the pathogenesis of the disorder rather than only providing temporary symptom relief.
Analysis of high-frequency energy in long-term average spectra of singing, speech, and voiceless fricatives.

Science.gov (United States)

Monson, Brian B; Lotto, Andrew J; Story, Brad H

2012-09-01

The human singing and speech spectrum includes energy above 5 kHz. To begin an in-depth exploration of this high-frequency energy (HFE), a database of anechoic high-fidelity recordings of singers and talkers was created and analyzed. Third-octave band analysis from the long-term average spectra showed that production level (soft vs normal vs loud), production mode (singing vs speech), and phoneme (for voiceless fricatives) all significantly affected HFE characteristics. Specifically, increased production level caused an increase in absolute HFE level, but a decrease in relative HFE level. Singing exhibited higher levels of HFE than speech in the soft and normal conditions, but not in the loud condition. Third-octave band levels distinguished phoneme class of voiceless fricatives. Female HFE levels were significantly greater than male levels only above 11 kHz. This information is pertinent to various areas of acoustics, including vocal tract modeling, voice synthesis, augmentative hearing technology (hearing aids and cochlear implants), and training/therapy for singing and speech.
Accuracy of Cochlear Implant Recipients on Speech Reception in Background Music

Science.gov (United States)

Gfeller, Kate; Turner, Christopher; Oleson, Jacob; Kliethermes, Stephanie; Driscoll, Virginia

2012-01-01

Objectives This study (a) examined speech recognition abilities of cochlear implant (CI) recipients in the spectrally complex listening condition of three contrasting types of background music, and (b) compared performance based upon listener groups: CI recipients using conventional long-electrode (LE) devices, Hybrid CI recipients (acoustic plus electric stimulation), and normal-hearing (NH) adults. Methods We tested 154 LE CI recipients using varied devices and strategies, 21 Hybrid CI recipients, and 49 NH adults on closed-set recognition of spondees presented in three contrasting forms of background music (piano solo, large symphony orchestra, vocal solo with small combo accompaniment) in an adaptive test. Outcomes Signal-to-noise thresholds for speech in music (SRTM) were examined in relation to measures of speech recognition in background noise and multi-talker babble, pitch perception, and music experience. Results SRTM thresholds varied as a function of category of background music, group membership (LE, Hybrid, NH), and age. Thresholds for speech in background music were significantly correlated with measures of pitch perception and speech in background noise thresholds; auditory status was an important predictor. Conclusions Evidence suggests that speech reception thresholds in background music change as a function of listener age (with more advanced age being detrimental), structural characteristics of different types of music, and hearing status (residual hearing). These findings have implications for everyday listening conditions such as communicating in social or commercial situations in which there is background music. PMID:23342550
Lexical Borrowing in the Speech of First-Generation Hungarian Immigrants in Australia

Directory of Open Access Journals (Sweden)

Anikó Hatoss

2016-09-01

Full Text Available This article reports findings of a sociolinguistic project which investigated language contact phenomena in the speech of first-generation Hungarian Australians living in Sydney. The research aimed to identify and analyze English lexical items borrowed into the spoken Hungarian of first-generation Hungarian–English bilinguals. This research had a mixed methods approach including a quantitative element (count of lexical manifestations by categories such as part of speech and a qualitative element in which the various lexical manifestations have been subjected to a linguistic analysis. The Hungarian National Corpus was used as a reference guide to determine the status of these phenomena in the lexicon of Standard Hungarian. The data were collected through semi-structured sociolinguistic interviews with 22 Hungarian Australians living in Sydney. The findings demonstrate that (a first-generation Hungarians are highly creative language users and integrate a large number of English lexical items into their speech. Most lexical borrowings belong to the derivational blends with the highest proportion of the nominal group. Lexical borrowings from English are morphologically integrated with Hungarian-derivational suffixes and inflectional case markings. This research provides original empirical data to better understand the various inter-language lexical manifestations in Hungarian–English bilingual contexts. The study adds to the relatively small body of research on Hungarian–English bilingualism in diasporic context and contributes to understanding lexical borrowing from a contact linguistic perspective.
Vocal therapy of hyperkinetic dysphonia.

Science.gov (United States)

Mumović, Gordana; Veselinović, Mila; Arbutina, Tanja; Škrbić, Renata

2014-01-01

Hyperkinetic (hyperfunctional) dysphonia is a common pathology. The disorder is often found in vocal professionals faced with high vocal requirements. The objective of this study was to evaluate the effects of vocal therapy on voice condition characterized by hyperkinetic dysphonia with prenodular lesions and soft nodules. The study included 100 adult patients and 27 children aged 4-16 years with prenodular lesions and soft nodules. A subjective acoustic analysis using the GIRBAS scale was performed prior to and after vocal therapy. Twenty adult patients and 10 children underwent objective acoustic analysis including several acoustic parameters. Pathological vocal qualities (hoarse, harsh and breathy voice) were also obtained by computer analysis. The subjective acoustic analysis revealed a significant (pvocal treatment in adults and children. After treatment, all levels of dysphonia were lowered in 85% (85/100) of adult patients and 29% (29/100) had a normal voice. Before vocal therapy 9 children had severe, 13 had moderate and 8 slight dysphonia. After vocal therapy only 1 child had severe dysphonia, 7 had moderate, 10 had slight levels of dysphonia and 9 were without voice disorder. The objective acoustic analysis in adults revealed a significant improvement (p≤0.025) in all dysphonia parameters except SD FO and jitter %. In children, the acoustic parameters SD FO, jitter % and NNE (normal noise energy) were significantly improved (p=0.003-0.03). Pathological voice qualities were also improved in adults and children (pVocal therapy effectively improves the voice in hyperkinetic dysphonia with prenodular lesions and soft nodules in both adults and children, affectinq diverse acoustic parameters.
Comparação de hábitos de bem estar vocal entre cantores líricos e populares A comparison between vocal habits of lyric and popular singers

Directory of Open Access Journals (Sweden)

Ana Paula Dassie-Leite

2011-02-01

objective questions about vocal habits and use of professional voice. Data were statistically analyzed RESULTS: popular singers have similar feeding habits as lyrical singers: smoking, alcohol and recreational drug use. Popular singers have fewer hours of sleep/rest along the day, which is a statistically significant difference. This group also differed from the lyrical singers because they have, in most cases, another work, with professional use of spoken voice. there was also a statistically significant increased workload on the use of singing voice in lyric singers, as well as the increased use of resources considering myths to improve the voice. Popular singers know less about the work of speech language pathologists with voice professionals. Lyrical singers warm up the voice with greater frequency over the popular singers, although this second group has demonstrated that this habit has been acquired. Both groups do not systematically slow down their voice, after the professional activity. CONCLUSION: popular and lyrical singers have some similar habits on the vocal health and they are different mainly due to the weekly to singing workload, using myths to improve voice, knowledge about the speech language pathologist work and voice warming-up practice.
Auditory–vocal mirroring in songbirds

Science.gov (United States)

Mooney, Richard

2014-01-01

Mirror neurons are theorized to serve as a neural substrate for spoken language in humans, but the existence and functions of auditory–vocal mirror neurons in the human brain remain largely matters of speculation. Songbirds resemble humans in their capacity for vocal learning and depend on their learned songs to facilitate courtship and individual recognition. Recent neurophysiological studies have detected putative auditory–vocal mirror neurons in a sensorimotor region of the songbird's brain that plays an important role in expressive and receptive aspects of vocal communication. This review discusses the auditory and motor-related properties of these cells, considers their potential role on song learning and communication in relation to classical studies of birdsong, and points to the circuit and developmental mechanisms that may give rise to auditory–vocal mirroring in the songbird's brain. PMID:24778375
Vibration and Noise in Magnetic Resonance Imaging of the Vocal Tract: Differences between Whole-Body and Open-Air Devices.

Science.gov (United States)

Přibil, Jiří; Přibilová, Anna; Frollo, Ivan

2018-04-05

This article compares open-air and whole-body magnetic resonance imaging (MRI) equipment working with a weak magnetic field as regards the methods of its generation, spectral properties of mechanical vibration and acoustic noise produced by gradient coils during the scanning process, and the measured noise intensity. These devices are used for non-invasive MRI reconstruction of the human vocal tract during phonation with simultaneous speech recording. In this case, the vibration and noise have negative influence on quality of speech signal. Two basic measurement experiments were performed within the paper: mapping sound pressure levels in the MRI device vicinity and picking up vibration and noise signals in the MRI scanning area. Spectral characteristics of these signals are then analyzed statistically and compared visually and numerically.
Towards Contactless Silent Speech Recognition Based on Detection of Active and Visible Articulators Using IR-UWB Radar.

Science.gov (United States)

Shin, Young Hoon; Seo, Jiwon

2016-10-29

People with hearing or speaking disabilities are deprived of the benefits of conventional speech recognition technology because it is based on acoustic signals. Recent research has focused on silent speech recognition systems that are based on the motions of a speaker's vocal tract and articulators. Because most silent speech recognition systems use contact sensors that are very inconvenient to users or optical systems that are susceptible to environmental interference, a contactless and robust solution is hence required. Toward this objective, this paper presents a series of signal processing algorithms for a contactless silent speech recognition system using an impulse radio ultra-wide band (IR-UWB) radar. The IR-UWB radar is used to remotely and wirelessly detect motions of the lips and jaw. In order to extract the necessary features of lip and jaw motions from the received radar signals, we propose a feature extraction algorithm. The proposed algorithm noticeably improved speech recognition performance compared to the existing algorithm during our word recognition test with five speakers. We also propose a speech activity detection algorithm to automatically select speech segments from continuous input signals. Thus, speech recognition processing is performed only when speech segments are detected. Our testbed consists of commercial off-the-shelf radar products, and the proposed algorithms are readily applicable without designing specialized radar hardware for silent speech processing.
Quantification of Porcine Vocal Fold Geometry.

Science.gov (United States)

Stevens, Kimberly A; Thomson, Scott L; Jetté, Marie E; Thibeault, Susan L

2016-07-01

The aim of this study was to quantify porcine vocal fold medial surface geometry and three-dimensional geometric distortion induced by freezing the larynx, especially in the region of the vocal folds. The medial surface geometries of five excised porcine larynges were quantified and reported. Five porcine larynges were imaged in a micro-CT scanner, frozen, and rescanned. Segmentations and three-dimensional reconstructions were used to quantify and characterize geometric features. Comparisons were made with geometry data previously obtained using canine and human vocal folds as well as geometries of selected synthetic vocal fold models. Freezing induced an overall expansion of approximately 5% in the transverse plane and comparable levels of nonuniform distortion in sagittal and coronal planes. The medial surface of the porcine vocal folds was found to compare reasonably well with other geometries, although the compared geometries exhibited a notable discrepancy with one set of published human female vocal fold geometry. Porcine vocal folds are qualitatively geometrically similar to data available for canine and human vocal folds, as well as commonly used models. Freezing of tissue in the larynx causes distortion of around 5%. The data can provide direction in estimating uncertainty due to bulk distortion of tissue caused by freezing, as well as quantitative geometric data that can be directly used in developing vocal fold models. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Morphometric Differences of Vocal Tract Articulators in Different Loudness Conditions in Singing.

Directory of Open Access Journals (Sweden)

Matthias Echternach

Full Text Available Dynamic MRI analysis of phonation has gathered interest in voice and speech physiology. However, there are limited data addressing the extent to which articulation is dependent on loudness.12 professional singer subjects of different voice classifications were analysed concerning the vocal tract profiles recorded with dynamic real-time MRI with 25fps in different pitch and loudness conditions. The subjects were asked to sing ascending scales on the vowel /a/ in three loudness conditions (comfortable=mf, very soft=pp, very loud=ff, respectively. Furthermore, fundamental frequency and sound pressure level were analysed from the simultaneously recorded optical audio signal after noise cancellation.The data show articulatory differences with respect to changes of both pitch and loudness. Here, lip opening and pharynx width were increased. While the vertical larynx position was rising with pitch it was lower for greater loudness. Especially, the lip opening and pharynx width were more strongly correlated with the sound pressure level than with pitch.For the vowel /a/ loudness has an effect on articulation during singing which should be considered when articulatory vocal tract data are interpreted.
Particularities of the (not so emotional speech in European Portuguese: Acted and spontaneous data analysis

Directory of Open Access Journals (Sweden)

Ana Margarida Belém Nunes

2017-12-01

Full Text Available The present article is a symbiosis of two previous studies made by the author on European Portuguese Emotional Speech. It is known that nonverbal vocal expressions, such as laughter, vocalizations and, for instance, screams are an important source of emotional cues in social contexts (Lima et al., 2013. In social contexts we get information’s about others emotional states also by facial and corporal expressions, touch and voice cues, (Lima et al., 2013 & Cowie et al, 2003. Nevertheless most of the existent research on emotion is based on simulated emotions that are induced in laboratory and/or produced by professional actors. In this study in particular, it is proposed to explore how much and in which voice related parameters spontaneous and acted speech diverge. On the other hand, this study will help to obtain data on emotional speech and to describe the expression of emotions, by voice alone, for the first time for European Portuguese. Analyses are mainly focused on parameters that are generally accepted as more directly related with voice quality like F0; jitter; shimmer and HNR (Lima et all, 2013; Tiovanen et al, 2006; Drioli et all, 2003. Given the scarcity of studies on voice quality in European Portuguese, it is important to highlight that this work presents original corpora specifically created for the presented research: a small corpus for spontaneous emotional speech and Feeltrace system to provide the necessary annotation and interpretation of emotions; a second corpus for acted emotions produced by a professional actor. It is particularly important to highlight that was found that European Portuguese presents some specificities on the values obtained for neutral expression, sadness and joy, that do not occur in other languages.
Automatic evaluation of speech rhythm instability and acceleration in dysarthrias associated with basal ganglia dysfunction

Directory of Open Access Journals (Sweden)

Jan eRusz

2015-07-01

Full Text Available Speech rhythm abnormalities are commonly present in patients with different neurodegenerative disorders. These alterations are hypothesized to be a consequence of disruption to the basal ganglia circuitry involving dysfunction of motor planning, programming and execution, which can be detected by a syllable repetition paradigm. Therefore, the aim of the present study was to design a robust signal processing technique that allows the automatic detection of spectrally-distinctive nuclei of syllable vocalizations and to determine speech features that represent rhythm instability and acceleration. A further aim was to elucidate specific patterns of dysrhythmia across various neurodegenerative disorders that share disruption of basal ganglia function. Speech samples based on repetition of the syllable /pa/ at a self-determined steady pace were acquired from 109 subjects, including 22 with Parkinson's disease (PD, 11 progressive supranuclear palsy (PSP, 9 multiple system atrophy (MSA, 24 ephedrone-induced parkinsonism (EP, 20 Huntington's disease (HD, and 23 healthy controls. Subsequently, an algorithm for the automatic detection of syllables as well as features representing rhythm instability and rhythm acceleration were designed. The proposed detection algorithm was able to correctly identify syllables and remove erroneous detections due to excessive inspiration and nonspeech sounds with a very high accuracy of 99.6%. Instability of vocal pace performance was observed in PSP, MSA, EP and HD groups. Significantly increased pace acceleration was observed only in the PD group. Although not significant, a tendency for pace acceleration was observed also in the PSP and MSA groups. Our findings underline the crucial role of the basal ganglia in the execution and maintenance of automatic speech motor sequences. We envisage the current approach to become the first step towards the development of acoustic technologies allowing automated assessment of rhythm
Experience with speech sounds is not necessary for cue trading by budgerigars (Melopsittacus undulatus.

Directory of Open Access Journals (Sweden)

Mary Flaherty

Full Text Available The influence of experience with human speech sounds on speech perception in budgerigars, vocal mimics whose speech exposure can be tightly controlled in a laboratory setting, was measured. Budgerigars were divided into groups that differed in auditory exposure and then tested on a cue-trading identification paradigm with synthetic speech. Phonetic cue trading is a perceptual phenomenon observed when changes on one cue dimension are offset by changes in another cue dimension while still maintaining the same phonetic percept. The current study examined whether budgerigars would trade the cues of voice onset time (VOT and the first formant onset frequency when identifying syllable initial stop consonants and if this would be influenced by exposure to speech sounds. There were a total of four different exposure groups: No speech exposure (completely isolated, Passive speech exposure (regular exposure to human speech, and two Speech-trained groups. After the exposure period, all budgerigars were tested for phonetic cue trading using operant conditioning procedures. Birds were trained to peck keys in response to different synthetic speech sounds that began with "d" or "t" and varied in VOT and frequency of the first formant at voicing onset. Once training performance criteria were met, budgerigars were presented with the entire intermediate series, including ambiguous sounds. Responses on these trials were used to determine which speech cues were used, if a trading relation between VOT and the onset frequency of the first formant was present, and whether speech exposure had an influence on perception. Cue trading was found in all birds and these results were largely similar to those of a group of humans. Results indicated that prior speech experience was not a requirement for cue trading by budgerigars. The results are consistent with theories that explain phonetic cue trading in terms of a rich auditory encoding of the speech signal.
Effects of a three-week vocal exercise program using the Finnish Kuukka exercises on the speaking voice of Norwegian broadcast journalism students.

Science.gov (United States)

Bele, Irene; Laukkanen, Anne-Maria; Sipilä, Laura

2010-12-01

Nine broadcast journalism students attended 10 hours in Kuukka vocal exercises, which aims at producing a ringing vocal quality. Nine control subjects received no training. A text was read at habitual loudness before and after the course. Five speech specialists evaluated the text samples for perceptual voice quality and analyzed mean fundamental frequency (F0), equivalent sound level (Leq), and long-term average spectrum (LTAS). For the Training Group, voice quality improved and correlated negatively with firmness and timbre (less firm and darker qualities being considered more desirable), and F0 increased slightly. Leq increased significantly in both groups. The results show positive and perceivable differences after the course. However, the aimed ring was not reached, may be due to too short time.
Vocal fold paralysis secondary to phonotrauma.

Science.gov (United States)

Klein, Travis A L; Gaziano, Joy E; Ridley, Marion B

2014-01-01

A unique case of acute onset vocal fold paralysis secondary to phonotrauma is presented. The cause was forceful vocalization by a drill instructor on a firearm range. Imaging studies revealed extensive intralaryngeal and retropharyngeal hemorrhage. Laryngoscopy showed a complete left vocal fold paralysis. Relative voice rest was recommended, and the patient regained normal vocal fold mobility and function after approximately 12 weeks. Copyright © 2014 The Voice Foundation. All rights reserved.
Song practice promotes acute vocal variability at a key stage of sensorimotor learning.

Directory of Open Access Journals (Sweden)

Julie E Miller

Full Text Available BACKGROUND: Trial by trial variability during motor learning is a feature encoded by the basal ganglia of both humans and songbirds, and is important for reinforcement of optimal motor patterns, including those that produce speech and birdsong. Given the many parallels between these behaviors, songbirds provide a useful model to investigate neural mechanisms underlying vocal learning. In juvenile and adult male zebra finches, endogenous levels of FoxP2, a molecule critical for language, decrease two hours after morning song onset within area X, part of the basal ganglia-forebrain pathway dedicated to song. In juveniles, experimental 'knockdown' of area X FoxP2 results in abnormally variable song in adulthood. These findings motivated our hypothesis that low FoxP2 levels increase vocal variability, enabling vocal motor exploration in normal birds. METHODOLOGY/PRINCIPAL FINDINGS: After two hours in either singing or non-singing conditions (previously shown to produce differential area X FoxP2 levels, phonological and sequential features of the subsequent songs were compared across conditions in the same bird. In line with our prediction, analysis of songs sung by 75 day (75d birds revealed that syllable structure was more variable and sequence stereotypy was reduced following two hours of continuous practice compared to these features following two hours of non-singing. Similar trends in song were observed in these birds at 65d, despite higher overall within-condition variability at this age. CONCLUSIONS/SIGNIFICANCE: Together with previous work, these findings point to the importance of behaviorally-driven acute periods during song learning that allow for both refinement and reinforcement of motor patterns. Future work is aimed at testing the observation that not only does vocal practice influence expression of molecular networks, but that these networks then influence subsequent variability in these skills.
Iconicity can ground the creation of vocal symbols.

Science.gov (United States)

Perlman, Marcus; Dale, Rick; Lupyan, Gary

2015-08-01

Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative 'vocal' charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic 'meaning template' we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems.
Speech impairment in Down syndrome: a review.

Science.gov (United States)

Kent, Ray D; Vorperian, Houri K

2013-02-01

This review summarizes research on disorders of speech production in Down syndrome (DS) for the purposes of informing clinical services and guiding future research. Review of the literature was based on searches using MEDLINE, Google Scholar, PsycINFO, and HighWire Press, as well as consideration of reference lists in retrieved documents (including online sources). Search terms emphasized functions related to voice, articulation, phonology, prosody, fluency, and intelligibility. The following conclusions pertain to four major areas of review: voice, speech sounds, fluency and prosody, and intelligibility. The first major area is voice. Although a number of studies have reported on vocal abnormalities in DS, major questions remain about the nature and frequency of the phonatory disorder. Results of perceptual and acoustic studies have been mixed, making it difficult to draw firm conclusions or even to identify sensitive measures for future study. The second major area is speech sounds. Articulatory and phonological studies show that speech patterns in DS are a combination of delayed development and errors not seen in typical development. Delayed (i.e., developmental) and disordered (i.e., nondevelopmental) patterns are evident by the age of about 3 years, although DS-related abnormalities possibly appear earlier, even in infant babbling. The third major area is fluency and prosody. Stuttering and/or cluttering occur in DS at rates of 10%-45%, compared with about 1% in the general population. Research also points to significant disturbances in prosody. The fourth major area is intelligibility. Studies consistently show marked limitations in this area, but only recently has the research gone beyond simple rating scales.

Vocal therapy of hyperkinetic dysphonia

OpenAIRE

Mumović Gordana; Veselinović Mila; Arbutina Tanja; Škrbić Renata

2014-01-01

Introduction. Hyperkinetic (hyperfunctional) dysphonia is a common pathology. The disorder is often found in vocal professionals faced with high vocal requirements. Objective. The objective of this study was to evaluate the effects of vocal therapy on voice condition characterized by hyperkinetic dysphonia with prenodular lesions and soft nodules. Methods. The study included 100 adult patients and 27 children aged 4-16 years with prenodular lesions and soft...
Optimal speech motor control and token-to-token variability: a Bayesian modeling approach.

Science.gov (United States)

Patri, Jean-François; Diard, Julien; Perrier, Pascal

2015-12-01

The remarkable capacity of the speech motor system to adapt to various speech conditions is due to an excess of degrees of freedom, which enables producing similar acoustical properties with different sets of control strategies. To explain how the central nervous system selects one of the possible strategies, a common approach, in line with optimal motor control theories, is to model speech motor planning as the solution of an optimality problem based on cost functions. Despite the success of this approach, one of its drawbacks is the intrinsic contradiction between the concept of optimality and the observed experimental intra-speaker token-to-token variability. The present paper proposes an alternative approach by formulating feedforward optimal control in a probabilistic Bayesian modeling framework. This is illustrated by controlling a biomechanical model of the vocal tract for speech production and by comparing it with an existing optimal control model (GEPPETO). The essential elements of this optimal control model are presented first. From them the Bayesian model is constructed in a progressive way. Performance of the Bayesian model is evaluated based on computer simulations and compared to the optimal control model. This approach is shown to be appropriate for solving the speech planning problem while accounting for variability in a principled way.
An Investigation of Extinction-Induced Vocalizations

Science.gov (United States)

Valentino, Amber L.; Shillingsburg, M. Alice; Call, Nathan A.; Burton, Britney; Bowen, Crystal N.

2011-01-01

Children with autism have significant communication delays. Although some children develop vocalizations through shaping and differential reinforcement, others rarely exhibit vocalizations, and alternative methods are targeted in intervention. However, vocal language often remains a goal for caregivers and clinicians. Thus, strategies to increase…
Speech dysprosody but no music 'dysprosody' in Parkinson's disease.

Science.gov (United States)

Harris, Robert; Leenders, Klaus L; de Jong, Bauke M

2016-12-01

Parkinson's disease is characterized not only by bradykinesia, rigidity, and tremor, but also by impairments of expressive and receptive linguistic prosody. The facilitating effect of music with a salient beat on patients' gait suggests that it might have a similar effect on vocal behavior, however it is currently unknown whether singing is affected by the disease. In the present study, fifteen Parkinson patients were compared with fifteen healthy controls during the singing of familiar melodies and improvised melodic continuations. While patients' speech could reliably be distinguished from that of healthy controls matched for age and gender, purely on the basis of aural perception, no significant differences in singing were observed, either in pitch, pitch range, pitch variability, and tempo, or in scale tone distribution, interval size or interval variability. The apparent dissociation of speech and singing in Parkinson's disease suggests that music could be used to facilitate expressive linguistic prosody. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Memristive Computational Architecture of an Echo State Network for Real-Time Speech Emotion Recognition

Science.gov (United States)

2015-05-28

recognition is simpler and requires less computational resources compared to other inputs such as facial expressions . The Berlin database of Emotional ...Processing Magazine, IEEE, vol. 18, no. 1, pp. 32– 80, 2001. [15] K. R. Scherer, T. Johnstone, and G. Klasmeyer, “Vocal expression of emotion ...Network for Real-Time Speech- Emotion Recognition 5a. CONTRACT NUMBER IN-HOUSE 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 62788F 6. AUTHOR(S) Q
Elaborate Mimetic Vocal Displays by Female Superb Lyrebirds

Directory of Open Access Journals (Sweden)

Anastasia H Dalziell

2016-04-01

Full Text Available Some of the most striking vocalizations in birds are made by males that incorporate vocal mimicry in their sexual displays. Mimetic vocalization in females is largely undescribed, but it is unclear whether this is because of a lack of selection for vocal mimicry in females, or whether the phenomenon has simply been overlooked. These issues are thrown into sharp relief in the superb lyrebird, Menura novaehollandiae, a basal oscine passerine with a lek-like mating system and female uniparental care. The spectacular mimetic song display produced by courting male lyrebirds is a textbook example of a sexually selected trait, but the vocalizations of female lyrebirds are largely unknown. Here, we provide the first analysis of the structure and context of the vocalizations of female lyrebirds. Female lyrebirds were completely silent during courtship; however, females regularly produced sophisticated vocal displays incorporating both lyrebird-specific vocalizations and imitations of sounds within their environment. The structure of female vocalizations varied significantly with context. While foraging, females mostly produced a complex lyrebird-specific song, whereas they gave lyrebird-specific alarm calls most often during nest defense. Within their vocal displays females also included a variety of mimetic vocalizations, including imitations of the calls of dangerous predators, and of alarm calls and song of harmless heterospecifics. Females gave more mimetic vocalizations during nest defense than while foraging, and the types of sounds they imitated varied between these contexts, suggesting that mimetic vocalizations have more than one function. These results are inconsistent with previous portrayals of vocalizations by female lyrebirds as rare, functionless by-products of sexual selection on males. Instead, our results support the hypotheses that complex female vocalizations play a role in nest defense and mediate female-female competition for
Vocal cysts: clinical, endoscopic, and surgical aspects.

Science.gov (United States)

Martins, Regina Helena Garcia; Santana, Marcela Ferreira; Tavares, Elaine Lara Mendes

2011-01-01

Vocal cysts are benign laryngeal lesions, which affect children and adults. They can be classified as epidermic or mucous-retention cyst. The objective was to study the clinical, endoscopic, and surgical aspects of vocal cysts. We reviewed the medical charts of 72 patients with vocal cysts, considering age, gender, occupation, time of vocal symptoms, nasosinusal and gastroesophageal symptoms, vocal abuse, tabagism, alcoholism, associated lesions, treatment, and histological details. Of the 72 cases, 46 were adults (36 females and 10 male) and 26 were children (eight girls and 18 boys). As far as occupation is concerned, there was a higher incidence of students and teachers. All the patients had symptoms of chronic hoarseness. Nasosinusal (27.77%) and gastroesophageal (32%) symptoms were not relevant. Vocal abuse was reported by 45.83%, smoking by 18%, and alcoholism by 8.4% of the patients. Unilateral cysts were seen in 93% of the cases, 22 patients had associated lesions, such as bridge, sulcus vocalis, and microweb. Surgical treatment was performed in 46 cases. Histological analysis of the epidermic cysts revealed a cavity with caseous content, covered by stratified squamous epithelium, often keratinized. Mucous cysts presented mucous content, and the walls were coated by a cylindrical ciliated epithelium. Vocal cysts are benign vocal fold lesions that affect children and adults, being often associated with vocal overuse, which frequently affects people who use their voices professionally. Vocal symptoms are chronic in course, often times since childhood, and the treatment of choice is surgical removal. A careful examination of the vocal folds is necessary during surgery, because other laryngeal lesions may be associated with vocal cysts. Copyright Â© 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
[The ultra-rapid cinematography of the larynx, its contributions in speech pathology].

Science.gov (United States)

Chevaillier, G; Sauvaget, E; Herman, P; Tran Ba Huy, P

2010-01-01

The development in the medical field of high speed cinematography and its dissemination in the field of speech pathology will probably change the way we view the larynx and diagnose its disorders. So far only the stroboscope could inform us about the appearance of vocal cord vibration but with certain limitations. Indeed the wave motion of the vocal cords is really only apparent motion. It is made possible through the phenomenon of retinal persistence of images and light intermittent vocal folds set to the frequency of the voice and out of phase by a few Hertz. This technique has several disadvantages: The need for the voice to trigger the strobe light; a low number of frames per second (25 fps) recorded; frame loss for the period between unlit two flashes; limitation in the study of the upper voice spectrum (gearing). The ultra-rapid cinematography brings a big difference in design since the digital recording can shoot up to 4000 frames per second with permanent lighting of the larynx. The slow reading of short sequences permits us to view the actual movement of vibrating structures, and to analyze the behavior of the vibrator during the transitional phases of the attack, depreciation and termination of sound. The footage in high resolution permits a detailed analysis of the movements of opening and closing of the vocal cords in phonation and respiration, and the diagnosis of lesions.
Vocal intensity in lecturers: Results of measurements conducted during lecture sessions

Directory of Open Access Journals (Sweden)

Witold Mikulski

2013-12-01

Full Text Available Background: Occupational voice users (inter alia: lecturers speak with different levels of vocal intensity. Speakers adjust this intensity knowingly (e.g. to underline the importance of fragments of the speech or unknowingly. The unknown adjustment of voice intensity occurs e.g. in the presence of high acoustic background noise (so-called Lombard effect, but it also results from many other factors: hearing loss, construction of the vocal tract, habits and others. The aim of the article is to confirm the thesis that in similar conditions of acoustic properties of the room different lecturers speak with different levels of vocal intensity. Materials and Methods: The study was conducted in a group of 10 lecturers in the same conference room. A-weighted sound pressure level determined at 1 m from the lecturer's mouth was adopted as a parameter defining the intensity of the lecturer's voice. The levels of all lecturers' voice intensity were compared and evaluated according to the criteria defined in EN ISO 9921. Results: Nine in ten lecturers were speaking with normal voice intensity (60-65 dB and only one full-time university lecturer was speaking with raised voice (66-71 dB. Conclusions: It was found that in the room of the same acoustic conditions the lecturers spoke with different intensities of voice. Some lecturers occasionally, and one all the time spoke with the voice intensity specified by PN-EN ISO 9921 as a raised voice. The results of the preliminary study warrant further studies in a larger group of teachers. Med Pr 2013;64(6:797–804
A magnetic resonance imaging study on the articulatory and acoustic speech parameters of Malay vowels.

Science.gov (United States)

Zourmand, Alireza; Mirhassani, Seyed Mostafa; Ting, Hua-Nong; Bux, Shaik Ismail; Ng, Kwan Hoong; Bilgen, Mehmet; Jalaludin, Mohd Amin

2014-07-25

The phonetic properties of six Malay vowels are investigated using magnetic resonance imaging (MRI) to visualize the vocal tract in order to obtain dynamic articulatory parameters during speech production. To resolve image blurring due to the tongue movement during the scanning process, a method based on active contour extraction is used to track tongue contours. The proposed method efficiently tracks tongue contours despite the partial blurring of MRI images. Consequently, the articulatory parameters that are effectively measured as tongue movement is observed, and the specific shape of the tongue and its position for all six uttered Malay vowels are determined.Speech rehabilitation procedure demands some kind of visual perceivable prototype of speech articulation. To investigate the validity of the measured articulatory parameters based on acoustic theory of speech production, an acoustic analysis based on the uttered vowels by subjects has been performed. As the acoustic speech and articulatory parameters of uttered speech were examined, a correlation between formant frequencies and articulatory parameters was observed. The experiments reported a positive correlation between the constriction location of the tongue body and the first formant frequency, as well as a negative correlation between the constriction location of the tongue tip and the second formant frequency. The results demonstrate that the proposed method is an effective tool for the dynamic study of speech production.
Emotion appraisal dimensions inferred from vocal expressions are consistent across cultures: a comparison between Australia and India.

Science.gov (United States)

Nordström, Henrik; Laukka, Petri; Thingujam, Nutankumar S; Schubert, Emery; Elfenbein, Hillary Anger

2017-11-01

This study explored the perception of emotion appraisal dimensions on the basis of speech prosody in a cross-cultural setting. Professional actors from Australia and India vocally portrayed different emotions (anger, fear, happiness, pride, relief, sadness, serenity and shame) by enacting emotion-eliciting situations. In a balanced design, participants from Australia and India then inferred aspects of the emotion-eliciting situation from the vocal expressions, described in terms of appraisal dimensions (novelty, intrinsic pleasantness, goal conduciveness, urgency, power and norm compatibility). Bayesian analyses showed that the perceived appraisal profiles for the vocally expressed emotions were generally consistent with predictions based on appraisal theories. Few group differences emerged, which suggests that the perceived appraisal profiles are largely universal. However, some differences between Australian and Indian participants were also evident, mainly for ratings of norm compatibility. The appraisal ratings were further correlated with a variety of acoustic measures in exploratory analyses, and inspection of the acoustic profiles suggested similarity across groups. In summary, results showed that listeners may infer several aspects of emotion-eliciting situations from the non-verbal aspects of a speaker's voice. These appraisal inferences also seem to be relatively independent of the cultural background of the listener and the speaker.
Risk factors for the appearance of minimal pathologic lesions on vocal folds in vocal professionals

Directory of Open Access Journals (Sweden)

Stojanović Jasmina

2012-01-01

Full Text Available Background/Aim. An excessive use or misuse of voice by vocal professionals may result in symptoms such are husky voice, hoarse voice, total loss of voice, or even organic changes taking place on vocal folds - minimal pathological lesions - MAPLs. The purpose of this study was to identify the type of MAPLs which affects vocal professionals, as well as to identify the risk factors that bring about these changes. Methods. There were 94 vocal professionals who were examined altogether, out of whom 46 were affected by MAPLs, whereas 48 of them were diagnosed with no MAPLs, so that they served as the control group. All these patients were clinically examined (anamnesis, clinical examination, bacteoriological examination of nose and pharynx, radiography of paranasal cavities, allergological processing, phoniatric examination, endo-video-stroboscopic examination, as well as gastroenterologic examination, and finally endocrinological and pulmological analyses. Results. The changes that occurred most often were identified as nodules (50%; n = 23/46 and polyps (24%; n = 11/46. Risk factors causing MAPLs in vocal professionals were as follows: age, which reduced the risk by 23.9% [OR 0.861 (0.786-0.942] whereas the years of career increase the risk [OR 1.114 (1.000-1.241], as well as the presence of a chronic respiratory disease [OR 7.310 (1.712- 31.218], and the presence of gastro-oesophageal reflux disease [OR 4.542 (1.263-16.334]. The following factors did not contribute to development of MAPLs in vocal professionals: sex, a place of residence, irritation, smoking, endocrinologic disease and the presence of poly-sinusitis. Conclusion. It is necessary to introduce comprehensive procedures for prevention of MAPLs, particularly in high-risk groups. Identification of the risk factors for MAPLs and prevention of their influence on vocal professionals (given that their income depends on their vocal ability is of the highest importance.
Functional results after external vocal fold medialization thyroplasty with the titanium vocal fold medialization implant.

Science.gov (United States)

Schneider, Berit; Denk, Doris-Maria; Bigenzahn, Wolfgang

2003-04-01

A persistent insufficiency of glottal closure is mostly a consequence of a unilateral vocal fold movement impairment. It can also be caused by vocal fold atrophy or scarring processes with regular bilateral respiratory vocal fold function. Because of consequential voice, breathing, and swallowing impairments, a functional surgical treatment is required. The goal of the study was to outline the functional results after medialization thyroplasty with the titanium vocal fold medialization implant according to Friedrich. In the period of 1999 to 2001, an external vocal fold medialization using the titanium implant was performed on 28 patients (12 women and 16 men). The patients were in the age range of 19 to 84 years. Twenty-two patients had a paralysis of the left-side vocal fold, and six patients, of the right-side vocal fold. Detailed functional examinations were executed on all patients before and after the surgery: perceptive voice sound analysis according to the "roughness, breathiness, and hoarseness" method, judgment of the s/z ratio and voice dysfunction index, voice range profile measurements, videostroboscopy, and pulmonary function tests. In case of dysphagia/aspiration, videofluoroscopy of swallowing was also performed. The respective data were statistically analyzed (paired t test, Wilcoxon-test). All patients reported on improvement of voice, swallowing, and breathing functions postoperatively. Videostroboscopy revealed an almost complete glottal closure after surgery in all of the patients. All voice-related parameters showed a significant improvement. An increase of the laryngeal resistance by the medialization procedure could be excluded by analysis of the pulmonary function test. The results confirm the external medialization of the vocal folds as an adequate method in the therapy of voice, swallowing, and breathing impairment attributable to an insufficient glottal closure. The titanium implant offers, apart from good tissue tolerability, the
Video Release: 47th Vice President of the United States Joseph R. Biden Jr. Speech at HUPO2017 Global Leadership Gala | Office of Cancer Clinical Proteomics Research

Science.gov (United States)

The Human Proteome Organization (HUPO) has released a video of the keynote speech given by the 47th Vice President of the United States of America Joseph R. Biden Jr. at the HUPO2017 Global Leadership Gala. Under the gala theme “International Cooperation in the Fight Against Cancer,” Biden recognized cancer as a collection of related diseases, the importance of data sharing and harmonization, and the need for collaboration across scientific disciplines as inflection points in cancer research.
Vibration and Noise in Magnetic Resonance Imaging of the Vocal Tract: Differences between Whole-Body and Open-Air Devices

Directory of Open Access Journals (Sweden)

Jiří Přibil

2018-04-01

Full Text Available This article compares open-air and whole-body magnetic resonance imaging (MRI equipment working with a weak magnetic field as regards the methods of its generation, spectral properties of mechanical vibration and acoustic noise produced by gradient coils during the scanning process, and the measured noise intensity. These devices are used for non-invasive MRI reconstruction of the human vocal tract during phonation with simultaneous speech recording. In this case, the vibration and noise have negative influence on quality of speech signal. Two basic measurement experiments were performed within the paper: mapping sound pressure levels in the MRI device vicinity and picking up vibration and noise signals in the MRI scanning area. Spectral characteristics of these signals are then analyzed statistically and compared visually and numerically.
Resting-associated vocalization emitted by captive Asian house shrews (Suncus murinus: acoustic structure and variability in an unusual mammalian vocalization.

Directory of Open Access Journals (Sweden)

Irena Schneiderová

Full Text Available Shrews have rich vocal repertoires that include vocalizations within the human audible frequency range and ultrasonic vocalizations. Here, we recorded and analyzed in detail the acoustic structure of a vocalization with unclear functional significance that was spontaneously produced by 15 adult, captive Asian house shrews (Suncus murinus while they were lying motionless and resting in their nests. This vocalization was usually emitted repeatedly in a long series with regular intervals. It showed some structural variability; however, the shrews most frequently emitted a tonal, low-frequency vocalization with minimal frequency modulation and a low, non-vocal click that was clearly noticeable at its beginning. There was no effect of sex, but the acoustic structure of the analyzed vocalizations differed significantly between individual shrews. The encoded individuality was low, but it cannot be excluded that this individuality would allow discrimination of family members, i.e., a male and female with their young, collectively resting in a common nest. The question remains whether the Asian house shrews indeed perceive the presence of their mates, parents or young resting in a common nest via the resting-associated vocalization and whether they use it to discriminate among their family members. Additional studies are needed to explain the possible functional significance of resting-associated vocalizations emitted by captive Asian house shrews. Our study highlights that the acoustic communication of shrews is a relatively understudied topic, particularly considering that they are highly vocal mammals.
Accuracy of cochlear implant recipients in speech reception in the presence of background music.

Science.gov (United States)

Gfeller, Kate; Turner, Christopher; Oleson, Jacob; Kliethermes, Stephanie; Driscoll, Virginia

2012-12-01

This study examined speech recognition abilities of cochlear implant (CI) recipients in the spectrally complex listening condition of 3 contrasting types of background music, and compared performance based upon listener groups: CI recipients using conventional long-electrode devices, Hybrid CI recipients (acoustic plus electric stimulation), and normal-hearing adults. We tested 154 long-electrode CI recipients using varied devices and strategies, 21 Hybrid CI recipients, and 49 normal-hearing adults on closed-set recognition of spondees presented in 3 contrasting forms of background music (piano solo, large symphony orchestra, vocal solo with small combo accompaniment) in an adaptive test. Signal-to-noise ratio thresholds for speech in music were examined in relation to measures of speech recognition in background noise and multitalker babble, pitch perception, and music experience. The signal-to-noise ratio thresholds for speech in music varied as a function of category of background music, group membership (long-electrode, Hybrid, normal-hearing), and age. The thresholds for speech in background music were significantly correlated with measures of pitch perception and thresholds for speech in background noise; auditory status was an important predictor. Evidence suggests that speech reception thresholds in background music change as a function of listener age (with more advanced age being detrimental), structural characteristics of different types of music, and hearing status (residual hearing). These findings have implications for everyday listening conditions such as communicating in social or commercial situations in which there is background music.
Vocal Fold Vibratory Changes Following Surgical Intervention.

Science.gov (United States)

Chen, Wenli; Woo, Peak; Murry, Thomas

2016-03-01

High-speed videoendoscopy (HSV) captures direct cycle-to-cycle visualization of vocal fold movement in real time. This ultrafast recording rate is capable of visualizing the vibratory motion of the vocal folds in severely disordered phonation and provides a direct method for examining vibratory changes after vocal fold surgery. The purpose of this study was to examine the vibratory motion before and after surgical intervention. HSV was captured from two subjects with identifiable midvocal fold benign lesions and six subjects with highly aperiodic vocal fold vibration before and after phonosurgery. Digital kymography (DKG) was used to extract high-speed kymographic vocal fold images sampled at the midmembranous, anterior 1/3, and posterior 1/3 region. Spectral analysis was subsequently applied to the DKG to quantify the cycle-to-cycle movements of the left and the right vocal fold, expressed as a spectrum. Before intervention, the vibratory spectrum consisted of decreased and flat-like spectral peaks with robust power asymmetry. After intervention, increases in spectral power and decreases in power symmetry were noted. Spectral power increases were most remarkable in the midmembranous region of the vocal fold. Surgical modification resulted in improved lateral excursion of the vocal folds, vibratory function, and perceptual measures of Voice Handicap Index-10. These changes in vibratory behavior trended toward normal vocal fold vibration. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Temporal predictive mechanisms modulate motor reaction time during initiation and inhibition of speech and hand movement.

Science.gov (United States)

Johari, Karim; Behroozmand, Roozbeh

2017-08-01

Skilled movement is mediated by motor commands executed with extremely fine temporal precision. The question of how the brain incorporates temporal information to perform motor actions has remained unanswered. This study investigated the effect of stimulus temporal predictability on response timing of speech and hand movement. Subjects performed a randomized vowel vocalization or button press task in two counterbalanced blocks in response to temporally-predictable and unpredictable visual cues. Results indicated that speech and hand reaction time was decreased for predictable compared with unpredictable stimuli. This finding suggests that a temporal predictive code is established to capture temporal dynamics of sensory cues in order to produce faster movements in responses to predictable stimuli. In addition, results revealed a main effect of modality, indicating faster hand movement compared with speech. We suggest that this effect is accounted for by the inherent complexity of speech production compared with hand movement. Lastly, we found that movement inhibition was faster than initiation for both hand and speech, suggesting that movement initiation requires a longer processing time to coordinate activities across multiple regions in the brain. These findings provide new insights into the mechanisms of temporal information processing during initiation and inhibition of speech and hand movement. Copyright © 2017 Elsevier B.V. All rights reserved.
Spasmodic Dysphonia: a Laryngeal Control Disorder Specific to Speech

Science.gov (United States)

Ludlow, Christy L.

2016-01-01

Spasmodic dysphonia (SD) is a rare neurological disorder that emerges in middle age, is usually sporadic, and affects intrinsic laryngeal muscle control only during speech. Spasmodic bursts in particular laryngeal muscles disrupt voluntary control during vowel sounds in adductor SD and interfere with voice onset after voiceless consonants in abductor SD. Little is known about its origins; it is classified as a focal dystonia secondary to an unknown neurobiological mechanism that produces a chronic abnormality of laryngeal motor neuron regulation during speech. It develops primarily in females and does not interfere with breathing, crying, laughter, and shouting. Recent postmortem studies have implicated the accumulation of clusters in the parenchyma and perivascular regions with inflammatory changes in the brainstem in one to two cases. A few cases with single mutations in THAP1, a gene involved in transcription regulation, suggest that a weak genetic predisposition may contribute to mechanisms causing a nonprogressive abnormality in laryngeal motor neuron control for speech but not for vocal emotional expression. Research is needed to address the basic cellular and proteomic mechanisms that produce this disorder to provide intervention that could target the pathogenesis of the disorder rather than only providing temporary symptom relief. PMID:21248101

Observation and investigation of a dynamic inflection point in current-voltage curves for roll-to-roll processed polymer photovoltaics

DEFF Research Database (Denmark)

Medford, Andrew James; Lilliedal, Mathilde Raad

2010-01-01

Inflection point behaviour is often observed in the current-voltage (IV) curve of polymer and organic solar cells. This phenomenon is examined in the context of flexible roll-to-roll (R2R) processed polymer solar cells in a large series of devices with a layer structure of: PET-ITO-ZnO-P3HT...... of this “photo-annealing” behaviour was further investigated by studying the effects of several key factors: temperature, illumination, and atmosphere. The results consistently showed that the inflection point is a dynamic interface phenomenon which can be removed under specific conditions. Subsequently...
Repairing the vibratory vocal fold.

Science.gov (United States)

Long, Jennifer L

2018-01-01

A vibratory vocal fold replacement would introduce a new treatment paradigm for structural vocal fold diseases such as scarring and lamina propria loss. This work implants a tissue-engineered replacement for vocal fold lamina propria and epithelium in rabbits and compares histology and function to injured controls and orthotopic transplants. Hypotheses were that the cell-based implant would engraft and control the wound response, reducing fibrosis and restoring vibration. Translational research. Rabbit adipose-derived mesenchymal stem cells (ASC) were embedded within a three-dimensional fibrin gel, forming the cell-based outer vocal fold replacement (COVR). Sixteen rabbits underwent unilateral resection of vocal fold epithelium and lamina propria, as well as reconstruction with one of three treatments: fibrin glue alone with healing by secondary intention, replantation of autologous resected vocal fold cover, or COVR implantation. After 4 weeks, larynges were examined histologically and with phonation. Fifteen rabbits survived. All tissues incorporated well after implantation. After 1 month, both graft types improved histology and vibration relative to injured controls. Extracellular matrix (ECM) of the replanted mucosa was disrupted, and ECM of the COVR implants remained immature. Immune reaction was evident when male cells were implanted into female rabbits. Best histologic and short-term vibratory outcomes were achieved with COVR implants containing male cells implanted into male rabbits. Vocal fold cover replacement with a stem cell-based tissue-engineered construct is feasible and beneficial in acute rabbit implantation. Wound-modifying behavior of the COVR implant is judged to be an important factor in preventing fibrosis. NA. Laryngoscope, 128:153-159, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Correlation of vocals and lyrics with left temporal musicogenic epilepsy.

Science.gov (United States)

Tseng, Wei-En J; Lim, Siew-Na; Chen, Lu-An; Jou, Shuo-Bin; Hsieh, Hsiang-Yao; Cheng, Mei-Yun; Chang, Chun-Wei; Li, Han-Tao; Chiang, Hsing-I; Wu, Tony

2018-03-15

Whether the cognitive processing of music and speech relies on shared or distinct neuronal mechanisms remains unclear. Music and language processing in the brain are right and left temporal functions, respectively. We studied patients with musicogenic epilepsy (ME) that was specifically triggered by popular songs to analyze brain hyperexcitability triggered by specific stimuli. The study included two men and one woman (all right-handed, aged 35-55 years). The patients had sound-triggered left temporal ME in response to popular songs with vocals, but not to instrumental, classical, or nonvocal piano solo versions of the same song. Sentimental lyrics, high-pitched singing, specificity/familiarity, and singing in the native language were the most significant triggering factors. We found that recognition of the human voice and analysis of lyrics are important causal factors in left temporal ME and provide observational evidence that sounds with speech structure are predominantly processed in the left temporal lobe. A literature review indicated that language-associated stimuli triggered ME in the left temporal epileptogenic zone at a nearly twofold higher rate compared with the right temporal region. Further research on ME may enhance understanding of the cognitive neuroscience of music. © 2018 New York Academy of Sciences.
Analysis of high-frequency energy in long-term average spectra of singing, speech, and voiceless fricatives

Science.gov (United States)

Monson, Brian B.; Lotto, Andrew J.; Story, Brad H.

2012-01-01

The human singing and speech spectrum includes energy above 5 kHz. To begin an in-depth exploration of this high-frequency energy (HFE), a database of anechoic high-fidelity recordings of singers and talkers was created and analyzed. Third-octave band analysis from the long-term average spectra showed that production level (soft vs normal vs loud), production mode (singing vs speech), and phoneme (for voiceless fricatives) all significantly affected HFE characteristics. Specifically, increased production level caused an increase in absolute HFE level, but a decrease in relative HFE level. Singing exhibited higher levels of HFE than speech in the soft and normal conditions, but not in the loud condition. Third-octave band levels distinguished phoneme class of voiceless fricatives. Female HFE levels were significantly greater than male levels only above 11 kHz. This information is pertinent to various areas of acoustics, including vocal tract modeling, voice synthesis, augmentative hearing technology (hearing aids and cochlear implants), and training/therapy for singing and speech. PMID:22978902
Time course of the influence of musical expertise on the processing of vocal and musical sounds.

Science.gov (United States)

Rigoulot, S; Pell, M D; Armony, J L

2015-04-02

Previous functional magnetic resonance imaging (fMRI) studies have suggested that different cerebral regions preferentially process human voice and music. Yet, little is known on the temporal course of the brain processes that decode the category of sounds and how the expertise in one sound category can impact these processes. To address this question, we recorded the electroencephalogram (EEG) of 15 musicians and 18 non-musicians while they were listening to short musical excerpts (piano and violin) and vocal stimuli (speech and non-linguistic vocalizations). The task of the participants was to detect noise targets embedded within the stream of sounds. Event-related potentials revealed an early differentiation of sound category, within the first 100 ms after the onset of the sound, with mostly increased responses to musical sounds. Importantly, this effect was modulated by the musical background of participants, as musicians were more responsive to music sounds than non-musicians, consistent with the notion that musical training increases sensitivity to music. In late temporal windows, brain responses were enhanced in response to vocal stimuli, but musicians were still more responsive to music. These results shed new light on the temporal course of neural dynamics of auditory processing and reveal how it is impacted by the stimulus category and the expertise of participants. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.
Adaptation to delayed auditory feedback induces the temporal recalibration effect in both speech perception and production.

Science.gov (United States)

Yamamoto, Kosuke; Kawabata, Hideaki

2014-12-01

We ordinarily speak fluently, even though our perceptions of our own voices are disrupted by various environmental acoustic properties. The underlying mechanism of speech is supposed to monitor the temporal relationship between speech production and the perception of auditory feedback, as suggested by a reduction in speech fluency when the speaker is exposed to delayed auditory feedback (DAF). While many studies have reported that DAF influences speech motor processing, its relationship to the temporal tuning effect on multimodal integration, or temporal recalibration, remains unclear. We investigated whether the temporal aspects of both speech perception and production change due to adaptation to the delay between the motor sensation and the auditory feedback. This is a well-used method of inducing temporal recalibration. Participants continually read texts with specific DAF times in order to adapt to the delay. Then, they judged the simultaneity between the motor sensation and the vocal feedback. We measured the rates of speech with which participants read the texts in both the exposure and re-exposure phases. We found that exposure to DAF changed both the rate of speech and the simultaneity judgment, that is, participants' speech gained fluency. Although we also found that a delay of 200 ms appeared to be most effective in decreasing the rates of speech and shifting the distribution on the simultaneity judgment, there was no correlation between these measurements. These findings suggest that both speech motor production and multimodal perception are adaptive to temporal lag but are processed in distinct ways.
Cross-cultural adaptation of the Brazilian version of the Vocal Fatigue Index - VFI.

Science.gov (United States)

Zambon, Fabiana; Moreti, Felipe; Nanjundeswaran, Chayadevie; Behlau, Mara

2017-03-13

The purpose of this study was to perform the cultural adaptation of the Brazilian version of the Vocal Fatigue Index (VFI). Two Brazilian bilingual speech-language pathologists (SLP) translated the original version of the VFI in English into Portuguese. The translations were reviewed by a committee of five voice specialist SLPs resulting in the final version of the instrument. A third bilingual SLP back-translated this final version and the same committee reviewed the differences from its original version. The final Portuguese version of the VFI, as in the original English version, was answered on a categorical scale of 0-4 indicating the frequency they experience the symptoms: 0=never, 1=almost never, 2=sometimes, 3=almost always, and 4=always. For cultural equivalence of the Portuguese version, the option "not applicable" was added to the categorical scale and 20 individuals with vocal complaints and dysphonia completed the index. Questions considered "not applicable" would be disregarded from the Brazilian version of the protocol; no question had to be removed from the instrument. The Brazilian Portuguese version was entitled "Índice de Fadiga Vocal - IFV" and features 19 questions, equivalent to the original instrument. Of the 19 items, 11 were related with tiredness of voice and voice avoidance, five concerned physical discomfort associated with voicing, and three were related to improvement of symptoms with rest or lack thereof. The Brazilian version of the VFI presents cultural and linguistic equivalence to the original instrument. The IFV validation into Brazilian Portuguese is in progress.
Autoshaping Infant Vocalizations

OpenAIRE

Myers, Alexander McNaughton

1981-01-01

A series of five experiments was conducted to determine whether operant or respondent factors controlled the emission of a particular vocalization ( "Q" ) by human infants 16 to 18 months old. Experiment 1 consisted of a pilot investigation of the effects of an autoshaping procedure on three infants' vocal behavior. All three subjects demonstrated increased emission of the target sound during the CR period. Experiments 2 through 4 attempted to replicate the findings of Experiment 1 under cont...
Análise de características vocais e de aspectos psicológicos em indivíduos com transtorno obsessivo-compulsivo Analysis of vocal characteristics and psychological aspects in individuals with obsessive-compulsive disorder

Directory of Open Access Journals (Sweden)

Mauriceia Cassol

2010-12-01

addition to analyze the psychological aspects that may be involved in the evaluated vocal issues. METHODS: The sample consisted of 35 individuals - 17 with OCD and 18 control cases - of both genders, with ages between 16 and 74 years. All subjects underwent the following research protocols: Beck Depression Inventory, Beck Anxiety Inventory, and the protocol for the characterization of the voices of individuals with psychiatric manifestations. The subjects also answered the voice psychodynamic analysis questionnaire focusing on their vocal self-image, and were submitted to auditory-perceptive evaluation and acoustic analysis of voice. RESULTS: In the analysis of the vocal self-image, the significant aspects described by the clinical group were the vocal characteristics sad and bad. In the auditory-perceptive analysis, there was a predominance of the slightly hoarse, breathy voice, alterations in resonance, speech rate, modulation and intonation. There were differences between jitter and shimmer values. All values regarding tremor were within normal standard parameters, and no differences were found between the groups regarding fundamental frequency values. CONCLUSION: It was possible to understand the perception of individuals with OCD regarding their own voices, and the deviations in vocal emissions. Thus, the speech-language pathologist can obtain information that allows the improvement in the quality of life of these individuals through speech-language pathology intervention, also aiming at interdisciplinarity.
Toward Defining "Vocal Constriction": Practitioner Perspectives.

Science.gov (United States)

Lemon-McMahon, Belinda; Hughes, Diane

2018-01-01

This research investigated the terminology used in relation to constriction of the singing voice from a range of practitioner perspectives. It focused on the locality, causes, consequences, management, trends, identification, and vocabulary of constriction. The research aimed to develop a holistic understanding of the term "vocal constriction" from participant experiences and perceptions (N = 10). Data collection occurred through in-depth, semi-structured interviews with a range of voice care professionals. Participants included three professional groups: (1) Ear, Nose, and Throat medical specialists or laryngologists, (2) speech pathologists or speech therapists, and (3) singing teachers. Purposive sampling was used to ensure that the participants from groups 1 and 2 had extensive experience with singers in their practice. The singing teachers were experienced in either classical or contemporary styles, or both. Participant responses highlighted a discrepancy in preferred terminology, with "constriction" being less favored overall. Several anatomical locations were identified including postural, supraglottic (anteroposterior and false fold), articulatory, and in the intrinsic and extrinsic laryngeal musculature; psychological issues were also identified. Primary causes, secondary causes, and influencing factors were identified. Inefficient technique and poor posture or alignment were considered primary causes; similarly, emotion and anxiety or stress were identified as influencing factors by the majority of participants. There was less uniformity in responses regarding other causes. The major findings of this research are the respective participant group distinctions, an uncertainty regarding anteroposterior constriction, and that the location and effects of constriction are individual to the singer and must be considered contextually. A definition is offered, and areas for further research are identified. Copyright © 2018 The Voice Foundation. Published by
Laser arytenoidectomy in the management of bilateral vocal cord paralysis in children.

Science.gov (United States)

Aubry, Karine; Leboulanger, Nicolas; Harris, Robert; Genty, Erwan; Denoyelle, Françoise; Garabedian, Erea-Noël

2010-05-01

To analyse the efficacy of CO(2) laser arytenoidectomy in the management of bilateral vocal cord paralysis in children. Retrospective series of 17 patients who underwent laser arytenoidectomy for bilateral vocal cord between 1995 and 2008 in a tertiary care institution. All patients had bilateral laryngeal paralysis, in isolation (n=5) or associated with concomitant airway conditions (n=12). All cases had anterior prolapse of the arytenoids with partial obstruction of the airway on inspiration. 12/17 patients (70.5%) were tracheotomy-dependant, 2/17 were in-extubatable, and 3/17 had severe airway limitation, effort dyspnea and poor sleep pattern. Main outcome measures were decannulation rate for patients with tracheotomy, occurrence of aspiration and quality of voice. The mean age was 2.8 years old. 9/12 patients with tracheotomy (75%) were decannulated with a median delay of 2 months (2 days to 18 months). Both of the intubated patients were extubated with a median delay of 36h. One of the decannulated patients who re-presented with a residual dyspnea after the arytenoidectomy was improved by a further laser cordotomy. 2/17 patients (11.7%) had post-operative persistent aspirations (with pneumopathies in one case), 5/17 patients were dysphonic, 3 improved with speech therapy and 2 with intracordal lipoinjection. Laser arytenoidectomy is effective for improving the breathing in children presenting with a bilateral vocal fold paralysis associated with obstructive arytenoid prolapse. Results are good as a first-line surgery or following laryngo-tracheal surgery. Voice outcomes are satisfactory. However, aspiration is a rare complication. Copyright (c) 2010 Elsevier Ireland Ltd. All rights reserved.
Vocal fold hemorrhage: factors predicting recurrence.

Science.gov (United States)

Lennon, Christen J; Murry, Thomas; Sulica, Lucian

2014-01-01

Vocal fold hemorrhage is an acute phonotraumatic injury treated with voice rest; recurrence is a generally accepted indication for surgical intervention. This study aims to identify factors predictive of recurrence based on outcomes of a large clinical series. Retrospective cohort. Retrospective review of cases of vocal fold hemorrhage presenting to a university laryngology service. Demographic information was compiled. Videostroboscopic exams were evaluated for hemorrhage extent, presence of varix, mucosal lesion, and/or vocal fold paresis. Vocal fold hemorrhage recurrence was the main outcome measure. Follow-up telephone survey was used to complement clinical data. Forty-seven instances of vocal fold hemorrhage were evaluated (25M:22F; 32 professional voice users). Twelve of the 47 (26%) patients experienced recurrence. Only the presence of varix demonstrated significant association with recurrence (P = 0.0089) on multivariate logistic regression. Vocal fold hemorrhage recurred in approximately 26% of patients. Varix was a predictor of recurrence, with 48% of those with varix experiencing recurrence. Monitoring, behavioral management and/or surgical intervention may be indicated to treat patients with such characteristics. © 2013 The American Laryngological, Rhinological and Otological Society, Inc.
Effect of hearing aids use on speech stimulus decoding through speech-evoked ABR

Directory of Open Access Journals (Sweden)

Renata Aparecida Leite

Full Text Available Abstract Introduction The electrophysiological responses obtained with the complex auditory brainstem response (cABR provide objective measures of subcortical processing of speech and other complex stimuli. The cABR has also been used to verify the plasticity in the auditory pathway in the subcortical regions. Objective To compare the results of cABR obtained in children using hearing aids before and after 9 months of adaptation, as well as to compare the results of these children with those obtained in children with normal hearing. Methods Fourteen children with normal hearing (Control Group - CG and 18 children with mild to moderate bilateral sensorineural hearing loss (Study Group - SG, aged 7-12 years, were evaluated. The children were submitted to pure tone and vocal audiometry, acoustic immittance measurements and ABR with speech stimulus, being submitted to the evaluations at three different moments: initial evaluation (M0, 3 months after the initial evaluation (M3 and 9 months after the evaluation (M9; at M0, the children assessed in the study group did not use hearing aids yet. Results When comparing the CG and the SG, it was observed that the SG had a lower median for the V-A amplitude at M0 and M3, lower median for the latency of the component V at M9 and a higher median for the latency of component O at M3 and M9. A reduction in the latency of component A at M9 was observed in the SG. Conclusion Children with mild to moderate hearing loss showed speech stimulus processing deficits and the main impairment is related to the decoding of the transient portion of this stimulus spectrum. It was demonstrated that the use of hearing aids promoted neuronal plasticity of the Central Auditory Nervous System after an extended time of sensory stimulation.
Sarcoidosis Presenting as Bilateral Vocal Fold Immobility.

Science.gov (United States)

Hintze, Justin M; Gnagi, Sharon H; Lott, David G

2018-05-01

Bilateral true vocal fold paralysis is rarely attributable to inflammatory diseases. Sarcoidosis is a rare but important etiology of bilateral true vocal fold paralysis by compressive lymphadenopathy, granulomatous infiltration, and neural involvement. We describe the first reported case of sarcoidosis presenting as bilateral vocal fold immobility caused by direct fixation by granulomatous infiltration severe enough to necessitate tracheostomy insertion. In addition, we discuss the presentation, the pathophysiology, and the treatment of this disease with a review of the literature of previously reported cases of sarcoidosis-related vocal fold immobility. Sarcoidosis should therefore be an important consideration for the otolaryngologist's differential diagnosis of true vocal fold immobility. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
78 FR 49693 - Speech-to-Speech and Internet Protocol (IP) Speech-to-Speech Telecommunications Relay Services...

Science.gov (United States)

2013-08-15

...-Speech Services for Individuals with Hearing and Speech Disabilities, Report and Order (Order), document...] Speech-to-Speech and Internet Protocol (IP) Speech-to-Speech Telecommunications Relay Services; Telecommunications Relay Services and Speech-to-Speech Services for Individuals With Hearing and Speech Disabilities...
Extensão vocal de idosos coralistas e não coralistas Vocal range in aged choristers and non-choristers

Directory of Open Access Journals (Sweden)

Tatiana Fernandes Rocha

2007-06-01

Full Text Available OBJETIVO: comparar a extensão vocal de idosos coralistas e não coralistas e analisar a influência da prática do canto-coral amador na extensão vocal dos mesmos. MÉTODOS: extração dos valores da extensão vocal em semitons por meio de um teclado musical e análise comparativa do número de semitons entre 40 idosos coralistas e 40 não coralistas. RESULTADOS: o número de semitons atingido pelos coralistas é significativamente maior que o atingido pelos não coralistas. O perfil de extensão vocal dos idosos coralistas foi de 27 a 39 semitons, perfazendo um total de 3 oitavas, 1 tom e 1 semitom. O perfil de extensão vocal dos idosos não coralistas foi de 18 a 35 semitons, perfazendo um total de 2 oitavas, 5 tons e 1 semitom. CONCLUSÃO: a prática do canto coral amador aumenta a extensão vocal de idosos coralistas.PURPOSE: compare the vocal extension of senior choristers and non-choristers and analyze the influence of the practice of the amateur coral-song in the vocal extension of the aforementioned subjects. METHODS: extracting the vocal extension through a musical keyboard and comparative analysis of the number of half-notes among 40 senior choristers and 40 non-choristers. RESULTS: the number of half-notes achieved by the choristers is significantly higher than the one achieved by the non-choristers. The vocal extension profile of the seniors choristers was from 27 to 39 half-notes, totalizing a sum of 3 octaves, 1 tone and 1 half-note. The profile of the no-choristers seniors' vocal extension was from 18 to 35 half-notes, totalizing a sum of 2 octaves, 5 tones and 1 half-note. CONCLUSION: The practice of the amateur coral song increases the choristers seniors' vocal extension.
Vocal quality in university teachers: a pilot study.

Science.gov (United States)

D'haeseleer, E; Claeys, S; Wuyts, F; Van Lierde, K M

2009-01-01

The main purpose of this study was to determine the vocal quality of 20 male and 9 female university teachers using a multi-parameter approach. Secondly, the effect of an academic lecture on the voice profiles of the university teachers was measured. All groups underwent subjective voice evaluations (perceptual evaluation, Voice Handicap Index, anamnesis of vocal complaints and vocal abuse) and objective voice evaluations (aerodynamic and acoustic parameters, vocal performance, and the Dysphonia Severity Index). The same voice assessment was performed after an academic lecture with a mean length of one and a half hours. The mean DSI score was + 2.2 for the male teachers and + 4.0 for the female teachers. The mean VHI score was 13. Perceptually, all voice parameters were rated as normal. The questionnaire revealed a relatively high amount of vocal abuse. No changes in the objective vocal parameters were found after the lecture. Perceptually, however, the voices of the university teachers were significantly less instable after the lecture. Although no negative changes in objective vocal quality were observed, 48% of the university teachers experienced subjective vocal changes. The authors concluded that university teachers are professional voice users with good vocal quality who suffer no handicapping effect from possible voice disorders. No important changes in the vocal profile after a teaching activity of one and a half hours were found, despite the high prevalence of voice complaints.
In vivo measurement of vocal fold surface resistance.

Science.gov (United States)

Mizuta, Masanobu; Kurita, Takashi; Dillon, Neal P; Kimball, Emily E; Garrett, C Gaelyn; Sivasankar, M Preeti; Webster, Robert J; Rousseau, Bernard

2017-10-01

A custom-designed probe was developed to measure vocal fold surface resistance in vivo. The purpose of this study was to demonstrate proof of concept of using vocal fold surface resistance as a proxy of functional tissue integrity after acute phonotrauma using an animal model. Prospective animal study. New Zealand White breeder rabbits received 120 minutes of airflow without vocal fold approximation (control) or 120 minutes of raised intensity phonation (experimental). The probe was inserted via laryngoscope and placed on the left vocal fold under endoscopic visualization. Vocal fold surface resistance of the middle one-third of the vocal fold was measured after 0 (baseline), 60, and 120 minutes of phonation. After the phonation procedure, the larynx was harvested and prepared for transmission electron microscopy. In the control group, vocal fold surface resistance values remained stable across time points. In the experimental group, surface resistance (X% ± Y% relative to baseline) was significantly decreased after 120 minutes of raised intensity phonation. This was associated with structural changes using transmission electron microscopy, which revealed damage to the vocal fold epithelium after phonotrauma, including disruption of the epithelium and basement membrane, dilated paracellular spaces, and alterations to epithelial microprojections. In contrast, control vocal fold specimens showed well-preserved stratified squamous epithelia. These data demonstrate the feasibility of measuring vocal fold surface resistance in vivo as a means of evaluating functional vocal fold epithelial barrier integrity. Device prototypes are in development for additional testing, validation, and for clinical applications in laryngology. NA Laryngoscope, 127:E364-E370, 2017. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
[Clinical analysis of vocal fold firbrous mass].

Science.gov (United States)

Chen, Hao; Sun, Jing Wu; Wan, Guang Lun; Hu, Yan Ming

2018-03-01

To explore the character of laryngoscopy finding, voice, and therapy of vocal fold fibrous mass. Clinical data, morphology, voice character, surgery and pathology of 15 cases with vocal fold fibrous mass were analyzed. The morbidity of vocal fold fibrous mass might be related to overuse of voice and laryngopharyngeal reflex. Laryngoscopy revealed shuttle line appearance, smoothness and decreased mucosal wave of vocal fold. These patients were invalid for voice training and might be improved by surgery, but recovery is slow. The morbidity of vocal fold fibrous mass might be related to overuse of voice and laryngopharyngeal reflex. Conservative treatment is ineffective for this disease, and surgery might improve. Copyright© by the Editorial Department of Journal of Clinical Otorhinolaryngology Head and Neck Surgery.
Improvement of Vocal Pathologies Diagnosis Using High-Speed Videolaryngoscopy

Science.gov (United States)

Tsuji, Domingos Hiroshi; Hachiya, Adriana; Dajer, Maria Eugenia; Ishikawa, Camila Cristina; Takahashi, Marystella Tomoe; Montagnoli, Arlindo Neto

2014-01-01

Introduction The study of the dynamic properties of vocal fold vibration is important for understanding the vocal production mechanism and the impact of organic and functional changes. The advent of high-speed videolaryngoscopy (HSV) has provided the possibility of seeing the real cycle of vocal fold vibration in detail through high sampling rate of successive frames and adequate spatial resolution. Objective To describe the technique, advantages, and limitations of using HSV and digital videokymography in the diagnosis of vocal pathologies. Methods We used HSV and digital videokymography to evaluate one normophonic individual and four patients with vocal fold pathologies (nodules, unilateral paralysis of the left vocal fold, intracordal cyst, and adductor spasmodic dysphonia). The vocal fold vibration parameters (glottic closure, vibrational symmetry, periodicity, mucosal wave, amplitude, and glottal cycle phases) were assessed. Results Differences in the vocal vibration parameters were observed and correlated with the pathophysiology. Conclusion HSV is the latest diagnostic tool in visual examination of vocal behavior and has considerable potential to refine our knowledge regarding the vocal fold vibration and voice production, as well as regarding the impact of pathologic conditions have on the mechanism of phonation. PMID:25992109

Improvement of Vocal Pathologies Diagnosis Using High-Speed Videolaryngoscopy

Directory of Open Access Journals (Sweden)

Tsuji, Domingos Hiroshi

2014-04-01

Full Text Available Introduction The study of the dynamic properties of vocal fold vibration is important for understanding the vocal production mechanism and the impact of organic and functional changes. The advent of high-speed videolaryngoscopy (HSV has provided the possibility of seeing the real cycle of vocal fold vibration in detail through high sampling rate of successive frames and adequate spatial resolution. Objective To describe the technique, advantages, and limitations of using HSV and digital videokymography in the diagnosis of vocal pathologies. Methods We used HSV and digital videokymography to evaluate one normophonic individual and four patients with vocal fold pathologies (nodules, unilateral paralysis of the left vocal fold, intracordal cyst, and adductor spasmodic dysphonia. The vocal fold vibration parameters (glottic closure, vibrational symmetry, periodicity, mucosal wave, amplitude, and glottal cycle phases were assessed. Results Differences in the vocal vibration parameters were observed and correlated with the pathophysiology. Conclusion HSV is the latest diagnostic tool in visual examination of vocal behavior and has considerable potential to refine our knowledge regarding the vocal fold vibration and voice production, as well as regarding the impact of pathologic conditions have on the mechanism of phonation.
The vocal monotony of monogamy

Science.gov (United States)

Thomas, Jeanette

2003-04-01

There are four phocids in waters around Antarctica: Weddell, leopard, crabeater, and Ross seals. These four species provide a unique opportunity to examine underwater vocal behavior in species sharing the same ecosystem. Some species live in pack ice, others in factice, but all are restricted to the Antarctic or sub-Antarctic islands. All breed and produce vocalizations under water. Social systems range from polygyny in large breeding colonies, to serial monogamy, to solitary species. The type of mating system influences the number of underwater vocalizations in the repertoire, with monogamous seals producing only a single call, polygynous species producing up to 35 calls, and solitary species an intermediate number of about 10 calls. Breeding occurs during the austral spring and each species carves-out an acoustic niche for communicating, with species using different frequency ranges, temporal patterns, and amplitude changes to convey their species-specific calls and presumably reduce acoustic competition. Some species exhibit geographic variations in their vocalizations around the continent, which may reflect discrete breeding populations. Some seals become silent during a vulnerable time of predation by killer whales, perhaps to avoid detection. Overall, vocalizations of these seals exhibit adaptive characteristics that reflect the co-evolution among species in the same ecosystem.
MARATHON DESPITE UNILATERAL VOCAL FOLD PARALYSIS

Directory of Open Access Journals (Sweden)

Matthias Echternach

2008-06-01

Full Text Available The principal symptoms of unilateral vocal fold paralysis are hoarseness and difficulty in swallowing. Dyspnea is comparatively rare (Laccourreye et al., 2003. The extent to which unilateral vocal fold paralysis may lead to respiratory problems at all - in contrast to bilateral vocal fold paralysis- has not yet well been determined. On the one hand, inspiration is impaired with unilateral vocal fold paralysis; on the other hand, neither the position of the vocal fold paralysis nor the degree of breathiness correlates with respiratory parameters (Cantarella et al., 2003; 2005. The question of what respiratory stress a patient with a vocal fold paresis can endure has not yet been dealt with.A 43 year-old female patient was suffering from recurrent unspecific respiratory complaints for four months after physical activity. During training for a marathon, she experienced no difficulty in breathing. These unspecific respiratory complaints occurred only after athletic activity and persisted for hours. The patient observed neither an increased coughing nor a stridor. Her voice remained unaltered during the attacks, nor were there any signs of a symptomatic gastroesophageal reflux or infectious disease. A cardio-pulmonary and a radiological examination by means of an X-ray of the thorax also revealed no pathological phenomena. As antiallergic and antiobstructive therapy remained unsuccessful, a laryngological examination was performed in order to exclude a vocal cord dysfunction.Surprisingly enough, the laryngostroboscopy showed, as an initial description, a vocal fold paralysis of the left vocal fold in median position (Figure 1. The anamnestic background for the cause was unclear. The only clue was a thoracotomy on the left side due to a pleuritis in childhood. A subsequent laryngoscopic examination had never been performed. Good mucosa waves and amplitudes were shown bilateral with complete glottal closure. Neither in the acoustic analysis, nor in the
Predicting Achievable Fundamental Frequency Ranges in Vocalization Across Species.

Directory of Open Access Journals (Sweden)

Ingo Titze

2016-06-01

Full Text Available Vocal folds are used as sound sources in various species, but it is unknown how vocal fold morphologies are optimized for different acoustic objectives. Here we identify two main variables affecting range of vocal fold vibration frequency, namely vocal fold elongation and tissue fiber stress. A simple vibrating string model is used to predict fundamental frequency ranges across species of different vocal fold sizes. While average fundamental frequency is predominantly determined by vocal fold length (larynx size, range of fundamental frequency is facilitated by (1 laryngeal muscles that control elongation and by (2 nonlinearity in tissue fiber tension. One adaptation that would increase fundamental frequency range is greater freedom in joint rotation or gliding of two cartilages (thyroid and cricoid, so that vocal fold length change is maximized. Alternatively, tissue layers can develop to bear a disproportionate fiber tension (i.e., a ligament with high density collagen fibers, increasing the fundamental frequency range and thereby vocal versatility. The range of fundamental frequency across species is thus not simply one-dimensional, but can be conceptualized as the dependent variable in a multi-dimensional morphospace. In humans, this could allow for variations that could be clinically important for voice therapy and vocal fold repair. Alternative solutions could also have importance in vocal training for singing and other highly-skilled vocalizations.
Effect of cognitive load on speech prosody in aviation: Evidence from military simulator flights.

Science.gov (United States)

Huttunen, Kerttu; Keränen, Heikki; Väyrynen, Eero; Pääkkönen, Rauno; Leino, Tuomo

2011-01-01

Mental overload directly affects safety in aviation and needs to be alleviated. Speech recordings are obtained non-invasively and as such are feasible for monitoring cognitive load. We recorded speech of 13 military pilots while they were performing a simulator task. Three types of cognitive load (load on situation awareness, information processing and decision making) were rated by a flight instructor separately for each flight phase and participant. As a function of increased cognitive load, the mean utterance-level fundamental frequency (F0) increased, on average, by 7 Hz and the mean vocal intensity increased by 1 dB. In the most intensive simulator flight phases, mean F0 increased by 12 Hz and mean intensity, by 1.5 dB. At the same time, the mean F0 range decreased by 5 Hz, on average. Our results showed that prosodic features of speech can be used to monitor speaker state and support pilot training in a simulator environment. Copyright © 2010 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Managing Inflections in Life and Career: Tale from a Physicist

Science.gov (United States)

Bhattacharya, Santanu

2010-03-01

By training, a physicist possesses one of the rarest qualities ever imparted in an educational degree program, namely, the ability to take on complex problems, divide them into ``solvable'' parts, derive solutions and put them back as insightful outputs. Dr Bhattacharya, CEO of Salorix, a research, analytics and consulting firm, explains how he has used these skills learned at the graduate school to build a career as a scientist, management consultant and entrepreneur. He will also speak about how the real-life skillsets of understanding and dealing with ``Inflections'', self discovery and introspection can be a great tool for managing one's life and career progression.
Tuberculose laríngea: proposta de intervenção fonoaudiológica nas sequelas de voz após o tratamento farmacológico Laryngeal tuberculosis: proposal of Speech-Language Pathology intervention in voice disorders following pharmacological treatment

Directory of Open Access Journals (Sweden)

Raquel de Cássia Ferro Fagundes

2011-03-01

possible complications from pulmonary tuberculosis, and the most common symptom is hoarseness, as a result of the healing process of ulcerative laryngeal lesions. The purpose of this study was to verify the effectiveness of speech-language therapy in a case of voice disorder following anti-tuberculosis drug treatment. The methodology used was the case study of the patient J.O.B.S, 39 years old, male, hotel receptionist with an eight-hour workday, former smoker, who had hoarseness, tiredness and dyspnea during speech as main complaints. Speech-language therapy sessions started after Speech-Language Pathology and otolaryngological evaluations, with the aims to reduce the laryngeal tension during phonation, induce supraglottic vocal fold separation, help the smooth movement of the vocal folds, install abdominal breathing, and improve pneumophonic coordination. After 12 sessions, several vocal parameters improved, including decrease of vocal tension during speech, use of abdominal breathing, improvement of pneumophonic coordination, loudness increase, and reduction of the abrupt vocal attack, which reflected in vocal emissions with less effort and more socially accepted. In spite of the limitations caused by the healing of the ulcerative lesions, speech-language therapy was important in this case study, and the patient was satisfied with the results obtained, which had positive influences on his oral communication and social life.
Vocal ontogeny in neotropical singing mice (Scotinomys.

Directory of Open Access Journals (Sweden)

Polly Campbell

Full Text Available Isolation calls produced by dependent young are a fundamental form of communication. For species in which vocal signals remain important to adult communication, the function and social context of vocal behavior changes dramatically with the onset of sexual maturity. The ontogenetic relationship between these distinct forms of acoustic communication is surprisingly under-studied. We conducted a detailed analysis of vocal development in sister species of Neotropical singing mice, Scotinomys teguina and S. xerampelinus. Adult singing mice are remarkable for their advertisement songs, rapidly articulated trills used in long-distance communication; the vocal behavior of pups was previously undescribed. We recorded 30 S. teguina and 15 S. xerampelinus pups daily, from birth to weaning; 23 S. teguina and 11 S. xerampelinus were recorded until sexual maturity. Like other rodent species with poikilothermic young, singing mice were highly vocal during the first weeks of life and stopped vocalizing before weaning. Production of first advertisement songs coincided with the onset of sexual maturity after a silent period of ≧2 weeks. Species differences in vocal behavior emerged early in ontogeny and notes that comprise adult song were produced from birth. However, the organization and relative abundance of distinct note types was very different between pups and adults. Notably, the structure, note repetition rate, and intra-individual repeatability of pup vocalizations did not become more adult-like with age; the highly stereotyped structure of adult song appeared de novo in the first songs of young adults. We conclude that, while the basic elements of adult song are available from birth, distinct selection pressures during maternal dependency, dispersal, and territorial establishment favor major shifts in the structure and prevalence of acoustic signals. This study provides insight into how an evolutionarily conserved form of acoustic signaling provides
Dysphonia and vocal fold telangiectasia in hereditary hemorrhagic telangiectasia.

Science.gov (United States)

Chang, Joseph; Yung, Katherine C

2014-11-01

This case report is the first documentation of dysphonia and vocal fold telangiectasia as a complication of hereditary hemorrhagic telangiectasia (HHT). Case report of a 40-year-old man with HHT presenting with 2 years of worsening hoarseness. Hoarseness corresponded with a period of anticoagulation. Endoscopy revealed vocal fold scarring, vocal fold telangiectasias, and plica ventricular is suggestive of previous submucosal vocal fold hemorrhage and subsequent counterproductive compensation with ventricular phonation. Hereditary hemorrhagic telangiectasia may present as dysphonia with vocal fold telangiectasias and place patients at risk of vocal fold hemorrhage. © The Author(s) 2014.
Speech-to-Speech Relay Service

Science.gov (United States)

Consumer Guide Speech to Speech Relay Service Speech-to-Speech (STS) is one form of Telecommunications Relay Service (TRS). TRS is a service that allows persons with hearing and speech disabilities ...
Avian vocal mimicry: a unified conceptual framework.

Science.gov (United States)

Dalziell, Anastasia H; Welbergen, Justin A; Igic, Branislav; Magrath, Robert D

2015-05-01

Mimicry is a classical example of adaptive signal design. Here, we review the current state of research into vocal mimicry in birds. Avian vocal mimicry is a conspicuous and often spectacular form of animal communication, occurring in many distantly related species. However, the proximate and ultimate causes of vocal mimicry are poorly understood. In the first part of this review, we argue that progress has been impeded by conceptual confusion over what constitutes vocal mimicry. We propose a modified version of Vane-Wright's (1980) widely used definition of mimicry. According to our definition, a vocalisation is mimetic if the behaviour of the receiver changes after perceiving the acoustic resemblance between the mimic and the model, and the behavioural change confers a selective advantage on the mimic. Mimicry is therefore specifically a functional concept where the resemblance between heterospecific sounds is a target of selection. It is distinct from other forms of vocal resemblance including those that are the result of chance or common ancestry, and those that have emerged as a by-product of other processes such as ecological convergence and selection for large song-type repertoires. Thus, our definition provides a general and functionally coherent framework for determining what constitutes vocal mimicry, and takes account of the diversity of vocalisations that incorporate heterospecific sounds. In the second part we assess and revise hypotheses for the evolution of avian vocal mimicry in the light of our new definition. Most of the current evidence is anecdotal, but the diverse contexts and acoustic structures of putative vocal mimicry suggest that mimicry has multiple functions across and within species. There is strong experimental evidence that vocal mimicry can be deceptive, and can facilitate parasitic interactions. There is also increasing support for the use of vocal mimicry in predator defence, although the mechanisms are unclear. Less progress has
Medición de la discapacidad vocal en los pacientes con nódulos vocales

Directory of Open Access Journals (Sweden)

Wasim Elhendi Halawa

2012-06-01

Full Text Available Con el objetivo de analizar el grado de discapacidad que suponen los nódulos vocales para los pacientes,presentamos los resultados de la valoración subjetiva (el índice de discapacidad vocal (V.H.I.-30adaptado al español y valoración de la sintomatología asociada a la disfonía en 97 pacientesdiagnosticados de nódulos vocales, encontrando un grado importante de discapacidad reflejado por unosvalores elevados del V.H.I.-30 (61,18, por sus tres subescalas (orgánica -26,48, funcional -21,75 yemocional -12,94 y por un importante grado de afectación por los síntomas asociados. Se comparannuestros resultados con los del grupo control de nuestro entorno y se estratifican los resultados según laprofesión de los pacientes. Concluimos que la presencia de nódulos vocales supone una discapacidadimportante a nivel de las actividades sociales y laborales del paciente y un impacto emocionalconsiderable.
An investigation of bimodal jet trajectory in flow through scaled models of the human vocal tract

Energy Technology Data Exchange (ETDEWEB)

Erath, Byron D.; Plesniak, Michael W. [Purdue University, School of Mechanical Engineering, Indiana (United States)

2006-05-15

Pulsatile two-dimensional flow through static divergent models of the human vocal folds is investigated. Although the motivation for this study is speech production, the results are generally applicable to a variety of engineering flows involving pulsatile flow through diffusers. Model glottal divergence angles of 10, 20, and 40 represent various geometries encountered in one phonation cycle. Frequency and amplitude of the flow oscillations are scaled with physiological Reynolds and Strouhal numbers typical of human phonation. Glottal velocity trajectories are measured along the anterior-posterior midline by using phase-averaged particle image velocimetry to acquire 1,000 realizations at ten discrete instances in the phonation cycle. The angular deflection of the glottal jet from the streamwise direction (symmetric configuration) is quantified for each realization. A bimodal flow configuration is observed for divergence angles of 10 and 20 , with the flow eventually skewing and attaching to the vocal fold walls. The deflection of the flow toward the vocal fold walls occurs when the forcing function reaches maximum velocity and zero acceleration. For a divergence angle of 40 , the flow never attaches to the vocal fold walls; however, there is increased variability in the glottal jet after the forcing function reaches maximum velocity and zero acceleration. The variation in the jet trajectory as a function of divergence angle is explained by performance maps of diffuser flow regimes. The smaller angle cases are in the unstable transitory stall regime while the 40 divergent case is in the fully developed two-dimensional stall regime. Very small geometric variations in model size and surface finish significantly affect the flow behavior. The bimodal, or flip-flopping, glottal jet behavior is expected to influence the dipole contribution to sound production. (orig.)
Singing in groups for Parkinson's disease (SING-PD): a pilot study of group singing therapy for PD-related voice/speech disorders.

Science.gov (United States)

Shih, Ludy C; Piel, Jordan; Warren, Amanda; Kraics, Lauren; Silver, Althea; Vanderhorst, Veronique; Simon, David K; Tarsy, Daniel

2012-06-01

Parkinson's disease related speech and voice impairment have significant impact on quality of life measures. LSVT(®)LOUD voice and speech therapy (Lee Silverman Voice Therapy) has demonstrated scientific efficacy and clinical effectiveness, but musically based voice and speech therapy has been underexplored as a potentially useful method of rehabilitation. We undertook a pilot, open-label study of a group-based singing intervention, consisting of twelve 90-min weekly sessions led by a voice and speech therapist/singing instructor. The primary outcome measure of vocal loudness as measured by sound pressure level (SPL) at 50 cm during connected speech was not significantly different one week after the intervention or at 13 weeks after the intervention. A number of secondary measures reflecting pitch range, phonation time and maximum loudness also were unchanged. Voice related quality of life (VRQOL) and voice handicap index (VHI) also were unchanged. This study suggests that a group singing therapy intervention at this intensity and frequency does not result in significant improvement in objective and subject-rated measures of voice and speech impairment. Copyright © 2012 Elsevier Ltd. All rights reserved.
Vocal evaluation in teachers with or without symptoms.

Science.gov (United States)

Tavares, Elaine L M; Martins, Regina H G

2007-07-01

The aim of this study was to perform voice evaluation in teachers with and without vocal symptoms, identifying etiologic factors of dysphonia, voice symptoms, vocal qualities, and laryngeal lesions. Eighty teachers were divided into two groups: GI (without or sporadic symptoms, 40) and GII (with frequent vocal symptoms, 40). They answered a specific questionnaire, and were subject to a perceptual vocal assessment (maximum phonation time, glottal attack, resonance, coordination of breathing and voicing, pitch, and loudness), GIRBAS scale, and to videolaryngoscopy. Females were predominant in both groups, and the age range was from 36 to 50 years. Elementary teachers predominated, working in classes with 31-40 students. Voice symptoms and alterations in the perceptual vocal analysis and in the GIRBAS scale were more frequent in GII. In 46 teachers (GI-16; GII-30), videolaryngoscopy exams were abnormal with the vocal nodules being the most frequent lesions. These results indicate that a teacher's voice is compromised, and requires more attention including control of environmental factors and associated diseases, preventive vocal hygiene, periodic laryngeal examinations, and access to adequate specialist treatment.
Vocal pedagogy and contemporary commercial music : reflections on higher education non-classical vocal pedagogy in the United States and Finland

OpenAIRE

Keskinen, Anu Katri

2013-01-01

This study is focused on the discipline of higher education contemporary commercial music (CCM) vocal pedagogy through the experiences of two vocal pedagogy teachers, the other in the USA and the other in Finland. The aim of this study has been to find out how the discipline presently looks from a vocal pedagogy teacher's viewpoint, what has the process of building higher education CCM vocal pedagogy courses been like, and where is the field headed. The discussion on CCM pedagogy, also kn...
Effect of Vocal Fold Medialization on Dysphagia in Patients with Unilateral Vocal Fold Immobility.

Science.gov (United States)

Cates, Daniel J; Venkatesan, Naren N; Strong, Brandon; Kuhn, Maggie A; Belafsky, Peter C

2016-09-01

The effect of vocal fold medialization (VFM) on vocal improvement in persons with unilateral vocal fold immobility (UVFI) is well established. The effect of VFM on the symptom of dysphagia is uncertain. The purpose of this study is to evaluate dysphagia symptoms in patients with UVFI pre- and post-VFM. Case series with chart review. Academic tertiary care medical center. The charts of 44 persons with UVFI who underwent VFM between June 1, 2013, and December 31, 2014, were abstracted from a prospectively maintained database at the University of California, Davis, Voice and Swallowing Center. Patient demographics, indications, and type of surgical procedure were recorded. Self-reported swallowing impairment was assessed with the validated 10-item Eating Assessment Tool (EAT-10) before and after surgery. A paired samples t test was used to compare pre- and postmedialization EAT-10 scores. Forty-four patients met criteria and underwent either vocal fold injection (73%) or thyroplasty (27%). Etiologies of vocal fold paralysis were iatrogenic (55%), idiopathic (29%), benign or malignant neoplastic (9%), traumatic (5%), or related to the late effects of radiation (2%). EAT-10 (mean ± SD) scores improved from 12.2 ± 11.1 to 7.7 ± 7.2 after medialization (P dysphagia and report significant improvement in swallowing symptoms following VFM. The symptomatic improvement appears to be durable over time. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
The Vocal Tract Organ: A New Musical Instrument Using 3-D Printed Vocal Tracts.

Science.gov (United States)

Howard, David M

2017-10-27

The advent and now increasingly widespread availability of 3-D printers is transforming our understanding of the natural world by enabling observations to be made in a tangible manner. This paper describes the use of 3-D printed models of the vocal tract for different vowels that are used to create an acoustic output when stimulated with an appropriate sound source in a new musical instrument: the Vocal Tract Organ. The shape of each printed vocal tract is recovered from magnetic resonance imaging. It sits atop a loudspeaker to which is provided an acoustic L-F model larynx input signal that is controlled by the notes played on a musical instrument digital interface device such as a keyboard. The larynx input is subject to vibrato with extent and frequency adjustable as desired within the ranges usually found for human singing. Polyphonic inputs for choral singing textures can be applied via a single loudspeaker and vocal tract, invoking the approximation of linearity in the voice production system, thereby making multiple vowel stops a possibility while keeping the complexity of the instrument in reasonable check. The Vocal Tract Organ offers a much more human and natural sounding result than the traditional Vox Humana stops found in larger pipe organs, offering the possibility of enhancing pipe organs of the future as well as becoming the basis for a "multi-vowel" chamber organ in its own right. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Reinke Edema: Watch For Vocal Fold Cysts.

Science.gov (United States)

Tüzüner, Arzu; Demirci, Sule; Yavanoglu, Ahmet; Kurkcuoglu, Melih; Arslan, Necmi

2015-06-01

Reinke edema is one of the common cause of dysphonia middle-aged population, and severe thickening of vocal folds require surgical treatment. Smoking plays a major role on etiology. Vocal fold cysts are also benign lesions and vocal trauma blamed for acquired cysts. We would like to present 3 cases with vocal fold cyst related with Reinke edema. First case had a subepidermal epidermoid cyst with Reinke edema, which could be easily observed before surgery during laryngostroboscopy. Second case had a mucous retention cyst into the edematous Reinke tissue, which was detected during surgical intervention, and third case had a epidermoid cyst that occurred 2 months after before microlaryngeal operation regarding Reinke edema reduction. These 3 cases revealed that surgical management of Reinke edema needs a careful dissection and close follow-up after surgery for presence of vocal fold cysts.
Investigating the neural correlates of voice versus speech-sound directed information in pre-school children.

Directory of Open Access Journals (Sweden)

Nora Maria Raschle

Full Text Available Studies in sleeping newborns and infants propose that the superior temporal sulcus is involved in speech processing soon after birth. Speech processing also implicitly requires the analysis of the human voice, which conveys both linguistic and extra-linguistic information. However, due to technical and practical challenges when neuroimaging young children, evidence of neural correlates of speech and/or voice processing in toddlers and young children remains scarce. In the current study, we used functional magnetic resonance imaging (fMRI in 20 typically developing preschool children (average age = 5.8 y; range 5.2-6.8 y to investigate brain activation during judgments about vocal identity versus the initial speech sound of spoken object words. FMRI results reveal common brain regions responsible for voice-specific and speech-sound specific processing of spoken object words including bilateral primary and secondary language areas of the brain. Contrasting voice-specific with speech-sound specific processing predominantly activates the anterior part of the right-hemispheric superior temporal sulcus. Furthermore, the right STS is functionally correlated with left-hemispheric temporal and right-hemispheric prefrontal regions. This finding underlines the importance of the right superior temporal sulcus as a temporal voice area and indicates that this brain region is specialized, and functions similarly to adults by the age of five. We thus extend previous knowledge of voice-specific regions and their functional connections to the young brain which may further our understanding of the neuronal mechanism of speech-specific processing in children with developmental disorders, such as autism or specific language impairments.

Oral and vocal fold diadochokinesis in dysphonic women

OpenAIRE

Louzada,Talita; Beraldinelle,Roberta; Berretin-Felix,Giédre; Brasolotto,Alcione Ghedini

2011-01-01

The evaluation of oral and vocal fold diadochokinesis (DDK) in individuals with voice disorders may contribute to the understanding of factors that affect the balanced vocal production. Scientific studies that make use of this assessment tool support the knowledge advance of this area, reflecting the development of more appropriate therapeutic planning. Objective: To compare the results of oral and vocal fold DDK in dysphonic women and in women without vocal disorders. Material and methods: F...
'Non-vocalization': a phonological error process in the speech of severely and profoundly hearing impaired adults, from the point of view of the theory of phonology as human behaviour.

Science.gov (United States)

Halpern, Orly; Tobin, Yishai

2008-01-01

'Non-vocalization' (N-V) is a newly described phonological error process in hearing impaired speakers. In N-V the hearing impaired person actually articulates the phoneme but without producing a voice. The result is an error process looking as if it is produced but sounding as if it is omitted. N-V was discovered by video recording the speech of two groups, profoundly and severely hearing impaired adults in four elicitation tasks of varying difficulty, and analysing 2065 phonological error processes (substitutions, omissions, and N-V) according to 24 criteria resulting in 49,560 data points. Results, which are discussed in view of the theory 'Phonology as Human Behaviour' (PHB), indicate that: (a) The more communicative the error process was; the more effort was made for its production and the more frequent its distribution; (b) The easier the elicitation task was, the more frequent the use of communicative error processes; c) The more difficult the elicitation task was, the more frequent the use of the relatively less communicative and easier to produce error processes; and d) The process of N-V functioned like a communicative error process for the group of profoundly hearing impaired adults.
Do Talkativeness and Vocal Loudness Correlate With Laryngeal Pathology? A Study of the Vocal Overdoer/Underdoer Continuum.

Science.gov (United States)

Bastian, Robert W; Thomas, James P

2016-09-01

Assess the correlation between self-rating scales of talkativeness and loudness with various types of voice disorders. This is a retrospective study. A total of 974 patients were analyzed. The cohort study included 430 consecutive patients presenting to the senior author with voice complaints from December 1995 to December 1998. The case-control study added 544 consecutive patients referred to the same examiner from January 1988 to December 1998 for vocal fold examination before thyroid, parathyroid, and carotid surgery. Patient responses on seven-point Likert self-rating scales of talkativeness and loudness were compared with laryngeal disease. Mucosal lesions clearly associated with vibratory trauma are strongly associated with a high self-rating of talkativeness. Laryngeal deconditioning disorders were associated with a low self-rating of talkativeness. Use of a simple self-rating scale of vocal loudness and talkativeness during history taking can reliably orient the examiner to the types of voice disorders likely to be diagnosed subsequently during vocal capability testing and visual laryngeal examination. The high degree of talkativeness and loudness seen in vocal overdoers correlates well with mucosal disorders such as nodules, polyps, capillary ectasia, epidermoid inclusion cysts, and hemorrhage. A lower degree of talkativeness correlates with muscle deconditioning disorders such as vocal fold bowing, atrophy, presbyphonia, and vocal fatigue syndrome. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Discrete Frenet frame, inflection point solitons, and curve visualization with applications to folded proteins

Science.gov (United States)

Hu, Shuangwei; Lundgren, Martin; Niemi, Antti J.

2011-06-01

We develop a transfer matrix formalism to visualize the framing of discrete piecewise linear curves in three-dimensional space. Our approach is based on the concept of an intrinsically discrete curve. This enables us to more effectively describe curves that in the limit where the length of line segments vanishes approach fractal structures in lieu of continuous curves. We verify that in the case of differentiable curves the continuum limit of our discrete equation reproduces the generalized Frenet equation. In particular, we draw attention to the conceptual similarity between inflection points where the curvature vanishes and topologically stable solitons. As an application we consider folded proteins, their Hausdorff dimension is known to be fractal. We explain how to employ the orientation of Cβ carbons of amino acids along a protein backbone to introduce a preferred framing along the backbone. By analyzing the experimentally resolved fold geometries in the Protein Data Bank we observe that this Cβ framing relates intimately to the discrete Frenet framing. We also explain how inflection points (a.k.a. soliton centers) can be located in the loops and clarify their distinctive rôle in determining the loop structure of folded proteins.
From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans [version 2; referees: 1 approved, 2 approved with reservations

Directory of Open Access Journals (Sweden)

Oren Poliva

2016-01-01

Full Text Available In the brain of primates, the auditory cortex connects with the frontal lobe via the temporal pole (auditory ventral stream; AVS and via the inferior parietal lobe (auditory dorsal stream; ADS. The AVS is responsible for sound recognition, and the ADS for sound-localization, voice detection and integration of calls with faces. I propose that the primary role of the ADS in non-human primates is the detection and response to contact calls. These calls are exchanged between tribe members (e.g., mother-offspring and are used for monitoring location. Detection of contact calls occurs by the ADS identifying a voice, localizing it, and verifying that the corresponding face is out of sight. Once a contact call is detected, the primate produces a contact call in return via descending connections from the frontal lobe to a network of limbic and brainstem regions. Because the ADS of present day humans also performs speech production, I further propose an evolutionary course for the transition from contact call exchange to an early form of speech. In accordance with this model, structural changes to the ADS endowed early members of the genus Homo with partial vocal control. This development was beneficial as it enabled offspring to modify their contact calls with intonations for signaling high or low levels of distress to their mother. Eventually, individuals were capable of participating in yes-no question-answer conversations. In these conversations the offspring emitted a low-level distress call for inquiring about the safety of objects (e.g., food, and his/her mother responded with a high- or low-level distress call to signal approval or disapproval of the interaction. Gradually, the ADS and its connections with brainstem motor regions became more robust and vocal control became more volitional. Speech emerged once vocal control was sufficient for inventing novel calls.
From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans [version 3; referees: 1 approved, 2 approved with reservations

Directory of Open Access Journals (Sweden)

Oren Poliva

2017-09-01

Full Text Available In the brain of primates, the auditory cortex connects with the frontal lobe via the temporal pole (auditory ventral stream; AVS and via the inferior parietal lobe (auditory dorsal stream; ADS. The AVS is responsible for sound recognition, and the ADS for sound-localization, voice detection and integration of calls with faces. I propose that the primary role of the ADS in non-human primates is the detection and response to contact calls. These calls are exchanged between tribe members (e.g., mother-offspring and are used for monitoring location. Detection of contact calls occurs by the ADS identifying a voice, localizing it, and verifying that the corresponding face is out of sight. Once a contact call is detected, the primate produces a contact call in return via descending connections from the frontal lobe to a network of limbic and brainstem regions. Because the ADS of present day humans also performs speech production, I further propose an evolutionary course for the transition from contact call exchange to an early form of speech. In accordance with this model, structural changes to the ADS endowed early members of the genus Homo with partial vocal control. This development was beneficial as it enabled offspring to modify their contact calls with intonations for signaling high or low levels of distress to their mother. Eventually, individuals were capable of participating in yes-no question-answer conversations. In these conversations the offspring emitted a low-level distress call for inquiring about the safety of objects (e.g., food, and his/her mother responded with a high- or low-level distress call to signal approval or disapproval of the interaction. Gradually, the ADS and its connections with brainstem motor regions became more robust and vocal control became more volitional. Speech emerged once vocal control was sufficient for inventing novel calls.
Quantitative electromyographic characteristics of idiopathic unilateral vocal fold paralysis.

Science.gov (United States)

Chang, Wei-Han; Fang, Tuan-Jen; Li, Hsueh-Yu; Jaw, Fu-Shan; Wong, Alice M K; Pei, Yu-Cheng

2016-11-01

Unilateral vocal fold paralysis with no preceding causes is diagnosed as idiopathic unilateral vocal fold paralysis. However, comprehensive guidelines for evaluating the defining characteristics of idiopathic unilateral vocal fold paralysis are still lacking. In the present study, we hypothesized that idiopathic unilateral vocal fold paralysis may have different clinical and neurologic characteristics from unilateral vocal fold paralysis caused by surgical trauma. Retrospective, case series study. Patients with unilateral vocal fold paralysis were evaluated using quantitative laryngeal electromyography, videolaryngostroboscopy, voice acoustic analysis, the Voice Outcome Survey, and the Short Form-36 Health Survey quality-of-life questionnaire. Patients with idiopathic and iatrogenic vocal fold paralysis were compared. A total of 124 patients were recruited. Of those, 17 with no definite identified causes after evaluation and follow-up were assigned to the idiopathic group. The remaining 107 patients with surgery-induced vocal fold paralysis were assigned to the iatrogenic group. Patients in the idiopathic group had higher recruitment of the thyroarytenoid-lateral cricoarytenoid muscle complex and better quality of life compared with the iatrogenic group. Idiopathic unilateral vocal fold paralysis has a distinct clinical presentation, with relatively minor denervation changes in the involved laryngeal muscles, and less impact on quality of life compared with iatrogenic vocal fold paralysis. 4. Laryngoscope, 126:E362-E368, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Relationship between adherence to speech therapy in patients with dysphonia and quality of life.

Science.gov (United States)

Duarte de Almeida, Letícia; Santos, Lívia Rodrigues; Bassi, Iara Barreto; Teixeira, Letícia Caldas; Côrtes Gama, Ana Cristina

2013-09-01

The present study analyzed if aspects of voice-related quality of life (VRQOL) were associated with adherence to voice therapy in teachers. Retrospective survey in which the medical records of 179 dysphonic teachers (62, abandonment group and 114, discharge group) who underwent speech therapy at the Speech Therapy Clinic at the Hospital das Clínicas of the Universidade Federal de Minas Gerais (AV-HCUFMG) were analyzed. Female teachers with dysphonia referred by Gerência de Saúde e Perícia Médica (Department of Health and Medical Analysis) of Belo Horizonte City Hall were included. The variables of interest were: age, number of voice therapy sessions attended (attendance), number of sessions missed (absence), type of dysphonia, and Vocal Activity and Protocol Profile (VAPP) scores administered during the first therapy session as a component of voice assessment. The chi-square test was used to assess categorical variables. For continuous variables, the Mann-Whitney test, a nonparametric test for independent samples, was used. The groups differed with regard to the type of dysphonia as well as the several parameters of the VAPP: vocal self-perception, effects at work, effects on daily communication, effects on emotion, and the total VAPP score. Individuals with less favorable VRQOL scores were less adherent to voice therapy compared with subjects with more favorable VRQOL. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Detailed vocalic information in Danish 20-month-olds' novel words

DEFF Research Database (Denmark)

Højen, Anders; Nazzi, Thierry

2010-01-01

results were found at 16 months with a simplified word-learning task (Havy & Nazzi, 2009). This indicated that vocalic information is given less weight than consonantal information when learning novel words. On the other hand, English 14- or 18-month-olds were sensitive to vowel mispronunciations of three...... Infants are endowed with an amazing capacity to perceive speech sounds. However, when learning new words, infants appear to not always use their perceptual capacities to their fullest. Recent research has provided conflicting evidence regarding the extent to which infants form new lexical...... representations with fully specified vowels. In a recent study, French 20-month-olds were able to learn two new words that differed by a single consonant but not words that differed by a single vowel, even when changing two or more phonetic features, in a name-based categorization task (Nazzi, 2005); similar...
Verbal Inflectional Morphology in L1 and L2 Spanish: A Frequency Effects Study Examining Storage versus Composition.

Science.gov (United States)

Bowden, Harriet Wood; Gelfand, Matthew P; Sanz, Cristina; Ullman, Michael T

2010-02-17

This study examines the storage vs. composition of Spanish inflected verbal forms in L1 and L2 speakers of Spanish. L2 participants were selected to have mid-to-advanced proficiency, high classroom experience, and low immersion experience, typical of medium-to-advanced foreign language learners. Participants were shown the infinitival forms of verbs from either Class I (the default class, which takes new verbs) or Classes II and III (non-default classes), and were asked to produce either first-person singular present-tense or imperfect forms, in separate tasks. In the present tense, the L1 speakers showed inflected-form frequency effects (i.e., higher frequency forms were produced faster, which is taken as a reflection of storage) for stem-changing (irregular) verb-forms from both Class I (e.g., pensar-pienso) and Classes II and III (e.g., perder-pierdo), as well as for non-stem-changing (regular) forms in Classes II/III (e.g., vender-vendo), in which the regular transformation does not appear to constitute a default. In contrast, Class I regulars (e.g., pescar-pesco), whose non-stem-changing transformation constitutes a default (e.g., it is applied to new verbs), showed no frequency effects. L2 speakers showed frequency effects for all four conditions (Classes I and II/III, regulars and irregulars). In the imperfect tense, the L1 speakers showed frequency effects for Class II/III (-ía-suffixed) but not Class I (-aba-suffixed) forms, even though both involve non-stem-change (regular) default transformations. The L2 speakers showed frequency effects for both types of forms. The pattern of results was not explained by a wide range of potentially confounding experimental and statistical factors, and does not appear to be compatible with single-mechanism models, which argue that all linguistic forms are learned and processed in associative memory. The findings are consistent with a dual-system view in which both verb class and regularity influence the storage vs
Nomenclature proposal to describe vocal fold motion impairment

NARCIS (Netherlands)

Rosen, Clark A.; Mau, Ted; Remacle, Marc; Hess, Markus; Eckel, Hans E.; Young, VyVy N.; Hantzakos, Anastasios; Yung, Katherine C.; Dikkers, Frederik G.

2016-01-01

The terms used to describe vocal fold motion impairment are confusing and not standardized. This results in a failure to communicate accurately and to major limitations of interpreting research studies involving vocal fold impairment. We propose standard nomenclature for reporting vocal fold
Nomenclature proposal to describe vocal fold motion impairment

NARCIS (Netherlands)

Rosen, Clark A.; Mau, Ted; Remacle, Marc; Hess, Markus; Eckel, Hans E.; Young, VyVy N.; Hantzakos, Anastasios; Yung, Katherine C.; Dikkers, Frederik G.

The terms used to describe vocal fold motion impairment are confusing and not standardized. This results in a failure to communicate accurately and to major limitations of interpreting research studies involving vocal fold impairment. We propose standard nomenclature for reporting vocal fold
Transmasculine people's vocal situations: a critical review of gender-related discourses and empirical data.

Science.gov (United States)

Azul, David

2015-01-01

to result in a voice that is recognized by others as male. More research into the different factors affecting transmasculine people's vocal situations that takes account of the diversity within the population is needed. © 2014 Royal College of Speech and Language Therapists.
A systematic review of psychological interventions for adult and pediatric patients with vocal cord dysfunction.

Science.gov (United States)

Guglani, Loveleen; Atkinson, Sarah; Hosanagar, Avinash; Guglani, Lokesh

2014-01-01

Vocal cord dysfunction (VCD) or paradoxical vocal-fold motion (PVFM) is a functional disorder of the vocal cords that requires multidisciplinary treatment. Besides relaxation techniques, the use of psychological interventions can help treat the underlying psychological co-morbidities. There is currently no literature that examines the effectiveness of psychological interventions for VCD/PVFM. To review the evidence for psychological interventions used for the treatment of patients with VCD/PVFM. We searched electronic databases for English medical literature using Pubmed (Medline), PsycInfo, Cochrane Database of Systematic Reviews, Cochrane Central Registry of Controlled Trials, and Clinicaltrials.gov. The date range for our search is from June 1964 to June 2014. We included studies that reported the use of psychological interventions in both adults and children diagnosed with VCD/PVFM. We included randomized controlled trials, case-control studies, retrospective chart reviews, prospective case series, and individual case reports. Most reported studies are small case series or individual case reports that have described the use of interventions such as psychotherapy, behavioral therapy, use of anti-anxiety and anti-depressant medications, and hypnotherapy in conjunction with breathing exercises taught by speech therapists for symptomatic relief. Among the various psychological interventions that have been reported, there is no data regarding effectiveness and/or superiority of one approach over another in either adult or pediatric patients. Psychological interventions have a role to play in the management of adult and pediatric patients with VCD/PVFM. Future prospective studies using uniform approaches for treatment of associated psychopathology may help address this question.
From Mimicry to Language: A Neuroanatomically Based Evolutionary Model of the Emergence of Vocal Language

Science.gov (United States)

Poliva, Oren

2016-01-01

The auditory cortex communicates with the frontal lobe via the middle temporal gyrus (auditory ventral stream; AVS) or the inferior parietal lobule (auditory dorsal stream; ADS). Whereas the AVS is ascribed only with sound recognition, the ADS is ascribed with sound localization, voice detection, prosodic perception/production, lip-speech integration, phoneme discrimination, articulation, repetition, phonological long-term memory and working memory. Previously, I interpreted the juxtaposition of sound localization, voice detection, audio-visual integration and prosodic analysis, as evidence that the behavioral precursor to human speech is the exchange of contact calls in non-human primates. Herein, I interpret the remaining ADS functions as evidence of additional stages in language evolution. According to this model, the role of the ADS in vocal control enabled early Homo (Hominans) to name objects using monosyllabic calls, and allowed children to learn their parents' calls by imitating their lip movements. Initially, the calls were forgotten quickly but gradually were remembered for longer periods. Once the representations of the calls became permanent, mimicry was limited to infancy, and older individuals encoded in the ADS a lexicon for the names of objects (phonological lexicon). Consequently, sound recognition in the AVS was sufficient for activating the phonological representations in the ADS and mimicry became independent of lip-reading. Later, by developing inhibitory connections between acoustic-syllabic representations in the AVS and phonological representations of subsequent syllables in the ADS, Hominans became capable of concatenating the monosyllabic calls for repeating polysyllabic words (i.e., developed working memory). Finally, due to strengthening of connections between phonological representations in the ADS, Hominans became capable of encoding several syllables as a single representation (chunking). Consequently, Hominans began vocalizing and
Investigation of in-vehicle speech intelligibility metrics for normal hearing and hearing impaired listeners

Science.gov (United States)

Samardzic, Nikolina

The effectiveness of in-vehicle speech communication can be a good indicator of the perception of the overall vehicle quality and customer satisfaction. Currently available speech intelligibility metrics do not account in their procedures for essential parameters needed for a complete and accurate evaluation of in-vehicle speech intelligibility. These include the directivity and the distance of the talker with respect to the listener, binaural listening, hearing profile of the listener, vocal effort, and multisensory hearing. In the first part of this research the effectiveness of in-vehicle application of these metrics is investigated in a series of studies to reveal their shortcomings, including a wide range of scores resulting from each of the metrics for a given measurement configuration and vehicle operating condition. In addition, the nature of a possible correlation between the scores obtained from each metric is unknown. The metrics and the subjective perception of speech intelligibility using, for example, the same speech material have not been compared in literature. As a result, in the second part of this research, an alternative method for speech intelligibility evaluation is proposed for use in the automotive industry by utilizing a virtual reality driving environment for ultimately setting targets, including the associated statistical variability, for future in-vehicle speech intelligibility evaluation. The Speech Intelligibility Index (SII) was evaluated at the sentence Speech Receptions Threshold (sSRT) for various listening situations and hearing profiles using acoustic perception jury testing and a variety of talker and listener configurations and background noise. In addition, the effect of individual sources and transfer paths of sound in an operating vehicle to the vehicle interior sound, specifically their effect on speech intelligibility was quantified, in the framework of the newly developed speech intelligibility evaluation method. Lastly
Responses of primate frontal cortex neurons during natural vocal communication.

Science.gov (United States)

Miller, Cory T; Thomas, A Wren; Nummela, Samuel U; de la Mothe, Lisa A

2015-08-01

The role of primate frontal cortex in vocal communication and its significance in language evolution have a controversial history. While evidence indicates that vocalization processing occurs in ventrolateral prefrontal cortex neurons, vocal-motor activity has been conjectured to be primarily subcortical and suggestive of a distinctly different neural architecture from humans. Direct evidence of neural activity during natural vocal communication is limited, as previous studies were performed in chair-restrained animals. Here we recorded the activity of single neurons across multiple regions of prefrontal and premotor cortex while freely moving marmosets engaged in a natural vocal behavior known as antiphonal calling. Our aim was to test whether neurons in marmoset frontal cortex exhibited responses during vocal-signal processing and/or vocal-motor production in the context of active, natural communication. We observed motor-related changes in single neuron activity during vocal production, but relatively weak sensory responses for vocalization processing during this natural behavior. Vocal-motor responses occurred both prior to and during call production and were typically coupled to the timing of each vocalization pulse. Despite the relatively weak sensory responses a population classifier was able to distinguish between neural activity that occurred during presentations of vocalization stimuli that elicited an antiphonal response and those that did not. These findings are suggestive of the role that nonhuman primate frontal cortex neurons play in natural communication and provide an important foundation for more explicit tests of the functional contributions of these neocortical areas during vocal behaviors. Copyright © 2015 the American Physiological Society.
A Rat Excised Larynx Model of Vocal Fold Scar

Science.gov (United States)

Welham, Nathan V.; Montequin, Douglas W.; Tateya, Ichiro; Tateya, Tomoko; Choi, Seong Hee; Bless, Diane M.

2009-01-01

Purpose: To develop and evaluate a rat excised larynx model for the measurement of acoustic, aerodynamic, and vocal fold vibratory changes resulting from vocal fold scar. Method: Twenty-four 4-month-old male Sprague-Dawley rats were assigned to 1 of 4 experimental groups: chronic vocal fold scar, chronic vocal fold scar treated with 100-ng basic…
Improving the speech intelligibility in classrooms

Science.gov (United States)

Lam, Choi Ling Coriolanus

One of the major acoustical concerns in classrooms is the establishment of effective verbal communication between teachers and students. Non-optimal acoustical conditions, resulting in reduced verbal communication, can cause two main problems. First, they can lead to reduce learning efficiency. Second, they can also cause fatigue, stress, vocal strain and health problems, such as headaches and sore throats, among teachers who are forced to compensate for poor acoustical conditions by raising their voices. Besides, inadequate acoustical conditions can induce the usage of public address system. Improper usage of such amplifiers or loudspeakers can lead to impairment of students' hearing systems. The social costs of poor classroom acoustics will be large to impair the learning of children. This invisible problem has far reaching implications for learning, but is easily solved. Many researches have been carried out that they have accurately and concisely summarized the research findings on classrooms acoustics. Though, there is still a number of challenging questions remaining unanswered. Most objective indices for speech intelligibility are essentially based on studies of western languages. Even several studies of tonal languages as Mandarin have been conducted, there is much less on Cantonese. In this research, measurements have been done in unoccupied rooms to investigate the acoustical parameters and characteristics of the classrooms. The speech intelligibility tests, which based on English, Mandarin and Cantonese, and the survey were carried out on students aged from 5 years old to 22 years old. It aims to investigate the differences in intelligibility between English, Mandarin and Cantonese of the classrooms in Hong Kong. The significance on speech transmission index (STI) related to Phonetically Balanced (PB) word scores will further be developed. Together with developed empirical relationship between the speech intelligibility in classrooms with the variations
Efeitos do som basal em fendas glóticas Effects of vocal fry incomplete glottal closure

Directory of Open Access Journals (Sweden)

Geovana de Paula Bolzan

2008-01-01

Full Text Available TEMA: som basal em fendas glóticas. PROCEDIMENTOS: participaram desta pesquisa dois sujeitos do sexo feminino, com idades entre 20 e 40 anos e diagnóstico otorrinolaringológico de fenda em ampulheta. Houve gravação da emissão sustentada da vogal /a/ e exame videolaringoestroboscópico, imediatamente a seguir, os sujeitos realizaram o som basal em três séries de 15 repetições, e foram submetidos a novo exame laríngeo e gravação da vogal. Os dados pré e pós-realização do som basal foram submetidos às analises acústica, perceptivo-auditiva e videolaringoestroboscópica, realizadas por juízes (três fonoaudiólogas e três otorrinolaringologistas, respectivamente. RESULTADOS: em ambos os sujeitos, houve melhora no fechamento glótico e amplitude de vibração da mucosa das pregas vocais; piora no tipo de voz; aumento das medidas de ruído e de Jitter. CONCLUSÃO: o som basal promoveu redução das fendas glóticas e aumento da amplitude de vibração da mucosa das pregas vocais; piora do tipo de voz, que ficou mais ruidoso; aumento das medidas de ruído e de Jitter, sugerindo irregularidade vibratória, provavelmente devido ao efeito do ajuste do som basal ao mobilizar intensamente a mucosa.BACKGROUND: vocal fry in incomplete glottal closure. PROCEDURE: two individuals aged between 20 and 40 years old presenting an otolaryngological diagnosis of ampoule chink were part of the study. A recording of the sustained emission of /a/ vowel took place, as well as a videostroboscopic examination. Right after, the individuals completed the vocal fry in three series of 15 repetitions, being submitted to a new laryngeal examination and to the vowel’s recording. Both pre-vocal and post vocal fry data were assessed through acoustic, perceptive-auditive and videostroboscopic analysis, carried out by judges (three speech and language pathologists and three otolaryngologists, respectively. RESULTS: for both individuals, there was an improvement

Vocal Fold Injection: Review of Indications, Techniques, and Materials for Augmentation

OpenAIRE

Mallur, Pavan S.; Rosen, Clark A.

2010-01-01

Vocal fold injection is a procedure that has over a 100 year history but was rarely done as short as 20 years ago. A renaissance has occurred with respect to vocal fold injection due to new technologies (visualization and materials) and new injection approaches. Awake, un-sedated vocal fold injection offers many distinct advantages for the treatment of glottal insufficiency (vocal fold paralysis, vocal fold paresis, vocal fold atrophy and vocal fold scar). A review of materials available and ...
Incidence of vocal fold immobility in patients with dysphagia.

Science.gov (United States)

Leder, Steven B; Ross, Douglas A

2005-01-01

This study prospectively investigated the incidence of vocal fold immobility, unilateral and bilateral, and its influence on aspiration status in a referred population of 1452 patients for a dysphagia evaluation from a large, urban, tertiary-care, teaching hospital. Main outcome measures included overall incidence of vocal fold immobility and aspiration status, with specific emphasis on age, etiology, and side of vocal fold immobility, i.e., right, left, or bilateral. Overall incidence of vocal fold immobility was 5.6% (81 of 1452 patients), including 47 males (mean age 55.7 yr) and 34 females (mean age 59.7 yr). In the subgroup of patients with vocal fold immobility, 31% (25 of 81) exhibited unilateral right, 60% (49 of 81) unilateral left, and 9% (7 of 81) bilateral impairment. Overall incidence of aspiration was found to be 29% (426 of 1452) of all patients referred for a swallow evaluation. Aspiration was observed in 44% (36 of 81) of patients presenting with vocal fold immobility, i.e., 44% (11 of 25) unilateral right, 43% (21 of 49) unilateral left, and 57% (4 of 7) bilateral vocal fold immobility. Left vocal fold immobility occurred most frequently due to surgical trauma. A liquid bolus was aspirated more often than a puree bolus. Side of vocal fold immobility and age were not factors that increased incidence of aspiration. In conclusion, vocal fold immobility, with an incidence of 5.6%, is not an uncommon finding in patients referred for a dysphagia evaluation in the acute-care setting, and vocal fold immobility, when present, was associated with a 15% increased incidence of aspiration when compared with a population already being evaluated for dysphagia.
Comprehensive analysis of ultrasonic vocalizations in a mouse model of fragile X syndrome reveals limited, call type specific deficits.

Directory of Open Access Journals (Sweden)

Snigdha Roy

Full Text Available Fragile X syndrome (FXS is a well-recognized form of inherited mental retardation, caused by a mutation in the fragile X mental retardation 1 (Fmr1 gene. The gene is located on the long arm of the X chromosome and encodes fragile X mental retardation protein (FMRP. Absence of FMRP in fragile X patients as well as in Fmr1 knockout (KO mice results, among other changes, in abnormal dendritic spine formation and altered synaptic plasticity in the neocortex and hippocampus. Clinical features of FXS include cognitive impairment, anxiety, abnormal social interaction, mental retardation, motor coordination and speech articulation deficits. Mouse pups generate ultrasonic vocalizations (USVs when isolated from their mothers. Whether those social ultrasonic vocalizations are deficient in mouse models of FXS is unknown. Here we compared isolation-induced USVs generated by pups of Fmr1-KO mice with those of their wild type (WT littermates. Though the total number of calls was not significantly different between genotypes, a detailed analysis of 10 different categories of calls revealed that loss of Fmr1 expression in mice causes limited and call-type specific deficits in ultrasonic vocalization: the carrier frequency of flat calls was higher, the percentage of downward calls was lower and that the frequency range of complex calls was wider in Fmr1-KO mice compared to their WT littermates.
Distribution of language-related Cntnap2 protein in neural circuits critical for vocal learning.

Science.gov (United States)

Condro, Michael C; White, Stephanie A

2014-01-01

Variants of the contactin associated protein-like 2 (Cntnap2) gene are risk factors for language-related disorders including autism spectrum disorder, specific language impairment, and stuttering. Songbirds are useful models for study of human speech disorders due to their shared capacity for vocal learning, which relies on similar cortico-basal ganglia circuitry and genetic factors. Here we investigate Cntnap2 protein expression in the brain of the zebra finch, a songbird species in which males, but not females, learn their courtship songs. We hypothesize that Cntnap2 has overlapping functions in vocal learning species, and expect to find protein expression in song-related areas of the zebra finch brain. We further expect that the distribution of this membrane-bound protein may not completely mirror its mRNA distribution due to the distinct subcellular localization of the two molecular species. We find that Cntnap2 protein is enriched in several song control regions relative to surrounding tissues, particularly within the adult male, but not female, robust nucleus of the arcopallium (RA), a cortical song control region analogous to human layer 5 primary motor cortex. The onset of this sexually dimorphic expression coincides with the onset of sensorimotor learning in developing males. Enrichment in male RA appears due to expression in projection neurons within the nucleus, as well as to additional expression in nerve terminals of cortical projections to RA from the lateral magnocellular nucleus of the nidopallium. Cntnap2 protein expression in zebra finch brain supports the hypothesis that this molecule affects neural connectivity critical for vocal learning across taxonomic classes. Copyright © 2013 Wiley Periodicals, Inc.
Adipose-Derived Mesenchymal Stem Cells in the Regeneration of Vocal Folds: A Study on a Chronic Vocal Fold Scar

Directory of Open Access Journals (Sweden)

Angelou Valerie

2016-01-01

Full Text Available Background. The aim of the study was to assess the histological effects of autologous infusion of adipose-derived stem cells (ADSC on a chronic vocal fold scar in a rabbit model as compared to an untreated scar as well as in injection of hyaluronic acid. Study Design. Animal experiment. Method. We used 74 New Zealand rabbits. Sixteen of them were used as control/normal group. We created a bilateral vocal fold wound in the remaining 58 rabbits. After 18 months we separated our population into three groups. The first group served as control/scarred group. The second one was injected with hyaluronic acid in the vocal folds, and the third received an autologous adipose-derived stem cell infusion in the scarred vocal folds (ADSC group. We measured the variation of thickness of the lamina propria of the vocal folds and analyzed histopathologic changes in each group after three months. Results. The thickness of the lamina propria was significantly reduced in the group that received the ADSC injection, as compared to the normal/scarred group. The collagen deposition, the hyaluronic acid, the elastin levels, and the organization of elastic fibers tend to return to normal after the injection of ADSC. Conclusions. Autologous injection of adipose-derived stem cells on a vocal fold chronic scar enhanced the healing of the vocal folds and the reduction of the scar tissue, even when compared to other treatments.
Adipose-Derived Mesenchymal Stem Cells in the Regeneration of Vocal Folds: A Study on a Chronic Vocal Fold Scar

Science.gov (United States)

Vassiliki, Kalodimou; Irini, Messini; Nikolaos, Psychalakis; Karampela, Eleftheria; Apostolos, Papalois

2016-01-01

Background. The aim of the study was to assess the histological effects of autologous infusion of adipose-derived stem cells (ADSC) on a chronic vocal fold scar in a rabbit model as compared to an untreated scar as well as in injection of hyaluronic acid. Study Design. Animal experiment. Method. We used 74 New Zealand rabbits. Sixteen of them were used as control/normal group. We created a bilateral vocal fold wound in the remaining 58 rabbits. After 18 months we separated our population into three groups. The first group served as control/scarred group. The second one was injected with hyaluronic acid in the vocal folds, and the third received an autologous adipose-derived stem cell infusion in the scarred vocal folds (ADSC group). We measured the variation of thickness of the lamina propria of the vocal folds and analyzed histopathologic changes in each group after three months. Results. The thickness of the lamina propria was significantly reduced in the group that received the ADSC injection, as compared to the normal/scarred group. The collagen deposition, the hyaluronic acid, the elastin levels, and the organization of elastic fibers tend to return to normal after the injection of ADSC. Conclusions. Autologous injection of adipose-derived stem cells on a vocal fold chronic scar enhanced the healing of the vocal folds and the reduction of the scar tissue, even when compared to other treatments. PMID:26933440
Female presence and estrous state influence mouse ultrasonic courtship vocalizations.

Directory of Open Access Journals (Sweden)

Jessica L Hanson

Full Text Available The laboratory mouse is an emerging model for context-dependent vocal signaling and reception. Mouse ultrasonic vocalizations are robustly produced in social contexts. In adults, male vocalization during courtship has become a model of interest for signal-receiver interactions. These vocalizations can be grouped into syllable types that are consistently produced by different subspecies and strains of mice. Vocalizations are unique to individuals, vary across development, and depend on social housing conditions. The behavioral significance of different syllable types, including the contexts in which different vocalizations are made and the responses listeners have to different types of vocalizations, is not well understood. We examined the effect of female presence and estrous state on male vocalizations by exploring the use of syllable types and the parameters of syllables during courtship. We also explored correlations between vocalizations and other behaviors. These experimental manipulations produced four main findings: 1 vocalizations varied among males, 2 the production of USVs and an increase in the use of a specific syllable type were temporally related to mounting behavior, 3 the frequency (kHz, bandwidth, and duration of syllables produced by males were influenced by the estrous phase of female partners, and 4 syllable types changed when females were removed. These findings show that mouse ultrasonic courtship vocalizations are sensitive to changes in female phase and presence, further demonstrating the context-sensitivity of these calls.
Defects in ultrasonic vocalization of cadherin-6 knockout mice.

Directory of Open Access Journals (Sweden)

Ryoko Nakagawa

Full Text Available BACKGROUND: Although some molecules have been identified as responsible for human language disorders, there is still little information about what molecular mechanisms establish the faculty of human language. Since mice, like songbirds, produce complex ultrasonic vocalizations for intraspecific communication in several social contexts, they can be good mammalian models for studying the molecular basis of human language. Having found that cadherins are involved in the vocal development of the Bengalese finch, a songbird, we expected cadherins to also be involved in mouse vocalizations. METHODOLOGY/PRINCIPAL FINDINGS: To examine whether similar molecular mechanisms underlie the vocalizations of songbirds and mammals, we categorized behavioral deficits including vocalization in cadherin-6 knockout mice. Comparing the ultrasonic vocalizations of cadherin-6 knockout mice with those of wild-type controls, we found that the peak frequency and variations of syllables were differed between the mutant and wild-type mice in both pup-isolation and adult-courtship contexts. Vocalizations during male-male aggression behavior, in contrast, did not differ between mutant and wild-type mice. Open-field tests revealed differences in locomotors activity in both heterozygote and homozygote animals and no difference in anxiety behavior. CONCLUSIONS/SIGNIFICANCE: Our results suggest that cadherin-6 plays essential roles in locomotor activity and ultrasonic vocalization. These findings also support the idea that different species share some of the molecular mechanisms underlying vocal behavior.
Systematic Studies of Modified Vocalization: Effects of Speech Rate and Instatement Style during Metronome Stimulation

Science.gov (United States)

Davidow, Jason H.; Bothe, Anne K.; Richardson, Jessica D.; Andreatta, Richard D.

2010-01-01

Purpose: This study introduces a series of systematic investigations intended to clarify the parameters of the fluency-inducing conditions (FICs) in stuttering. Method: Participants included 11 adults, aged 20-63 years, with typical speech-production skills. A repeated measures design was used to examine the relationships between several speech…
Técnica do retalho pediculado para correção do sulco vocal The pediculated flap technique to sulcus vocalis repairing

Directory of Open Access Journals (Sweden)

Marcos Grellet

Full Text Available Introdução: técnica do retalho pediculado de mucosa para reparar o sulco vocal permite o aparecimento da onda mucosa nessa região. A presença do sulco vocal traz como conseqüência rouquidão, soprosidade e aspereza. Outros sintomas podem estar presentes como fadiga ao falar, queimação ou ardor. Objetivo: provocar o aparecimento de onda mucosa com técnica cirúrgica. Forma de estudo: clínico retrospectivo. Material e método: Foram operados 3 pacientes para auxiliar no deslocamento do epitélio escamoso estratificado e da camada superficial da prega vocal aderidos ao ligamento vocal injetamos pequena quantidade de dexametasona. Obtemos o retalho pediculado descolando retalho de mucosa da prega vocal. Resultados: No pós-operatório, a videoestroboscopia mostra uniformidade do revestimento da cobertura da prega vocal na região do sulco vocal. Nos pacientes operados observamos a presença da onda mucosa nessa região e a coaptação das pregas vocais é satisfatória, no caso de sulco unilateral. A análise subjetiva e objetiva da voz apresenta resultados normais a partir de um ano da cirurgia. Os sintomas, esforço e fadiga ao falar, ardor e queimação, desapareceram nesse período. Para sulco bilateral operamos inicialmente o sulco de uma prega vocal com melhora dos índices acústicos utilizados, embora não atingisse valores normais em todos os parâmetros avaliados no curto período de evolução (30 dias de pós-operatório após realizarmos a correção cirúrgica do sulco da outra prega vocal. Conclusão: A técnica microfonocirúrgica de retalho pediculado de mucosa para correção do sulco vocal mostrou resultados amplamente favoráveis para reabilitação da voz nos três pacientes apresentados.Introduction: the pedicullate flap technique to repair sulcus vocalis allows the appearing of the mucous wave in this region. Sulcus vocalis cause hoarseness, breathing and roughness. Other symptoms can happen during the speech like
Vocal competition in male Xenopus laevis frogs

OpenAIRE

Tobias, Martha L.; Corke, Anna; Korsh, Jeremy; Yin, David; Kelley, Darcy B.

2010-01-01

Male Xenopus laevis frogs produce underwater advertisement calls that attract gravid females and suppress calling by male competitors. Here we explore whether groups of males establish vocal ranks and whether auditory cues alone suffice for vocal suppression. Tests of male–male pairs within assigned groups reveal linear vocal dominance relations, in which each male has a defined rank. Both the duration over which males interact, as well as the number of competitive opportunities, affect linea...
Estudo do comportamento vocal no ciclo menstrual: avaliação perceptivo-auditiva, acústica e auto-perceptiva Vocal behavior during menstrual cycle: perceptual-auditory, acoustic and self-perception analysis

Directory of Open Access Journals (Sweden)

Luciane C. de Figueiredo

2004-06-01

-control. MATERIAL AND METHOD: We studied thirty speech and language pathology students with age ranging from 18 to 25 years, non smokers, with a regular menstrual cycle and who did not take contraceptive. The voices were recorded on the first day of menstruation and on the thirteenth day postmenstruation (ovulation period, for comparison. RESULTS: In the first day of menstruation it was observed: hoarseness and breathiness from light to moderate, vocal instability, voicing interruption, normal pitch and loudness and adequate resonance; worse quality of the harmonics definition, increased amount of noise between them and lower length of superior harmonics. A higher fundamental frequency, higher values of Jitter and Shimmer and a lower harmonic-to-noise ratio was also observed. CONCLUSION: During the menstrual cycle there are changes in the vocal quality, in the harmonic behavior and in the vocal parameters (f0, Jitter, Shimmer and harmonic-to-noise ratio. However, the majority of the students were unaware of the vocal variation during menstruation.
Histopathologic study of human vocal fold mucosa unphonated over a decade.

Science.gov (United States)

Sato, Kiminori; Umeno, Hirohito; Ono, Takeharu; Nakashima, Tadashi

2011-12-01

Mechanotransduction caused by vocal fold vibration could possibly be an important factor in the maintenance of extracellular matrices and layered structure of the human adult vocal fold mucosa as a vibrating tissue after the layered structure has been completed. Vocal fold stellate cells (VFSCs) in the human maculae flavae of the vocal fold mucosa are inferred to be involved in the metabolism of extracellular matrices of the vocal fold mucosa. Maculae flavae are also considered to be an important structure in the growth and development of the human vocal fold mucosa. Tension caused by phonation (vocal fold vibration) is hypothesized to stimulate the VFSCs to accelerate production of extracellular matrices. A human adult vocal fold mucosa unphonated over a decade was investigated histopathologically. Vocal fold mucosa unphonated for 11 years and 2 months of a 64-year-old male with cerebral hemorrhage was investigated by light and electron microscopy. The vocal fold mucosae (including maculae flavae) were atrophic. The vocal fold mucosa did not have a vocal ligament, Reinke's space or a layered structure. The lamina propria appeared as a uniform structure. Morphologically, the VFSCs synthesized fewer extracellular matrices, such as fibrous protein and glycosaminoglycan. Consequently, VFSCs appeared to decrease their level of activity.
Vocal Tract and Glottal Function During and After Vocal Exercising With Resonance Tube and Straw

Czech Academy of Sciences Publication Activity Database

Guzman, M.; Laukkanen, A. M.; Krupa, P.; Horáček, Jaromír; Švec, J.G.; Geneid, A.

2013-01-01

Roč. 27, č. 4 (2013), "523.e19"-"523.e34" ISSN 0892-1997 R&D Projects: GA ČR GAP101/12/1306 Institutional support: RVO:61388998 Keywords : vocal exercises * resonance tube * vocal tract impedance * computerized tomography * singer’s/speaker’s formant cluster Subject RIV: BI - Acoustics Impact factor: 0.944, year: 2013 http://www.sciencedirect.com/science/journal/08921997
Idiopathic unilateral vocal-fold paralysis in the adult.

Science.gov (United States)

Rubin, F; Villeneuve, A; Alciato, L; Slaïm, L; Bonfils, P; Laccourreye, O

2018-02-02

To analyze the characteristics of adult idiopathic unilateral vocal-fold paralysis. Retrospective study of diagnostic problems, clinical data and recovery in an inception cohort of 100 adult patients with idiopathic unilateral vocal-fold paralysis (Group A) and comparison with a cohort of 211 patients with isolated non-idiopathic non-traumatic unilateral vocal-fold paralysis (Group B). Diagnostic problems were noted in 24% of cases in Group A: eight patients with concomitant common upper aerodigestive tract infection, five patients with a concomitant condition liable to induce immunodepression and 11 patients in whom a malignant tumor occurred along the path of the ipsilateral vagus and inferior laryngeal nerves or in the ipsilateral paralyzed larynx. There was no recovery of vocal-fold motion beyond 51 months after onset of paralysis. The 5-year actuarial estimate for recovery differed significantly (Pvocal-fold paralysis. In non-traumatic vocal-fold paralysis in adult patients, without recovery of vocal-fold motion, a minimum three years' regular follow-up is recommended. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
The Effect of Vocal Hygiene and Behavior Modification Instruction on the Self-Reported Vocal Health Habits of Public School Music Teachers

Science.gov (United States)

Hackworth, Rhonda S.

2007-01-01

This study examined the effects of vocal hygiene and behavior modification instruction on self-reported behaviors of music teachers. Subjects (N = 76) reported daily behaviors for eight weeks: water consumption, warm-up, talking over music/noise, vocal rest, nonverbal commands, and vocal problems. Subjects were in experimental group 1 or 2, or the…
Noradrenergic control of gene expression and long-term neuronal adaptation evoked by learned vocalizations in songbirds.

Directory of Open Access Journals (Sweden)

Tarciso A F Velho

Full Text Available Norepinephrine (NE is thought to play important roles in the consolidation and retrieval of long-term memories, but its role in the processing and memorization of complex acoustic signals used for vocal communication has yet to be determined. We have used a combination of gene expression analysis, electrophysiological recordings and pharmacological manipulations in zebra finches to examine the role of noradrenergic transmission in the brain's response to birdsong, a learned vocal behavior that shares important features with human speech. We show that noradrenergic transmission is required for both the expression of activity-dependent genes and the long-term maintenance of stimulus-specific electrophysiological adaptation that are induced in central auditory neurons by stimulation with birdsong. Specifically, we show that the caudomedial nidopallium (NCM, an area directly involved in the auditory processing and memorization of birdsong, receives strong noradrenergic innervation. Song-responsive neurons in this area express α-adrenergic receptors and are in close proximity to noradrenergic terminals. We further show that local α-adrenergic antagonism interferes with song-induced gene expression, without affecting spontaneous or evoked electrophysiological activity, thus dissociating the molecular and electrophysiological responses to song. Moreover, α-adrenergic antagonism disrupts the maintenance but not the acquisition of the adapted physiological state. We suggest that the noradrenergic system regulates long-term changes in song-responsive neurons by modulating the gene expression response that is associated with the electrophysiological activation triggered by song. We also suggest that this mechanism may be an important contributor to long-term auditory memories of learned vocalizations.
Indirect vs Direct Voice Therapy for Children With Vocal Nodules: A Randomized Clinical Trial.

Science.gov (United States)

Hartnick, Christopher; Ballif, Catherine; De Guzman, Vanessa; Sataloff, Robert; Campisi, Paolo; Kerschner, Joseph; Shembel, Adrianna; Reda, Domenic; Shi, Helen; Sheryka Zacny, Elinore; Bunting, Glenn

2018-02-01

Benign vocal fold nodules affect 12% to 22% of the pediatric population, and 95% of otolaryngologists recommend voice therapy as treatment. However, no randomized clinical trials that we are aware of have shown its benefits. To determine the impact of voice therapy in children with vocal fold nodules according to pretherapy and posttherapy scores on the Pediatric Voice-Related Quality of Life (PVRQOL) survey; secondary objectives included changes in phonatory parameters. For this multicenter randomized clinical trial, 114 children ages 6 to 10 years with vocal fold nodules, PVRQOL scores less than 87.5, and dysphonia for longer than 12 weeks were recruited from outpatient voice and speech clinics. This age range was identified because these patients have not experienced pubertal changes of the larynx, tolerate stroboscopy, and cooperate with voice therapy. Participants were blinded to treatment arm. Participants received either indirect or direct therapy for 8 to 12 weeks. Indirect therapy focused on education and discussion of voice principles, while direct treatment used the stimulus, response, antecedent paradigm. The primary outcome measure was PVRQOL score change before and after treatment. Secondary phonatory measures were also compared. Overall, 114 children were recruited for study (mean [SD] age, 8 [1.4] years; 83 males [73%]); with 57 randomized to receive either indirect or direct therapy. Both direct and indirect therapy approaches showed significant differences in PVRQOL scores pretherapy to posttherapy. The mean increase in PVRQOL score for direct therapy was 19.2, and 14.7 for indirect therapy (difference, 4.5; 95.3% CI, -10.8 to 19.8). Of 44 participants in the direct therapy group, 27 (61%) achieved a clinically meaningful PVRQOL improvement, compared with 26 of 49 (53%) for indirect therapy (difference, 8%; 95% CI, -12 to 28). Post hoc stratification showed robust effects in the direct therapy group for older children (Cohen d = 0.50) and the
Prosodic constraints on inflected words: an area of difficulty for German-speaking children with specific language impairment?

Science.gov (United States)

Kauschke, Christina; Renner, Lena; Domahs, Ulrike

2013-08-01

Recent studies suggest that morphosyntactic difficulties may result from prosodic problems. We therefore address the interface between inflectional morphology and prosody in typically developing children (TD) and children with SLI by testing whether these groups are sensitive to prosodic constraints that guide plural formation in German. A plural elicitation task was designed consisting of 60 words and 20 pseudowords. The performance of 14 German-speaking children with SLI (mean age 7.5) was compared to age-matched controls and to younger children matched for productive vocabulary. TD children performed significantly better than children with SLI. Error analyses revealed that children with SLI produced more forms that did not meet the optimal shape of a noun plural. Beyond the fact that children with SLI have deficits in plural marking, the findings suggest that they also show reduced sensitivity to prosodic requirements. In other words, the prosodic structure of inflected words seems to be vulnerable in children with SLI.
VOCAL SEGMENT CLASSIFICATION IN POPULAR MUSIC

DEFF Research Database (Denmark)

Feng, Ling; Nielsen, Andreas Brinch; Hansen, Lars Kai

2008-01-01

This paper explores the vocal and non-vocal music classification problem within popular songs. A newly built labeled database covering 147 popular songs is announced. It is designed for classifying signals from 1sec time windows. Features are selected for this particular task, in order to capture...

Accelerometer-based automatic voice onset detection in speech mapping with navigated repetitive transcranial magnetic stimulation.

Science.gov (United States)

Vitikainen, Anne-Mari; Mäkelä, Elina; Lioumis, Pantelis; Jousmäki, Veikko; Mäkelä, Jyrki P

2015-09-30

The use of navigated repetitive transcranial magnetic stimulation (rTMS) in mapping of speech-related brain areas has recently shown to be useful in preoperative workflow of epilepsy and tumor patients. However, substantial inter- and intraobserver variability and non-optimal replicability of the rTMS results have been reported, and a need for additional development of the methodology is recognized. In TMS motor cortex mappings the evoked responses can be quantitatively monitored by electromyographic recordings; however, no such easily available setup exists for speech mappings. We present an accelerometer-based setup for detection of vocalization-related larynx vibrations combined with an automatic routine for voice onset detection for rTMS speech mapping applying naming. The results produced by the automatic routine were compared with the manually reviewed video-recordings. The new method was applied in the routine navigated rTMS speech mapping for 12 consecutive patients during preoperative workup for epilepsy or tumor surgery. The automatic routine correctly detected 96% of the voice onsets, resulting in 96% sensitivity and 71% specificity. Majority (63%) of the misdetections were related to visible throat movements, extra voices before the response, or delayed naming of the previous stimuli. The no-response errors were correctly detected in 88% of events. The proposed setup for automatic detection of voice onsets provides quantitative additional data for analysis of the rTMS-induced speech response modifications. The objectively defined speech response latencies increase the repeatability, reliability and stratification of the rTMS results. Copyright © 2015 Elsevier B.V. All rights reserved.
Vascular lesions of the vocal fold.

Science.gov (United States)

Gökcan, Kürşat Mustafa; Dursun, Gürsel

2009-04-01

The aim of the study was to present symptoms, laryngological findings, clinical course, management modalities, and consequences of vascular lesions of vocal fold. This study examined 162 patients, the majority professional voice users, with vascular lesions regarding their presenting symptoms, laryngological findings, clinical courses and treatment results. The most common complaint was sudden hoarseness with hemorrhagic polyp. Microlaryngoscopic surgery was performed in 108 cases and the main indication of surgery was the presence of vocal fold mass or development of vocal polyp during clinical course. Cold microsurgery was utilized for removal of vocal fold masses and feeding vessels cauterized using low power, pulsed CO(2) laser. Acoustic analysis of patients revealed a significant improvement of jitter, shimmer and harmonics/noise ratio values after treatment. Depending on our clinical findings, we propose treatment algorithm where voice rest and behavioral therapy is the integral part and indications of surgery are individualized for each patient.
Assessing vocal performance in complex birdsong: a novel approach.

Science.gov (United States)

Geberzahn, Nicole; Aubin, Thierry

2014-08-06

Vocal performance refers to the ability to produce vocal signals close to physical limits. Such motor skills can be used by conspecifics to assess a signaller's competitive potential. For example it is difficult for birds to produce repeated syllables both rapidly and with a broad frequency bandwidth. Deviation from an upper-bound regression of frequency bandwidth on trill rate has been widely used to assess vocal performance. This approach is, however, only applicable to simple trilled songs, and even then may be affected by differences in syllable complexity. Using skylarks (Alauda arvensis) as a birdsong model with a very complex song structure, we detected another performance trade-off: minimum gap duration between syllables was longer when the frequency ratio between the end of one syllable and the start of the next syllable (inter-syllable frequency shift) was large. This allowed us to apply a novel measure of vocal performance ¿ vocal gap deviation: the deviation from a lower-bound regression of gap duration on inter-syllable frequency shift. We show that skylarks increase vocal performance in an aggressive context suggesting that this trait might serve as a signal for competitive potential. We suggest using vocal gap deviation in future studies to assess vocal performance in songbird species with complex structure.
Reduction of Parkinson's-related dysphonia by thyroplasty.

Science.gov (United States)

Roubeau, B; Bruel, M; de Crouy Chanel, O; Périé, S

2016-12-01

Parkinson's-related dysphonia has a negative impact on the quality of speech by increasing the effects of the associated dysarthria. When this dysphonia is related to vocal fold adduction defect, constituting a real glottic insufficiency, vocal fold medialization can be proposed after failure of intensive voice and speech therapy. Acoustic and aerodynamic voice and speech analysis techniques, perceptual evaluation and estimation of vocal handicap, associated with fiberoptic laryngoscopy were performed to determine the indication for vocal fold medialization in these patients with glottic insufficiency. Vocal fold medialization by Montgomery thyroplasty implant was performed under local anesthesia and neuroanalgesia in two patients with Parkinson's disease presenting a dysphonia refractory to speech therapy. Postoperative evaluation showed improvement of voice quality with an increased number of harmonics and improvement of aerodynamic parameters. Vocal fold medialization by Montgomery thyroplasty implant effectively improved voice quality in these two patients allowing a more effective vocal fold adduction. The reducing of the hypophonia has a positive effect on the quality of oral communication. The medialization thyroplasty technique, under local anesthesia, allows intraoperative control of the voice as well as removal of the implant when necessary. Copyright Â© 2016 Elsevier Masson SAS. All rights reserved.
Vocal tract and glottal function during and after vocal exercising with resonance tube and straw.

Science.gov (United States)

Guzman, Marco; Laukkanen, Anne-Maria; Krupa, Petr; Horáček, Jaromir; Švec, Jan G; Geneid, Ahmed

2013-07-01

The present study aimed to investigate the vocal tract and glottal function during and after phonation into a tube and a stirring straw. A male classically trained singer was assessed. Computerized tomography (CT) was performed when the subject produced [a:] at comfortable speaking pitch, phonated into the resonance tube and when repeating [a:] after the exercise. Similar procedure was performed with a narrow straw after 15 minutes silence. Anatomic distances and area measures were obtained from CT midsagittal and transversal images. Acoustic, perceptual, electroglottographic (EGG), and subglottic pressure measures were also obtained. During and after phonation into the tube or straw, the velum closed the nasal passage better, the larynx position lowered, and hypopharynx area widened. Moreover, the ratio between the inlet of the lower pharynx and the outlet of the epilaryngeal tube became larger during and after tube/straw phonation. Acoustic results revealed a stronger spectral prominence in the singer/speaker's formant cluster region after exercising. Listening test demonstrated better voice quality after straw/tube than before. Contact quotient derived from EGG decreased during both tube and straw and remained lower after exercising. Subglottic pressure increased during straw and remained somewhat higher after it. CT and acoustic results indicated that vocal exercises with increased vocal tract impedance lead to increased vocal efficiency and economy. One of the major changes was the more prominent singer's/speaker's formant cluster. Vocal tract and glottal modifications were more prominent during and after straw exercising compared with tube phonation. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
An analysis of machine translation and speech synthesis in speech-to-speech translation system

OpenAIRE

Hashimoto, K.; Yamagishi, J.; Byrne, W.; King, S.; Tokuda, K.

2011-01-01

This paper provides an analysis of the impacts of machine translation and speech synthesis on speech-to-speech translation systems. The speech-to-speech translation system consists of three components: speech recognition, machine translation and speech synthesis. Many techniques for integration of speech recognition and machine translation have been proposed. However, speech synthesis has not yet been considered. Therefore, in this paper, we focus on machine translation and speech synthesis, ...
Analysis of whistles produced by the Tucuxi Dolphin Sotalia fluviatilis from Sepetiba Bay, Brazil

Directory of Open Access Journals (Sweden)

Erber Claudia

2004-01-01

Full Text Available From July 2001 to June 2002, we recorded a total of 2h55min of Tucuxi Dolphin Sotalia fluviatilis vocalizations from Sepetiba Bay, Brazil (22º35'S-44º03'W. A total of 3350 whistles were analyzed quantitative and qualitatively and were divided into 124 types, by visual inspection of sonograms. The following parameters were measured: Initial Frequency, Final Frequency, Minimum Frequency, Maximum Frequency, Duration, Number of Inflections, Frequency at the Inflection Points, Frequency at 1/2, 1/4, and 3/4 of whistle duration, Presence of Frequency Modulation and Harmonics. Ascending type whistles (N=2719 were most common, representing 82% of the total. Dolphin behavior and average group size observed during recording influenced the whistle's quantitative and qualitative parameters. The results demonstrate the great diversity of whistles emitted and indicate a functional role of these vocalizations during the observed behaviors.
Adapted to roar: functional morphology of tiger and lion vocal folds.

Directory of Open Access Journals (Sweden)

Sarah A Klemuk

Full Text Available Vocal production requires active control of the respiratory system, larynx and vocal tract. Vocal sounds in mammals are produced by flow-induced vocal fold oscillation, which requires vocal fold tissue that can sustain the mechanical stress during phonation. Our understanding of the relationship between morphology and vocal function of vocal folds is very limited. Here we tested the hypothesis that vocal fold morphology and viscoelastic properties allow a prediction of fundamental frequency range of sounds that can be produced, and minimal lung pressure necessary to initiate phonation. We tested the hypothesis in lions and tigers who are well-known for producing low frequency and very loud roaring sounds that expose vocal folds to large stresses. In histological sections, we found that the Panthera vocal fold lamina propria consists of a lateral region with adipocytes embedded in a network of collagen and elastin fibers and hyaluronan. There is also a medial region that contains only fibrous proteins and hyaluronan but no fat cells. Young's moduli range between 10 and 2000 kPa for strains up to 60%. Shear moduli ranged between 0.1 and 2 kPa and differed between layers. Biomechanical and morphological data were used to make predictions of fundamental frequency and subglottal pressure ranges. Such predictions agreed well with measurements from natural phonation and phonation of excised larynges, respectively. We assume that fat shapes Panthera vocal folds into an advantageous geometry for phonation and it protects vocal folds. Its primary function is probably not to increase vocal fold mass as suggested previously. The large square-shaped Panthera vocal fold eases phonation onset and thereby extends the dynamic range of the voice.
Detection of cardiac activity changes from human speech

Science.gov (United States)

Tovarek, Jaromir; Partila, Pavol; Voznak, Miroslav; Mikulec, Martin; Mehic, Miralem

2015-05-01

Impact of changes in blood pressure and pulse from human speech is disclosed in this article. The symptoms of increased physical activity are pulse, systolic and diastolic pressure. There are many methods of measuring and indicating these parameters. The measurements must be carried out using devices which are not used in everyday life. In most cases, the measurement of blood pressure and pulse following health problems or other adverse feelings. Nowadays, research teams are trying to design and implement modern methods in ordinary human activities. The main objective of the proposal is to reduce the delay between detecting the adverse pressure and to the mentioned warning signs and feelings. Common and frequent activity of man is speaking, while it is known that the function of the vocal tract can be affected by the change in heart activity. Therefore, it can be a useful parameter for detecting physiological changes. A method for detecting human physiological changes by speech processing and artificial neural network classification is described in this article. The pulse and blood pressure changes was induced by physical exercises in this experiment. The set of measured subjects was formed by ten healthy volunteers of both sexes. None of the subjects was a professional athlete. The process of the experiment was divided into phases before, during and after physical training. Pulse, systolic, diastolic pressure was measured and voice activity was recorded after each of them. The results of this experiment describe a method for detecting increased cardiac activity from human speech using artificial neural network.
Paving the Way for Speech: Voice-Training-Induced Plasticity in Chronic Aphasia and Apraxia of Speech—Three Single Cases

Directory of Open Access Journals (Sweden)

Monika Jungblut

2014-01-01

Full Text Available Difficulties with temporal coordination or sequencing of speech movements are frequently reported in aphasia patients with concomitant apraxia of speech (AOS. Our major objective was to investigate the effects of specific rhythmic-melodic voice training on brain activation of those patients. Three patients with severe chronic nonfluent aphasia and AOS were included in this study. Before and after therapy, patients underwent the same fMRI procedure as 30 healthy control subjects in our prestudy, which investigated the neural substrates of sung vowel changes in untrained rhythm sequences. A main finding was that post-minus pretreatment imaging data yielded significant perilesional activations in all patients for example, in the left superior temporal gyrus, whereas the reverse subtraction revealed either no significant activation or right hemisphere activation. Likewise, pre- and posttreatment assessments of patients’ vocal rhythm production, language, and speech motor performance yielded significant improvements for all patients. Our results suggest that changes in brain activation due to the applied training might indicate specific processes of reorganization, for example, improved temporal sequencing of sublexical speech components. In this context, a training that focuses on rhythmic singing with differently demanding complexity levels as concerns motor and cognitive capabilities seems to support paving the way for speech.
Botulinum toxin in the treatment of vocal fold nodules.

Science.gov (United States)

Allen, Jacqui E; Belafsky, Peter C

2009-12-01

Promising new techniques in the management of vocal fold nodules have been developed in the past 2 years. Simultaneously, the therapeutic use of botulinum toxin has rapidly expanded. This review explores the use of botulinum toxin in treatment of vocal nodules and summarizes current therapeutic concepts. New microsurgical instruments and techniques, refinements in laser technology, radiosurgical excision and steroid intralesional injections are all promising new techniques in the management of vocal nodules. Botulinum toxin-induced 'voice rest' is a new technique we have employed in patients with recalcitrant nodules. Successful resolution of nodules is possible with this technique, without the risk of vocal fold scarring inherent in dissection/excision techniques. Botulinum toxin usage is exponentially increasing, and large-scale, long-term studies demonstrate its safety profile. Targeted vocal fold temporary paralysis induced by botulinum toxin injection is a new, well tolerated and efficacious treatment in patients with persistent vocal fold nodules.
The Effect of Teaching Experience and Specialty (Vocal or Instrumental) on Vocal Health Ratings of Music Teachers

Science.gov (United States)

Hackworth, Rhonda S.

2010-01-01

The current study sought to determine the relationship among music teachers' length of teaching experience, specialty (vocal or instrumental), and ratings of behaviors and teaching activities related to vocal health. Participants (N = 379) were experienced (n = 208) and preservice (n = 171) music teachers, further categorized by specialty, either…
A Systematic Review of Psychological Interventions for Adult and Pediatric Patients with Vocal Cord Dysfunction

Directory of Open Access Journals (Sweden)

Loveleen eGuglani

2014-08-01

Full Text Available Background: Vocal Cord Dysfunction (VCD or Paradoxical Vocal Fold Motion (PVFM is a functional disorder of the vocal cords that requires multidisciplinary treatment. Besides relaxation techniques, the use of psychological interventions can help treat the underlying psychological co-morbidities. There is currently no literature that examines the effectiveness of psychological interventions for VCD/PVFM. Objectives: To review the evidence for psychological interventions used for the treatment of patients with VCD/PVFM. Data Sources: We searched electronic databases for English medical literature using Pubmed (Medline, PsycInfo, Cochrane Database of Systematic Reviews, Cochrane Central Registry of Controlled Trials and Clinicaltrials.gov. The date range for our search is from July 1963 to July 2013. Study Eligibility Criteria, Participants and Interventions: We included studies that reported the use of psychological interventions in both adults and children diagnosed with VCD/PVFM. We included randomized controlled trials, case-control studies, retrospective chart reviews, prospective case series, and individual case reports. Results: Most reported studies are small case series or individual case reports that have described the use of interventions such as psychotherapy, behavioral therapy, use of anti-anxiety and anti-depressant medications, and hypnotherapy in conjunction with breathing exercises taught by speech therapists for symptomatic relief. Among the various psychological interventions that have been reported, there is no data regarding effectiveness and/or superiority of one approach over another in either adult or pediatric patients. Conclusions: Psychological interventions have a role to play in the management of adult and pediatric patients with VCD/PVFM. Future prospective studies using uniform approaches for treatment of associated psychopathology may help address this question. Systematic Review Registration Number: CRD42013004873
Ultrasonic vocalizations of adult male Foxp2-mutant mice: behavioral contexts of arousal and emotion.

Science.gov (United States)

Gaub, S; Fisher, S E; Ehret, G

2016-02-01

Adult mouse ultrasonic vocalizations (USVs) occur in multiple behavioral and stimulus contexts associated with various levels of arousal, emotion and social interaction. Here, in three experiments of increasing stimulus intensity (water; female urine; male interacting with adult female), we tested the hypothesis that USVs of adult males express the strength of arousal and emotion via different USV parameters (18 parameters analyzed). Furthermore, we analyzed two mouse lines with heterozygous Foxp2 mutations (R552H missense, S321X nonsense), known to produce severe speech and language disorders in humans. These experiments allowed us to test whether intact Foxp2 function is necessary for developing full adult USV repertoires, and whether mutations of this gene influence instinctive vocal expressions based on arousal and emotion. The results suggest that USV calling rate characterizes the arousal level, while sound pressure and spectrotemporal call complexity (overtones/harmonics, type of frequency jumps) may provide indices of levels of positive emotion. The presence of Foxp2 mutations did not qualitatively affect the USVs; all USV types that were found in wild-type animals also occurred in heterozygous mutants. However, mice with Foxp2 mutations displayed quantitative differences in USVs as compared to wild-types, and these changes were context dependent. Compared to wild-type animals, heterozygous mutants emitted mainly longer and louder USVs at higher minimum frequencies with a higher occurrence rate of overtones/harmonics and complex frequency jump types. We discuss possible hypotheses about Foxp2 influence on emotional vocal expressions, which can be investigated in future experiments using selective knockdown of Foxp2 in specific brain circuits. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
A nomenclature paradigm for benign midmembranous vocal fold lesions.

Science.gov (United States)

Rosen, Clark A; Gartner-Schmidt, Jackie; Hathaway, Bridget; Simpson, C Blake; Postma, Gregory N; Courey, Mark; Sataloff, Robert T

2012-06-01

There is a significant lack of uniform agreement regarding nomenclature for benign vocal fold lesions (BVFLs). This confusion results in difficulty for clinicians communicating with their patients and with each other. In addition, BVFL research and comparison of treatment methods are hampered by the lack of a detailed and uniform BVFL nomenclature. Clinical consensus conferences were held to develop an initial BVFL nomenclature paradigm. Perceptual video analysis was performed to validate the stroboscopy component of the paradigm. The culmination of the consensus conferences and the video-perceptual analysis was used to evaluate the BVFL nomenclature paradigm using a retrospective review of patients with BVFL. An initial BVFL nomenclature paradigm was proposed utilizing detailed definitions relating to vocal fold lesion morphology, stroboscopy, response to voice therapy and intraoperative findings. Video-perceptual analysis of stroboscopy demonstrated that the proposed binary stroboscopy system used in the BVFL nomenclature paradigm was valid and widely applicable. Retrospective review of 45 patients with BVFL followed to the conclusion of treatment demonstrated that slight modifications of the initial BVFL nomenclature paradigm were required. With the modified BVFL nomenclature paradigm, 96% of the patients fit into the predicted pattern and definitions of the BVFL nomenclature system. This study has validated a multidimensional BVFL nomenclature paradigm. This vocal fold nomenclature paradigm includes nine distinct vocal fold lesions: vocal fold nodules, vocal fold polyp, pseudocyst, vocal fold cyst (subepithelial or ligament), nonspecific vocal fold lesion, vocal fold fibrous mass (subepithelial or ligament), and reactive lesion. Copyright © 2011 The American Laryngological, Rhinological, and Otological Society, Inc.
Auditory responses in the amygdala to social vocalizations

Science.gov (United States)

Gadziola, Marie A.

The underlying goal of this dissertation is to understand how the amygdala, a brain region involved in establishing the emotional significance of sensory input, contributes to the processing of complex sounds. The general hypothesis is that communication calls of big brown bats (Eptesicus fuscus) transmit relevant information about social context that is reflected in the activity of amygdalar neurons. The first specific aim analyzed social vocalizations emitted under a variety of behavioral contexts, and related vocalizations to an objective measure of internal physiological state by monitoring the heart rate of vocalizing bats. These experiments revealed a complex acoustic communication system among big brown bats in which acoustic cues and call structure signal the emotional state of a sender. The second specific aim characterized the responsiveness of single neurons in the basolateral amygdala to a range of social syllables. Neurons typically respond to the majority of tested syllables, but effectively discriminate among vocalizations by varying the response duration. This novel coding strategy underscores the importance of persistent firing in the general functioning of the amygdala. The third specific aim examined the influence of acoustic context by characterizing both the behavioral and neurophysiological responses to natural vocal sequences. Vocal sequences differentially modify the internal affective state of a listening bat, with lower aggression vocalizations evoking the greatest change in heart rate. Amygdalar neurons employ two different coding strategies: low background neurons respond selectively to very few stimuli, whereas high background neurons respond broadly to stimuli but demonstrate variation in response magnitude and timing. Neurons appear to discriminate the valence of stimuli, with aggression sequences evoking robust population-level responses across all sound levels. Further, vocal sequences show improved discrimination among stimuli
Morphometric Study of Vocal Folds in Indian Cadavers

Directory of Open Access Journals (Sweden)

Rawal J.D.

2015-06-01

Full Text Available Introduction: -The larynx is an air passage and a sphincteric device used in respiration and phonation. The larynx, from inside outwards has a framework of mucosa surrounded by fibro-elastic membrane which in turn is surrounded by cartilages and then a layer of muscles. Vocal folds are intrinsic ligament of larynx covered by mucosal folds. Larynx generates sound through rhythmic opening and closing of the vocal folds. The perceived pitch of human voice mainly depends upon fundamental frequency of sound generated by larynx. Aim: - The aim of present study is to measure various dimensions of vocal folds in Indian cadavers. Material & Methods: - 50 larynx were obtained from embalmed cadavers, of which 10 larynx were of females. Vocal cords were dissected from the larynx and morphometric analysis was done. Results and Conclusions: - The average total length of the vocal folds was found to be 16.11 mm. ± 2.62 mm. in male and 14.10 mm. ± 1.54 mm. in female cadavers. The average width of the vocal folds was found to be 4.38 mm. ± 0.74 mm. in male and 3.60 mm. ± 0.64 mm. in female cadavers. The average total length of the membranous part of the vocal folds was found to be 11.90 mm. ± 1.86 mm. in male and 10.45 mm. ± 1.81 mm. in female cadavers. The average ratio of the length of the membranous and the cartilaginous parts of the vocal folds was calculated to be 3.10 ± 0.96in male and 2.85 ± 0.73in female cadavers.
The predictability of frequency-altered auditory feedback changes the weighting of feedback and feedforward input for speech motor control.

Science.gov (United States)

Scheerer, Nichole E; Jones, Jeffery A

2014-12-01

Speech production requires the combined effort of a feedback control system driven by sensory feedback, and a feedforward control system driven by internal models. However, the factors that dictate the relative weighting of these feedback and feedforward control systems are unclear. In this event-related potential (ERP) study, participants produced vocalisations while being exposed to blocks of frequency-altered feedback (FAF) perturbations that were either predictable in magnitude (consistently either 50 or 100 cents) or unpredictable in magnitude (50- and 100-cent perturbations varying randomly within each vocalisation). Vocal and P1-N1-P2 ERP responses revealed decreases in the magnitude and trial-to-trial variability of vocal responses, smaller N1 amplitudes, and shorter vocal, P1 and N1 response latencies following predictable FAF perturbation magnitudes. In addition, vocal response magnitudes correlated with N1 amplitudes, vocal response latencies, and P2 latencies. This pattern of results suggests that after repeated exposure to predictable FAF perturbations, the contribution of the feedforward control system increases. Examination of the presentation order of the FAF perturbations revealed smaller compensatory responses, smaller P1 and P2 amplitudes, and shorter N1 latencies when the block of predictable 100-cent perturbations occurred prior to the block of predictable 50-cent perturbations. These results suggest that exposure to large perturbations modulates responses to subsequent perturbations of equal or smaller size. Similarly, exposure to a 100-cent perturbation prior to a 50-cent perturbation within a vocalisation decreased the magnitude of vocal and N1 responses, but increased P1 and P2 latencies. Thus, exposure to a single perturbation can affect responses to subsequent perturbations. © 2014 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
Is laughter a better vocal change detector than a growl?

Science.gov (United States)

Pinheiro, Ana P; Barros, Carla; Vasconcelos, Margarida; Obermeier, Christian; Kotz, Sonja A

2017-07-01

The capacity to predict what should happen next and to minimize any discrepancy between an expected and an actual sensory input (prediction error) is a central aspect of perception. Particularly in vocal communication, the effective prediction of an auditory input that informs the listener about the emotionality of a speaker is critical. What is currently unknown is how the perceived valence of an emotional vocalization affects the capacity to predict and detect a change in the auditory input. This question was probed in a combined event-related potential (ERP) and time-frequency analysis approach. Specifically, we examined the brain response to standards (Repetition Positivity) and to deviants (Mismatch Negativity - MMN), as well as the anticipatory response to the vocal sounds (pre-stimulus beta oscillatory power). Short neutral, happy (laughter), and angry (growls) vocalizations were presented both as standard and deviant stimuli in a passive oddball listening task while participants watched a silent movie and were instructed to ignore the vocalizations. MMN amplitude was increased for happy compared to neutral and angry vocalizations. The Repetition Positivity was enhanced for happy standard vocalizations. Induced pre-stimulus upper beta power was increased for happy vocalizations, and predicted the modulation of the standard Repetition Positivity. These findings indicate enhanced sensory prediction for positive vocalizations such as laughter. Together, the results suggest that positive vocalizations are more effective predictors in social communication than angry and neutral ones, possibly due to their high social significance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Primordial blackholes and gravitational waves for an inflection-point model of inflation

Energy Technology Data Exchange (ETDEWEB)

Choudhury, Sayantan [Physics and Applied Mathematics Unit, Indian Statistical Institute, 203 B.T. Road, Kolkata 700 108 (India); Mazumdar, Anupam [Consortium for Fundamental Physics, Physics Department, Lancaster University, LA1 4YB (United Kingdom)

2014-06-02

In this article we provide a new closed relationship between cosmic abundance of primordial gravitational waves and primordial blackholes that originated from initial inflationary perturbations for inflection-point models of inflation where inflation occurs below the Planck scale. The current Planck constraint on tensor-to-scalar ratio, running of the spectral tilt, and from the abundance of dark matter content in the universe, we can deduce a strict bound on the current abundance of primordial blackholes to be within a range, 9.99712×10{sup −3}<Ω{sub PBH}h{sup 2}<9.99736×10{sup −3}.

Diagnostic and therapeutic pitfalls in benign vocal fold diseases

Science.gov (United States)

Bohlender, Jörg

2013-01-01

More than half of patients presenting with hoarseness show benign vocal fold changes. The clinician should be familiar with the anatomy, physiology and functional aspects of voice disorders and also the modern diagnostic and therapeutic possibilities in order to ensure an optimal and patient specific management. This review article focuses on the diagnostic and therapeutic limitations and difficulties of treatment of benign vocal fold tumors, the management and prevention of scarred vocal folds and the issue of unilateral vocal fold paresis. PMID:24403969
Análise perceptivo-auditiva de parâmetros vocais em cantores da noite do estilo musical brega da cidade do Recife Perceptual vocal pattern analysis of singers from kitschy musical style in Recife

Directory of Open Access Journals (Sweden)

Elthon Gomes Fernandes da Silva

2009-09-01

Full Text Available OBJETIVO: avaliar de forma perceptivo-auditiva a voz dos cantores da noite do estilo musical Brega da cidade do Recife. MÉTODOS: pesquisa realizada na clínica-escola do curso de Fonoaudiologia da Universidade Federal de Pernambuco e na emissora de TV Rede Estação canal 14, ambos localizados na cidade do Recife. Trata-se de estudo observacional, transversal e descritivo. Com anuência de 13 cantores, maiores de 18 anos, houve gravação da voz falada na emissão sustentada de vogais e durante a música "parabéns pra você"; na voz cantada realizou-se a gravação de trecho de música pertencente ao repertório do cantor. RESULTADOS: tempos de fonação reduzidos; modificações no pitch e loudness, comparando voz falada e cantada, ambos passando de adequados para, respectivamente, agudo e elevada; mudanças na ressonância, que era laringofaríngea e tornou-se equilibrada com compensação nasal. Houve manutenção do ataque vocal brusco; mudança do registro modal misto na voz habitual para o modal cabeça na voz profissional; predominância da qualidade vocal clara na voz falada e padrões adequados para modulação, projeção e articulação na voz cantada. CONCLUSÃO: os cantores da noite do estilo musical Brega da cidade do Recife apresentaram tempos de fonação reduzidos e tiveram, da voz falada para a voz cantada, mudanças no pitch, loudness e ressonância e manutenção das características vocais para ataque e registro. A qualidade vocal clara na voz falada foi predominante, assim como a modulação adequada, boa projeção e articulação precisa estavam entre os padrões vocais mais frequentes na voz cantada.PURPOSE: to evaluate the perceptual form concerning the voice of the singers from kitschy musical style in Recife. METHODS: clinical research was carried out in the clinic-school of Speech, Language and Hearing Sciences course in the Federal University of Pernambuco and Network TV Station channel 14, both located in the
Características vocais do canto japonês nos gêneros enka e mudo enka Vocal characteristics of enka and mudo enka genre of Japanese singing

Directory of Open Access Journals (Sweden)

Cintia Megumi Nishimura

2006-12-01

evaluation protocol with vocal characteristics found in both genres of Japanese music. The evaluation was carried out by three speech pathologists, licensed by the Speech Pathology Federal Board, who determined the most outstanding vocal characteristics of each genre, from a literature-based stratification. RESULTS: in the enka genre, the kobushi, the vibrato and crescendos and decrescendos were present in 100% of the vocal samples. We found 80% of metal, 90% of nasality and registration alternation and 70% of soprosity. In the mudo enka genre, the crescendos and decrescendos were present in 100% of the vocal samples. We found 70% of soprosity, 90% of vibrato, 50% of registration alternation, 40% of metal and 20% of nasality and kobushi. CONCLUSION: When comparing the two genres, enka and mudo enka, we verified the strong presence of Kobushi in the enka genre, along with a larger predominance of vibrato, metal, nasality, alternation of registrations and crescendos and decrescendos. The soprosity was found in equal rates in both genres. The identification of vocal characteristics is useful for the speech pathologist as well as singing professors when helping their clients (singers during the learning of the Japanese singing or in its improvement.
Vocal cord hemangioma in an adult

Directory of Open Access Journals (Sweden)

Muzaffer Kanlıkama

2011-03-01

Full Text Available Hemangioma is one of the most common benign tumorsin the head and neck region. Laryngeal hemangiomasare benign vascular tumors of unknown etiology thatarise from subglottic region with stridor in infants. Thistype also known as congenital laryngeal hemangioma, isthe more common. Congenital hemangiomas occur usuallyin subglottic region and more frequent in girls. Laryngealhemangioma in adults is a very rare conditionand main symptom is hoarseness and breathing difficulties.Adult hemangiomas can be seen in different locationssuch as the epiglottis, aryepiglottic folds, arytenoidsand false and true vocal cords. They are more oftenof cavernous form and cause hoarseness. In this reportwe present an adult patient with hemangioma ofthe left vocal fold and review the literature. Diagnosticinvestigation revealed a pink-purple mass which was extendedfrom the anterior comissure to the posterior partof true vocal cord and false vocal cord, filling the ventriculeand extending to supraglottic region. Directlaryngoscopy was performed, but the lesion was not excisedbecause of its widespread extension in the larynx. JClin Exp Invest 2010; 2(1: 91-94
The vocal load of Reform Jewish cantors in the USA.

Science.gov (United States)

Hapner, Edie; Gilman, Marina

2012-03-01

Jewish cantors comprise a subset of vocal professionals that is not well understood by vocal health professionals. This study aimed to document the vocal demands, vocal training, reported incidence of voice problems, and treatment-seeking behavior of Reform Jewish cantors. The study used a prospective observational design to anonymously query Reform Jewish cantors using a 35-item multiple-choice survey distributed online. Demographic information, medical history, vocal music training, cantorial duties, history of voice problems, and treatment-seeking behavior were addressed. Results indicated that many of the commonly associated risk factors for developing voice disorders were present in this population, including high vocal demands, reduced vocal downtime, allergies, and acid reflux. Greater than 65% of the respondents reported having had a voice problem that interfered with their ability to perform their duties at some time during their careers. Reform Jewish cantors are a population of occupational voice users who may be currently unidentified and underserved by vocal health professionals. The results of the survey suggest that Reform Jewish cantors are occupational voice users and are at high risk for developing voice disorders. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
The effect of surface electrical stimulation on vocal fold position.

Science.gov (United States)

Humbert, Ianessa A; Poletto, Christopher J; Saxon, Keith G; Kearney, Pamela R; Ludlow, Christy L

2008-01-01

Closure of the true and false vocal folds is a normal part of airway protection during swallowing. Individuals with reduced or delayed true vocal fold closure can be at risk for aspiration and may benefit from intervention to ameliorate the problem. Surface electrical stimulation is currently used during therapy for dysphagia, despite limited knowledge of its physiological effects. Prospective single effects study. The immediate physiological effect of surface stimulation on true vocal fold angle was examined at rest in 27 healthy adults using 10 different electrode placements on the submental and neck regions. Fiberoptic nasolaryngoscopic recordings during passive inspiration were used to measure change in true vocal fold angle with stimulation. Vocal fold angles changed only to a small extent during two electrode placements (P vocal fold abduction was 2.4 degrees; while horizontal placements of electrodes in the submental region produced a mean adduction of 2.8 degrees (P = .03). Surface electrical stimulation to the submental and neck regions does not produce immediate true vocal fold adduction adequate for airway protection during swallowing, and one position may produce a slight increase in true vocal fold opening.
The influence of thyroarytenoid and cricothyroid muscle activation on vocal fold stiffness and eigenfrequencies

Science.gov (United States)

Yin, Jun; Zhang, Zhaoyan

2013-01-01

The influence of the thyroarytenoid (TA) and cricothyroid (CT) muscle activation on vocal fold stiffness and eigenfrequencies was investigated in a muscularly controlled continuum model of the vocal folds. Unlike the general understanding that vocal fold fundamental frequency was determined by vocal fold tension, this study showed that vocal fold eigenfrequencies were primarily determined by vocal fold stiffness. This study further showed that, with reference to the resting state of zero strain, vocal fold stiffness in both body and cover layers increased with either vocal fold elongation or shortening. As a result, whether vocal fold eigenfrequencies increased or decreased with CT/TA activation depended on how the CT/TA interaction influenced vocal fold deformation. For conditions of strong CT activation and thus an elongated vocal fold, increasing TA contraction reduced the degree of vocal fold elongation and thus reduced vocal fold eigenfrequencies. For conditions of no CT activation and thus a resting or slightly shortened vocal fold, increasing TA contraction increased the degree of vocal fold shortening and thus increased vocal fold eigenfrequencies. In the transition region of a slightly elongated vocal fold, increasing TA contraction first decreased and then increased vocal fold eigenfrequencies. PMID:23654401
Vocal fold contact patterns based on normal modes of vibration.

Science.gov (United States)

Smith, Simeon L; Titze, Ingo R

2018-05-17

The fluid-structure interaction and energy transfer from respiratory airflow to self-sustained vocal fold oscillation continues to be a topic of interest in vocal fold research. Vocal fold vibration is driven by pressures on the vocal fold surface, which are determined by the shape of the glottis and the contact between vocal folds. Characterization of three-dimensional glottal shapes and contact patterns can lead to increased understanding of normal and abnormal physiology of the voice, as well as to development of improved vocal fold models, but a large inventory of shapes has not been directly studied previously. This study aimed to take an initial step toward characterizing vocal fold contact patterns systematically. Vocal fold motion and contact was modeled based on normal mode vibration, as it has been shown that vocal fold vibration can be almost entirely described by only the few lowest order vibrational modes. Symmetric and asymmetric combinations of the four lowest normal modes of vibration were superimposed on left and right vocal fold medial surfaces, for each of three prephonatory glottal configurations, according to a surface wave approach. Contact patterns were generated from the interaction of modal shapes at 16 normalized phases during the vibratory cycle. Eight major contact patterns were identified and characterized by the shape of the flow channel, with the following descriptors assigned: convergent, divergent, convergent-divergent, uniform, split, merged, island, and multichannel. Each of the contact patterns and its variation are described, and future work and applications are discussed. Copyright © 2018 Elsevier Ltd. All rights reserved.
Social ultrasonic vocalization in awake head-restrained mouse

Directory of Open Access Journals (Sweden)

Benjamin Weiner

2016-12-01

Full Text Available Numerous animal species emit vocalizations in response to various social stimuli. The neural basis of vocal communication has been investigated in monkeys, songbirds, rats, bats and invertebrates resulting in deep insights into motor control, neural coding and learning. Mice, which recently became very popular as a model system for mammalian neuroscience, also utilize ultrasonic vocalizations (USVs during mating behavior. However, our knowledge is lacking of both the behavior and its underlying neural mechanism. We developed a novel method for head-restrained male mice (HRMM to interact with non-restrained female mice (NRFM and show that mice can emit USVs in this context. We first recorded USVs in free arena with non-restrained male mice (NRMM and NRFM. Of the NRMM, which vocalized in the free arena, the majority could be habituated to also vocalize while head-restrained but only when a female mouse was present in proximity. The USVs emitted by HRMM are similar to the USVs of NRMM in the presence of a female mouse in their spectral structure, inter syllable interval distribution and USV sequence length, and therefore are interpreted as social USVs. By analyzing vocalizations of NRMM, we established criteria to predict which individuals are likely to vocalize while head fixed based on the USV rate and average syllable duration. To characterize the USVs emitted by HRMM, we analyzed the syllable composition of HRMM and NRMM and found that USVs emitted by HRMM have higher proportions of USVs with complex spectral representation, supporting previous studies showing that mice social USVs are context dependent. Our results suggest a way to study the neural mechanisms of production and control of social vocalization in mice using advanced methods requiring head fixation.
Social Ultrasonic Vocalization in Awake Head-Restrained Mouse.

Science.gov (United States)

Weiner, Benjamin; Hertz, Stav; Perets, Nisim; London, Michael

2016-01-01

Numerous animal species emit vocalizations in response to various social stimuli. The neural basis of vocal communication has been investigated in monkeys, songbirds, rats, bats, and invertebrates resulting in deep insights into motor control, neural coding, and learning. Mice, which recently became very popular as a model system for mammalian neuroscience, also utilize ultrasonic vocalizations (USVs) during mating behavior. However, our knowledge is lacking of both the behavior and its underlying neural mechanism. We developed a novel method for head-restrained male mice (HRMM) to interact with non-restrained female mice (NRFM) and show that mice can emit USVs in this context. We first recorded USVs in a free arena with non-restrained male mice (NRMM) and NRFM. Of the NRMM, which vocalized in the free arena, the majority could be habituated to also vocalize while head-restrained but only when a female mouse was present in proximity. The USVs emitted by HRMM are similar to the USVs of NRMM in the presence of a female mouse in their spectral structure, inter-syllable interval distribution, and USV sequence length, and therefore are interpreted as social USVs. By analyzing the vocalizations of NRMM, we established criteria to predict which individuals are likely to vocalize while head fixed based on the USV rate and average syllable duration. To characterize the USVs emitted by HRMM, we analyzed the syllable composition of HRMM and NRMM and found that USVs emitted by HRMM have a higher proportion of USVs with complex spectral representation, supporting previous studies showing that mice social USVs are context dependent. Our results suggest a way to study the neural mechanisms of production and control of social vocalization in mice using advanced methods requiring head fixation.
Axon guidance pathways served as common targets for human speech/language evolution and related disorders.

Science.gov (United States)

Lei, Huimeng; Yan, Zhangming; Sun, Xiaohong; Zhang, Yue; Wang, Jianhong; Ma, Caihong; Xu, Qunyuan; Wang, Rui; Jarvis, Erich D; Sun, Zhirong

2017-11-01

Human and several nonhuman species share the rare ability of modifying acoustic and/or syntactic features of sounds produced, i.e. vocal learning, which is the important neurobiological and behavioral substrate of human speech/language. This convergent trait was suggested to be associated with significant genomic convergence and best manifested at the ROBO-SLIT axon guidance pathway. Here we verified the significance of such genomic convergence and assessed its functional relevance to human speech/language using human genetic variation data. In normal human populations, we found the affected amino acid sites were well fixed and accompanied with significantly more associated protein-coding SNPs in the same genes than the rest genes. Diseased individuals with speech/language disorders have significant more low frequency protein coding SNPs but they preferentially occurred outside the affected genes. Such patients' SNPs were enriched in several functional categories including two axon guidance pathways (mediated by netrin and semaphorin) that interact with ROBO-SLITs. Four of the six patients have homozygous missense SNPs on PRAME gene family, one youngest gene family in human lineage, which possibly acts upon retinoic acid receptor signaling, similarly as FOXP2, to modulate axon guidance. Taken together, we suggest the axon guidance pathways (e.g. ROBO-SLIT, PRAME gene family) served as common targets for human speech/language evolution and related disorders. Copyright © 2017 Elsevier Inc. All rights reserved.
Effect of Performance Time of the Semi-Occluded Vocal Tract Exercises in Dysphonic Children.

Science.gov (United States)

Ramos, Lorena de Almeida; Gama, Ana Cristina Côrtes

2017-05-01

This study aimed to verify the effects of execution time on auditory-perceptual and acoustic responses in children with dysphonia completing straw phonation exercises. A randomized, prospective, comparative intra-subject study design was used. Twenty-seven children, ranging from 5 to 10 years of age, diagnosed with vocal cord nodules or cysts, were enrolled in the study. All subjects included in the Experimental Group were also included in the Control Group which involved complete voice rest. Sustained vowels (/a/e/ε/e/) counting from 1 to 10 were recorded before the exercises (m0) and then again after the first (m1), third (m3), fifth (m5), and seventh (m7) minutes of straw phonation exercises. The recordings were randomized and presented to five speech therapists, who evaluated vocal quality based on the Grade Roughness Breathiness Asthenia/Strain Instability scale. For acoustic analysis, fundamental frequency, jitter, shimmer, glottal to noise excitation ratio, and noise parameters were analyzed. Reduced roughness, breathiness, and noise measurements as well as increased glottal to noise excitation ratio were observed in the Experimental Group after 3 minutes of exercise. Reduced grade of dysphonia and breathiness were noted after 5 minutes. The ideal duration of straw phonation in children with dysphonia is from 3 to 5 minutes. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Neural Entrainment to Speech Modulates Speech Intelligibility

NARCIS (Netherlands)

Riecke, Lars; Formisano, Elia; Sorger, Bettina; Baskent, Deniz; Gaudrain, Etienne

2018-01-01

Speech is crucial for communication in everyday life. Speech-brain entrainment, the alignment of neural activity to the slow temporal fluctuations (envelope) of acoustic speech input, is a ubiquitous element of current theories of speech processing. Associations between speech-brain entrainment and
Vocal communication in an avian hybrid zone

NARCIS (Netherlands)

Hartog, Paula Maria den

2008-01-01

Avian vocalizations function in mate attraction and territorial defence. Vocalizations can act as behavioural barriers and play an important role in speciation processes. Hybrid zones illustrate behavioural barriers are not always impermeable and provide a natural laboratory to examine the role of
Possible association between Helicobacter pylori infection and vocal fold leukoplakia.

Science.gov (United States)

Chen, Min; Chen, Jian; Yang, Yue; Cheng, Lei; Wu, Hai-Tao

2018-03-06

Several studies have indicated the larynx as possible Helicobacter pylori (H. pylori) reservoirs. This study explored the association between H. pylori and vocal fold leukoplakia. The case-control study involved 51 patients with vocal fold leukoplakia and 35 control patients with vocal polyps. Helicobacter pylori was detected in tissues by the rapid urease test, nested polymerase chain reaction (PCR), and single-step PCR. The H. pylori-specific immunoglobulin antibodies were detected in plasma by enzyme-linked immunosorbent assay (ELISA). Helicobacter pylori-positive rate of vocal fold leukoplakia and vocal polyps was 23.5% versus 11.4% (P = .157), 37.2% versus 14.3% (P = .020), 27.5% versus 8.6% (P = .031), and 70.6% versus 68.6% (P = .841) detected by rapid urease test, nested PCR, single-step PCR, and ELISA, respectively. Regression analysis indicated that H. pylori infection (P = .044) was the independent risk factor for vocal fold leukoplakia. Helicobacter pylori infection exists in the larynx and may be associated with vocal fold leukoplakia. © 2018 Wiley Periodicals, Inc.
Time course of recovery of idiopathic vocal fold paralysis.

Science.gov (United States)

Husain, Solomon; Sadoughi, Babak; Mor, Niv; Levin, Ariana M; Sulica, Lucian

2018-01-01

To clarify the time course of recovery in patients with idiopathic vocal fold paralysis. Retrospective chart review. Medical records for all patients with idiopathic vocal fold paralysis over a 10-year period were reviewed to obtain demographic and clinical information, including onset of disease and recovery of vocal function. Stroboscopic exams of patients who recovered voice were reviewed blindly to assess return of vocal fold motion. Thirty-eight of 55 patients (69%) recovered vocal function. Time course of recovery could be assessed in 34 patients who did not undergo injection augmentation. The mean time to recovery was 152.8 ± 109.3 days (left, 179.8 ± 111.3 days; right, 105.3 ± 93.7 days; P = .088). Two-thirds of patients recovered within 6 months. Probability of recovery declined over time. Five of 22 patients who recovered voice had return of vocal fold motion; 17 did not. The mean time to recovery did not differ between these groups (return of motion, 127.4 ± 132.3 days; no return of motion, 160.1 ± 105.1 days; P = .290). Sixty-nine percent of patients with idiopathic vocal fold paralysis recovered vocal function, two-thirds doing so within 6 months of onset. Age, gender, laterality, use of injection augmentation did not influence recovery rate. Declining probability of recovery over time leads us to consider framework surgery after 6 months in patients with idiopathic paralysis. 4. Laryngoscope, 128:148-152, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Visual classification of feral cat Felis silvestris catus vocalizations.

Science.gov (United States)

Owens, Jessica L; Olsen, Mariana; Fontaine, Amy; Kloth, Christopher; Kershenbaum, Arik; Waller, Sara

2017-06-01

Cat vocal behavior, in particular, the vocal and social behavior of feral cats, is poorly understood, as are the differences between feral and fully domestic cats. The relationship between feral cat social and vocal behavior is important because of the markedly different ecology of feral and domestic cats, and enhanced comprehension of the repertoire and potential information content of feral cat calls can provide both better understanding of the domestication and socialization process, and improved welfare for feral cats undergoing adoption. Previous studies have used conflicting classification schemes for cat vocalizations, often relying on onomatopoeic or popular descriptions of call types (e.g., "miow"). We studied the vocalizations of 13 unaltered domestic cats that complied with our behavioral definition used to distinguish feral cats from domestic. A total of 71 acoustic units were extracted and visually analyzed for the construction of a hierarchical classification of vocal sounds, based on acoustic properties. We identified 3 major categories (tonal, pulse, and broadband) that further breakdown into 8 subcategories, and show a high degree of reliability when sounds are classified blindly by independent observers (Fleiss' Kappa K = 0.863). Due to the limited behavioral contexts in this study, additional subcategories of cat vocalizations may be identified in the future, but our hierarchical classification system allows for the addition of new categories and new subcategories as they are described. This study shows that cat vocalizations are diverse and complex, and provides an objective and reliable classification system that can be used in future studies.
Vocal fold submucosal infusion technique in phonomicrosurgery.

Science.gov (United States)

Kass, E S; Hillman, R E; Zeitels, S M

1996-05-01

Phonomicrosurgery is optimized by maximally preserving the vocal fold's layered microstructure (laminae propriae). The technique of submucosal infusion of saline and epinephrine into the superficial lamina propria (SLP) was examined to delineate how, when, and why it was helpful toward this surgical goal. A retrospective review revealed that the submucosal infusion technique was used to enhance the surgery in 75 of 152 vocal fold procedures that were performed over the last 2 years. The vocal fold epithelium was noted to be adherent to the vocal ligament in 29 of the 75 cases: 19 from previous surgical scarring, 4 from cancer, 3 from sulcus vocalis, 2 from chronic hemorrhage, and 1 from radiotherapy. The submucosal infusion technique was most helpful when the vocal fold epithelium required resection and/or when extensive dissection in the SLP was necessary. The infusion enhanced the surgery by vasoconstriction of the microvasculature in the SLP, which improved visualization during cold-instrument tangential dissection. Improved visualization facilitated maximal preservation of the SLP, which is necessary for optimal pliability of the overlying epithelium. The infusion also improved the placement of incisions at the perimeter of benign, premalignant, and malignant lesions, and thereby helped preserve epithelium uninvolved by the disorder.
Analysis of laryngoscopic features in patients with unilateral vocal fold paresis.

Science.gov (United States)

Woo, Peak; Parasher, Arjun K; Isseroff, Tova; Richards, Amanda; Sivak, Mark

2016-08-01

The diagnosis of paresis in patients with vocal fold motion impairment remains a challenge. More than 27 clinical parameters have been cited that may signify paresis. We hypothesize that some features are more significant than others. Prospective case series. Two laryngologists rated laryngoscopy findings in 19 patients suspected of paresis. The diagnosis was confirmed with laryngeal electromyography. A standard set of 27 ratings was used for each examination that included movement, laryngeal configuration, and stroboscopy signs. A Fisher exact test was completed for each measure. A kappa coefficient was calculated for effectiveness in predicting the laterality of paresis. Left-sided vocal fold paresis (n = 13) was significantly associated with ipsilateral axis deviation, thinner vocal fold, bowing, reduced movement, reduced kinesis, and phase lag (P vocal fold paresis (n = 6) was significantly associated with ipsilateral shorter vocal fold, axis deviation, reduced movement, and reduced kinesis (P vocal fold, vocal fold bowing, reduced movement, reduced kinesis, and phase lag were more likely to be associated with vocal fold paresis. 4 Laryngoscope, 126:1831-1836, 2016. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.
Utilization of the lower inflection point of the pressure-volume curve results in protective conventional ventilation comparable to high frequency oscillatory ventilation in an animal model of acute respiratory distress syndrome

Directory of Open Access Journals (Sweden)

Felipe S. Rossi

2008-01-01

Full Text Available INTRODUCTION: Studies comparing high frequency oscillatory and conventional ventilation in acute respiratory distress syndrome have used low values of positive end-expiratory pressure and identified a need for better recruitment and pulmonary stability with high frequency. OBJECTIVE: To compare conventional and high frequency ventilation using the lower inflection point of the pressure-volume curve as the determinant of positive end-expiratory pressure to obtain similar levels of recruitment and alveolar stability. METHODS: After lung lavage of adult rabbits and lower inflection point determination, two groups were randomized: conventional (positive end-expiratory pressure = lower inflection point; tidal volume=6 ml/kg and high frequency ventilation (mean airway pressures= lower inflection point +4 cmH2O. Blood gas and hemodynamic data were recorded over 4 h. After sacrifice, protein analysis from lung lavage and histologic evaluation were performed. RESULTS: The oxygenation parameters, protein and histological data were similar, except for the fact that significantly more normal alveoli were observed upon protective ventilation. High frequency ventilation led to lower PaCO2 levels. DISCUSSION: Determination of the lower inflection point of the pressure-volume curve is important for setting the minimum end expiratory pressure needed to keep the airways opened. This is useful when comparing different strategies to treat severe respiratory insufficiency, optimizing conventional ventilation, improving oxygenation and reducing lung injury. CONCLUSIONS: Utilization of the lower inflection point of the pressure-volume curve in the ventilation strategies considered in this study resulted in comparable efficacy with regards to oxygenation and hemodynamics, a high PaCO2 level and a lower pH. In addition, a greater number of normal alveoli were found after protective conventional ventilation in an animal model of acute respiratory distress syndrome.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.