WorldWideScience

Sample records for voice recognition software

  1. Voice recognition software for clinical use.

    Science.gov (United States)

    Korn, K

    1998-11-01

    The current generation voice recognition products truly offer the promise of voice recognition systems, that are financially and operationally acceptable for use in a health care facility. Although the initial capital outlay for the purchase of such equipment may be substantial, the long-term benefit is felt to outweigh the expense. The ability to utilize computer equipment for educational purposes and information management alone helps to rationalize the cost. In addition, it is important to remember that the Internet has become a substantial source of information which provides another functional use for this equipment. Although one can readily see the implication for such a program in clinical practice, other uses for the program should not be overlooked. Uses far beyond the writing of clinic notes and correspondence can be easily envisioned. Utilization of voice recognition software offers clinical practices the ability to produce quality printed records in a timely and cost-effective manner. After learning procedures for the selected product and appropriately formatting word processing software and printers, printed progress notes should be able to be produced in less time than traditional dictation and transcription methods. Although certain procedures and practices may need to be altered, or may preclude optimal utilization of this type of system, many advantages are apparent. It is recommended that facilities consider utilization of Voice Recognition products such as Dragon Systems Naturally Speaking Software, or at least consider a trial of this method with one of the limited-feature products, if current dictation practices are unsatisfactory or excessively costly. Free downloadable trial software or single user software can provide a reduced-cost method for trial evaluation of such products if a major commitment is not felt to be desired. A list of voice recognition software manufacturer web sites may be accessed through the following: http

  2. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob;

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...

  3. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...... be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. METHODS: Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS...

  4. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob;

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...... be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. METHODS: Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS...... was compared with the same dictate transcribed by an experienced research secretary, and the effect of adding words to the vocabulary of the VRS was investigated. The number of errors per hundred words was used as outcome. Furthermore, three experienced researchers assessed the subjective readability using...

  5. Voice recognition software can be used for scientific articles.

    Science.gov (United States)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob; Rosenberg, Jacob

    2015-02-01

    Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS was compared with the same dictate transcribed by an experienced research secretary, and the effect of adding words to the vocabulary of the VRS was investigated. The number of errors per hundred words was used as outcome. Furthermore, three experienced researchers assessed the subjective readability using a Likert scale (0-10). Dragon Nuance Premium version 12.5 was used as VRS. The median number of errors per hundred words was 18 (range: 8.5-24.3), which improved when 15,000 words were added to the vocabulary. Subjective readability assessment showed that the texts were understandable with a median score of five (range: 3-9), which was improved with the addition of 5,000 words. The out-of-the-box performance of VRS was acceptable and improved after additional words were added. Further studies are needed to investigate the effect of additional software accuracy training.

  6. Embodied Transcription: A Creative Method for Using Voice-Recognition Software

    Science.gov (United States)

    Brooks, Christine

    2010-01-01

    Voice-recognition software is designed to be used by one user (voice) at a time, requiring a researcher to speak all of the words of a recorded interview to achieve transcription. Thus, the researcher becomes a conduit through which interview material is inscribed as written word. Embodied Transcription acknowledges performative and interpretative…

  7. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob

    2015-01-01

    be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. METHODS: Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS...... with a median score of five (range: 3-9), which was improved with the addition of 5,000 words. CONCLUSION: The out-of-the-box performance of VRS was acceptable and improved after additional words were added. Further studies are needed to investigate the effect of additional software accuracy training....

  8. Neural mechanisms for voice recognition

    NARCIS (Netherlands)

    Andics, A.V.; McQueen, J.M.; Petersson, K.M.; Gal, V.; Rudas, G.; Vidnyanszky, Z.

    2010-01-01

    We investigated neural mechanisms that support voice recognition in a training paradigm with fMRI. The same listeners were trained on different weeks to categorize the mid-regions of voice-morph continua as an individual's voice. Stimuli implicitly defined a voice-acoustics space, and training expli

  9. Voice Recognition Technology: Has It Come of Age?

    Directory of Open Access Journals (Sweden)

    Joseph R. Zumalt

    2005-12-01

    Full Text Available Voice recognition software allows computer users to bypass their keyboards and use their voices to enter text. While the library literature is somewhat silent about voice recognition technology, the medical and legal communities have reported some success using it. Voice recognition software was tested for dictation accuracy and usability within an agriculture library at the University of Illinois. Dragon NaturallySpeaking 8.0 was found to be more accurate than speech recognition within Microsoft Office 2003. Helpful Web sites and a short history regarding this breakthrough technology are included.

  10. FILTWAM and Voice Emotion Recognition

    NARCIS (Netherlands)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2014-01-01

    This paper introduces the voice emotion recognition part of our framework for improving learning through webcams and microphones (FILTWAM). This framework enables multimodal emotion recognition of learners during game-based learning. The main goal of this study is to validate the use of microphone d

  11. FILTWAM and Voice Emotion Recognition

    NARCIS (Netherlands)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2014-01-01

    This paper introduces the voice emotion recognition part of our framework for improving learning through webcams and microphones (FILTWAM). This framework enables multimodal emotion recognition of learners during game-based learning. The main goal of this study is to validate the use of microphone

  12. Building Domain Specific Languages for Voice Recognition Applications

    Directory of Open Access Journals (Sweden)

    Cristian IONITA

    2008-01-01

    Full Text Available This paper presents a method of implementing the voice recognition for the control of software applications. The solutions proposed are based on transforming a subset of the natural language in commands recognized by the application using a formal language defined by the means of a context free grammar. At the end of the paper is presented the modality of integration of voice recognition and of voice synthesis for the Romanian language in Windows applications.

  13. Voice congruency facilitates word recognition.

    Directory of Open Access Journals (Sweden)

    Sandra Campeanu

    Full Text Available Behavioral studies of spoken word memory have shown that context congruency facilitates both word and source recognition, though the level at which context exerts its influence remains equivocal. We measured event-related potentials (ERPs while participants performed both types of recognition task with words spoken in four voices. Two voice parameters (i.e., gender and accent varied between speakers, with the possibility that none, one or two of these parameters was congruent between study and test. Results indicated that reinstating the study voice at test facilitated both word and source recognition, compared to similar or no context congruency at test. Behavioral effects were paralleled by two ERP modulations. First, in the word recognition test, the left parietal old/new effect showed a positive deflection reflective of context congruency between study and test words. Namely, the same speaker condition provided the most positive deflection of all correctly identified old words. In the source recognition test, a right frontal positivity was found for the same speaker condition compared to the different speaker conditions, regardless of response success. Taken together, the results of this study suggest that the benefit of context congruency is reflected behaviorally and in ERP modulations traditionally associated with recognition memory.

  14. WHEEL CHAIR USING VOICE RECOGNITION

    OpenAIRE

    Manish Kumar Yadav*; Rajat Kumar; Santosh Yadav; Ravindra Prajapati; Prof. Kshirsagar

    2016-01-01

    The wide spread prevalence of lost limbs and sensing system is of major concern in present day due to wars, accident, age and health problems. This Omni-directional wheelchair was designed for the less able elderly to move more flexibly in narrow spaces, such as elevators or small aisle. The wheelchair is developed to help disabled patients by using speech recognition system to control the movement of wheelchair in different directions by using voice commands and also the simple movement of t...

  15. Human voice recognition depends on language ability.

    Science.gov (United States)

    Perrachione, Tyler K; Del Tufo, Stephanie N; Gabrieli, John D E

    2011-07-29

    The ability to recognize people by their voice is an important social behavior. Individuals differ in how they pronounce words, and listeners may take advantage of language-specific knowledge of speech phonology to facilitate recognizing voices. Impaired phonological processing is characteristic of dyslexia and thought to be a basis for difficulty in learning to read. We tested voice-recognition abilities of dyslexic and control listeners for voices speaking listeners' native language or an unfamiliar language. Individuals with dyslexia exhibited impaired voice-recognition abilities compared with controls only for voices speaking their native language. These results demonstrate the importance of linguistic representations for voice recognition. Humans appear to identify voices by making comparisons between talkers' pronunciations of words and listeners' stored abstract representations of the sounds in those words.

  16. Voice Recognition in Face-Blind Patients.

    Science.gov (United States)

    Liu, Ran R; Pancaroglu, Raika; Hills, Charlotte S; Duchaine, Brad; Barton, Jason J S

    2016-04-01

    Right or bilateral anterior temporal damage can impair face recognition, but whether this is an associative variant of prosopagnosia or part of a multimodal disorder of person recognition is an unsettled question, with implications for cognitive and neuroanatomic models of person recognition. We assessed voice perception and short-term recognition of recently heard voices in 10 subjects with impaired face recognition acquired after cerebral lesions. All 4 subjects with apperceptive prosopagnosia due to lesions limited to fusiform cortex had intact voice discrimination and recognition. One subject with bilateral fusiform and anterior temporal lesions had a combined apperceptive prosopagnosia and apperceptive phonagnosia, the first such described case. Deficits indicating a multimodal syndrome of person recognition were found only in 2 subjects with bilateral anterior temporal lesions. All 3 subjects with right anterior temporal lesions had normal voice perception and recognition, 2 of whom performed normally on perceptual discrimination of faces. This confirms that such lesions can cause a modality-specific associative prosopagnosia.

  17. Voice Recognition: A New Assessment Tool?

    Science.gov (United States)

    Jones, Darla

    2005-01-01

    This article presents the results of a study conducted in Anchorage, Alaska, that evaluated the accuracy and efficiency of using voice recognition (VR) technology to collect oral reading fluency data for classroom-based assessments. The primary research question was as follows: Is voice recognition technology a valid and reliable alternative to…

  18. Frequency and analysis of non-clinical errors made in radiology reports using the National Integrated Medical Imaging System voice recognition dictation software.

    Science.gov (United States)

    Motyer, R E; Liddy, S; Torreggiani, W C; Buckley, O

    2016-11-01

    Voice recognition (VR) dictation of radiology reports has become the mainstay of reporting in many institutions worldwide. Despite benefit, such software is not without limitations, and transcription errors have been widely reported. Evaluate the frequency and nature of non-clinical transcription error using VR dictation software. Retrospective audit of 378 finalised radiology reports. Errors were counted and categorised by significance, error type and sub-type. Data regarding imaging modality, report length and dictation time was collected. 67 (17.72 %) reports contained ≥1 errors, with 7 (1.85 %) containing 'significant' and 9 (2.38 %) containing 'very significant' errors. A total of 90 errors were identified from the 378 reports analysed, with 74 (82.22 %) classified as 'insignificant', 7 (7.78 %) as 'significant', 9 (10 %) as 'very significant'. 68 (75.56 %) errors were 'spelling and grammar', 20 (22.22 %) 'missense' and 2 (2.22 %) 'nonsense'. 'Punctuation' error was most common sub-type, accounting for 27 errors (30 %). Complex imaging modalities had higher error rates per report and sentence. Computed tomography contained 0.040 errors per sentence compared to plain film with 0.030. Longer reports had a higher error rate, with reports >25 sentences containing an average of 1.23 errors per report compared to 0-5 sentences containing 0.09. These findings highlight the limitations of VR dictation software. While most error was deemed insignificant, there were occurrences of error with potential to alter report interpretation and patient management. Longer reports and reports on more complex imaging had higher error rates and this should be taken into account by the reporting radiologist.

  19. Chemical recognition software

    Energy Technology Data Exchange (ETDEWEB)

    Wagner, J.S.; Trahan, M.W.; Nelson, W.E.; Hargis, P.H. Jr.; Tisone, G.C.

    1994-06-01

    We have developed a capability to make real time concentration measurements of individual chemicals in a complex mixture using a multispectral laser remote sensing system. Our chemical recognition and analysis software consists of three parts: (1) a rigorous multivariate analysis package for quantitative concentration and uncertainty estimates, (2) a genetic optimizer which customizes and tailors the multivariate algorithm for a particular application, and (3) an intelligent neural net chemical filter which pre-selects from the chemical database to find the appropriate candidate chemicals for quantitative analyses by the multivariate algorithms, as well as providing a quick-look concentration estimate and consistency check. Detailed simulations using both laboratory fluorescence data and computer synthesized spectra indicate that our software can make accurate concentration estimates from complex multicomponent mixtures, even when the mixture is noisy and contaminated with unknowns.

  20. Chemical recognition software

    Energy Technology Data Exchange (ETDEWEB)

    Wagner, J.S.; Trahan, M.W.; Nelson, W.E.; Hargis, P.J. Jr.; Tisone, G.C.

    1994-12-01

    We have developed a capability to make real time concentration measurements of individual chemicals in a complex mixture using a multispectral laser remote sensing system. Our chemical recognition and analysis software consists of three parts: (1) a rigorous multivariate analysis package for quantitative concentration and uncertainty estimates, (2) a genetic optimizer which customizes and tailors the multivariate algorithm for a particular application, and (3) an intelligent neural net chemical filter which pre-selects from the chemical database to find the appropriate candidate chemicals for quantitative analyses by the multivariate algorithms, as well as providing a quick-look concentration estimate and consistency check. Detailed simulations using both laboratory fluorescence data and computer synthesized spectra indicate that our software can make accurate concentration estimates from complex multicomponent mixtures. even when the mixture is noisy and contaminated with unknowns.

  1. Implicit multisensory associations influence voice recognition.

    Directory of Open Access Journals (Sweden)

    Katharina von Kriegstein

    2006-10-01

    Full Text Available Natural objects provide partially redundant information to the brain through different sensory modalities. For example, voices and faces both give information about the speech content, age, and gender of a person. Thanks to this redundancy, multimodal recognition is fast, robust, and automatic. In unimodal perception, however, only part of the information about an object is available. Here, we addressed whether, even under conditions of unimodal sensory input, crossmodal neural circuits that have been shaped by previous associative learning become activated and underpin a performance benefit. We measured brain activity with functional magnetic resonance imaging before, while, and after participants learned to associate either sensory redundant stimuli, i.e. voices and faces, or arbitrary multimodal combinations, i.e. voices and written names, ring tones, and cell phones or brand names of these cell phones. After learning, participants were better at recognizing unimodal auditory voices that had been paired with faces than those paired with written names, and association of voices with faces resulted in an increased functional coupling between voice and face areas. No such effects were observed for ring tones that had been paired with cell phones or names. These findings demonstrate that brief exposure to ecologically valid and sensory redundant stimulus pairs, such as voices and faces, induces specific multisensory associations. Consistent with predictive coding theories, associative representations become thereafter available for unimodal perception and facilitate object recognition. These data suggest that for natural objects effective predictive signals can be generated across sensory systems and proceed by optimization of functional connectivity between specialized cortical sensory modules.

  2. Pegembangan Game dengan Menggunakan Teknologi Voice Recognition Berbasis Android

    Directory of Open Access Journals (Sweden)

    Franky Hadinata Marpaung

    2014-06-01

    Full Text Available The purpose of this research is to create a new kind of game by using technology that rarely used in current games. It is developed as an entertainment media and also a social media in which the users can play the games together via multiplayer mode. This research uses Scrum development method since it supports small scaled developer and it supports software increment along the development. Using this game application, the users can play and watch interesting animations by controlling it with their voice, listen the character imitating the users’ voice, play various mini games both in single player or multiplayer mode via Bluetooth connection. The conclusion is that game application of My Name is Dug use voice recognition and inter-devices connection as its main features. It also has various mini games that support both single player and multiplayer.

  3. Machine Recognition vs Human Recognition of Voices

    Science.gov (United States)

    2012-05-01

    recognized. The accuracy of speaker recognition for disyllables was 87%. For monosyllables, it was 81%, consonant- vowel excerpts were 63%, and... vowel excerpts were 56%. Thus, they demonstrated that the identification performance decreased as the number of phonemes decreased. In [2], the...will still sound natural and the performance of listeners could be tied directly to the degradation of particular frequencies. If the performance

  4. Enhancing nursing practice by utilizing voice recognition for direct documentation.

    Science.gov (United States)

    Fratzke, Jason; Tucker, Sharon; Shedenhelm, Heidi; Arnold, Jackie; Belda, Tom; Petera, Michael

    2014-02-01

    Innovative strategies that preserve nursing time for direct patient care activities are needed. This study examined the utility, feasibility, and acceptability of voice recognition (VR) software to document nursing care and patient outcomes in an electronic health record in a simulated nursing care environment. A phase 1 trial included 5 iterative experiments with observations and nurse participant feedback to allow enhancements to the speech detection capabilities and refinement of the technology, software, and processes. Utility ratings improved over time; however, interference on nursing care remained a concern throughout. Nurse participants favored keyboard entry electronic health record, largely due to software and technical issues, but also relative to the culture shift the new technology brings to nursing practice. Successful adoption of VR technology by nursing will be dependent on receptiveness of the nurses and perceived benefits, timely access to education and training, and minimization of barriers to using the software.

  5. Improving Speaker Recognition by Biometric Voice Deconstruction

    Directory of Open Access Journals (Sweden)

    Luis Miguel eMazaira-Fernández

    2015-09-01

    Full Text Available Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g. YouTube to broadcast its message. In this new scenario, classical identification methods (such fingerprints or face recognition have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. Through the present paper, a new methodology to characterize speakers will be shown. This methodology is benefiting from the advances achieved during the last years in understanding and modelling voice production. The paper hypothesizes that a gender dependent characterization of speakers combined with the use of a new set of biometric parameters extracted from the components resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract gender-dependent extended biometric parameters are given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.

  6. Improving Speaker Recognition by Biometric Voice Deconstruction.

    Science.gov (United States)

    Mazaira-Fernandez, Luis Miguel; Álvarez-Marquina, Agustín; Gómez-Vilda, Pedro

    2015-01-01

    Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g., YouTube) to broadcast its message. In this new scenario, classical identification methods (such as fingerprints or face recognition) have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. The present study benefits from the advances achieved during last years in understanding and modeling voice production. The paper hypothesizes that a gender-dependent characterization of speakers combined with the use of a set of features derived from the components, resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract the gender-dependent extended biometric parameters is given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.

  7. Temporal voice areas exist in autism spectrum disorder but are dysfunctional for voice identity recognition

    Science.gov (United States)

    Borowiak, Kamila; von Kriegstein, Katharina

    2016-01-01

    The ability to recognise the identity of others is a key requirement for successful communication. Brain regions that respond selectively to voices exist in humans from early infancy on. Currently, it is unclear whether dysfunction of these voice-sensitive regions can explain voice identity recognition impairments. Here, we used two independent functional magnetic resonance imaging studies to investigate voice processing in a population that has been reported to have no voice-sensitive regions: autism spectrum disorder (ASD). Our results refute the earlier report that individuals with ASD have no responses in voice-sensitive regions: Passive listening to vocal, compared to non-vocal, sounds elicited typical responses in voice-sensitive regions in the high-functioning ASD group and controls. In contrast, the ASD group had a dysfunction in voice-sensitive regions during voice identity but not speech recognition in the right posterior superior temporal sulcus/gyrus (STS/STG)—a region implicated in processing complex spectrotemporal voice features and unfamiliar voices. The right anterior STS/STG correlated with voice identity recognition performance in controls but not in the ASD group. The findings suggest that right STS/STG dysfunction is critical for explaining voice recognition impairments in high-functioning ASD and show that ASD is not characterised by a general lack of voice-sensitive responses. PMID:27369067

  8. Software Based Supernova Recognition

    Science.gov (United States)

    Walters, Stephen M.

    2014-05-01

    This paper describes software for detecting Supernova (SN) in images. The software can operate in real-time to discover SN while data is being collected so the instrumentation can immediately be re-tasked to perform spectroscopy or photometry of a discovery. Because the instrumentation captures two images per minute, the realtime budget is constrained to 30 seconds per target, a challenging goal. Using a set of two to four images, the program creates a "Reference" (REF) image and a "New" (NEW) image where all images are used in both NEW and REF but any SN survives the combination process only in the NEW image. This process produces good quality images having similar noise characteristics but without artifacts that might be interpreted as SN. The images are then adjusted for seeing and brightness differences using a variant of Tomaney and Crotts method of Point Spread Function (PSF) matching after which REF is subtracted from NEW to produce a Difference (DIF) image. A Classifier is then trained on a grid of artificial SN to estimate the statistical properties of four attributes and used in a process to mask false positives that can be clearly identified as such. Further training to avoid any remaining false positives sets the range, in standard deviations for each attribute, that the Classifier will accept as a valid SN. This training enables the Classifier to discriminate between SN and most subtraction residue. Lastly, the DIF image is scanned and measured by the Classifier to find locations where all four properties fall within their acceptance ranges. If multiple locations are found, the one best conforming to the training estimates is chosen. This location is then declared as a Candidate SN, the instrumentation re-tasked and the operator notified.

  9. Familiar Person Recognition: Is Autonoetic Consciousness More Likely to Accompany Face Recognition Than Voice Recognition?

    Science.gov (United States)

    Barsics, Catherine; Brédart, Serge

    2010-11-01

    Autonoetic consciousness is a fundamental property of human memory, enabling us to experience mental time travel, to recollect past events with a feeling of self-involvement, and to project ourselves in the future. Autonoetic consciousness is a characteristic of episodic memory. By contrast, awareness of the past associated with a mere feeling of familiarity or knowing relies on noetic consciousness, depending on semantic memory integrity. Present research was aimed at evaluating whether conscious recollection of episodic memories is more likely to occur following the recognition of a familiar face than following the recognition of a familiar voice. Recall of semantic information (biographical information) was also assessed. Previous studies that investigated the recall of biographical information following person recognition used faces and voices of famous people as stimuli. In this study, the participants were presented with personally familiar people's voices and faces, thus avoiding the presence of identity cues in the spoken extracts and allowing a stricter control of frequency exposure with both types of stimuli (voices and faces). In the present study, the rate of retrieved episodic memories, associated with autonoetic awareness, was significantly higher from familiar faces than familiar voices even though the level of overall recognition was similar for both these stimuli domains. The same pattern was observed regarding semantic information retrieval. These results and their implications for current Interactive Activation and Competition person recognition models are discussed.

  10. Electrolarynx Voice Recognition Utilizing Pulse Coupled Neural Network

    Directory of Open Access Journals (Sweden)

    Fatchul Arifin

    2010-08-01

    Full Text Available The laryngectomies patient has no ability to speak normally because their vocal chords have been removed. The easiest option for the patient to speak again is by using electrolarynx speech. This tool is placed on the lower chin. Vibration of the neck while speaking is used to produce sound. Meanwhile, the technology of "voice recognition" has been growing very rapidly. It is expected that the technology of "voice recognition" can also be used by laryngectomies patients who use electrolarynx.This paper describes a system for electrolarynx speech recognition. Two main parts of the system are feature extraction and pattern recognition. The Pulse Coupled Neural Network – PCNN is used to extract the feature and characteristic of electrolarynx speech. Varying of β (one of PCNN parameter also was conducted. Multi layer perceptron is used to recognize the sound patterns. There are two kinds of recognition conducted in this paper: speech recognition and speaker recognition. The speech recognition recognizes specific speech from every people. Meanwhile, speaker recognition recognizes specific speech from specific person. The system ran well. The "electrolarynx speech recognition" has been tested by recognizing of “A” and "not A" voice. The results showed that the system had 94.4% validation. Meanwhile, the electrolarynx speaker recognition has been tested by recognizing of “saya” voice from some different speakers. The results showed that the system had 92.2% validation. Meanwhile, the best β parameter of PCNN for electrolarynx recognition is 3.

  11. (Almost) Word for Word: As Voice Recognition Programs Improve, Students Reap the Benefits

    Science.gov (United States)

    Smith, Mark

    2006-01-01

    Voice recognition software is hardly new--attempts at capturing spoken words and turning them into written text have been available to consumers for about two decades. But what was once an expensive and highly unreliable tool has made great strides in recent years, perhaps most recognized in programs such as Nuance's Dragon NaturallySpeaking…

  12. Noise Robust Speech Recognition Applied to Voice-Driven Wheelchair

    Science.gov (United States)

    Sasou, Akira; Kojima, Hiroaki

    2009-12-01

    Conventional voice-driven wheelchairs usually employ headset microphones that are capable of achieving sufficient recognition accuracy, even in the presence of surrounding noise. However, such interfaces require users to wear sensors such as a headset microphone, which can be an impediment, especially for the hand disabled. Conversely, it is also well known that the speech recognition accuracy drastically degrades when the microphone is placed far from the user. In this paper, we develop a noise robust speech recognition system for a voice-driven wheelchair. This system can achieve almost the same recognition accuracy as the headset microphone without wearing sensors. We verified the effectiveness of our system in experiments in different environments, and confirmed that our system can achieve almost the same recognition accuracy as the headset microphone without wearing sensors.

  13. When the face fits: recognition of celebrities from matching and mismatching faces and voices.

    Science.gov (United States)

    Stevenage, Sarah V; Neil, Greg J; Hamlin, Iain

    2014-01-01

    The results of two experiments are presented in which participants engaged in a face-recognition or a voice-recognition task. The stimuli were face-voice pairs in which the face and voice were co-presented and were either "matched" (same person), "related" (two highly associated people), or "mismatched" (two unrelated people). Analysis in both experiments confirmed that accuracy and confidence in face recognition was consistently high regardless of the identity of the accompanying voice. However accuracy of voice recognition was increasingly affected as the relationship between voice and accompanying face declined. Moreover, when considering self-reported confidence in voice recognition, confidence remained high for correct responses despite the proportion of these responses declining across conditions. These results converged with existing evidence indicating the vulnerability of voice recognition as a relatively weak signaller of identity, and results are discussed in the context of a person-recognition framework.

  14. The Neuropsychology of Familiar Person Recognition from Face and Voice

    OpenAIRE

    2014-01-01

    Prosopagnosia has been considered for a long period of time as the most important and almost exclusive disorder in the recognition of familiar people. In recent years, however, this conviction has been undermined by the description of patients showing a concomitant defect in the recognition of familiar faces and voices as a consequence of lesions encroaching upon the right anterior temporal lobe (ATL). These new data have obliged researchers to reconsider on one hand the construct of ‘associa...

  15. Emotional Recognition in Autism Spectrum Conditions from Voices and Faces

    Science.gov (United States)

    Stewart, Mary E.; McAdam, Clair; Ota, Mitsuhiko; Peppe, Sue; Cleland, Joanne

    2013-01-01

    The present study reports on a new vocal emotion recognition task and assesses whether people with autism spectrum conditions (ASC) perform differently from typically developed individuals on tests of emotional identification from both the face and the voice. The new test of vocal emotion contained trials in which the vocal emotion of the sentence…

  16. Protocol Software for a Packet Voice Terminal

    Science.gov (United States)

    1983-11-16

    III satellite. The PVTs with their attached telephone instrument serve as the interface with the voice user. The PVTs prepare speech for transmission...through a packet network by digitizing the speech, preparing speech data packets, and sending speech data messages. The PVT handles the speech coming...TOTALKin. Thes Foori alControle wilno trani speech message s unlTreessin ithas note rcenie s speehdfr ao sufficentpo toefl erAcofitt cdng atei t pfres

  17. Acoustic cues for the recognition of self-voice and other-voice

    Directory of Open Access Journals (Sweden)

    Mingdi eXu

    2013-10-01

    Full Text Available Self-recognition, being indispensable for successful social communication, has become a major focus in current social neuroscience. The physical aspects of the self are most typically manifested in the face and voice. Compared with the wealth of studies on self-face recognition, self-voice recognition (SVR has not gained much attention. Converging evidence has suggested that the fundamental frequency (F0 and formant structures serve as the key acoustic cues for other-voice recognition (OVR. However, little is known about which, and how, acoustic cues are utilized for SVR as opposed to OVR. To address this question, we independently manipulated the F0 and formant information of recorded voices and investigated their contributions to SVR and OVR. Japanese participants were presented with recorded vocal stimuli and were asked to identify the speaker—either themselves or one of their peers. Six groups of 5 peers of the same sex participated in the study. Under conditions where the formant information was fully preserved and where only the frequencies lower than the third formant (F3 were retained, accuracies of SVR deteriorated significantly with the modulation of the F0, and the results were comparable for OVR. By contrast, under a condition where only the frequencies higher than F3 were retained, the accuracy of SVR was significantly higher than that of OVR throughout the range of F0 modulations, and the F0 scarcely affected the accuracies of SVR and OVR. Our results indicate that while both F0 and formant information are involved in SVR, as well as in OVR, the advantage of SVR is manifested only when major formant information for speech intelligibility is absent. These findings imply the robustness of self-voice representation, possibly by virtue of auditory familiarity and other factors such as its association with motor/articulatory representation.

  18. Secure Recognition of Voice-Less Commands Using Videos

    Science.gov (United States)

    Yau, Wai Chee; Kumar, Dinesh Kant; Weghorn, Hans

    Interest in voice recognition technologies for internet applications is growing due to the flexibility of speech-based communication. The major drawback with the use of sound for internet access with computers is that the commands will be audible to other people in the vicinity. This paper examines a secure and voice-less method for recognition of speech-based commands using video without evaluating sound signals. The proposed approach represents mouth movements in the video data using 2D spatio-temporal templates (STT). Zernike moments (ZM) are computed from STT and fed into support vector machines (SVM) to be classified into one of the utterances. The experimental results demonstrate that the proposed technique produces a high accuracy of 98% in a phoneme classification task. The proposed technique is demonstrated to be invariant to global variations of illumination level. Such a system is useful for securely interpreting user commands for internet applications on mobile devices.

  19. The Neuropsychology of Familiar Person Recognition from Face and Voice

    Directory of Open Access Journals (Sweden)

    Guido Gainotti

    2014-05-01

    Full Text Available Prosopagnosia has been considered for a long period of time as the most important and almost exclusive disorder in the recognition of familiar people. In recent years, however, this conviction has been undermined by the description of patients showing a concomitant defect in the recognition of familiar faces and voices as a consequence of lesions encroaching upon the right anterior temporal lobe (ATL. These new data have obliged researchers to reconsider on one hand the construct of ‘associative prosopagnosia’ and on the other hand current models of people recognition. A systematic review of the patterns of familiar people recognition disorders observed in patients with right and left ATL lesions has shown that in patients with right ATL lesions face familiarity feelings and the retrieval of person-specific semantic information from faces are selectively affected, whereas in patients with left ATL lesions the defect selectively concerns famous people naming. Furthermore, some patients with right ATL lesions and intact face familiarity feelings show a defect in the retrieval of person-specific semantic knowledge greater from face than from name. These data are at variance with current models assuming: (a that familiarity feelings are generated at the level of person identity nodes (PINs where information processed by various sensory modalities converge, and (b that PINs provide a modality-free gateway to a single semantic system, where information about people is stored in an amodal format. They suggest, on the contrary: (a that familiarity feelings are generated at the level of modality-specific recognition units; (b that face and voice recognition units are represented more in the right than in the left ATLs; (c that in the right ATL are mainly stored person-specific information based on a convergence of perceptual information, whereas in the left ATLs are represented verbally-mediated person-specific information.

  20. Voice-Controlled Educational Software for Pre-School Education

    Directory of Open Access Journals (Sweden)

    Aysun Coşkun

    2014-02-01

    Full Text Available Nowadays communities are continuously evolving and changing socially, economically and culturally. Technology is certainly at the top of causative agent of development and change. The opportunities offered by technology especially in education accelerating this development. In recent years rapidly progress of training technology has led to use computers in education. It's a fact that audio-visual support provided by computers is efficient in pre-school children's education. But in pre-school period children don't know how to read and write and haven't completed their small muscle development, this fact limits the use of computers in the process. However, speech recognition technologies which is one of the opportunities offered by technology, is the solution to overcome this limitations. Speech recognition technologies are the systems, that developed for the recognition of sounds by computers are perceived by the devices like microphone. In this study with the facilities of speech recognition technology, supporting individual learning in pre-school education has been aimed. For this reason an educational software in which this technologies of speech recognition are integrated for the usage of pre-school education has been prepared. The most distinctive characteristic of this software is to be able to used with completely sounds. Thus, this software can be used easily by the children who can not to use their hands and arms.

  1. Software for Partly Automated Recognition of Targets

    Science.gov (United States)

    Opitz, David; Blundell, Stuart; Bain, William; Morris, Matthew; Carlson, Ian; Mangrich, Mark

    2003-01-01

    The Feature Analyst is a computer program for assisted (partially automated) recognition of targets in images. This program was developed to accelerate the processing of high-resolution satellite image data for incorporation into geographic information systems (GIS). This program creates an advanced user interface that embeds proprietary machine-learning algorithms in commercial image-processing and GIS software. A human analyst provides samples of target features from multiple sets of data, then the software develops a data-fusion model that automatically extracts the remaining features from selected sets of data. The program thus leverages the natural ability of humans to recognize objects in complex scenes, without requiring the user to explain the human visual recognition process by means of lengthy software. Two major subprograms are the reactive agent and the thinking agent. The reactive agent strives to quickly learn the user s tendencies while the user is selecting targets and to increase the user s productivity by immediately suggesting the next set of pixels that the user may wish to select. The thinking agent utilizes all available resources, taking as much time as needed, to produce the most accurate autonomous feature-extraction model possible.

  2. Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

    CERN Document Server

    Muda, Lindasalwa; Elamvazuthi, I

    2010-01-01

    Digital processing of speech signal and voice recognition algorithm is very important for fast and accurate automatic voice recognition technology. The voice is a signal of infinite information. A direct analysis and synthesizing the complex voice signal is due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching are introduced to represent the voice signal. Several methods such as Liner Predictive Predictive Coding (LPC), Hidden Markov Model (HMM), Artificial Neural Network (ANN) and etc are evaluated with a view to identify a straight forward and effective method for voice signal. The extraction and matching process is implemented right after the Pre Processing or filtering signal is performed. The non-parametric method for modelling the human auditory perception system, Mel Frequency Cepstral Coefficients (MFCCs) are utilize as extraction techniques. The non linear sequence alignment known as Dynamic Time Warping (DTW) intro...

  3. Impact of PACS and Voice-Recognition Reporting on the Education of Radiology Residents

    OpenAIRE

    Gutierrez, Antonio J.; Mullins, Mark E.; Robert A. Novelline

    2005-01-01

    Rationale and Objectives: The introduction of picture archiving and communication system (PACS) has decreased the time needed to interpret radiology examinations resulting in an increased workflow. Because of concerns that the increase in exam throughput and the use of voice recognition may have a negative impact upon radiology resident education, a survey was conducted to assess the impact of PACS and voice recognition. Materials and Methods: Residents at four diagnostic radiology training p...

  4. Superior voice recognition in a patient with acquired prosopagnosia and object agnosia.

    Science.gov (United States)

    Hoover, Adria E N; Démonet, Jean-François; Steeves, Jennifer K E

    2010-11-01

    Anecdotally, it has been reported that individuals with acquired prosopagnosia compensate for their inability to recognize faces by using other person identity cues such as hair, gait or the voice. Are they therefore superior at the use of non-face cues, specifically voices, to person identity? Here, we empirically measure person and object identity recognition in a patient with acquired prosopagnosia and object agnosia. We quantify person identity (face and voice) and object identity (car and horn) recognition for visual, auditory, and bimodal (visual and auditory) stimuli. The patient is unable to recognize faces or cars, consistent with his prosopagnosia and object agnosia, respectively. He is perfectly able to recognize people's voices and car horns and bimodal stimuli. These data show a reverse shift in the typical weighting of visual over auditory information for audiovisual stimuli in a compromised visual recognition system. Moreover, the patient shows selectively superior voice recognition compared to the controls revealing that two different stimulus domains, persons and objects, may not be equally affected by sensory adaptation effects. This also implies that person and object identity recognition are processed in separate pathways. These data demonstrate that an individual with acquired prosopagnosia and object agnosia can compensate for the visual impairment and become quite skilled at using spared aspects of sensory processing. In the case of acquired prosopagnosia it is advantageous to develop a superior use of voices for person identity recognition in everyday life. Copyright © 2010 Elsevier Ltd. All rights reserved.

  5. Who gets credit for input? Demographic and structural status cues in voice recognition.

    Science.gov (United States)

    Howell, Taeya M; Harrison, David A; Burris, Ethan R; Detert, James R

    2015-11-01

    The authors investigate the employee features that, alongside overall voice expression, affect supervisors' voice recognition. Drawing primarily from status characteristics and network position theories, the authors propose and find in a study of 693 employees from 89 different credit union units that supervisors are more likely to credit those reporting the same amount of voice if the employees have higher ascribed or assigned (by the organization) status--cued by demographic variables such as majority ethnicity and full-time work hours. Further, supervisors are more likely to recognize voice from employees who have higher achieved status--cued by their centrality in informal social structures. The authors also find that even when certain groups of lower status employees speak up more, they cannot compensate for the negative effect of their demographic membership on voice recognition by their boss. The authors underscore how recognition of employee voice by supervisors matters for employees. It carries (mediates) the effects of voice expression and status onto performance evaluations 1 year later, which means that demographic differences in the assignment of credit for voice can serve as an implicit pathway for discrimination.

  6. Voice identity recognition: functional division of the right STS and its behavioral relevance.

    Science.gov (United States)

    Schall, Sonja; Kiebel, Stefan J; Maess, Burkhard; von Kriegstein, Katharina

    2015-02-01

    The human voice is the primary carrier of speech but also a fingerprint for person identity. Previous neuroimaging studies have revealed that speech and identity recognition is accomplished by partially different neural pathways, despite the perceptual unity of the vocal sound. Importantly, the right STS has been implicated in voice processing, with different contributions of its posterior and anterior parts. However, the time point at which vocal and speech processing diverge is currently unknown. Also, the exact role of the right STS during voice processing is so far unclear because its behavioral relevance has not yet been established. Here, we used the high temporal resolution of magnetoencephalography and a speech task control to pinpoint transient behavioral correlates: we found, at 200 msec after stimulus onset, that activity in right anterior STS predicted behavioral voice recognition performance. At the same time point, the posterior right STS showed increased activity during voice identity recognition in contrast to speech recognition whereas the left mid STS showed the reverse pattern. In contrast to the highly speech-sensitive left STS, the current results highlight the right STS as a key area for voice identity recognition and show that its anatomical-functional division emerges around 200 msec after stimulus onset. We suggest that this time point marks the speech-independent processing of vocal sounds in the posterior STS and their successful mapping to vocal identities in the anterior STS.

  7. Ability for voice recognition is a marker for dyslexia in children.

    Science.gov (United States)

    Perea, Manuel; Jiménez, María; Suárez-Coalla, Paz; Fernández, Nohemí; Viña, Cecilia; Cuetos, Fernando

    2014-01-01

    A recent voice recognition experiment conducted by Perrachione, Del Tufo, and Gabrieli (2011) revealed that, in normal adult readers, the accuracy at identifying human voices was better in the participants' mother tongue than in an unfamiliar language, while this difference was absent in a group of adults with dyslexia. This pattern favored a view of dyslexia as due to "fundamentally impoverished native-language phonological representations." To further examine this issue, we conducted two voice recognition experiments, one with children with/without dyslexia, and the other with adults with/without dyslexia. Results revealed that children/adults with dyslexia were less accurate at identifying voices than normal readers and, importantly, this effect was independent of language. These data are more consistent with the assumption of dyslexia as due to a deficit in multisensory integration rather than a deficit based on impoverished native-language phonologically based representations.

  8. Literature Review of Voice Recognition and Generation Technology for Army Helicopter Applications.

    Science.gov (United States)

    1984-08-01

    support up this conclusion (Jay, 1981; Coler , 1983). Based upon the research presented, the following statements can be made: a. When flight control...dB) must be overcome by the voice recognizer ( Coler , 1983). 11 55i The effects of noise on voice recognition were the topic of a study performed at...noise when the subject was also required to perform a tracking task and enter data ( Coler , 1983). Performance was evaluated for three different

  9. Analysis of the influence of sound signal processing parameters on the quality voice command recognition

    Directory of Open Access Journals (Sweden)

    L. P. Dyuzhayev

    2014-04-01

    Full Text Available Introduction. For the task of voice control over different devices recognition of single (isolated voice commands is required. Typically, this control method requires high reliability (at least 95% accuracy voice recognition. It should be noted that voice commands are often pronounced in high noisiness. All presently known methods and algorithms of speech recognition do not allow to clearly determine which parameters of sound signal can provide the best results. The main part. On the first level of voice recognition is about preprocessing and extracting of acoustic features that have a number of useful features – they are easily calculated, providing a compact representation of the voice commands that are resistant to noise interference; On the next level given command is looked for in the reference dictionary. To get MFCC coefficients input file has to be divided into frames. Each frame is measured by a window function and processed by discrete Fourier transform. The resulting representation of signal in the frequency domain is divided into ranges using a set of triangular filters. The last step is to perform discrete cosine transform. Method of dynamic time warping allows to get a value that is an inverse of degree of similarity between given command and a reference. Conclusions. Research has shown that in the field of voice commands recognition optimum results in terms of quality / performance can be achieved using the following parameters of sound signal processing:8 kHz sample rate, frame duration 70–120 ms, Hamming weighting function of a window, number of Fourier samples is 512.

  10. Body expressions influence recognition of emotions in the face and voice.

    Science.gov (United States)

    Van den Stock, Jan; Righart, Ruthger; de Gelder, Beatrice

    2007-08-01

    The most familiar emotional signals consist of faces, voices, and whole-body expressions, but so far research on emotions expressed by the whole body is sparse. The authors investigated recognition of whole-body expressions of emotion in three experiments. In the first experiment, participants performed a body expression-matching task. Results indicate good recognition of all emotions, with fear being the hardest to recognize. In the second experiment, two alternative forced choice categorizations of the facial expression of a compound face-body stimulus were strongly influenced by the bodily expression. This effect was a function of the ambiguity of the facial expression. In the third experiment, recognition of emotional tone of voice was similarly influenced by task irrelevant emotional body expressions. Taken together, the findings illustrate the importance of emotional whole-body expressions in communication either when viewed on their own or, as is often the case in realistic circumstances, in combination with facial expressions and emotional voices.

  11. Effects of emotional and perceptual-motor stress on a voice recognition system's accuracy: An applied investigation

    Science.gov (United States)

    Poock, G. K.; Martin, B. J.

    1984-02-01

    This was an applied investigation examining the ability of a speech recognition system to recognize speakers' inputs when the speakers were under different stress levels. Subjects were asked to speak to a voice recognition system under three conditions: (1) normal office environment, (2) emotional stress, and (3) perceptual-motor stress. Results indicate a definite relationship between voice recognition system performance and the type of low stress reference patterns used to achieve recognition.

  12. Speech Recognition: How Do We Teach It?

    Science.gov (United States)

    Barksdale, Karl

    2002-01-01

    States that growing use of speech recognition software has made voice writing an essential computer skill. Describes how to present the topic, develop basic speech recognition skills, and teach speech recognition outlining, writing, proofreading, and editing. (Contains 14 references.) (SK)

  13. Motorcycle Start-stop System based on Intelligent Biometric Voice Recognition

    Science.gov (United States)

    Winda, A.; E Byan, W. R.; Sofyan; Armansyah; Zariantin, D. L.; Josep, B. G.

    2017-03-01

    Current mechanical key in the motorcycle is prone to bulgary, being stolen or misplaced. Intelligent biometric voice recognition as means to replace this mechanism is proposed as an alternative. The proposed system will decide whether the voice is belong to the user or not and the word utter by the user is ‘On’ or ‘Off’. The decision voice will be sent to Arduino in order to start or stop the engine. The recorded voice is processed in order to get some features which later be used as input to the proposed system. The Mel-Frequency Ceptral Coefficient (MFCC) is adopted as a feature extraction technique. The extracted feature is the used as input to the SVM-based identifier. Experimental results confirm the effectiveness of the proposed intelligent voice recognition and word recognition system. It show that the proposed method produces a good training and testing accuracy, 99.31% and 99.43%, respectively. Moreover, the proposed system shows the performance of false rejection rate (FRR) and false acceptance rate (FAR) accuracy of 0.18% and 17.58%, respectively. In the intelligent word recognition shows that the training and testing accuracy are 100% and 96.3%, respectively.

  14. Recognition of voice commands using adaptation of foreign language speech recognizer via selection of phonetic transcriptions

    Science.gov (United States)

    Maskeliunas, Rytis; Rudzionis, Vytautas

    2011-06-01

    In recent years various commercial speech recognizers have become available. These recognizers provide the possibility to develop applications incorporating various speech recognition techniques easily and quickly. All of these commercial recognizers are typically targeted to widely spoken languages having large market potential; however, it may be possible to adapt available commercial recognizers for use in environments where less widely spoken languages are used. Since most commercial recognition engines are closed systems the single avenue for the adaptation is to try set ways for the selection of proper phonetic transcription methods between the two languages. This paper deals with the methods to find the phonetic transcriptions for Lithuanian voice commands to be recognized using English speech engines. The experimental evaluation showed that it is possible to find phonetic transcriptions that will enable the recognition of Lithuanian voice commands with recognition accuracy of over 90%.

  15. Near-infrared face recognition utilizing open CV software

    Science.gov (United States)

    Sellami, Louiza; Ngo, Hau; Fowler, Chris J.; Kearney, Liam M.

    2014-06-01

    Commercially available hardware, freely available algorithms, and authors' developed software are synergized successfully to detect and recognize subjects in an environment without visible light. This project integrates three major components: an illumination device operating in near infrared (NIR) spectrum, a NIR capable camera and a software algorithm capable of performing image manipulation, facial detection and recognition. Focusing our efforts in the near infrared spectrum allows the low budget system to operate covertly while still allowing for accurate face recognition. In doing so a valuable function has been developed which presents potential benefits in future civilian and military security and surveillance operations.

  16. The Pandora software development kit for pattern recognition

    Energy Technology Data Exchange (ETDEWEB)

    Marshall, J.S.; Thomson, M.A. [University of Cambridge, Cavendish Laboratory, Cambridge (United Kingdom)

    2015-09-15

    The development of automated solutions to pattern recognition problems is important in many areas of scientific research and human endeavour. This paper describes the implementation of the Pandora software development kit, which aids the process of designing, implementing and running pattern recognition algorithms. The Pandora Application Programming Interfaces ensure simple specification of the building-blocks defining a pattern recognition problem. The logic required to solve the problem is implemented in algorithms. The algorithms request operations to create or modify data structures and the operations are performed by the Pandora framework. This design promotes an approach using many decoupled algorithms, each addressing specific topologies. Details of algorithms addressing two pattern recognition problems in High Energy Physics are presented: reconstruction of events at a high-energy e{sup +}e{sup -} linear collider and reconstruction of cosmic ray or neutrino events in a liquid argon time projection chamber. (orig.)

  17. Application of Voice Recognition Input to Decision Support Systems

    Science.gov (United States)

    1988-12-01

    namely, a Bark-scale frequency warping and the incorporation of suprasegmental energy information. All distortion measures and their modifications were...lowest score; (2) Whereas the addition of suprasegmental energy information helped the recognition performance, the use of gain and absolute loudness

  18. Actuator prototype system by voice commands using free software

    Directory of Open Access Journals (Sweden)

    Jaime Andrango

    2016-06-01

    Full Text Available This prototype system is a software application that through the use of techniques of digital signal processing, extracts information from the user's speech, which is then used to manage the on/off actuator on a peripheral computer when vowels are pronounced. The method applies spectral differences. The application uses the parallel port as actuator, with the information recorded in the memory address 378H. This prototype was developed using free software tools for its versatility and dynamism, and to allow other researchers to base on it for further studies.

  19. Behavioral biometrics for verification and recognition of malicious software agents

    Science.gov (United States)

    Yampolskiy, Roman V.; Govindaraju, Venu

    2008-04-01

    Homeland security requires technologies capable of positive and reliable identification of humans for law enforcement, government, and commercial applications. As artificially intelligent agents improve in their abilities and become a part of our everyday life, the possibility of using such programs for undermining homeland security increases. Virtual assistants, shopping bots, and game playing programs are used daily by millions of people. We propose applying statistical behavior modeling techniques developed by us for recognition of humans to the identification and verification of intelligent and potentially malicious software agents. Our experimental results demonstrate feasibility of such methods for both artificial agent verification and even for recognition purposes.

  20. Proactiveness in entrepreneurial software firms: the executives' voice

    Directory of Open Access Journals (Sweden)

    Jean-Pierre Boissin

    2010-12-01

    Full Text Available This article approaches proactiveness in firms, considered to be one of the dimensions of the entrepreneurial orientation. Its goal is to introduce the results of an exploratory and qualitative study, which aimed to characterize the proactiveness in entrepreneurial software firms. The theory resumes the concepts of entrepreneurial firms, entrepreneurial orientation and proactiveness. The data gathering was accomplished through deeper interviews with executives from 13 software firms that stand out in terms of entrepreneurship in Rio Grande do Sul state. The results of the study demonstrate that firms are proactive and show a characterization regarding this behavior, starting from the conceptual base adopted in the present study. Among the proactiveness elements in the researched organizations, the onesrelated to environment monitoring and opportunities quest are highlighted. The study also consoliding a components’ set of proactiveness based on the theory and organizational practice reported by executives.

  1. Voice recognition through phonetic features with Punjabi utterances

    Science.gov (United States)

    Kaur, Jasdeep; Juglan, K. C.; Sharma, Vishal; Upadhyay, R. K.

    2017-07-01

    This paper deals with perception and disorders of speech in view of Punjabi language. Visualizing the importance of voice identification, various parameters of speaker identification has been studied. The speech material was recorded with a tape recorder in their normal and disguised mode of utterances. Out of the recorded speech materials, the utterances free from noise, etc were selected for their auditory and acoustic spectrographic analysis. The comparison of normal and disguised speech of seven subjects is reported. The fundamental frequency (F0) at similar places, Plosive duration at certain phoneme, Amplitude ratio (A1:A2) etc. were compared in normal and disguised speech. It was found that the formant frequency of normal and disguised speech remains almost similar only if it is compared at the position of same vowel quality and quantity. If the vowel is more closed or more open in the disguised utterance the formant frequency will be changed in comparison to normal utterance. The ratio of the amplitude (A1: A2) is found to be speaker dependent. It remains unchanged in the disguised utterance. However, this value may shift in disguised utterance if cross sectioning is not done at the same location.

  2. Combining speech recognition software with Digital Imaging and Communications in Medicine (DICOM) workstation software on a Microsoft Windows platform.

    Science.gov (United States)

    Ernst, R; Carpenter, W; Torres, W; Wheeler, S

    2001-06-01

    This presentation describes our experience in combining speech recognition software, clinical review software, and other software products on a single computer. Different processor speeds, random access memory (RAM), and computer costs were evaluated. We found that combining continuous speech recognition software with Digital Imaging and Communications in Medicine (DICOM) workstation software on the same platform is feasible and can lead to substantial savings of hardware cost. This combination optimizes use of limited workspace and can improve radiology workflow.

  3. Combining speech recognition software with digital imaging and communications in medicine (DICOM) workstation software on a microsoft windows platform

    OpenAIRE

    Ernst, Randy; Carpenter, Walter; Torres, William; Wheeler, Scott

    2001-01-01

    This presentation describes our experience in combining speech recognition software, clinical review software, and other software products on a single computer. Different processor speeds, random access memory (RAM), and computer costs were evaluated. We found that combining continuous speech recognition software with Digital Imaging and Communications in Medicine (DICOM) workstation software on the same platform is feasible and can lead to substantial savings of hardware cost. This combinati...

  4. VoiceForum, a software platform for spoken interaction: a model for the "Call Triangle"?

    OpenAIRE

    Fynn, John; Wigham, Ciara R.

    2011-01-01

    VoiceForum is a pedagogical project created as a response to learners' needs in the spoken language observed mainly at the Hypermedia Language Centre of Blaise Pascal University, France. It comprises a web-based forum approach for posting interactive audio and text with a dedicated unintrusive space for teacher feedback. The software platform (freely available via download), thus, offers a means of providing guidance through contextualised help to individual learners on their spoken discourse...

  5. Voice emotion recognition by cochlear-implanted children and their normally-hearing peers.

    Science.gov (United States)

    Chatterjee, Monita; Zion, Danielle J; Deroche, Mickael L; Burianek, Brooke A; Limb, Charles J; Goren, Alison P; Kulkarni, Aditya M; Christensen, Julie A

    2015-04-01

    Despite their remarkable success in bringing spoken language to hearing impaired listeners, the signal transmitted through cochlear implants (CIs) remains impoverished in spectro-temporal fine structure. As a consequence, pitch-dominant information such as voice emotion, is diminished. For young children, the ability to correctly identify the mood/intent of the speaker (which may not always be visible in their facial expression) is an important aspect of social and linguistic development. Previous work in the field has shown that children with cochlear implants (cCI) have significant deficits in voice emotion recognition relative to their normally hearing peers (cNH). Here, we report on voice emotion recognition by a cohort of 36 school-aged cCI. Additionally, we provide for the first time, a comparison of their performance to that of cNH and NH adults (aNH) listening to CI simulations of the same stimuli. We also provide comparisons to the performance of adult listeners with CIs (aCI), most of whom learned language primarily through normal acoustic hearing. Results indicate that, despite strong variability, on average, cCI perform similarly to their adult counterparts; that both groups' mean performance is similar to aNHs' performance with 8-channel noise-vocoded speech; that cNH achieve excellent scores in voice emotion recognition with full-spectrum speech, but on average, show significantly poorer scores than aNH with 8-channel noise-vocoded speech. A strong developmental effect was observed in the cNH with noise-vocoded speech in this task. These results point to the considerable benefit obtained by cochlear-implanted children from their devices, but also underscore the need for further research and development in this important and neglected area. This article is part of a Special Issue entitled .

  6. OBJECT ORIENTED SOFTWARE FOR MICRO WORK PIECE RECOGNITION IN MICROASSEMBLY

    Directory of Open Access Journals (Sweden)

    Toledo-Ramírez, G.K.

    2006-04-01

    Full Text Available The aim of this article is to describe object oriented software for the automatic micro work piece handling system.The general task of this system is the recognition of work pieces with neural classifier and detection of theirpositions. Other important functions of the system are work piece styles database administration, work piecedatabase administration for neural classifier training and testing, neural classifier interface between database, userand work piece finder. The software is object oriented and widely commented, that makes its modification,adaptation and improvement easier. Most of the software modules can be used in other research projects. Thesoftware was tested on image database. The results of experiments prove its effectiveness in chosen task.

  7. Recognition disorders for famous faces and voices: a review of the literature and normative data of a new test battery.

    Science.gov (United States)

    Quaranta, Davide; Piccininni, Chiara; Carlesimo, Giovanni Augusto; Luzzi, Simona; Marra, Camillo; Papagno, Costanza; Trojano, Luigi; Gainotti, Guido

    2016-03-01

    Several anatomo-clinical investigations have shown that familiar face recognition disorders not due to high level perceptual defects are often observed in patients with lesions of the right anterior temporal lobe (ATL). The meaning of these findings is, however, controversial, because some authors claim that these patients show pure instances of modality-specific 'associative prosopagnosia', whereas other authors maintain that in these patients voice recognition is also impaired and that these patients have a 'multimodal person recognition disorder'. To solve the problem of the nature of famous faces recognition disorders in patients affected by right ATL lesions, it is therefore very important to verify with formal tests if these patients are or are not able to recognize others by voice, but a direct comparison between the two modalities is hindered by the fact that voice recognition is more difficult than face recognition. To circumvent this difficulty, we constructed a test battery in which subjects were requested to recognize the same persons (well-known at the national level) through their faces and voices, evaluating familiarity and identification processes. The present paper describes the 'Famous People Recognition Battery' and reports the normative data necessary to clarify the nature of person recognition disorders observed in patients affected by right ATL lesions.

  8. Suggestions for Layout and Functional Behavior of Software-Based Voice Switch Keysets

    Science.gov (United States)

    Scott, David W.

    2010-01-01

    Marshall Space Flight Center (MSFC) provides communication services for a number of real time environments, including Space Shuttle Propulsion support and International Space Station (ISS) payload operations. In such settings, control team members speak with each other via multiple voice circuits or loops. Each loop has a particular purpose and constituency, and users are assigned listen and/or talk capabilities for a given loop based on their role in fulfilling the purpose. A voice switch is a given facility's hardware and software that supports such communication, and may be interconnected with other facilities switches to create a large network that, from an end user perspective, acts like a single system. Since users typically monitor and/or respond to several voice loops concurrently for hours on end and real time operations can be very dynamic and intense, it s vital that a control panel or keyset for interfacing with the voice switch be a servant that reduces stress, not a master that adds it. Implementing the visual interface on a computer screen provides tremendous flexibility and configurability, but there s a very real risk of overcomplication. (Remember how office automation made life easier, which led to a deluge of documents that made life harder?) This paper a) discusses some basic human factors considerations related to keysets implemented as application software windows, b) suggests what to standardize at the facility level and what to leave to the user's preference, and c) provides screen shot mockups for a robust but reasonably simple user experience. Concepts apply to keyset needs in almost any type of operations control or support center.

  9. Examining the effects of variation in emotional tone of voice on spoken word recognition.

    Science.gov (United States)

    Krestar, Maura L; McLennan, Conor T

    2013-09-01

    Emotional tone of voice (ETV) is essential for optimal verbal communication. Research has found that the impact of variation in nonlinguistic features of speech on spoken word recognition differs according to a time course. In the current study, we investigated whether intratalker variation in ETV follows the same time course in two long-term repetition priming experiments. We found that intratalker variability in ETVs affected reaction times to spoken words only when processing was relatively slow and difficult, not when processing was relatively fast and easy. These results provide evidence for the use of both abstract and episodic lexical representations for processing within-talker variability in ETV, depending on the time course of spoken word recognition.

  10. Pattern recognition software and techniques for biological image analysis.

    Directory of Open Access Journals (Sweden)

    Lior Shamir

    Full Text Available The increasing prevalence of automated image acquisition systems is enabling new types of microscopy experiments that generate large image datasets. However, there is a perceived lack of robust image analysis systems required to process these diverse datasets. Most automated image analysis systems are tailored for specific types of microscopy, contrast methods, probes, and even cell types. This imposes significant constraints on experimental design, limiting their application to the narrow set of imaging methods for which they were designed. One of the approaches to address these limitations is pattern recognition, which was originally developed for remote sensing, and is increasingly being applied to the biology domain. This approach relies on training a computer to recognize patterns in images rather than developing algorithms or tuning parameters for specific image processing tasks. The generality of this approach promises to enable data mining in extensive image repositories, and provide objective and quantitative imaging assays for routine use. Here, we provide a brief overview of the technologies behind pattern recognition and its use in computer vision for biological and biomedical imaging. We list available software tools that can be used by biologists and suggest practical experimental considerations to make the best use of pattern recognition techniques for imaging assays.

  11. Pattern recognition software and techniques for biological image analysis.

    Science.gov (United States)

    Shamir, Lior; Delaney, John D; Orlov, Nikita; Eckley, D Mark; Goldberg, Ilya G

    2010-11-24

    The increasing prevalence of automated image acquisition systems is enabling new types of microscopy experiments that generate large image datasets. However, there is a perceived lack of robust image analysis systems required to process these diverse datasets. Most automated image analysis systems are tailored for specific types of microscopy, contrast methods, probes, and even cell types. This imposes significant constraints on experimental design, limiting their application to the narrow set of imaging methods for which they were designed. One of the approaches to address these limitations is pattern recognition, which was originally developed for remote sensing, and is increasingly being applied to the biology domain. This approach relies on training a computer to recognize patterns in images rather than developing algorithms or tuning parameters for specific image processing tasks. The generality of this approach promises to enable data mining in extensive image repositories, and provide objective and quantitative imaging assays for routine use. Here, we provide a brief overview of the technologies behind pattern recognition and its use in computer vision for biological and biomedical imaging. We list available software tools that can be used by biologists and suggest practical experimental considerations to make the best use of pattern recognition techniques for imaging assays.

  12. It doesn't matter what you say: FMRI correlates of voice learning and recognition independent of speech content.

    Science.gov (United States)

    Zäske, Romi; Awwad Shiekh Hasan, Bashar; Belin, Pascal

    2017-09-01

    Listeners can recognize newly learned voices from previously unheard utterances, suggesting the acquisition of high-level speech-invariant voice representations during learning. Using functional magnetic resonance imaging (fMRI) we investigated the anatomical basis underlying the acquisition of voice representations for unfamiliar speakers independent of speech, and their subsequent recognition among novel voices. Specifically, listeners studied voices of unfamiliar speakers uttering short sentences and subsequently classified studied and novel voices as "old" or "new" in a recognition test. To investigate "pure" voice learning, i.e., independent of sentence meaning, we presented German sentence stimuli to non-German speaking listeners. To disentangle stimulus-invariant and stimulus-dependent learning, during the test phase we contrasted a "same sentence" condition in which listeners heard speakers repeating the sentences from the preceding study phase, with a "different sentence" condition. Voice recognition performance was above chance in both conditions although, as expected, performance was higher for same than for different sentences. During study phases activity in the left inferior frontal gyrus (IFG) was related to subsequent voice recognition performance and same versus different sentence condition, suggesting an involvement of the left IFG in the interactive processing of speaker and speech information during learning. Importantly, at test reduced activation for voices correctly classified as "old" compared to "new" emerged in a network of brain areas including temporal voice areas (TVAs) of the right posterior superior temporal gyrus (pSTG), as well as the right inferior/middle frontal gyrus (IFG/MFG), the right medial frontal gyrus, and the left caudate. This effect of voice novelty did not interact with sentence condition, suggesting a role of temporal voice-selective areas and extra-temporal areas in the explicit recognition of learned voice identity

  13. Educational Pedagogy Explored: Attachment, Voice, and Students’ Limited Recognition of the Purpose of Writing

    Directory of Open Access Journals (Sweden)

    Rebecca A. Fairchild

    2013-07-01

    Full Text Available The following teacher research case-study involved an exploration of educational pedagogy by working with a freshman composition student at a college university. All data collected for the study was gathered during the 2013 spring semester. The study was driven by an inquiry based approach where the researcher determined the center of focus that arose from an exploration of the student as a writer through a survey, a classroom observation, multiple one-on-one meetings, and email conversations. The focus area that arose was the student’s limited recognition that writing was done solely for school purposes. Related puzzlements stemming from this focus area included the student’s lack of attachment and lack of voice in her writing. The conclusive data provided insights for how to educate students in future classrooms regarding how vital it is for students to be able to attach themselves to their work.

  14. Voice Activity Detector of Wake-Up-Word Speech Recognition System Design on FPGA

    Directory of Open Access Journals (Sweden)

    Veton Z. Këpuska

    2014-12-01

    Full Text Available A typical speech recognition system is push-to-talk operated that requires activation. However for those who use hands-busy applications, movement may by restricted or impossible. One alternative is to use Speech-Only Interface. The proposed method that is called Wake-Up-Word Speech Recognition (WUW-SR that utilizes speech only interface. A WUW-SR system would allow the user to activate systems (Cell phone, Computer, etc. with only speech commands instead of manual activation. The trend in WUW-SR hardware design is towards implementing a complete system on a single chip intended for various applications. This paper presents an experimental FPGA design and implementation of a novel architecture of a real time feature extraction processor that includes: Voice Activity Detector (VAD, and features extraction, MFCC, LPC, and ENH_MFCC. In the WUW-SR system, the recognizer front-end with VAD is located at the terminal which is typically connected over a data network(e.g., serverfor remote back-end recognition. VAD is responsible for segmenting the signal into speech-like and non-speech-like segments. For any given frame VAD reports one of two possible states: VAD_ON or VAD_OFF. The back-end is then responsible to score the features that are being segmented during VAD_ON stage. The most important characteristic of the presented design is that it should guarantee virtually 100% correct rejection for non-WUW (out of vocabulary words - OOV while maintaining correct acceptance rate of 99.9% or higher (in vocabulary words - INV. This requirement sets apart WUW-SR from other speech recognition tasks because no existing system can guarantee 100% reliability by any measure.

  15. Scientific Bases of Human-Machine Communication by Voice

    Science.gov (United States)

    Schafer, Ronald W.

    1995-10-01

    The scientific bases for human-machine communication by voice are in the fields of psychology, linguistics, acoustics, signal processing, computer science, and integrated circuit technology. The purpose of this paper is to highlight the basic scientific and technological issues in human-machine communication by voice and to point out areas of future research opportunity. The discussion is organized around the following major issues in implementing human-machine voice communication systems: (i) hardware/software implementation of the system, (ii) speech synthesis for voice output, (iii) speech recognition and understanding for voice input, and (iv) usability factors related to how humans interact with machines.

  16. Automatic Speech Recognition Systems for the Evaluation of Voice and Speech Disorders in Head and Neck Cancer

    Directory of Open Access Journals (Sweden)

    Andreas Maier

    2010-01-01

    Full Text Available In patients suffering from head and neck cancer, speech intelligibility is often restricted. For assessment and outcome measurements, automatic speech recognition systems have previously been shown to be appropriate for objective and quick evaluation of intelligibility. In this study we investigate the applicability of the method to speech disorders caused by head and neck cancer. Intelligibility was quantified by speech recognition on recordings of a standard text read by 41 German laryngectomized patients with cancer of the larynx or hypopharynx and 49 German patients who had suffered from oral cancer. The speech recognition provides the percentage of correctly recognized words of a sequence, that is, the word recognition rate. Automatic evaluation was compared to perceptual ratings by a panel of experts and to an age-matched control group. Both patient groups showed significantly lower word recognition rates than the control group. Automatic speech recognition yielded word recognition rates which complied with experts' evaluation of intelligibility on a significant level. Automatic speech recognition serves as a good means with low effort to objectify and quantify the most important aspect of pathologic speech—the intelligibility. The system was successfully applied to voice and speech disorders.

  17. Advances in software engineering and their relations to pattern recognition and image processing

    Energy Technology Data Exchange (ETDEWEB)

    Tanimoto, S.L.

    1982-01-01

    In recent years software engineering has emerged as a discipline of programming. It includes the conceptualization, design, implementation, testing and modification of software systems. Related issues are languages, standards, distribution and parallel processing, and total programming environments. The fact that more than 80 percent of system development costs were in software rather than hardware helps one appreciate the importance of any effort to understand and enhance the software production process. Many pattern recognition projects involve fairly large software efforts. It makes sense not only for researchers to make use of the latest software tools and methodologies but also to anticipate future changes. 76 references.

  18. 病态嗓音的识别与研究%study and recognition of pathological voice

    Institute of Scientific and Technical Information of China (English)

    陈承义; 高俊芬

    2013-01-01

      通过分析嗓音的发音机理,提取正常与病态嗓音的传统声学参数:基频、共振峰、Mel 倒谱系数(MFCC),以及非线性特征参数:计盒维数与截距,作为病态嗓音识别的特征矢量集.应用高斯混合模型(GMM)对156例正常嗓音与146例病态嗓音进行建模与识别.结果表明:非线性特征参数计盒维数与截距能很好地区分正常与病态嗓音,它们与传统声学参数基频和共振峰的组合,能够取得92.60%的识别率.%By analyzing the mechanism of pronunciation, normal and pathological voice of traditional acoustic parameters:fun-damental frequency, formant, Mel Frequency Cepstrum Coefficient(MFCC), and non-linear feature parameters:box-counting dimension and intercept, are extracted as feature vectors of recognition of pathological voice. 156 normal voice samples and 146 pathological voice samples are recognized based on Gaussian Mixture Model(GMM). The results show that the nonlinear fea-ture parameters of box-counting dimension and intercept can well distinguish between normal and pathological voice. The com-bination of box-counting dimension, intercept and the traditional acoustic parameters-fundamental frequency and formant can achieve a better recognition rate of 92.60%.

  19. The Pandora Software Development Kit for Pattern Recognition

    CERN Document Server

    Marshall, J S

    2015-01-01

    running pattern recognition algorithms. The Pandora Application Programming Interfaces ensure simple specification of the building-blocks defining a pattern recognition problem. The logic required to solve the problem is implemented in algorithms. The algorithms request operations to create or modify data structures and the operations are performed by the Pandora framework. This design promotes an approach using many decoupled algorithms, each addressing specific topologies. Details of algorithms addressing two pattern recognition problems in High Energy Physics are presented: reconstruction of events at a high-energy e+e- linear collider and reconstruction of cosmic ray or neutrino events in a liquid argon time projection chamber.

  20. The software peculiarities of pattern recognition in track detectors

    Energy Technology Data Exchange (ETDEWEB)

    Starkov, N. [P. N. Lebedev Physical Institute of the Russian Academy of Sciences, Leninskii prosp. 53, 119991 Moscow (Russian Federation)

    2015-12-31

    The different kinds of nuclear track recognition algorithms are represented. Several complicated samples of use them in physical experiments are considered. The some processing methods of complicated images are described.

  1. A Unique Wavelet Steganography Based Voice Biometric Protection Scheme

    Directory of Open Access Journals (Sweden)

    Sanjaypande M. B

    2013-03-01

    Full Text Available Voice biometric is an easy and cost effective biometric technique which requires minimalistic hardware and software complexity. General voice biometric needs a voice phrase by user which is processed with Mel Filter and Vector Quantized features are extracted. Vector quantization reduces the codebook size but decreases the accuracy of recognition. Therefore we propose a voice biometric system where voice file's non quantized code books are matched with spoken phrase. In order to ensure security to such direct voice sample we embed the voice file in a randomly selected image using DWT technique. Imposters are exposed to only images and are unaware of the voice files. We show that the technique produces better efficiency in comparison to VQ based technique.

  2. Software for pattern recognition of the larvae of Aedes aegypti and Aedes albopictus

    Directory of Open Access Journals (Sweden)

    São Thiago André Iwersen de

    2002-01-01

    Full Text Available Software for pattern recognition of the larvae of mosquitoes Aedes aegypti and Aedes albopictus, biological vectors of dengue and yellow fever, has been developed. Rapid field identification of larva using a digital camera linked to a laptop computer equipped with this software may greatly help prevention campaigns.

  3. Software for pattern recognition of the larvae of Aedes aegypti and Aedes albopictus

    OpenAIRE

    São Thiago André Iwersen de; Kupek Emil; Ferreira Neto Joaquim Alves; São Thiago Paulo de Tarso

    2002-01-01

    Software for pattern recognition of the larvae of mosquitoes Aedes aegypti and Aedes albopictus, biological vectors of dengue and yellow fever, has been developed. Rapid field identification of larva using a digital camera linked to a laptop computer equipped with this software may greatly help prevention campaigns.

  4. A self-teaching image processing and voice-recognition-based, intelligent and interactive system to educate visually impaired children

    Science.gov (United States)

    Iqbal, Asim; Farooq, Umar; Mahmood, Hassan; Asad, Muhammad Usman; Khan, Akrama; Atiq, Hafiz Muhammad

    2010-02-01

    A self teaching image processing and voice recognition based system is developed to educate visually impaired children, chiefly in their primary education. System comprises of a computer, a vision camera, an ear speaker and a microphone. Camera, attached with the computer system is mounted on the ceiling opposite (on the required angle) to the desk on which the book is placed. Sample images and voices in the form of instructions and commands of English, Urdu alphabets, Numeric Digits, Operators and Shapes are already stored in the database. A blind child first reads the embossed character (object) with the help of fingers than he speaks the answer, name of the character, shape etc into the microphone. With the voice command of a blind child received by the microphone, image is taken by the camera which is processed by MATLAB® program developed with the help of Image Acquisition and Image processing toolbox and generates a response or required set of instructions to child via ear speaker, resulting in self education of a visually impaired child. Speech recognition program is also developed in MATLAB® with the help of Data Acquisition and Signal Processing toolbox which records and process the command of the blind child.

  5. Error Rates in Users of Automatic Face Recognition Software.

    Science.gov (United States)

    White, David; Dunn, James D; Schmid, Alexandra C; Kemp, Richard I

    2015-01-01

    In recent years, wide deployment of automatic face recognition systems has been accompanied by substantial gains in algorithm performance. However, benchmarking tests designed to evaluate these systems do not account for the errors of human operators, who are often an integral part of face recognition solutions in forensic and security settings. This causes a mismatch between evaluation tests and operational accuracy. We address this by measuring user performance in a face recognition system used to screen passport applications for identity fraud. Experiment 1 measured target detection accuracy in algorithm-generated 'candidate lists' selected from a large database of passport images. Accuracy was notably poorer than in previous studies of unfamiliar face matching: participants made over 50% errors for adult target faces, and over 60% when matching images of children. Experiment 2 then compared performance of student participants to trained passport officers-who use the system in their daily work-and found equivalent performance in these groups. Encouragingly, a group of highly trained and experienced "facial examiners" outperformed these groups by 20 percentage points. We conclude that human performance curtails accuracy of face recognition systems-potentially reducing benchmark estimates by 50% in operational settings. Mere practise does not attenuate these limits, but superior performance of trained examiners suggests that recruitment and selection of human operators, in combination with effective training and mentorship, can improve the operational accuracy of face recognition systems.

  6. Methods and Software Architecture for Activity Recognition from Position Data

    DEFF Research Database (Denmark)

    Godsk, Torben

    rates -- both in an offline setup as well as in real-time with continuously streamed data. In real-time, I am able to classify the five activities with success rates of: 97.8%, 85.2%, 84%, 93.7% and 72.2%, respectively, with a weighted average of 90.6%. Moreover, when doing the classification offline, I...... am able to classify the five activities with success rates of: 98.6%, 90.4%, 95.7%, 91% and 85.2%, respectively, with a weighted average of 94.8%. Furthermore, the software architecture integrates with "PerPos: a platform for pervasive positioning". PerPos is a software platform. Through cloud...

  7. Hardware/Software Co-Design for Spike Based Recognition

    CERN Document Server

    Ghani, Arfan; Maguire, Liam; Harkin, Jim

    2008-01-01

    The practical applications based on recurrent spiking neurons are limited due to their non-trivial learning algorithms. The temporal nature of spiking neurons is more favorable for hardware implementation where signals can be represented in binary form and communication can be done through the use of spikes. This work investigates the potential of recurrent spiking neurons implementations on reconfigurable platforms and their applicability in temporal based applications. A theoretical framework of reservoir computing is investigated for hardware/software implementation. In this framework, only readout neurons are trained which overcomes the burden of training at the network level. These recurrent neural networks are termed as microcircuits which are viewed as basic computational units in cortical computation. This paper investigates the potential of recurrent neural reservoirs and presents a novel hardware/software strategy for their implementation on FPGAs. The design is implemented and the functionality is ...

  8. The Cambridge Mindreading Face-Voice Battery for Children (CAM-C): complex emotion recognition in children with and without autism spectrum conditions.

    Science.gov (United States)

    Golan, Ofer; Sinai-Gavrilov, Yana; Baron-Cohen, Simon

    2015-01-01

    Difficulties in recognizing emotions and mental states are central characteristics of autism spectrum conditions (ASC). However, emotion recognition (ER) studies have focused mostly on recognition of the six 'basic' emotions, usually using still pictures of faces. This study describes a new battery of tasks for testing recognition of nine complex emotions and mental states from video clips of faces and from voice recordings taken from the Mindreading DVD. This battery (the Cambridge Mindreading Face-Voice Battery for Children or CAM-C) was given to 30 high-functioning children with ASC, aged 8 to 11, and to 25 matched controls. The ASC group scored significantly lower than controls on complex ER from faces and voices. In particular, participants with ASC had difficulty with six out of nine complex emotions. Age was positively correlated with all task scores, and verbal IQ was correlated with scores in the voice task. CAM-C scores were negatively correlated with parent-reported level of autism spectrum symptoms. Children with ASC show deficits in recognition of complex emotions and mental states from both facial and vocal expressions. The CAM-C may be a useful test for endophenotypic studies of ASC and is one of the first to use dynamic stimuli as an assay to reveal the ER profile in ASC. It complements the adult version of the CAM Face-Voice Battery, thus providing opportunities for developmental assessment of social cognition in autism.

  9. 语音情感识别研究现状综述%A General Summary of the Research Status Que about the Voice Emotion Recognition

    Institute of Scientific and Technical Information of China (English)

    何秉羲

    2015-01-01

    This article starts from the concept and process of voice emotion recognition, the phased research situation about the process of voice emotion recognition has carried on the comprehensive elaboration in recent years, and the fu-ture research and its development are prospected.%本文从语音情感识别的概念以及流程入手,对近些年来关于语音情感识别过程情况的阶段性研究成果进行了综合阐述,并对其未来研究及其发展进行了展望。

  10. SOFTWARE EFFORT ESTIMATION FRAMEWORK TO IMPROVE ORGANIZATION PRODUCTIVITY USING EMOTION RECOGNITION OF SOFTWARE ENGINEERS IN SPONTANEOUS SPEECH

    Directory of Open Access Journals (Sweden)

    B.V.A.N.S.S. Prabhakar Rao

    2015-10-01

    Full Text Available Productivity is a very important part of any organisation in general and software industry in particular. Now a day’s Software Effort estimation is a challenging task. Both Effort and Productivity are inter-related to each other. This can be achieved from the employee’s of the organization. Every organisation requires emotionally stable employees in their firm for seamless and progressive working. Of course, in other industries this may be achieved without man power. But, software project development is labour intensive activity. Each line of code should be delivered from software engineer. Tools and techniques may helpful and act as aid or supplementary. Whatever be the reason software industry has been suffering with success rate. Software industry is facing lot of problems in delivering the project on time and within the estimated budget limit. If we want to estimate the required effort of the project it is significant to know the emotional state of the team member. The responsibility of ensuring emotional contentment falls on the human resource department and the department can deploy a series of systems to carry out its survey. This analysis can be done using a variety of tools, one such, is through study of emotion recognition. The data needed for this is readily available and collectable and can be an excellent source for the feedback systems. The challenge of recognition of emotion in speech is convoluted primarily due to the noisy recording condition, the variations in sentiment in sample space and exhibition of multiple emotions in a single sentence. The ambiguity in the labels of training set also increases the complexity of problem addressed. The existing models using probabilistic models have dominated the study but present a flaw in scalability due to statistical inefficiency. The problem of sentiment prediction in spontaneous speech can thus be addressed using a hybrid system comprising of a Convolution Neural Network and

  11. Cleaver: software for identifying taxon specific restriction endonuclease recognition sites.

    Science.gov (United States)

    Jarman, Simon N

    2006-09-01

    Cleaver is an application for identifying restriction endonuclease recognition sites that occur in some taxa but not in others. Differences in DNA fragment restriction patterns among taxa are the basis for many diagnostic assays for taxonomic identification and are used in procedures for removing the DNA of some taxa from pools of DNA from mixed sources. Cleaver analyses restriction digestion of groups of orthologous DNA sequences simultaneously to allow identification of differences in restriction pattern among the fragments derived from different taxa. Cleaver is freely available without registration from its website (http://cleaver.sourceforge.net/) and can be copied, modified and re-distributed under the terms of the GNU general public licence version2 (http://www.gnu.org/licences/gpl). The program can be run as a script for computers that have Python 2.3 and necessary extra modules installed. This allows it to run on Gnu/Linux, Unix, MacOSX and Windows platforms. Stand-alone executable versions for Windows and MacOSX operating systems are available.

  12. Speech Recognition Software for Language Learning: Toward an Evaluation of Validity and Student Perceptions

    Science.gov (United States)

    Cordier, Deborah

    2009-01-01

    A renewed focus on foreign language (FL) learning and speech for communication has resulted in computer-assisted language learning (CALL) software developed with Automatic Speech Recognition (ASR). ASR features for FL pronunciation (Lafford, 2004) are functional components of CALL designs used for FL teaching and learning. The ASR features…

  13. Using Speech Recognition Software to Increase Writing Fluency for Individuals with Physical Disabilities

    Science.gov (United States)

    Garrett, Jennifer Tumlin; Heller, Kathryn Wolff; Fowler, Linda P.; Alberto, Paul A.; Fredrick, Laura D.; O'Rourke, Colleen M.

    2011-01-01

    Students with physical disabilities often have difficulty with writing fluency, despite the use of various strategies, adaptations, and assistive technology (AT). One possible intervention is the use of speech recognition software, although there is little research on its impact on students with physical disabilities. This study used an…

  14. The Voice Transcription Technique: Use of Voice Recognition Software to Transcribe Digital Interview Data in Qualitative Research

    Science.gov (United States)

    Matheson, Jennifer L.

    2007-01-01

    Transcribing interview data is a time-consuming task that most qualitative researchers dislike. Transcribing is even more difficult for people with physical limitations because traditional transcribing requires manual dexterity and the ability to sit at a computer for long stretches of time. Researchers have begun to explore using an automated…

  15. Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization

    Directory of Open Access Journals (Sweden)

    Buddhamas eKriengwatana

    2015-01-01

    Full Text Available The extent to which human speech perception evolved by taking advantage of predispositions and pre-existing features of vertebrate auditory and cognitive systems remains a central question in the evolution of speech. This paper reviews asymmetries in vowel perception, speaker voice recognition, and speaker normalization in non-human animals – topics that have not been thoroughly discussed in relation to the abilities of non-human animals, but are nonetheless important aspects of vocal perception. Throughout this paper we demonstrate that addressing these issues in non-human animals is relevant and worthwhile because many non-human animals must deal with similar issues in their natural environment. That is, they must also discriminate between similar-sounding vocalizations, determine signaler identity from vocalizations, and resolve signaler-dependent variation in vocalizations from conspecifics. Overall, we find that, although plausible, the current evidence is insufficiently strong to conclude that directional asymmetries in vowel perception are specific to humans, or that non-human animals can use voice characteristics to recognize human individuals. However, we do find some indication that non-human animals can normalize speaker differences. Accordingly, we identify avenues for future research that would greatly improve and advance our understanding of these topics.

  16. Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders.

    Science.gov (United States)

    Kitzing, Peter; Maier, Andreas; Ahlander, Viveka Lyberg

    2009-01-01

    In general opinion computerized automatic speech recognition (ASR) seems to be regarded as a method only to accomplish transcriptions from spoken language to written text and as such quite insecure and rather cumbersome. However, due to great advances in computer technology and informatics methodology ASR has nowadays become quite dependable and easier to handle, and the number of applications has increased considerably. After some introductory background information on ASR a number of applications of great interest for professionals in voice, speech, and language therapy are pointed out. In the foreseeable future, the keyboard and mouse will by means of ASR technology be replaced in many functions by a microphone as the human-computer interface, and the computer will talk back via its loud-speaker. It seems important that professionals engaged in the care of oral communication disorders take part in this development so their clients may get the optimal benefit from this new technology.

  17. Higher-order neural network software for distortion invariant object recognition

    Science.gov (United States)

    Reid, Max B.; Spirkovska, Lilly

    1991-01-01

    The state-of-the-art in pattern recognition for such applications as automatic target recognition and industrial robotic vision relies on digital image processing. We present a higher-order neural network model and software which performs the complete feature extraction-pattern classification paradigm required for automatic pattern recognition. Using a third-order neural network, we demonstrate complete, 100 percent accurate invariance to distortions of scale, position, and in-plate rotation. In a higher-order neural network, feature extraction is built into the network, and does not have to be learned. Only the relatively simple classification step must be learned. This is key to achieving very rapid training. The training set is much smaller than with standard neural network software because the higher-order network only has to be shown one view of each object to be learned, not every possible view. The software and graphical user interface run on any Sun workstation. Results of the use of the neural software in autonomous robotic vision systems are presented. Such a system could have extensive application in robotic manufacturing.

  18. Batch metadata assignment to archival photograph collections using facial recognition software

    Directory of Open Access Journals (Sweden)

    Kyle Banerjee

    2013-07-01

    Full Text Available Useful metadata is essential to giving individual meaning and value within the context of a greater image collection as well as making them more discoverable. However, often little information is available about the photos themselves, so adding consistent metadata to large collections of digital and digitized photographs is a time consuming process requiring highly experienced staff. By using facial recognition software, staff can identify individuals more quickly and reliably. Knowledge of individuals in photos helps staff determine when and where photos are taken and also improves understanding of the subject matter. This article demonstrates simple techniques for using facial recognition software and command line tools to assign, modify, and read metadata for large archival photograph collections.

  19. DigitalVHI--a freeware open-source software application to capture the Voice Handicap Index and other questionnaire data in various languages.

    Science.gov (United States)

    Herbst, Christian T; Oh, Jinook; Vydrová, Jitka; Švec, Jan G

    2015-07-01

    In this short report we introduce DigitalVHI, a free open-source software application for obtaining Voice Handicap Index (VHI) and other questionnaire data, which can be put on a computer in clinics and used in clinical practice. The software can simplify performing clinical studies since it makes the VHI scores directly available for analysis in a digital form. It can be downloaded from http://www.christian-herbst.org/DigitalVHI/.

  20. Using commercial-off-the-shelf speech recognition software for conning U.S. warships

    OpenAIRE

    Tamez, Dorothy J.

    2003-01-01

    Approved for public release; distribution is unlimited The U.S. Navy's Transformation Roadmap is leading the fleet in a smaller, faster, and more technologically advanced direction. Smaller platforms and reduced manpower resources create opportunities to fill important positions, including ship-handling control, with technology. This thesis investigates the feasibility of using commercial-off-the-shelf (COTS) speech recognition software (SRS) for conning a Navy ship. Dragon NaturallySpeaki...

  1. Design and Implementation of Monophones and Triphones-Based Speech Recognition Systems for Voice Activated Telephony

    Directory of Open Access Journals (Sweden)

    Rupayan Das

    2013-07-01

    Full Text Available Speech recognition is the ability of a machine or program to convert spoken words into its equivalent text form. Nowadays, most recognition systems use Hidden Markov Models for modeling the spoken utterances. In this paper we have implemented two speaker independent speech recognition systems which include all the words required for dialing a phone. The systems contain 42 words including digits from zero to nine and also include names of 20 persons. A total of 16,800 utterances have been used for training each system. The two systems are able to recognize continuous speech and it is implemented with the help of monophones and triphones using HTK. Experimental results show an accuracy of 74.11% for monophones based models and 93.77% for triphones based models.

  2. A Real-Time Face Motion Based Approach towards Modeling Socially Assistive Wireless Robot Control with Voice Recognition

    Directory of Open Access Journals (Sweden)

    Abhinaba Bhattacharjee

    2015-10-01

    Full Text Available The robotics domain has a couple of specific general design requirements which requires the close integration of planning, sensing, control and modeling and for sure the robot must take into account the interactions between itself, its task and its environment surrounding it. Thus considering the fundamental configurations, the main motive is to design a system with user-friendly interfaces that possess the ability to control embedded robotic systems by natural means. While earlier works have focused primarily on issues such as manipulation and navigation only, this proposal presents a conceptual and intuitive approach towards man-machine interaction in order to provide a secured live biometric logical authorization to the user access, while making an intelligent interaction with the control station to navigate advanced gesture controlled wireless Robotic prototypes or mobile surveillance systems along desired directions through required displacements. The intuitions are based on tracking real-time 3-Dimensional Face Motions using skin tone segmentation and maximum area considerations of segmented face-like blobs, Or directing the system with voice commands using real-time speech recognition. The system implementation requires designing a user interface to communicate between the Control station and prototypes wirelessly, either by accessing the internet over an encrypted Wi-Fi Protected Access (WPA via a HTML web page for communicating with face motions or with the help of natural voice commands like “Trace 5 squares”, “Trace 10 triangles”, “Move 10 meters”, etc. evaluated on an iRobot Create over Bluetooth connectivity using a Bluetooth Access Module (BAM. Such an implementation can prove to be highly effective for designing systems of elderly aid and maneuvering the physically challenged.

  3. DLMS Voice Data Entry.

    Science.gov (United States)

    1980-06-01

    between operator and computer displayed on ADM-3A 20c A-I Possible Hardware Configuration for a Multistation Cartographic VDES ...this program a Voice Recognition System (VRS) which can be used to explore the use of voice data entry ( VDE ) in the DIMS or other cartographic data...Multi-Station Cartographic Voice Data Entry System An engineering development model voice data entry system ( VDES ) could be most efficiently

  4. TreeRipper web application: towards a fully automated optical tree recognition software

    Directory of Open Access Journals (Sweden)

    Hughes Joseph

    2011-05-01

    Full Text Available Abstract Background Relationships between species, genes and genomes have been printed as trees for over a century. Whilst this may have been the best format for exchanging and sharing phylogenetic hypotheses during the 20th century, the worldwide web now provides faster and automated ways of transferring and sharing phylogenetic knowledge. However, novel software is needed to defrost these published phylogenies for the 21st century. Results TreeRipper is a simple website for the fully-automated recognition of multifurcating phylogenetic trees (http://linnaeus.zoology.gla.ac.uk/~jhughes/treeripper/. The program accepts a range of input image formats (PNG, JPG/JPEG or GIF. The underlying command line c++ program follows a number of cleaning steps to detect lines, remove node labels, patch-up broken lines and corners and detect line edges. The edge contour is then determined to detect the branch length, tip label positions and the topology of the tree. Optical Character Recognition (OCR is used to convert the tip labels into text with the freely available tesseract-ocr software. 32% of images meeting the prerequisites for TreeRipper were successfully recognised, the largest tree had 115 leaves. Conclusions Despite the diversity of ways phylogenies have been illustrated making the design of a fully automated tree recognition software difficult, TreeRipper is a step towards automating the digitization of past phylogenies. We also provide a dataset of 100 tree images and associated tree files for training and/or benchmarking future software. TreeRipper is an open source project licensed under the GNU General Public Licence v3.

  5. A basic study on application of voice recognition input to an electronic nursing record system -evaluation of the function as an input interface-.

    Science.gov (United States)

    Marukami, Terutaka; Tani, Shoko; Matsuda, Atsuko; Takemoto, Keiko; Shindo, Akiko; Inada, Hiroshi

    2012-06-01

    As computerization in the nursing field has been recently progressing, an electronic nursing record system is gradually introduced in the medical institution in Japan. Although it is expected for the electronic nursing record system to reduce the load of nursing work, the conventional keyboard operation is used for information input of the present electronic nursing record system and it has some problems concerning the input time and the operationability for common nurses who are unfamiliar with the computer operation. In the present study, we conducted a basic study on application of voice recognition input to an electronic nursing record system. The voice input is recently introduced to an electronic medical record system in a few clinics. However, so far the entered information cannot be processed because the information of the medical record must be entered as a free sentence. Therefore, we contrived a template for an electronic nursing record system and introduced it to the system for simple information entry and easy processing of the entered information in this study. Furthermore, an input experiment for evaluation of the voice input with the template was carried out by voluntary subjects for evaluation of the function as an input interface of an electronic nursing record system. The results of the experiment revealed that the input time by the voice input is obviously fast compared with that by the keyboard input and operationability of the voice input was superior to the keyboard input although all subjects had inexperience of the voice input. As a result, it was suggested our method, the voice input using the template made by us, might be useful for an input interface of an electronic nursing record system.

  6. A Voice Operated Tour Planning System for Autonomous Mobile Robots

    Directory of Open Access Journals (Sweden)

    Charles V. Smith Iii

    2010-06-01

    Full Text Available Control systems driven by voice recognition software have been implemented before but lacked the context driven approach to generate relevant responses and actions. A partially voice activated control system for mobile robotics is presented that allows an autonomous robot to interact with people and the environment in a meaningful way, while dynamically creating customized tours. Many existing control systems also require substantial training for voice application. The system proposed requires little to no training and is adaptable to chaotic environments. The traversable area is mapped once and from that map a fully customized route is generated to the user

  7. Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification.

    Directory of Open Access Journals (Sweden)

    David K Tcheng

    Full Text Available Discriminating between black and white spruce (Picea mariana and Picea glauca is a difficult palynological classification problem that, if solved, would provide valuable data for paleoclimate reconstructions. We developed an open-source visual recognition software (ARLO, Automated Recognition with Layered Optimization capable of differentiating between these two species at an accuracy on par with human experts. The system applies pattern recognition and machine learning to the analysis of pollen images and discovers general-purpose image features, defined by simple features of lines and grids of pixels taken at different dimensions, size, spacing, and resolution. It adapts to a given problem by searching for the most effective combination of both feature representation and learning strategy. This results in a powerful and flexible framework for image classification. We worked with images acquired using an automated slide scanner. We first applied a hash-based "pollen spotting" model to segment pollen grains from the slide background. We next tested ARLO's ability to reconstruct black to white spruce pollen ratios using artificially constructed slides of known ratios. We then developed a more scalable hash-based method of image analysis that was able to distinguish between the pollen of black and white spruce with an estimated accuracy of 83.61%, comparable to human expert performance. Our results demonstrate the capability of machine learning systems to automate challenging taxonomic classifications in pollen analysis, and our success with simple image representations suggests that our approach is generalizable to many other object recognition problems.

  8. Comparison of post menopausal voice changes across professional and non-professional users of the voice

    Directory of Open Access Journals (Sweden)

    Pallavi Vishwas Sovani

    2010-12-01

    Full Text Available Menopause effects a permanent change in certain body functions, one of them being voice. Moreover, if the voice is used continuously as a part of one’s occupation, this may further impact postmenopausal voice changes. The present study investigated the impact of menopause and professional voice use, and their interaction effect, on the voice. 92 women were classified into reproductive (52 and postmenopausal (40. Each group was divided into Level II (teachers and Level IV (clerks of Koufman and Isaacson’s (1991 classification. Acoustic parameters were analyzed using the VisiPitch III software. Aerodynamic parameters were manually calculated. The VHI (Voice Handicap Index was also included to improve the face validity of the study. Results suggest that Fo, SFo and MPT reduce post menopause while NHR and VTI increase. Some changes are accelerated in teachers as compared to clerks while some are decelerated. VHI scores of teachers are significantly greater than clerks, though not significantly different across menopause. Thus the presence or absence of voice use in one’s profession differentially affects postmenopausal changes. The study has implications in improving the condition of teachers in India, developing norms for menopausal changes and modifying allowable limits for voice recognition systems in future.

  9. Evaluation of MPEG-7-Based Audio Descriptors for Animal Voice Recognition over Wireless Acoustic Sensor Networks.

    Science.gov (United States)

    Luque, Joaquín; Larios, Diego F; Personal, Enrique; Barbancho, Julio; León, Carlos

    2016-05-18

    Environmental audio monitoring is a huge area of interest for biologists all over the world. This is why some audio monitoring system have been proposed in the literature, which can be classified into two different approaches: acquirement and compression of all audio patterns in order to send them as raw data to a main server; or specific recognition systems based on audio patterns. The first approach presents the drawback of a high amount of information to be stored in a main server. Moreover, this information requires a considerable amount of effort to be analyzed. The second approach has the drawback of its lack of scalability when new patterns need to be detected. To overcome these limitations, this paper proposes an environmental Wireless Acoustic Sensor Network architecture focused on use of generic descriptors based on an MPEG-7 standard. These descriptors demonstrate it to be suitable to be used in the recognition of different patterns, allowing a high scalability. The proposed parameters have been tested to recognize different behaviors of two anuran species that live in Spanish natural parks; the Epidalea calamita and the Alytes obstetricans toads, demonstrating to have a high classification performance.

  10. Software requirements and support for image-algebraic analysis, detection, and recognition of small targets

    Science.gov (United States)

    Schmalz, Mark S.; Ritter, Gerhard X.; Forsman, Robert H.; Yang, Chyuan-Huei T.; Hu, Wen-Chen; Porter, Ryan A.; McTaggart, Gary; Hranicky, James F.; Davis, James F.

    1995-06-01

    The detection of hazardous targets frequently requires a multispectral approach to image acquisition and analysis, which we have implemented in a software system called MATRE (multispectral automated target recognition and enhancement). MATRE provides capabilities of image enhancement, image database management, spectral signature extraction and visualization, statistical analysis of greyscale imagery, as well as 2D and 3D image processing operations. Our system is based upon a client-server architecture that is amenable to distributed implementation. In this paper, we discuss salient issues and requirements for multispectral recognition of hazardous targets, and show that our software fulfills or exceeds such requirements. MATRE's capabilities, as well as statistical and morphological analysis results, are exemplified with emphasis upon computational cost, ease of installation, and maintenance on various Unix platforms. Additionally, MATRE's image processing functions can be coded in vector-parallel form, for ease of implementation of SIMD-parallel processors. Our algorithms are expressed in terms of image algebra, a concise, rigorous notation that unifies linear and nonlinear mathematics in the image domain. An image algebra class library for the C + + language has been incorporated into the our system, which facilitates fast algorithm prototyping without the numerous drawbacks of descrete coding.

  11. The Usefulness of Automatic Speech Recognition (ASR Eyespeak Software in Improving Iraqi EFL Students’ Pronunciation

    Directory of Open Access Journals (Sweden)

    Lina Fathi Sidig Sidgi

    2017-02-01

    Full Text Available The present study focuses on determining whether automatic speech recognition (ASR technology is reliable for improving English pronunciation to Iraqi EFL students. Non-native learners of English are generally concerned about improving their pronunciation skills, and Iraqi students face difficulties in pronouncing English sounds that are not found in their native language (Arabic. This study is concerned with ASR and its effectiveness in overcoming this difficulty. The data were obtained from twenty participants randomly selected from first-year college students at Al-Turath University College from the Department of English in Baghdad-Iraq. The students had participated in a two month pronunciation instruction course using ASR Eyespeak software. At the end of the pronunciation instruction course using ASR Eyespeak software, the students completed a questionnaire to get their opinions about the usefulness of the ASR Eyespeak in improving their pronunciation. The findings of the study revealed that the students found ASR Eyespeak software very useful in improving their pronunciation and helping them realise their pronunciation mistakes. They also reported that learning pronunciation with ASR Eyespeak enjoyable.

  12. A meta-analysis of in-vehicle and nomadic voice-recognition system interaction and driving performance.

    Science.gov (United States)

    Simmons, Sarah M; Caird, Jeff K; Steel, Piers

    2017-09-01

    Driver distraction is a growing and pervasive issue that requires multiple solutions. Voice-recognition (V-R) systems may decrease the visual-manual (V-M) demands of a wide range of in-vehicle system and smartphone interactions. However, the degree that V-R systems integrated into vehicles or available in mobile phone applications affect driver distraction is incompletely understood. A comprehensive meta-analysis of experimental studies was conducted to address this knowledge gap. To meet study inclusion criteria, drivers had to interact with a V-R system while driving and doing everyday V-R tasks such as dialing, initiating a call, texting, emailing, destination entry or music selection. Coded dependent variables included detection, reaction time, lateral position, speed and headway. Comparisons of V-R systems with baseline driving and/or a V-M condition were also coded. Of 817 identified citations, 43 studies involving 2000 drivers and 183 effect sizes (r) were analyzed in the meta-analysis. Compared to baseline, driving while interacting with a V-R system is associated with increases in reaction time and lane positioning, and decreases in detection. When V-M systems were compared to V-R systems, drivers had slightly better performance with the latter system on reaction time, lane positioning and headway. Although V-R systems have some driving performance advantages over V-M systems, they have a distraction cost relative to driving without any system at all. The pattern of results indicates that V-R systems impose moderate distraction costs on driving. In addition, drivers minimally engage in compensatory performance adjustments such as reducing speed and increasing headway while using V-R systems. Implications of the results for theory, design guidelines and future research are discussed. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. Voice integrated systems

    Science.gov (United States)

    Curran, P. Mike

    1977-01-01

    The program at Naval Air Development Center was initiated to determine the desirability of interactive voice systems for use in airborne weapon systems crew stations. A voice recognition and synthesis system (VRAS) was developed and incorporated into a human centrifuge. The speech recognition aspect of VRAS was developed using a voice command system (VCS) developed by Scope Electronics. The speech synthesis capability was supplied by a Votrax, VS-5, speech synthesis unit built by Vocal Interface. The effects of simulated flight on automatic speech recognition were determined by repeated trials in the VRAS-equipped centrifuge. The relationship of vibration, G, O2 mask, mission duration, and cockpit temperature and voice quality was determined. The results showed that: (1) voice quality degrades after 0.5 hours with an O2 mask; (2) voice quality degrades under high vibration; and (3) voice quality degrades under high levels of G. The voice quality studies are summarized. These results were obtained with a baseline of 80 percent recognition accuracy with VCS.

  14. A Transliteration Algorithm for Adapting a Japanese Voice Controlled Browser to English

    Science.gov (United States)

    Saito, Kuniko; Shinohara, Akio; Nagata, Masaaki; Ohara, Hisashi

    We propose a novel algorithm to transliterate English to Japanese and its application to a voice controlled browser, which enable ordinary Japanese people to browse English Web site by voice. Speech recognition software designed for native English speakers do not work for most Japanese because Japanese can't pronounce English as native English speakers do. Therefore, we combined Japanese speech recognition software with English-to-Japanese transliteration software. The accuracy of our transliteration algorithm is 80% recall for the top candidate, and 92% recall for the top three candidates. The browser using this transliteration algorithm makes it possible for Japanese to navigate English Web pages almost as accurate as Japanese pages by voice commands.

  15. Voice, Schooling, Inequality, and Scale

    Science.gov (United States)

    Collins, James

    2013-01-01

    The rich studies in this collection show that the investigation of voice requires analysis of "recognition" across layered spatial-temporal and sociolinguistic scales. I argue that the concepts of voice, recognition, and scale provide insight into contemporary educational inequality and that their study benefits, in turn, from paying attention to…

  16. Exploring expressivity and emotion with artificial voice and speech technologies.

    Science.gov (United States)

    Pauletto, Sandra; Balentine, Bruce; Pidcock, Chris; Jones, Kevin; Bottaci, Leonardo; Aretoulaki, Maria; Wells, Jez; Mundy, Darren P; Balentine, James

    2013-10-01

    Emotion in audio-voice signals, as synthesized by text-to-speech (TTS) technologies, was investigated to formulate a theory of expression for user interface design. Emotional parameters were specified with markup tags, and the resulting audio was further modulated with post-processing techniques. Software was then developed to link a selected TTS synthesizer with an automatic speech recognition (ASR) engine, producing a chatbot that could speak and listen. Using these two artificial voice subsystems, investigators explored both artistic and psychological implications of artificial speech emotion. Goals of the investigation were interdisciplinary, with interest in musical composition, augmentative and alternative communication (AAC), commercial voice announcement applications, human-computer interaction (HCI), and artificial intelligence (AI). The work-in-progress points towards an emerging interdisciplinary ontology for artificial voices. As one study output, HCI tools are proposed for future collaboration.

  17. Speech recognition software and electronic psychiatric progress notes: physicians' ratings and preferences

    Directory of Open Access Journals (Sweden)

    Derman Yaron D

    2010-08-01

    Full Text Available Abstract Background The context of the current study was mandatory adoption of electronic clinical documentation within a large mental health care organization. Psychiatric electronic documentation has unique needs by the nature of dense narrative content. Our goal was to determine if speech recognition (SR would ease the creation of electronic progress note (ePN documents by physicians at our institution. Methods Subjects: Twelve physicians had access to SR software on their computers for a period of four weeks to create ePN. Measurements: We examined SR software in relation to its perceived usability, data entry time savings, impact on the quality of care and quality of documentation, and the impact on clinical and administrative workflow, as compared to existing methods for data entry. Data analysis: A series of Wilcoxon signed rank tests were used to compare pre- and post-SR measures. A qualitative study design was used. Results Six of twelve participants completing the study favoured the use of SR (five with SR alone plus one with SR via hand-held digital recorder for creating electronic progress notes over their existing mode of data entry. There was no clear perceived benefit from SR in terms of data entry time savings, quality of care, quality of documentation, or impact on clinical and administrative workflow. Conclusions Although our findings are mixed, SR may be a technology with some promise for mental health documentation. Future investigations of this nature should use more participants, a broader range of document types, and compare front- and back-end SR methods.

  18. LABORATORY VOICE DATA ENTRY SYSTEM.

    Energy Technology Data Exchange (ETDEWEB)

    PRAISSMAN,J.L.SUTHERLAND,J.C.

    2003-04-01

    We have assembled a system using a personal computer workstation equipped with standard office software, an audio system, speech recognition software and an inexpensive radio-based wireless microphone that permits laboratory workers to enter or modify data while performing other work. Speech recognition permits users to enter data while their hands are holding equipment or they are otherwise unable to operate a keyboard. The wireless microphone allows unencumbered movement around the laboratory without a ''tether'' that might interfere with equipment or experimental procedures. To evaluate the potential of voice data entry in a laboratory environment, we developed a prototype relational database that records the disposal of radionuclides and/or hazardous chemicals Current regulations in our laboratory require that each such item being discarded must be inventoried and documents must be prepared that summarize the contents of each container used for disposal. Using voice commands, the user enters items into the database as each is discarded. Subsequently, the program prepares the required documentation.

  19. Voice Matching Using Genetic Algorithm

    Directory of Open Access Journals (Sweden)

    Abhishek Bal

    2014-03-01

    Full Text Available In this paper, the use of Genetic Algorithm (GA for voice recognition is described. The practical application of Genetic Algorithm (GA to the solution of engineering problem is a rapidly emerging approach in the field of control engineering and signal processing. Genetic algorithms are useful for searching a space in multi-directional way from large spaces and poorly defined space. Voice is a signal of infinite information. Digital processing of voice signal is very important for automatic voice recognition technology. Nowadays, voice processing is very much important in security mechanism due to mimicry characteristic. So studying the voice feature extraction in voice processing is very necessary in military, hospital, telephone system, investigation bureau and etc. In order to extract valuable information from the voice signal, make decisions on the process, and obtain results, the data needs to be manipulated and analyzed. In this paper, if the instant voice is not matched with same person’s reference voices in the database, then Genetic Algorithm (GA is applied between two randomly chosen reference voices. Again the instant voice is compared with the result of Genetic Algorithm (GA which is used, including its three main steps: selection, crossover and mutation. We illustrate our approach with different sample of voices from human in our institution.

  20. Visual Recognition Software for Binary Classification and its Application to Pollen Identification

    Science.gov (United States)

    Punyasena, S. W.; Tcheng, D. K.; Nayak, A.

    2014-12-01

    An underappreciated source of uncertainty in paleoecology is the uncertainty of palynological identifications. The confidence of any given identification is not regularly reported in published results, so cannot be incorporated into subsequent meta-analyses. Automated identifications systems potentially provide a means of objectively measuring the confidence of a given count or single identification, as well as a mechanism for increasing sample sizes and throughput. We developed the software ARLO (Automated Recognition with Layered Optimization) to tackle difficult visual classification problems such as pollen identification. ARLO applies pattern recognition and machine learning to the analysis of pollen images. The features that the system discovers are not the traditional features of pollen morphology. Instead, general purpose image features, such as pixel lines and grids of different dimensions, size, spacing, and resolution, are used. ARLO adapts to a given problem by searching for the most effective combination of feature representation and learning strategy. We present a two phase approach which uses our machine learning process to first segment pollen grains from the background and then classify pollen pixels and report species ratios. We conducted two separate experiments that utilized two distinct sets of algorithms and optimization procedures. The first analysis focused on reconstructing black and white spruce pollen ratios, training and testing our classification model at the slide level. This allowed us to directly compare our automated counts and expert counts to slides of known spruce ratios. Our second analysis focused on maximizing classification accuracy at the individual pollen grain level. Instead of predicting ratios of given slides, we predicted the species represented in a given image window. The resulting analysis was more scalable, as we were able to adapt the most efficient parts of the methodology from our first analysis. ARLO was able to

  1. Face and Voice Recognition Algorithms of Sign-in System for Underground Coalmine%人脸与声音结合的矿井人员签到识别

    Institute of Scientific and Technical Information of China (English)

    王君; 李成武; 杨茜; 刘世森

    2012-01-01

    矿井时有安全事故发生,签到管理系统可及时、准确掌握人员出入人员状况,保障矿井安全生产,方便及时救援.针对传统签到管理系统用于矿井,遇到光线昏暗、人脸易附着粉尘、干扰噪音等因素影响,签到识别方法检测率低,提出了—种根据KL变换(Karhunen-Loeve Transform)和TAN分类(Tree-Augmented Naive Bayesian network)相结合的人脸识别,并辅以声音识别的方法.通过形态学滤波变换快速去掉大部分无用背景,使处理更快速,特征点更突出;自动根据具体环境选择图像识别或声音识别,使识别准确率更高.仿真结果表明:结合声音的系统识别方法既减小了计算复杂度,又提高了人员识别率,还增强了适应性.%Coalmine accidents happen sometimes. It is significant to know the accurate statement of the miners in coalmine or outside, which is convenient for rescue. When the traditional Sign—in Management System was used in coal mine, the system meets new problems, such as black, hazy face, etc. Aiming at this issue, this paper put forward a face recognition algorithm based on the combination of Karhunen—Loeve Transform and Tree—Augmented Naive Bayesian network classifier, which uses the morphological filtering to remove most of useless transform background quickly. In addition, the voice recognition method was addede to that algorithm which makes feature point more outstanding and identification more accuracy, according to the specific environment automatic selection of face recognition or voice recognition. The simulation shows that this algorithm not only reduces the computational complexity and improves the human face recognition rate, but also enhances the adaptability.

  2. Validity of jitter measures in non-quasi-periodic voices. Part I: perceptual and computer performances in cycle pattern recognition.

    Science.gov (United States)

    Dejonckere, Philippe; Schoentgen, Jean; Giordano, Andrea; Fraj, Samia; Bocchi, Leonardo; Manfredi, Claudia

    2011-07-01

    The limit of about 5% for reliable quantification of jitter in sustained vowels of dysphonic voices-a widely accepted guideline-deserves critical analysis. The present study pertains to the effect of experience and training on the perceptual (visual) capability of correctly identifying periods in (highly) perturbed signals, and to a comparison of the performance of several programs for voice analysis. Synthesized realistic vowels (/a:/) with exactly known jitter (2.7%-31.5%) are used as material. After selection and training, experienced raters demonstrate excellent agreement in correctly identifying periods up to high values of jitter put in. Perceptual rating outperforms all computer programs in accuracy. Most remain reliable up to 10% jitter; one of them correctly measures up to the highest level.

  3. An automatic speech recognition system with speaker-independent identification support

    Science.gov (United States)

    Caranica, Alexandru; Burileanu, Corneliu

    2015-02-01

    The novelty of this work relies on the application of an open source research software toolkit (CMU Sphinx) to train, build and evaluate a speech recognition system, with speaker-independent support, for voice-controlled hardware applications. Moreover, we propose to use the trained acoustic model to successfully decode offline voice commands on embedded hardware, such as an ARMv6 low-cost SoC, Raspberry PI. This type of single-board computer, mainly used for educational and research activities, can serve as a proof-of-concept software and hardware stack for low cost voice automation systems.

  4. VoiceRelay: voice key operation using visual basic.

    Science.gov (United States)

    Abrams, Lise; Jennings, David T

    2004-11-01

    Using a voice key is a popular method for recording vocal response times in a variety of language production tasks. This article describes a class module called VoiceRelay that can be easily utilized in Visual Basic programs for voice key operation. This software-based voice key offers the precision of traditional voice keys (although accuracy is system dependent), as well as the flexibility of volume and sensitivity control. However, VoiceRelay is a considerably less expensive alternative for recording vocal response times because it operates with existing PC hardware and does not require the purchase of external response boxes or additional experiment-generation software. A sample project demonstrating implementation of the VoiceRelay class module may be downloaded from the Psychonomic Society Web archive, www.psychonomic.org/archive.

  5. Evaluation of an Intelligent Assistive Technology for Voice Navigation of Spreadsheets

    CERN Document Server

    Flood, Derek; Caffery, Fergal Mc; Bishop, Brian

    2008-01-01

    An integral part of spreadsheet auditing is navigation. For sufferers of Repetitive Strain Injury who need to use voice recognition technology this navigation can be highly problematic. To counter this the authors have developed an intelligent voice navigation system, iVoice, which replicates common spreadsheet auditing behaviours through simple voice commands. This paper outlines the iVoice system and summarizes the results of a study to evaluate iVoice when compared to a leading voice recognition technology.

  6. Advances in Speech Recognition

    CERN Document Server

    Neustein, Amy

    2010-01-01

    This volume is comprised of contributions from eminent leaders in the speech industry, and presents a comprehensive and in depth analysis of the progress of speech technology in the topical areas of mobile settings, healthcare and call centers. The material addresses the technical aspects of voice technology within the framework of societal needs, such as the use of speech recognition software to produce up-to-date electronic health records, not withstanding patients making changes to health plans and physicians. Included will be discussion of speech engineering, linguistics, human factors ana

  7. TU-C-17A-03: An Integrated Contour Evaluation Software Tool Using Supervised Pattern Recognition for Radiotherapy

    Energy Technology Data Exchange (ETDEWEB)

    Chen, H; Tan, J; Kavanaugh, J; Dolly, S; Gay, H; Thorstad, W; Anastasio, M; Altman, M; Mutic, S; Li, H [Washington University School of Medicine, Saint Louis, MO (United States)

    2014-06-15

    Purpose: Radiotherapy (RT) contours delineated either manually or semiautomatically require verification before clinical usage. Manual evaluation is very time consuming. A new integrated software tool using supervised pattern contour recognition was thus developed to facilitate this process. Methods: The contouring tool was developed using an object-oriented programming language C# and application programming interfaces, e.g. visualization toolkit (VTK). The C# language served as the tool design basis. The Accord.Net scientific computing libraries were utilized for the required statistical data processing and pattern recognition, while the VTK was used to build and render 3-D mesh models from critical RT structures in real-time and 360° visualization. Principal component analysis (PCA) was used for system self-updating geometry variations of normal structures based on physician-approved RT contours as a training dataset. The inhouse design of supervised PCA-based contour recognition method was used for automatically evaluating contour normality/abnormality. The function for reporting the contour evaluation results was implemented by using C# and Windows Form Designer. Results: The software input was RT simulation images and RT structures from commercial clinical treatment planning systems. Several abilities were demonstrated: automatic assessment of RT contours, file loading/saving of various modality medical images and RT contours, and generation/visualization of 3-D images and anatomical models. Moreover, it supported the 360° rendering of the RT structures in a multi-slice view, which allows physicians to visually check and edit abnormally contoured structures. Conclusion: This new software integrates the supervised learning framework with image processing and graphical visualization modules for RT contour verification. This tool has great potential for facilitating treatment planning with the assistance of an automatic contour evaluation module in avoiding

  8. The Effect of Automatic Speech Recognition Eyespeak Software on Iraqi Students' English Pronunciation: A Pilot Study

    Science.gov (United States)

    Sidgi, Lina Fathi Sidig; Shaari, Ahmad Jelani

    2017-01-01

    The use of technology, such as computer-assisted language learning (CALL), is used in teaching and learning in the foreign language classrooms where it is most needed. One promising emerging technology that supports language learning is automatic speech recognition (ASR). Integrating such technology, especially in the instruction of pronunciation…

  9. The Usefulness of Automatic Speech Recognition (ASR) Eyespeak Software in Improving Iraqi EFL Students' Pronunciation

    Science.gov (United States)

    Sidgi, Lina Fathi Sidig; Shaari, Ahmad Jelani

    2017-01-01

    The present study focuses on determining whether automatic speech recognition (ASR) technology is reliable for improving English pronunciation to Iraqi EFL students. Non-native learners of English are generally concerned about improving their pronunciation skills, and Iraqi students face difficulties in pronouncing English sounds that are not…

  10. 基于BP和ARM的发动机声音识别系统%Voice recognition engine based on BP's system in realization of ARM

    Institute of Scientific and Technical Information of China (English)

    姜愉

    2012-01-01

    Aimed at addressing automatic fee charging of highway toll stations and large-scale re- chargeable parking lots, this paper introduces the design of a embedded speech recognition system based on ARM9 and embedded Linux system of the engine sound by analyzing the BP neural network recognition theory. The design consisting of S3C2410 microprocessors and Linux operating systems involves trans- planting the C language of speech recognition program to the embedded Linux operating system's file system when cross-compiled. The paper describes the system s hardware and software framework, and offers the experiments results produced by real-time recognition of the car type by the engine sound. The results prove its accuracy, real-time and validity.%为解决高速公路收费站及大型停车收费场自动收费问题,依据BP神经网络识别理论,设计了一个基于ARM9及嵌入式Linux系统的发动机声音识别系统。选用S3C2410微处理器和嵌入式Linux操作系统,把交叉编译后的发动机声音识别C语言程序移植到操作系统的文件中,实现了发动机声音实时识别功能,给出了系统整体软硬件结构框架以及实时输入发动机声音判别汽车类型的识别结果。现场实验证实了该系统的准确性、实时性和有效性。

  11. Multimodal emotion recognition as assessment for learning in a game-based communication skills training

    NARCIS (Netherlands)

    Nadolski, Rob; Bahreini, Kiavash; Westera, Wim

    2014-01-01

    This paper presentation describes how our FILTWAM software artifacts for face and voice emotion recognition will be used for assessing learners' progress and providing adequate feedback in an online game-based communication skills training. This constitutes an example of in-game assessment for mainl

  12. Multimodal Emotion Recognition for Assessment of Learning in a Game-Based Communication Skills Training

    NARCIS (Netherlands)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2015-01-01

    This paper describes how our FILTWAM software artifacts for face and voice emotion recognition will be used for assessing learners' progress and providing adequate feedback in an online game-based communication skills training. This constitutes an example of in-game assessment for mainly formative p

  13. Multimodal emotion recognition as assessment for learning in a game-based communication skills training

    NARCIS (Netherlands)

    Nadolski, Rob; Bahreini, Kiavash; Westera, Wim

    2014-01-01

    This paper presentation describes how our FILTWAM software artifacts for face and voice emotion recognition will be used for assessing learners' progress and providing adequate feedback in an online game-based communication skills training. This constitutes an example of in-game assessment for mainl

  14. Practical applications of interactive voice technologies: Some accomplishments and prospects

    Science.gov (United States)

    Grady, Michael W.; Hicklin, M. B.; Porter, J. E.

    1977-01-01

    A technology assessment of the application of computers and electronics to complex systems is presented. Three existing systems which utilize voice technology (speech recognition and speech generation) are described. Future directions in voice technology are also described.

  15. Forensic Automatic Speaker Recognition Based on Likelihood Ratio Using Acoustic-phonetic Features Measured Automatically

    Directory of Open Access Journals (Sweden)

    Huapeng Wang

    2015-01-01

    Full Text Available Forensic speaker recognition is experiencing a remarkable paradigm shift in terms of the evaluation framework and presentation of voice evidence. This paper proposes a new method of forensic automatic speaker recognition using the likelihood ratio framework to quantify the strength of voice evidence. The proposed method uses a reference database to calculate the within- and between-speaker variability. Some acoustic-phonetic features are extracted automatically using the software VoiceSauce. The effectiveness of the approach was tested using two Mandarin databases: A mobile telephone database and a landline database. The experiment's results indicate that these acoustic-phonetic features do have some discriminating potential and are worth trying in discrimination. The automatic acoustic-phonetic features have acceptable discriminative performance and can provide more reliable results in evidence analysis when fused with other kind of voice features.

  16. The expression and recognition of emotions in the voice across five nations: A lens model analysis based on acoustic features.

    Science.gov (United States)

    Laukka, Petri; Elfenbein, Hillary Anger; Thingujam, Nutankumar S; Rockstuhl, Thomas; Iraki, Frederick K; Chui, Wanda; Althoff, Jean

    2016-11-01

    This study extends previous work on emotion communication across cultures with a large-scale investigation of the physical expression cues in vocal tone. In doing so, it provides the first direct test of a key proposition of dialect theory, namely that greater accuracy of detecting emotions from one's own cultural group-known as in-group advantage-results from a match between culturally specific schemas in emotional expression style and culturally specific schemas in emotion recognition. Study 1 used stimuli from 100 professional actors from five English-speaking nations vocally conveying 11 emotional states (anger, contempt, fear, happiness, interest, lust, neutral, pride, relief, sadness, and shame) using standard-content sentences. Detailed acoustic analyses showed many similarities across groups, and yet also systematic group differences. This provides evidence for cultural accents in expressive style at the level of acoustic cues. In Study 2, listeners evaluated these expressions in a 5 × 5 design balanced across groups. Cross-cultural accuracy was greater than expected by chance. However, there was also in-group advantage, which varied across emotions. A lens model analysis of fundamental acoustic properties examined patterns in emotional expression and perception within and across groups. Acoustic cues were used relatively similarly across groups both to produce and judge emotions, and yet there were also subtle cultural differences. Speakers appear to have a culturally nuanced schema for enacting vocal tones via acoustic cues, and perceivers have a culturally nuanced schema in judging them. Consistent with dialect theory's prediction, in-group judgments showed a greater match between these schemas used for emotional expression and perception. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  17. Voice Disorders

    Science.gov (United States)

    Voice is the sound made by air passing from your lungs through your larynx, or voice box. In your larynx are your vocal cords, ... to make sound. For most of us, our voices play a big part in who we are, ...

  18. Every Voice

    Science.gov (United States)

    Patrick, Penny

    2008-01-01

    This article discusses how the author develops an approach that allows her students, who are part of the marginalized population, to learn the power of their own voices--not just their writing voices, but their oral voices as well. The author calls it "TWIST": Thoughts, Writing folder, Inquiring mind, Supplies, and Teamwork. It is where…

  19. Every Voice

    Science.gov (United States)

    Patrick, Penny

    2008-01-01

    This article discusses how the author develops an approach that allows her students, who are part of the marginalized population, to learn the power of their own voices--not just their writing voices, but their oral voices as well. The author calls it "TWIST": Thoughts, Writing folder, Inquiring mind, Supplies, and Teamwork. It is where…

  20. Voice restoration

    NARCIS (Netherlands)

    Hilgers, F.J.M.; Balm, A.J.M.; van den Brekel, M.W.M.; Tan, I.B.; Remacle, M.; Eckel, H.E.

    2010-01-01

    Surgical prosthetic voice restoration is the best possible option for patients to regain oral communication after total laryngectomy. It is considered to be the present "gold standard" for voice rehabilitation of laryngectomized individuals. Surgical prosthetic voice restoration, in essence, is alwa

  1. 3D Imaging for hand gesture recognition: Exploring the software-hardware interaction of current technologies

    Science.gov (United States)

    Periverzov, Frol; Ilieş, Horea T.

    2012-09-01

    Interaction with 3D information is one of the fundamental and most familiar tasks in virtually all areas of engineering and science. Several recent technological advances pave the way for developing hand gesture recognition capabilities available to all, which will lead to more intuitive and efficient 3D user interfaces (3DUI). These developments can unlock new levels of expression and productivity in all activities concerned with the creation and manipulation of virtual 3D shapes and, specifically, in engineering design. Building fully automated systems for tracking and interpreting hand gestures requires robust and efficient 3D imaging techniques as well as potent shape classifiers. We survey and explore current and emerging 3D imaging technologies, and focus, in particular, on those that can be used to build interfaces between the users' hands and the machine. The purpose of this paper is to categorize and highlight the relevant differences between these existing 3D imaging approaches in terms of the nature of the information provided, output data format, as well as the specific conditions under which these approaches yield reliable data. Furthermore we explore the impact of each of these approaches on the computational cost and reliability of the required image processing algorithms. Finally we highlight the main challenges and opportunities in developing natural user interfaces based on hand gestures, and conclude with some promising directions for future research. [Figure not available: see fulltext.

  2. Facial recognition software success rates for the identification of 3D surface reconstructed facial images: implications for patient privacy and security.

    Science.gov (United States)

    Mazura, Jan C; Juluru, Krishna; Chen, Joseph J; Morgan, Tara A; John, Majnu; Siegel, Eliot L

    2012-06-01

    Image de-identification has focused on the removal of textual protected health information (PHI). Surface reconstructions of the face have the potential to reveal a subject's identity even when textual PHI is absent. This study assessed the ability of a computer application to match research subjects' 3D facial reconstructions with conventional photographs of their face. In a prospective study, 29 subjects underwent CT scans of the head and had frontal digital photographs of their face taken. Facial reconstructions of each CT dataset were generated on a 3D workstation. In phase 1, photographs of the 29 subjects undergoing CT scans were added to a digital directory and tested for recognition using facial recognition software. In phases 2-4, additional photographs were added in groups of 50 to increase the pool of possible matches and the test for recognition was repeated. As an internal control, photographs of all subjects were tested for recognition against an identical photograph. Of 3D reconstructions, 27.5% were matched correctly to corresponding photographs (95% upper CL, 40.1%). All study subject photographs were matched correctly to identical photographs (95% lower CL, 88.6%). Of 3D reconstructions, 96.6% were recognized simply as a face by the software (95% lower CL, 83.5%). Facial recognition software has the potential to recognize features on 3D CT surface reconstructions and match these with photographs, with implications for PHI.

  3. Speech Recognition System For Robotic Control And Movement

    Directory of Open Access Journals (Sweden)

    Biraja Nalini Rout

    2015-08-01

    Full Text Available Abstract In a current scenario voice and data recognition is one of the most sought after field in the area of artificial intelligence and robotic 1 engineering. The idea specializes on deriving a voice to voice intelligent system which operates purely on audiovoice instructions using a specialized voice recognition module a micro controller a set of wheels and a movable arm to operate. The working involves real time voice inputs feeded to the VR module which equivalently processes the audio signals and produces the output in audio format. It consists an IDE for both Windows and UNIX based operating system for manipulating and processing instructions both at software and hardware levels. The system also can perform a basic set of manual operations decides through the expert system. The VR module processes the data using multilayer perceptron to generate the required result. Movable arm operates to pick and place objects as per the given voice instructions. Its usability involves substituting manual work at both personal and professional levels.

  4. 寻轨式语音播报送餐机器人系统的软硬件设计%Software and Hardware Design of Track Type Voice Broadcast Service Robot

    Institute of Scientific and Technical Information of China (English)

    苏杰仁; 张立; 程院莲

    2015-01-01

    设计了一种寻轨式语音播报送餐机器人,该机器人具有寻轨、语音播报、自动检测端盘、红外避障等功能。文中给出了送餐机器人的设计方案及软硬件设计,设计成本低、可靠性高,具有很高的实用价值。%The track type voice broadcast service robot is designed.The robot has the functions of track seeking,voice broadcast,plate au-tomatic detection and infrared obstacle avoidance.The design of hardware and software is given in this paper.The design is low cost,and has high reliability and high practical value.

  5. DoD Related Software Technology Requirements, Practices, and Prospects for the Future

    Science.gov (United States)

    1984-06-01

    Program Testing". IEEE Trans. on Software Engineering, 6:2, March 1980, 162-169. A • 21. S. T. Redwine, Jr. "An Engineering Approach to SoftwareTest Data... local and rapid programming capability - voice recognition - multiple path message routing - access to all supporting databases - common decision...o Local and wide area weather control o Topographic systems-position reporting and recording and terrain information and video terrain displays

  6. Air segmented amplitude modulated multiplexed flow analysis with software-based phase recognition: determination of phosphate ion.

    Science.gov (United States)

    Ogusu, Takeshi; Uchimoto, Katsuya; Takeuchi, Masaki; Tanaka, Hideji

    2014-01-01

    Amplitude modulated multiplexed flow analysis (AMMFA) has been improved by introducing air segmentation and software-based phase recognition. Sample solutions, the flow rates of which are respectively varied at different frequencies, are merged. Air is introduced to the merged liquid stream in order to limit the dispersion of analytes within each liquid segment separated by air bubbles. The stream is led to a detector with no physical deaeration. Air signals are distinguished from liquid signals through the analysis of detector output signals, and are suppressed down to the level of liquid signals. Resulting signals are smoothed based on moving average computation. Thus processed signals are analyzed by fast Fourier transform. The analytes in the samples are respectively determined from the amplitudes of the corresponding wave components obtained. The developed system has been applied to the simultaneous determinations of phosphate ions in water samples by a Malachite Green method. The linearity of the analytical curve (0.0-31.0 μmol dm(-3)) is good (r(2)>0.999) and the detection limit (3.3 σ) at the modulation period of 30s is 0.52 μmol dm(-3). Good recoveries around 100% have been obtained for phosphate ions spiked into real water samples.

  7. Tomato recognition based on feature analysis and fast development of recognition software%基于特征分析的番茄识别及其软件的快速开发

    Institute of Scientific and Technical Information of China (English)

    王玉飞; 尹建军; 仲苏玉; 陈树人

    2011-01-01

    In order to guide manipulator to pick tomato, an automatic tomato recognition based on shape and texture features analysis and development method of recognition software are researched. First, child images of tomato and disturbed objects are extracted respectively from tomato images captured under outdoor scene. Three shape parameters and five texture parameters are chosen to do statistical analyses. The algorithm of tomato recognition is verified quickly under the enviroment of Matlab. And then, OpenCV function library is used to reproduce the recognition process under VC++ development environment to improve real-time of tomato recognition. Fast development of software is realized. By processing of 674 tomato images, the tests show that right recognition ratio of harvesting mature tomatoes can reach more than 90 percent when growth state of tomatoes is seperated each other. By processing of different size of tomato images, the tests show that the implementation efficiency of the codes programmed with OpenCV is risen considerably in comparison with the codes programmed with Matlab. Morever, the software can capture real-time image and process synchronously binocular image, and may provide a reference for development of fruit-picking robotic vision software.%为导引机械手采摘番茄,提出了基于形状与纹理特征分析的番茄识别和软件实现方法.提取番茄和枯叶等干扰物的子图像,选择3个形状特征和5个纹理特征参数进行统计分析,在Matlab下快速验证番茄识别算法,再利用OpenCV函数库在Visual C++下对识别算法进行再现,实现了目标识别软件的快速开发.对674幅番茄图像进行识别试验的结果表明,对生长状态为相互分离的成熟番茄,正确识别率达到90%以上.不同分辨率的番茄图像处理实验结果表明,利用OpenCV函数库编制的程序比Matlab下的代码执行效率显著提高,而且能实时采集并同步处理双目图像,可为采摘机器人目

  8. Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

    Directory of Open Access Journals (Sweden)

    Maureen Fontaine

    2017-07-01

    Full Text Available The ability to recognize an individual from their voice is a widespread ability with a long evolutionary history. Yet, the perceptual representation of familiar voices is ill-defined. In two experiments, we explored the neuropsychological processes involved in the perception of voice identity. We specifically explored the hypothesis that familiar voices (trained-to-familiar (Experiment 1, and famous voices (Experiment 2 are represented as a whole complex pattern, well approximated by the average of multiple utterances produced by a single speaker. In experiment 1, participants learned three voices over several sessions, and performed a three-alternative forced-choice identification task on original voice samples and several “speaker averages,” created by morphing across varying numbers of different vowels (e.g., [a] and [i] produced by the same speaker. In experiment 2, the same participants performed the same task on voice samples produced by familiar speakers. The two experiments showed that for famous voices, but not for trained-to-familiar voices, identification performance increased and response times decreased as a function of the number of utterances in the averages. This study sheds light on the perceptual representation of familiar voices, and demonstrates the power of average in recognizing familiar voices. The speaker average captures the unique characteristics of a speaker, and thus retains the information essential for recognition; it acts as a prototype of the speaker.

  9. Comparing acoustic and perceptual voice parameters in female teachers based on voice complaints

    Directory of Open Access Journals (Sweden)

    Maryam Faghani Abukeili

    2014-04-01

    Full Text Available Background and Aim: Teachers are a large group of professional voice users that several risk factors and voice demands causes various voice complaints among them. As the voice is multidimensional, the aim of this study was acoustic and perceptual measurement of teachers’ voice and comparing the findings between two groups with many and few voice complaints.Methods: Sixty female teachers of high school in Sari, north of Iran, were chosen by available sampling to participate in this cross-sectional study. According to a voice complaints questionnaire, 21 subjects located in few voice complaints and 31 in many voice complaints group. After a working day, subjects completed a voice self-assessment questionnaire. Also, teachers’voice were recorded during three tasks including sustained vowels /a/ and /i/, text reading and conversational speech. Acoustic parameters were analyzed by Praat software and 2 speech-language pathalogists performed auditory-perceptual assessment by GRBAS ( Grade, Roughness, Breathiness, Asthenia, Strain scale. Results: Comparing of the voice self-assessment between the two groups demonstrated statistically significant difference (p<0.05; however results of the acoustic and auditory-perceptual measurement did not show significant diffrence.Conclusion: Despite prevalent voice problems in teachers, there are various conditions in terms of complaints and assessments methods. In this study, only a remarkable deviation documented in the client-based assessments in many voice compliants group in comparison with few voice compliants, which would be probably related to different individual’s perception of voice problem between two groups. These results support paying attention to self-assessments in clinical process of voice problems.

  10. The software for automatic creation of the formal grammars used by speech recognition, computer vision, editable text conversion systems, and some new functions

    Science.gov (United States)

    Kardava, Irakli; Tadyszak, Krzysztof; Gulua, Nana; Jurga, Stefan

    2017-02-01

    For more flexibility of environmental perception by artificial intelligence it is needed to exist the supporting software modules, which will be able to automate the creation of specific language syntax and to make a further analysis for relevant decisions based on semantic functions. According of our proposed approach, of which implementation it is possible to create the couples of formal rules of given sentences (in case of natural languages) or statements (in case of special languages) by helping of computer vision, speech recognition or editable text conversion system for further automatic improvement. In other words, we have developed an approach, by which it can be achieved to significantly improve the training process automation of artificial intelligence, which as a result will give us a higher level of self-developing skills independently from us (from users). At the base of our approach we have developed a software demo version, which includes the algorithm and software code for the entire above mentioned component's implementation (computer vision, speech recognition and editable text conversion system). The program has the ability to work in a multi - stream mode and simultaneously create a syntax based on receiving information from several sources.

  11. Keeping Your Voice Healthy

    Science.gov (United States)

    ... Find an ENT Doctor Near You Keeping Your Voice Healthy Keeping Your Voice Healthy Patient Health Information ... heavily voice-related. Key Steps for Keeping Your Voice Healthy Drink plenty of water. Moisture is good ...

  12. Multimodal recognition of emotions

    NARCIS (Netherlands)

    Datcu, D.

    2009-01-01

    This thesis proposes algorithms and techniques to be used for automatic recognition of six prototypic emotion categories by computer programs, based on the recognition of facial expressions and emotion patterns in voice. Considering the applicability in real-life conditions, the research is carried

  13. Research of Speech Recognition System Based on Matlab%基于Matlab的语音识别系统研究

    Institute of Scientific and Technical Information of China (English)

    王彪

    2011-01-01

    A speech recognition system based on Matlab software is designed, and record, broadcast, pretreat voice signals, subsection filtering, feature extraction and speech recognition are its main functions. This system has achieved discriminate simple voice requirements is verificated by the experiment, but some places are needed to improve, such as: whether complex voice coule be discriminated in complex environment.%设计了一个基于Matlab软件的语音识别系统,其主要功能有语音信号的录制、播放、预处理、分段滤波、特征提取以及识别语音.通过实验验证了本系统能够达到识别简单语音的要求,但仍有需改进的地方,如:能否在复杂环境下识别比较复杂的语音.

  14. Voice Collection under Different Spectrum

    Directory of Open Access Journals (Sweden)

    Min Li

    2013-05-01

    Full Text Available According to the short-time Fourier transform theory and principle of digital filtering, this paper established a mathematical model called collection of voice signal collection at different spectrum. The voice signal was a non-stationary process, while the standard Fourier transform only applied to the periodic signal, transient signals or stationary random signal. Therefore, the standard Fourier transform could not be directly used for the speech signal. By controlling the input different types and parameters, this paper analyzed the collected original voice signal spectrum with the use of MATLAB software platform. At the same time, it realized the extraction, recording and playback of the speech signal at different frequencies. Therefore, the waveforms could be displayed obviously on the graphic user interface and voice effect could be more clearly. Meanwhile, the result was verified by the hardware platforms, which consisted of TMS320VC5509A [1] chip and TLV320AIC23 voice chip. The results showed that the extraction of voice signal under different spectrum model was scientific, rational and effective.

  15. The Effect of Automatic Speech Recognition EyeSpeak Software on Iraqi Students’ English Pronunciation: A Pilot Study

    OpenAIRE

    Lina Fathi Sidig Sidgi; Ahmad Jelani Shaari

    2017-01-01

    The use of technology, such as computer-assisted language learning (CALL), is used in teaching and learning in the foreign language classrooms where it is most needed. One promising emerging technology that supports language learning is automatic speech recognition (ASR). Integrating such technology, especially in the instruction of pronunciation in the classroom, is important in helping students to achieve correct pronunciation. In Iraq, English is a foreign language, and it is not surprisin...

  16. Productivity, part 2: cloud storage, remote meeting tools, screencasting, speech recognition software, password managers, and online data backup.

    Science.gov (United States)

    Lackey, Amanda E; Pandey, Tarun; Moshiri, Mariam; Lalwani, Neeraj; Lall, Chandana; Bhargava, Puneet

    2014-06-01

    It is an opportune time for radiologists to focus on personal productivity. The ever increasing reliance on computers and the Internet has significantly changed the way we work. Myriad software applications are available to help us improve our personal efficiency. In this article, the authors discuss some tools that help improve collaboration and personal productivity, maximize e-learning, and protect valuable digital data.

  17. Improved sensitivity of wearable nanogenerators made of electrospun Eu3+ doped P(VDF-HFP)/graphene composite nanofibers for self-powered voice recognition

    Science.gov (United States)

    Adhikary, Prakriti; Biswas, Anirban; Mandal, Dipankar

    2016-12-01

    Composite nanofibers of Eu3+ doped poly(vinylidene fluoride-co-hexafluoropropylene) (P(VDF-HFP))/graphene are prepared by the electrospinning technique for the fabrication of ultrasensitive wearable piezoelectric nanogenerators (WPNGs) where the post-poling technique is not necessary. It is found that the complete conversion of the piezoelectric β-phase and the improvement of the degree of crystallinity is governed by the incorporation of Eu3+ and graphene sheets into P(VDF-HFP) nanofibers. The flexible nanocomposite fibers are associated with a hypersensitive electronic transition that results in an intense red light emission, and WPNGs also have the capability of detecting external pressure as low as ~23 Pa with a higher degree of acoustic sensitivity, ~11 V Pa-1, than has ever been previously reported. This means that ultrasensitive WPNGs can be utilized to recognize human voices, which suggests they could be a potential tool in the biomedical and national security sectors. The capacitor’s ability to charge from abundant environmental vibrations, such as music, wind, body motion, etc, drives WPNGs as a power source for portable electronics. This fact may open up the prospect of using the Eu3+ doped P(VDF-HFP)/graphene composite electrospun nanofibers, with their multifunctional properties such as vibration sensitivity, wearability, red light emission capability and piezoelectric energy harvesting, for various promising applications in portable electronics, health care monitoring, noise detection and security monitoring.

  18. Speaker Recognition

    DEFF Research Database (Denmark)

    Mølgaard, Lasse Lohilahti; Jørgensen, Kasper Winther

    2005-01-01

    Speaker recognition is basically divided into speaker identification and speaker verification. Verification is the task of automatically determining if a person really is the person he or she claims to be. This technology can be used as a biometric feature for verifying the identity of a person...... in applications like banking by telephone and voice mail. The focus of this project is speaker identification, which consists of mapping a speech signal from an unknown speaker to a database of known speakers, i.e. the system has been trained with a number of speakers which the system can recognize....

  19. Speech Recognition Technology Applied to Intelligent Mobile Navigation System

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The capability of human-computer interaction reflects the intelligent degree of mobile navigation system.The navigation data and functions of mobile navigation system are divided into system commands and non-system commands in this paper.And then a group of speech commands are Abstracted.This paper applies speech recognition technology to intelligent mobile navigation system to process speech commands and does some deep research on the integration of speech recognition technology with mobile navigation system.The navigation operation can be performed by speech commands,which makes human-computer interaction easy during navigation.Speech command interface of navigation system is implemented by Dutty ++ Software,which is based on speech recognition system -Via Voice of IBM.Through navigation experiments,navigation can be done almost without keyboard,which proved that human-computer interaction is very convenient by speech commands and the reliability is also higher.

  20. A Joint Time-Frequency and Matrix Decomposition Feature Extraction Methodology for Pathological Voice Classification

    Directory of Open Access Journals (Sweden)

    Behnaz Ghoraani

    2009-01-01

    Full Text Available The number of people affected by speech problems is increasing as the modern world places increasing demands on the human voice via mobile telephones, voice recognition software, and interpersonal verbal communications. In this paper, we propose a novel methodology for automatic pattern classification of pathological voices. The main contribution of this paper is extraction of meaningful and unique features using Adaptive time-frequency distribution (TFD and nonnegative matrix factorization (NMF. We construct Adaptive TFD as an effective signal analysis domain to dynamically track the nonstationarity in the speech and utilize NMF as a matrix decomposition (MD technique to quantify the constructed TFD. The proposed method extracts meaningful and unique features from the joint TFD of the speech, and automatically identifies and measures the abnormality of the signal. Depending on the abnormality measure of each signal, we classify the signal into normal or pathological. The proposed method is applied on the Massachusetts Eye and Ear Infirmary (MEEI voice disorders database which consists of 161 pathological and 51 normal speakers, and an overall classification accuracy of 98.6% was achieved.

  1. Voice Interactive Classroom, a service-oriented software architecture to enable cross-platform multi-channel access to Internet-based learning

    OpenAIRE

    Álvarez García, Víctor Manuel

    2014-01-01

    La tecnología software está creando un contexto ubicuo para la vida y el aprendizaje humano en el que gradualmente se incorporan nuevos modos de interacción. Además, la interacción con los sistemas de aprendizaje basados en Internet ha evolucionado desde el acceso tradicional a través del navegador web de un ordenador personal o portátil a accesos más flexibles desde dispositivos móviles. Sin embargo, en ambos casos, los sistemas de e-learning han creado un contexto en el cual la interacción ...

  2. Simple and efficient method for region of interest value extraction from picture archiving and communication system viewer with optical character recognition software and macro program.

    Science.gov (United States)

    Lee, Young Han; Park, Eun Hae; Suh, Jin-Suck

    2015-01-01

    The objectives are: 1) to introduce a simple and efficient method for extracting region of interest (ROI) values from a Picture Archiving and Communication System (PACS) viewer using optical character recognition (OCR) software and a macro program, and 2) to evaluate the accuracy of this method with a PACS workstation. This module was designed to extract the ROI values on the images of the PACS, and created as a development tool by using open-source OCR software and an open-source macro program. The principal processes are as follows: (1) capture a region of the ROI values as a graphic file for OCR, (2) recognize the text from the captured image by OCR software, (3) perform error-correction, (4) extract the values including area, average, standard deviation, max, and min values from the text, (5) reformat the values into temporary strings with tabs, and (6) paste the temporary strings into the spreadsheet. This principal process was repeated for the number of ROIs. The accuracy of this module was evaluated on 1040 recognitions from 280 randomly selected ROIs of the magnetic resonance images. The input times of ROIs were compared between conventional manual method and this extraction module-assisted input method. The module for extracting ROI values operated successfully using the OCR and macro programs. The values of the area, average, standard deviation, maximum, and minimum could be recognized and error-corrected with AutoHotkey-coded module. The average input times using the conventional method and the proposed module-assisted method were 34.97 seconds and 7.87 seconds, respectively. A simple and efficient method for ROI value extraction was developed with open-source OCR and a macro program. Accurate inputs of various numbers from ROIs can be extracted with this module. The proposed module could be applied to the next generation of PACS or existing PACS that have not yet been upgraded. Copyright © 2015 AUR. Published by Elsevier Inc. All rights reserved.

  3. Voice parameters and videonasolaryngoscopy in children with vocal nodules: a longitudinal study, before and after voice therapy.

    Science.gov (United States)

    Valadez, Victor; Ysunza, Antonio; Ocharan-Hernandez, Esther; Garrido-Bustamante, Norma; Sanchez-Valerio, Araceli; Pamplona, Ma C

    2012-09-01

    Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. Before the onset of voice therapy, there was a significant difference (ptherapy period, a significant improvement (pvocal nodules were no longer discernible on the vocal folds in any of the cases. SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  4. The pattern recognition software for automatic treatment of track detector data at the PAVICOM completely automated measuring facility

    Energy Technology Data Exchange (ETDEWEB)

    Aleksandrov, A. [Lebedev Physical Institute, Russian Academy of Sciences, Leninskii pr., 53, Moscow, 119991 (Russian Federation); Kashkarov, L. [Vernadsky Institute of Geochemistry and Analytical Chemistry of Russian Academy of Sciences, Kosygina str., 19, Moscow, 119991 (Russian Federation); Polukhina, N. [Lebedev Physical Institute, Russian Academy of Sciences, Leninskii pr., 53, Moscow, 119991 (Russian Federation)], E-mail: poluhina@sci.lebedev.ru; Starkov, N. [Lebedev Physical Institute, Russian Academy of Sciences, Leninskii pr., 53, Moscow, 119991 (Russian Federation)], E-mail: starkov@sci.lebedev.ru

    2008-08-15

    A software for the PAVICOM completely automated measuring facility is developed. The facility is used to analyse charged-particle tracks in solid-state detectors (nuclear emulsions, CR-39 films, meteorite olivin, etc.) applied in high-energy physics, nuclear physics, and astrophysics. The scanning of the detectors is performed by a PC-controlled CCD camera. The track-image processing program implements filtering, clustering, and tracking procedures to determine the track parameters. Dedicated routines and procedures were developed to perform the track analysis for different kinds of scientific applications. The PAVICOM is capable of determining geometric characteristics of tracks (length, width, etc.), their spatial distribution, and the magnitude of the particle charge.

  5. GlycReSoft: A Software Package for Automated Recognition of Glycans from LC/MS Data

    Energy Technology Data Exchange (ETDEWEB)

    Maxwell, Evan; Tan, Yan; Tan, Yuxiang; Hu, Han; Benson, Gary; Aizikov, Konstantin; Conley, Shannon; Staples, Gregory O.; Slysz, Gordon W.; Smith, Richard D.; Zaia, Joseph

    2012-09-26

    Glycosylation modifies the physicochemical properties and protein binding functions of glycoconjugates. These modifications are biosynthesized in the endoplasmic reticulum and Golgi apparatus by a series of enzymatic transformations that are under complex control. As a result, mature glycans on a given site are heterogeneous mixtures of glycoforms. This gives rise to a spectrum of adhesive properties that strongly influences interactions with binding partners and resultant biological effects. In order to understand the roles glycosylation plays in normal and disease processes, efficient structural analysis tools are necessary. In the field of glycomics, liquid chromatography/mass spectrometry (LC/MS) is used to profile the glycans present in a given sample. This technology enables comparison of glycan compositions and abundances among different biological samples, i.e. normal versus disease, normal versus mutant, etc. Manual analysis of the glycan profiling LC/MS data is extremely time-consuming and efficient software tools are needed to eliminate this bottleneck. In this work, we have developed a tool to computationally model LC/MS data to enable efficient profiling of glycans. Using LC/MS data deconvoluted by Decon2LS/DeconTools, we built a list of unique neutral masses corresponding to candidate glycan compositions summarized over their various charge states, adducts and range of elution times. Our work aims to provide confident identification of true compounds in complex data sets that are not amenable to manual interpretation. This capability is an essential part of glycomics work flows. We demonstrate this tool, GlycReSoft, using an LC/MS dataset on tissue derived heparan sulfate oligosaccharides. The software, code and a test data set are publically archived under an open source license.

  6. The Cambridge Face Tracker: Accurate, Low Cost Measurement of Head Posture Using Computer Vision and Face Recognition Software

    Science.gov (United States)

    Thomas, Peter B. M.; Baltrušaitis, Tadas; Robinson, Peter; Vivian, Anthony J.

    2016-01-01

    Purpose We validate a video-based method of head posture measurement. Methods The Cambridge Face Tracker uses neural networks (constrained local neural fields) to recognize facial features in video. The relative position of these facial features is used to calculate head posture. First, we assess the accuracy of this approach against videos in three research databases where each frame is tagged with a precisely measured head posture. Second, we compare our method to a commercially available mechanical device, the Cervical Range of Motion device: four subjects each adopted 43 distinct head postures that were measured using both methods. Results The Cambridge Face Tracker achieved confident facial recognition in 92% of the approximately 38,000 frames of video from the three databases. The respective mean error in absolute head posture was 3.34°, 3.86°, and 2.81°, with a median error of 1.97°, 2.16°, and 1.96°. The accuracy decreased with more extreme head posture. Comparing The Cambridge Face Tracker to the Cervical Range of Motion Device gave correlation coefficients of 0.99 (P < 0.0001), 0.96 (P < 0.0001), and 0.99 (P < 0.0001) for yaw, pitch, and roll, respectively. Conclusions The Cambridge Face Tracker performs well under real-world conditions and within the range of normally-encountered head posture. It allows useful quantification of head posture in real time or from precaptured video. Its performance is similar to that of a clinically validated mechanical device. It has significant advantages over other approaches in that subjects do not need to wear any apparatus, and it requires only low cost, easy-to-setup consumer electronics. Translational Relevance Noncontact assessment of head posture allows more complete clinical assessment of patients, and could benefit surgical planning in future. PMID:27730008

  7. The Cambridge Face Tracker: Accurate, Low Cost Measurement of Head Posture Using Computer Vision and Face Recognition Software.

    Science.gov (United States)

    Thomas, Peter B M; Baltrušaitis, Tadas; Robinson, Peter; Vivian, Anthony J

    2016-09-01

    We validate a video-based method of head posture measurement. The Cambridge Face Tracker uses neural networks (constrained local neural fields) to recognize facial features in video. The relative position of these facial features is used to calculate head posture. First, we assess the accuracy of this approach against videos in three research databases where each frame is tagged with a precisely measured head posture. Second, we compare our method to a commercially available mechanical device, the Cervical Range of Motion device: four subjects each adopted 43 distinct head postures that were measured using both methods. The Cambridge Face Tracker achieved confident facial recognition in 92% of the approximately 38,000 frames of video from the three databases. The respective mean error in absolute head posture was 3.34°, 3.86°, and 2.81°, with a median error of 1.97°, 2.16°, and 1.96°. The accuracy decreased with more extreme head posture. Comparing The Cambridge Face Tracker to the Cervical Range of Motion Device gave correlation coefficients of 0.99 (P Cambridge Face Tracker performs well under real-world conditions and within the range of normally-encountered head posture. It allows useful quantification of head posture in real time or from precaptured video. Its performance is similar to that of a clinically validated mechanical device. It has significant advantages over other approaches in that subjects do not need to wear any apparatus, and it requires only low cost, easy-to-setup consumer electronics. Noncontact assessment of head posture allows more complete clinical assessment of patients, and could benefit surgical planning in future.

  8. The Glasgow Voice Memory Test: Assessing the ability to memorize and recognize unfamiliar voices.

    Science.gov (United States)

    Aglieri, Virginia; Watson, Rebecca; Pernet, Cyril; Latinus, Marianne; Garrido, Lúcia; Belin, Pascal

    2017-02-01

    One thousand one hundred and twenty subjects as well as a developmental phonagnosic subject (KH) along with age-matched controls performed the Glasgow Voice Memory Test, which assesses the ability to encode and immediately recognize, through an old/new judgment, both unfamiliar voices (delivered as vowels, making language requirements minimal) and bell sounds. The inclusion of non-vocal stimuli allows the detection of significant dissociations between the two categories (vocal vs. non-vocal stimuli). The distributions of accuracy and sensitivity scores (d') reflected a wide range of individual differences in voice recognition performance in the population. As expected, KH showed a dissociation between the recognition of voices and bell sounds, her performance being significantly poorer than matched controls for voices but not for bells. By providing normative data of a large sample and by testing a developmental phonagnosic subject, we demonstrated that the Glasgow Voice Memory Test, available online and accessible from all over the world, can be a valid screening tool (~5 min) for a preliminary detection of potential cases of phonagnosia and of "super recognizers" for voices.

  9. Speaking and Nonspeaking Voice Professionals: Who Has the Better Voice?

    Science.gov (United States)

    Chitguppi, Chandala; Raj, Anoop; Meher, Ravi; Rathore, P K

    2017-04-18

    Voice professionals can be classified into two major subgroups: the primarily speaking and the primarily nonspeaking voice professionals. Nonspeaking voice professionals mainly include singers, whereas speaking voice professionals include the rest of the voice professionals. Although both of these groups have high vocal demands, it is currently unknown whether both groups show similar voice changes after their daily voice use. Comparison of these two subgroups of voice professionals has never been done before. This study aimed to compare the speaking voice of speaking and nonspeaking voice professionals with no obvious vocal fold pathology or voice-related complaints on the day of assessment. After obtaining relevant voice-related history, voice analysis and videostroboscopy were performed in 50 speaking and 50 nonspeaking voice professionals. Speaking voice professionals showed significantly higher incidence of voice-related complaints as compared with nonspeaking voice professionals. Voice analysis revealed that most acoustic parameters including fundamental frequency, jitter percent, and harmonic-to-noise ratio were significantly higher in speaking voice professionals, whereas videostroboscopy did not show any significant difference between the two groups. This is the first study of its kind to analyze the effect of daily voice use in the two subgroups of voice professionals with no obvious vocal fold pathology. We conclude that voice professionals should not be considered as a homogeneous group. The detrimental effects of excessive voice use were observed to occur more significantly in speaking voice professionals than in nonspeaking voice professionals. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  10. Leveraging voice

    DEFF Research Database (Denmark)

    Frølunde, Lisbeth

    2017-01-01

    researchers improve our practices and how could digital online video help offer more positive stories about research and higher education? How can academics in higher education be better to tell about our research, thereby reclaiming and leveraging our voice in a post-factual era? As higher education......This paper speculates on how researchers share research without diluting our credibility and how to make strategies for the future. It also calls for consideration of new traditions and practices for communicating knowledge to a wider audience across multiple media platforms. How might we...... continues to engage with digital and networked technologies it becomes increasingly relevant to question why and how academics could (re) position research knowledge in the digital and online media landscape of today and the future. The paper highlights methodological issues that arise in relation...

  11. Feeling voices.

    Directory of Open Access Journals (Sweden)

    Paolo Ammirante

    Full Text Available Two experiments investigated deaf individuals' ability to discriminate between same-sex talkers based on vibrotactile stimulation alone. Nineteen participants made same/different judgments on pairs of utterances presented to the lower back through voice coils embedded in a conforming chair. Discrimination of stimuli matched for F0, duration, and perceived magnitude was successful for pairs of spoken sentences in Experiment 1 (median percent correct = 83% and pairs of vowel utterances in Experiment 2 (median percent correct = 75%. Greater difference in spectral tilt between "different" pairs strongly predicted their discriminability in both experiments. The current findings support the hypothesis that discrimination of complex vibrotactile stimuli involves the cortical integration of spectral information filtered through frequency-tuned skin receptors.

  12. DSP Based System for Real time Voice Synthesis Applications Development

    CERN Document Server

    Arsinte, Radu; Miron, Costin

    2008-01-01

    This paper describes an experimental system designed for development of real time voice synthesis applications. The system is composed from a DSP coprocessor card, equipped with an TMS320C25 or TMS320C50 chip, voice acquisition module (ADDA2),host computer (IBM-PC compatible), software specific tools.

  13. The "VoiceForum" Platform for Spoken Interaction

    Science.gov (United States)

    Fynn, Fohn; Wigham, Chiara R.

    2011-01-01

    Showcased in the courseware exhibition, "VoiceForum" is a web-based software platform for asynchronous learner interaction in threaded discussions using voice and text. A dedicated space is provided for the tutor who can give feedback on a posted message and dialogue with the participants at a separate level from the main interactional…

  14. Rapid identification of mycolic acid patterns of mycobacteria by high-performance liquid chromatography using pattern recognition software and a Mycobacterium library.

    Science.gov (United States)

    Glickman, S E; Kilburn, J O; Butler, W R; Ramos, L S

    1994-01-01

    Current methods for identifying mycobacteria by high-performance liquid chromatography (HPLC) require a visual assessment of the generated chromatographic data, which often involves time-consuming hand calculations and the use of flow charts. Our laboratory has developed a personal computer-based file containing patterns of mycolic acids detected in 45 species of Mycobacterium, including both slowly and rapidly growing species, as well as Tsukamurella paurometabolum and members of the genera Corynebacterium, Nocardia, Rhodococcus, and Gordona. The library was designed to be used in conjunction with a commercially available pattern recognition software package, Pirouette (Infometrix, Seattle, Wash.). Pirouette uses the K-nearest neighbor algorithm, a similarity-based classification method, to categorize unknown samples on the basis of their multivariate proximities to samples of a preassigned category. Multivariate proximity is calculated from peak height data, while peak heights are named by retention time matching. The system was tested for accuracy by using 24 species of Mycobacterium. Of the 1,333 strains evaluated, > or = 97% were correctly identified. Identification of M. tuberculosis (n = 649) was 99.85% accurate, and identification of the M. avium complex (n = 211) was > or = 98% accurate; > or = 95% of strains of both double-cluster and single-cluster M. gordonae (n = 47) were correctly identified. This system provides a rapid, highly reliable assessment of HPLC-generated chromatographic data for the identification of mycobacteria. PMID:8195387

  15. Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening.

    Science.gov (United States)

    Uloza, Virgilijus; Padervinskis, Evaldas; Vegiene, Aurelija; Pribuisiene, Ruta; Saferis, Viktoras; Vaiciukynas, Evaldas; Gelzinis, Adas; Verikas, Antanas

    2015-11-01

    The objective of this study is to evaluate the reliability of acoustic voice parameters obtained using smart phone (SP) microphones and investigate the utility of use of SP voice recordings for voice screening. Voice samples of sustained vowel/a/obtained from 118 subjects (34 normal and 84 pathological voices) were recorded simultaneously through two microphones: oral AKG Perception 220 microphone and SP Samsung Galaxy Note3 microphone. Acoustic voice signal data were measured for fundamental frequency, jitter and shimmer, normalized noise energy (NNE), signal to noise ratio and harmonic to noise ratio using Dr. Speech software. Discriminant analysis-based Correct Classification Rate (CCR) and Random Forest Classifier (RFC) based Equal Error Rate (EER) were used to evaluate the feasibility of acoustic voice parameters classifying normal and pathological voice classes. Lithuanian version of Glottal Function Index (LT_GFI) questionnaire was utilized for self-assessment of the severity of voice disorder. The correlations of acoustic voice parameters obtained with two types of microphones were statistically significant and strong (r = 0.73-1.0) for the entire measurements. When classifying into normal/pathological voice classes, the Oral-NNE revealed the CCR of 73.7% and the pair of SP-NNE and SP-shimmer parameters revealed CCR of 79.5%. However, fusion of the results obtained from SP voice recordings and GFI data provided the CCR of 84.60% and RFC revealed the EER of 7.9%, respectively. In conclusion, measurements of acoustic voice parameters using SP microphone were shown to be reliable in clinical settings demonstrating high CCR and low EER when distinguishing normal and pathological voice classes, and validated the suitability of the SP microphone signal for the task of automatic voice analysis and screening.

  16. Voice-controlled Debugging of Spreadsheets

    CERN Document Server

    Flood, Derek

    2008-01-01

    Developments in Mobile Computing are putting pressure on the software industry to research new modes of interaction that do not rely on the traditional keyboard and mouse combination. Computer users suffering from Repetitive Strain Injury also seek an alternative to keyboard and mouse devices to reduce suffering in wrist and finger joints. Voice-control is an alternative approach to spreadsheet development and debugging that has been researched and used successfully in other domains. While voice-control technology for spreadsheets is available its effectiveness has not been investigated. This study is the first to compare the performance of a set of expert spreadsheet developers that debugged a spreadsheet using voice-control technology and another set that debugged the same spreadsheet using keyboard and mouse. The study showed that voice, despite its advantages, proved to be slower and less accurate. However, it also revealed ways in which the technology might be improved to redress this imbalance.

  17. Fractal Dimension of Voice-Signal Waveforms

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The fractal dimension is one important parameter that characterizes waveforms. In this paper, we derive a new method to calculate fractal dimension of digital voice-signal waveforms. We show that fractal dimension is an efficient tool for speaker recognition or speech recognition. It can be used to identify different speakers or distinguish speech. We apply our results to Chinese speaker recognition and numerical experiment shows that fractal dimension is an efficient parameter to characterize individual Chinese speakers. We have developed a semiautomatic voiceprint analysis system based on the theory of this paper and former researches.

  18. Objective Voice Parameters in Colombian School Workers with Healthy Voices

    Directory of Open Access Journals (Sweden)

    Lady Catherine Cantor Cutiva

    2015-09-01

    Full Text Available Objectives: To characterize the objective voice parameters among school workers, and to identi­fy associated factors of three objective voice parameters, namely fundamental frequency, sound pressure level and maximum phonation time. Materials and methods: We conducted a cross-sectional study among 116 Colombian teachers and 20 Colombian non-teachers. After signing the informed consent form, participants filled out a questionnaire. Then, a voice sample was recorded and evaluated perceptually by a speech therapist and by objective voice analysis with praat software. Short-term environmental measurements of sound level, temperature, humi­dity, and reverberation time were conducted during visits at the workplaces, such as classrooms and offices. Linear regression analysis was used to determine associations between individual and work-related factors and objective voice parameters. Results: Compared with men, women had higher fundamental frequency (201 Hz for teachers and 209 for non-teachers vs. 120 Hz for teachers and 127 for non-teachers and sound pressure level (82 dB vs. 80 dB, and shorter maximum phonation time (around 14 seconds vs. around 16 seconds. Female teachers younger than 50 years of age evidenced a significant tendency to speak with lower fundamental frequen­cy and shorter mpt compared with female teachers older than 50 years of age. Female teachers had significantly higher fundamental frequency (66 Hz, higher sound pressure level (2 dB and short phonation time (2 seconds than male teachers. Conclusion: Female teachers younger than 50 years of age had significantly lower F0 and shorter mpt compared with those older than 50 years of age. The multivariate analysis showed that gender was a much more important determinant of variations in F0, spl and mpt than age and teaching occupation. Objectively measured temperature also contributed to the changes on spl among school workers.

  19. Perceiving a stranger's voice as being one's own: a 'rubber voice' illusion?

    Directory of Open Access Journals (Sweden)

    Zane Z Zheng

    Full Text Available We describe an illusion in which a stranger's voice, when presented as the auditory concomitant of a participant's own speech, is perceived as a modified version of their own voice. When the congruence between utterance and feedback breaks down, the illusion is also broken. Compared to a baseline condition in which participants heard their own voice as feedback, hearing a stranger's voice induced robust changes in the fundamental frequency (F0 of their production. Moreover, the shift in F0 appears to be feedback dependent, since shift patterns depended reliably on the relationship between the participant's own F0 and the stranger-voice F0. The shift in F0 was evident both when the illusion was present and after it was broken, suggesting that auditory feedback from production may be used separately for self-recognition and for vocal motor control. Our findings indicate that self-recognition of voices, like other body attributes, is malleable and context dependent.

  20. Dimensionality in voice quality.

    Science.gov (United States)

    Bele, Irene Velsvik

    2007-05-01

    This study concerns speaking voice quality in a group of male teachers (n = 35) and male actors (n = 36), as the purpose was to investigate normal and supranormal voices. The goal was the development of a method of valid perceptual evaluation for normal to supranormal and resonant voices. The voices (text reading at two loudness levels) had been evaluated by 10 listeners, for 15 vocal characteristics using VA scales. In this investigation, the results of an exploratory factor analysis of the vocal characteristics used in this method are presented, reflecting four dimensions of major importance for normal and supranormal voices. Special emphasis is placed on the effects on voice quality of a change in the loudness variable, as two loudness levels are studied. Furthermore, the vocal characteristics Sonority and Ringing voice quality are paid special attention, as the essence of the term "resonant voice" was a basic issue throughout a doctoral dissertation where this study was included.

  1. Voice box (image)

    Science.gov (United States)

    The larynx, or voice box, is located in the neck and performs several important functions in the body. The larynx is involved in swallowing, breathing, and voice production. Sound is produced when the air which ...

  2. Voice and Aging

    Science.gov (United States)

    ... dramatic voice changes are those during childhood and adolescence. The larynx (or voice box) and vocal cord tissues do not fully mature until late teenage years. Hormone-related changes during adolescence are ...

  3. Voice and endocrinology

    OpenAIRE

    KVS Hari Kumar; Anurag Garg; Ajai Chandra, N. S.; Singh, S. P.; Rakesh Datta

    2016-01-01

    Voice is one of the advanced features of natural evolution that differentiates human beings from other primates. The human voice is capable of conveying the thoughts into spoken words along with a subtle emotion to the tone. This extraordinary character of the voice in expressing multiple emotions is the gift of God to the human beings and helps in effective interpersonal communication. Voice generation involves close interaction between cerebral signals and the peripheral apparatus consistin...

  4. Voice-Controlled Artificial Handspeak System

    Directory of Open Access Journals (Sweden)

    Carlo Fonda

    2014-01-01

    Full Text Available A man-machine interaction project is described which aims to establish an automated voice to sign language translator for communication with the deaf using integrated open technologies. The first prototype consists of a robotic hand designed with OpenSCAD and manufactured with a low-cost 3D printer ─which smoothly reproduces the alphabet of the sign language controlled by voice only. The core automation comprises an Arduino UNO controller used to activate a set of servo motors that follow instructions from a Raspberry Pi mini-computer having installed the open source speech recognition engine Julius. We discuss its features, limitations and possible future developments.

  5. Writing with Voice

    Science.gov (United States)

    Kesler, Ted

    2012-01-01

    In this Teaching Tips article, the author argues for a dialogic conception of voice, based in the work of Mikhail Bakhtin. He demonstrates a dialogic view of voice in action, using two writing examples about the same topic from his daughter, a fifth-grade student. He then provides five practical tips for teaching a dialogic conception of voice in…

  6. Tips for Healthy Voices

    Science.gov (United States)

    ... social interaction as well as for most people’s occupation. Proper care and use of your voice will give you the best chance for having a healthy voice for your entire lifetime. Hoarseness or roughness in your voice is often ...

  7. Voice Recognition Accuracy: What Is Acceptable?

    Science.gov (United States)

    1982-11-01

    127. Modesto Sumter 128. Worchester Catskills 129. Huntsville Janesville 130. Waterville Osage Beach 131. Baton Rouge Phoenix 132. Marquette Billings...133. New Orleans Antingua 134. Walla Walla Modesto 135. Tupelo Augusta 136. Astoria Greinville 137. Catskills Bermuda 138. Atlanta Huntsville 139...78 Sort Sos 79 Type Up 80 Debug 81 Papa Alpha 82 Quebec Stack 83 Romeo Tango 84 Sierra Alpha 85 Tango Romeo 86 Uniform Nine 87 Victor

  8. Controlling An Electric Car Starter System Through Voice

    Directory of Open Access Journals (Sweden)

    A.B. Muhammad Firdaus

    2015-04-01

    Full Text Available Abstract These days automotive has turned into a stand out amongst the most well-known modes of transportation on the grounds that a large number of Malaysians could bear to have an auto. There are numerous decisions of innovations in auto that have in the market. One of the engineering is voice controlled framework. Voice Recognition is the procedure of consequently perceiving a certain statement talked by a specific speaker focused around individual data included in discourse waves. This paper is to make an car controlled by voice of human. An essential pre-processing venture in Voice Recognition systems is to recognize the vicinity of noise. Sensitivity to speech variability lacking recognition precision and helplessness to mimic are among the principle specialized obstacles that keep the far reaching selection of speech-based recognition systems. Voice recognition systems work sensibly well with a quiet conditions however inadequately under loud conditions or in twisted channels. The key focus of the project is to control an electric car starter system.

  9. Voice handicap in singers.

    Science.gov (United States)

    Murry, Thomas; Zschommler, Anne; Prokop, Jan

    2009-05-01

    The study aimed to determine the differences in responses to the Voice Handicap Index (VHI-10) between singers and nonsingers and to evaluate the ranked order differences of the VHI-10 statements for both groups. The VHI-10 was modified to include statements related to the singing voice for comparison to the original VHI-10. Thirty-five nonsingers with documented voice disorders responded to the VHI-10. A second group, consisting of 35 singers with voice complaints, responded to the VHI-10 with three statements added specifically addressing the singing voice. Data from both groups were analyzed in terms of overall subject self-rating of voice handicap and the rank order of statements from least to most important. The difference between the mean VHI-10 for the singers and nonsingers was not statistically significant, thus, supporting the validity of the VHI-10. However, the 10 statements were ranked differently in terms of their importance by both groups. In addition, when three statements related specifically to the singing voice were substituted in the original VHI-10, the singers judged their voice problem to be more severe than when using the original VHI-10. The type of statements used to assess self-perception of voice handicap may be related to the subject population. Singers with voice problems do not rate their voices to be more handicapped than nonsingers unless statements related specifically to singing are included.

  10. METHODS FOR QUALITY ENHANCEMENT OF USER VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    O. N. Faizulaieva

    2014-03-01

    Full Text Available The reasonability for the usage of computer systems user voice in the authentication process is proved. The scientific task for improving the signal/noise ratio of the user voice signal in the authentication system is considered. The object of study is the process of input and output of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of voice signal against external interference signals are researched. Methods for quality enhancement of user voice signal in voice authentication systems are suggested. As modern computer facilities, including mobile ones, have two-channel audio card, the usage of two microphones is proposed in the voice signal input system of authentication system. Meanwhile, the task of forming a lobe of microphone array in a desired area of voice signal registration (100 Hz to 8 kHz is solved. The usage of directional properties of the proposed microphone array gives the possibility to have the influence of external interference signals two or three times less in the frequency range from 4 to 8 kHz. The possibilities for implementation of space-time processing of the recorded signals using constant and adaptive weighting factors are investigated. The simulation results of the proposed system for input and extraction of signals during digital processing of narrowband signals are presented. The proposed solutions make it possible to improve the value of the signal/noise ratio of the useful signals recorded up to 10, ..., 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker’s discrimination.

  11. Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance

    Directory of Open Access Journals (Sweden)

    Ravichander Vipperla

    2010-01-01

    Full Text Available With ageing, human voices undergo several changes which are typically characterized by increased hoarseness and changes in articulation patterns. In this study, we have examined the effect on Automatic Speech Recognition (ASR and found that the Word Error Rates (WER on older voices is 10% absolute higher compared to those of adult voices. Subsequently, we compared several voice source parameters including fundamental frequency, jitter, shimmer, harmonicity, and cepstral peak prominence of adult and older males. Several of these parameters show statistically significant difference for the two groups. However, artificially increasing jitter and shimmer measures do not effect the ASR accuracies significantly. Artificially lowering the fundamental frequency degrades the ASR performance marginally but this drop in performance can be overcome to some extent using Vocal Tract Length Normalisation (VTLN. Overall, we observe that the changes in the voice source parameters do not have a significant impact on ASR performance. Comparison of the likelihood scores of all the phonemes for the two age groups show that there is a systematic mismatch in the acoustic space of the two age groups. Comparison of the phoneme recognition rates show that mid vowels, nasals, and phonemes that depend on the ability to create constrictions with tongue tip for articulation are more affected by ageing than other phonemes.

  12. FUNDAMENTALS OF SPEAKER RECOGNITION

    OpenAIRE

    ERTAŞ, Figen

    2000-01-01

    The explosive growth of information technology in the last decade has made a considerable impact on the design and construction of systems for human-machine communication, which is becoming increasingly important in many aspects of life. Amongst other speech processing tasks, a great deal of attention has been devoted to developing procedures that identify people from their voices, and the design and construction of speaker recognition systems has been a fascinating enterprise pursued over ma...

  13. Smartphones Offer New Opportunities in Clinical Voice Research.

    Science.gov (United States)

    Manfredi, C; Lebacq, J; Cantarella, G; Schoentgen, J; Orlandi, S; Bandini, A; DeJonckere, P H

    2017-01-01

    Smartphone technology provides new opportunities for recording standardized voice samples of patients and sending the files by e-mail to the voice laboratory. This drastically improves the collection of baseline data, as used in research on efficiency of voice treatments. However, the basic requirement is the suitability of smartphones for recording and digitizing pathologic voices (mainly characterized by period perturbations and noise) without significant distortion. In this experiment, two smartphones (a very inexpensive one and a high-level one) were tested and compared with direct microphone recordings in a soundproof room. The voice stimuli consisted in synthesized deviant voice samples (median of fundamental frequency: 120 and 200 Hz) with three levels of jitter and three levels of added noise. All voice samples were analyzed using PRAAT software. The results show high correlations between jitter, shimmer, and noise-to-harmonics ratio measured on the recordings via both smartphones, the microphone, and measured directly on the sound files from the synthesizer. Smartphones thus appear adequate for reliable recording and digitizing of pathologic voices. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  14. Singing voice outcomes following singing voice therapy.

    Science.gov (United States)

    Dastolfo-Hromack, Christina; Thomas, Tracey L; Rosen, Clark A; Gartner-Schmidt, Jackie

    2016-11-01

    The objectives of this study were to describe singing voice therapy (SVT), describe referred patient characteristics, and document the outcomes of SVT. Retrospective. Records of patients receiving SVT between June 2008 and June 2013 were reviewed (n = 51). All diagnoses were included. Demographic information, number of SVT sessions, and symptom severity were retrieved from the medical record. Symptom severity was measured via the 10-item Singing Voice Handicap Index (SVHI-10). Treatment outcome was analyzed by diagnosis, history of previous training, and SVHI-10. SVHI-10 scores decreased following SVT (mean change = 11, 40% decrease) (P singing lessons (n = 10) also completed an average of three SVT sessions. Primary muscle tension dysphonia (MTD1) and benign vocal fold lesion (lesion) were the most common diagnoses. Most patients (60%) had previous vocal training. SVHI-10 decrease was not significantly different between MTD and lesion. This is the first outcome-based study of SVT in a disordered population. Diagnosis of MTD or lesion did not influence treatment outcomes. Duration of SVT was short (approximately three sessions). Voice care providers are encouraged to partner with a singing voice therapist to provide optimal care for the singing voice. This study supports the use of SVT as a tool for the treatment of singing voice disorders. 4 Laryngoscope, 126:2546-2551, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.

  15. Multimodal user input to supervisory control systems - Voice-augmented keyboard

    Science.gov (United States)

    Mitchell, Christine M.; Forren, Michelle G.

    1987-01-01

    The use of a voice-augmented keyboard input modality is evaluated in a supervisory control application. An implementation of voice recognition technology in supervisory control is proposed: voice is used to request display pages, while the keyboard is used to input system reconfiguration commands. Twenty participants controlled GT-MSOCC, a high-fidelity simulation of the operator interface to a NASA ground control system, via a workstation equipped with either a single keyboard or a voice-augmented keyboard. Experimental results showed that in all cases where significant performance differences occurred, performance with the voice-augmented keyboard modality was inferior to and had greater variance than the keyboard-only modality. These results suggest that current moderately priced voice recognition systems are an inappropriate human-computer interaction technology in supervisory control systems.

  16. FUNDAMENTALS OF SPEAKER RECOGNITION

    Directory of Open Access Journals (Sweden)

    Figen ERTAŞ

    2000-02-01

    Full Text Available The explosive growth of information technology in the last decade has made a considerable impact on the design and construction of systems for human-machine communication, which is becoming increasingly important in many aspects of life. Amongst other speech processing tasks, a great deal of attention has been devoted to developing procedures that identify people from their voices, and the design and construction of speaker recognition systems has been a fascinating enterprise pursued over many decades. This paper introduces speaker recognition in general and discusses its relevant parameters in relation to system performance.

  17. Design of a digital voice data compression technique for orbiter voice channels

    Science.gov (United States)

    1975-01-01

    Candidate techniques were investigated for digital voice compression to a transmission rate of 8 kbps. Good voice quality, speaker recognition, and robustness in the presence of error bursts were considered. The technique of delayed-decision adaptive predictive coding is described and compared with conventional adaptive predictive coding. Results include a set of experimental simulations recorded on analog tape. The two FM broadcast segments produced show the delayed-decision technique to be virtually undegraded or minimally degraded at .001 and .01 Viterbi decoder bit error rates. Preliminary estimates of the hardware complexity of this technique indicate potential for implementation in space shuttle orbiters.

  18. Recognition measured values

    OpenAIRE

    LEITKEP, Zdeněk

    2012-01-01

    This work deals recognition measured values. The main task is to find suitable method for preprocessing images and create interface to software performing recognition. Created application will be used primarily to analyze the photos on site acquisition. Application is developed in Java and properly documented on javadoc level.

  19. Handwritten Digits Recognition

    OpenAIRE

    Grand, Eric

    2000-01-01

    My work of diploma consisted in developing a Windows application for the recognition of the handwritten digits. The source images come from a pen-scanner. The user can also draw the digits directly with the mouse and do the recognition of it. In this software, I integrated the SVM Light reconizer.

  20. Clinical Voices - an update

    DEFF Research Database (Denmark)

    Fusaroli, Riccardo; Weed, Ethan

    Anomalous aspects of speech and voice, including pitch, fluency, and voice quality, are reported to characterise many mental disorders. However, it has proven difficult to quantify and explain this oddness of speech by employing traditional statistical methods. In this talk we will show how the t...... the temporal dynamics of the voice in Asperger's patients enable us to automatically reconstruct the diagnosis, and assess the Autism quotient score. We then generalise the findings to Danish and American children with autism....

  1. Effects of Medications on Voice

    Science.gov (United States)

    ... ENT Doctor Near You Effects of Medications on Voice Effects of Medications on Voice Patient Health Information ... entnet.org . Could Your Medication Be Affecting Your Voice? Some medications including prescription, over-the-counter, and ...

  2. Voice-based assessments of trustworthiness, competence, and warmth in blind and sighted adults

    OpenAIRE

    Oleszkiewicz, Anna; Pisanski, Katarzyna; Lachowicz-Tabaczek, Kinga; Sorokowska, Agnieszka

    2016-01-01

    The study of voice perception in congenitally blind individuals allows researchers rare insight into how a lifetime of visual deprivation affects the development of voice perception. Previous studies have suggested that blind adults outperform their sighted counterparts in low-level auditory tasks testing spatial localization and pitch discrimination, as well as in verbal speech processing; however, blind persons generally show no advantage in nonverbal voice recognition or discrimination tas...

  3. Onset and Maturation of Fetal Heart Rate Response to the Mother's Voice over Late Gestation

    Science.gov (United States)

    Kisilevsky, Barbara S.; Hains, Sylvia M. J.

    2011-01-01

    Background: Term fetuses discriminate their mother's voice from a female stranger's, suggesting recognition/learning of some property of her voice. Identification of the onset and maturation of the response would increase our understanding of the influence of environmental sounds on the development of sensory abilities and identify the period when…

  4. EXPERIMENTAL STUDY OF FIRMWARE FOR INPUT AND EXTRACTION OF USER’S VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    O. N. Faizulaieva

    2014-09-01

    Full Text Available Scientific task for improving the signal-to-noise ratio for user’s voice signal in computer systems and networks during the process of user’s voice authentication is considered. The object of study is the process of input and extraction of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of the voice signal on the background of external interference signals are investigated. Ways for quality improving of the user’s voice signal in systems of voice authentication are investigated experimentally. Firmware means for experimental unit of input and extraction of the user’s voice signal against external interference influence are considered. As modern computer means, including mobile, have two-channel audio card, two microphones are used in the voice signal input. The distance between sonic-wave sensors is 20 mm and it provides forming one direction pattern lobe of microphone array in a desired area of voice signal registration (from 100 Hz to 8 kHz. According to the results of experimental studies, the usage of directional properties of the proposed microphone array and space-time processing of the recorded signals with implementation of constant and adaptive weighting factors has made it possible to reduce considerably the influence of interference signals. The results of firmware experimental studies for input and extraction of the user’s voice signal against external interference influence are shown. The proposed solutions will give the possibility to improve the value of the signal/noise ratio of the useful signals recorded up to 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker discrimination.

  5. Interface Everywhere: Further Development of a Gesture and Voice Commanding Interface Project

    Data.gov (United States)

    National Aeronautics and Space Administration — Natural User Interface (NUI) is a term used to describe a number of technologies such as speech recognition, multi-touch, and kinetic interfaces. Gesture and voice...

  6. Assessment voice synthesizers for reading in digital books

    Directory of Open Access Journals (Sweden)

    Sérvulo Fernandes da Silva Neto

    2013-07-01

    Full Text Available The digital accessibility shows ways to information access in digital media that assist people with different types of disabilities to a better interaction with the computer independent of its limitations. Of these tools are composed by voice synthesizers, that supposedly simplifying their access to any recorded knowledge through digital technologies. However such tools have emerged originally in countries foreign language. Which brings us to the following research problem: the voice synthesizers are appropriate for reading digital books in the Portuguese language? The objective of this study was to analyze and classify different software tools voice synthesizers in combination with software digital book readers to support accessibility to e-books in Portuguese. Through literature review were identified applications software voice synthesizers, composing the sample analyzed in this work. We used a simplified version of the method of Multiple Criteria Decision Support - MMDA, to assess these. In the research 12 were considered readers of e-books and 11 software voice synthesizer, tested with six formats of e-books (E-pub, PDF, HTML, DOC, TXT, and Mobi. In accordance with the results, the software Virtual Vision achieved the highest score. Relative to formats, it was found that the PDF has measured a better score when summed the results of the three synthesizers. In the studied universe contacted that many synthesizers simply cannot be used because they did not support the Portuguese language.

  7. Voiced Reading and Rhythm

    Institute of Scientific and Technical Information of China (English)

    詹艳萍

    2007-01-01

    Since voiced reading is an important way in learning English,rhythm is the most critical factor that enables to read beautifully.This article illustrates the relationship between rhythm and voiced reading,the importance of rhythm,and the methods to develop the sense of rhythm.

  8. Clinical Voices - an update

    DEFF Research Database (Denmark)

    Fusaroli, Riccardo; Weed, Ethan

    Anomalous aspects of speech and voice, including pitch, fluency, and voice quality, are reported to characterise many mental disorders. However, it has proven difficult to quantify and explain this oddness of speech by employing traditional statistical methods. In this talk we will show how...

  9. Borderline Space for Voice

    Science.gov (United States)

    Batchelor, Denise

    2012-01-01

    Being on the borderline as a student in higher education is not always negative, to do with marginalisation, exclusion and having a voice that is vulnerable. Paradoxically, being on the edge also has positive connections with integration, inclusion and having a voice that is strong. Alternative understandings of the concept of borderline space can…

  10. Voice and endocrinology

    Directory of Open Access Journals (Sweden)

    KVS Hari Kumar

    2016-01-01

    Full Text Available Voice is one of the advanced features of natural evolution that differentiates human beings from other primates. The human voice is capable of conveying the thoughts into spoken words along with a subtle emotion to the tone. This extraordinary character of the voice in expressing multiple emotions is the gift of God to the human beings and helps in effective interpersonal communication. Voice generation involves close interaction between cerebral signals and the peripheral apparatus consisting of the larynx, vocal cords, and trachea. The human voice is susceptible to the hormonal changes throughout life right from the puberty until senescence. Thyroid, gonadal and growth hormones have tremendous impact on the structure and function of the vocal apparatus. The alteration of voice is observed even in physiological states such as puberty and menstruation. Astute clinical observers make out the changes in the voice and refer the patients for endocrine evaluation. In this review, we shall discuss the hormonal influence on the voice apparatus in normal and endocrine disorders.

  11. Face the voice

    DEFF Research Database (Denmark)

    Lønstrup, Ansa

    2014-01-01

    will be based on a reception aesthetic and phenomenological approach, the latter as presented by Don Ihde in his book Listening and Voice. Phenomenologies of Sound , and my analytical sketches will be related to theoretical statements concerning the understanding of voice and media (Cavarero, Dolar, La...

  12. Ontario's Student Voice Initiative

    Science.gov (United States)

    Courtney, Jean

    2014-01-01

    This article describes in some detail aspects of the Student Voice initiative funded and championed by Ontario's Ministry of Education since 2008. The project enables thousands of students to make their voices heard in meaningful ways and to participate in student-led research. Some students from grades 7 to 12 become members of the Student…

  13. EasyVoice: Integrating voice synthesis with Skype

    CERN Document Server

    Condado, Paulo A

    2007-01-01

    This paper presents EasyVoice, a system that integrates voice synthesis with Skype. EasyVoice allows a person with voice disabilities to talk with another person located anywhere in the world, removing an important obstacle that affect these people during a phone or VoIP-based conversation.

  14. Objective and Subjective Voice Examination in Korean Medicine

    Directory of Open Access Journals (Sweden)

    Junsang Yu

    2014-09-01

    Full Text Available Objectives: When a person speaks, voice problems usually include pain or discomfort and/or difficulties in terms of the pitch, the loudness and the quality of the voice. When patients with voice problems induced by stroke, Parkinson’s disease, and systemic diseases involving the voice are examined, generally, of the Four Diagnoses (四診, a Diagnosis of Hearing can be used in current Korean medicine. The effects of acupuncture and herb medicine on voice problems have been reported for over 20 years. However, when it comes to improvements, objective and subjective evaluation methods need to be explained. Methods: Subjective methods for evaluating voice were studied through a literature search of old medicinal books containing Korean medicine diagnostics, and an objective evaluation method using Praat software is presented. Results: Korean medicine doctors analyze the patient’s voice in clinical settings unconsciously on a daily basis. However, most voice diagnoses depend on the doctor’s subjective evaluation. Voice qualities can be evaluated by using the Eight Principles (八綱, including Yin-Yang; the Five Elements (Phases; the Grade, Roughness, Breathy, Asthenic, Strained (GRBAS score, and the Visual Analogue Scale (VAS as subjective methods, and an acoustic analysis using the Praat program can be used as an objective method. Conclusion: A more complete voice examination can be achieved by using subjective and objective methods at the same time. For an objective explanation and management of patient’s voice problems or systemic disorders, an objective method should be used in Korean medicine, which already has many subjective diagnostic methods. More research needs to be conducted, and more clinical evidence needs to be collected in the future.

  15. Voice Savers for Music Teachers

    Science.gov (United States)

    Cookman, Starr

    2012-01-01

    Music teachers are in a class all their own when it comes to voice use. These elite vocal athletes require stamina, strength, and flexibility from their voices day in, day out for hours at a time. Voice rehabilitation clinics and research show that music education ranks high among the professionals most commonly affected by voice problems.…

  16. Mandarin recognition over the telephone

    Science.gov (United States)

    Kao, Yuhung

    1996-06-01

    Mandarin Chinese is the official language in China and Taiwan, it is the native language of a quarter of the world population. As the services enabled by speech recognition technology (e.g. telephone voice dialing, information query) become more popular in English, we would like to extend this capability to other languages. Mandarin is one of the major languages under research in our laboratory. This paper describes how we extend our work in English speech recognition into Mandarin. We will described the corpus: Voice Across Taiwan, the training of a complete set of Mandarin syllable models, preliminary performance results and error analysis. A fast prototyping system was built, where a user can write any context free grammar with no restriction of vocabulary, then the grammar can be compiled into recognition models. It enables user to quickly test the performance of a new vocabulary.

  17. Speech Rate Control for Improving Elderly Speech Recognition of Smart Devices

    Directory of Open Access Journals (Sweden)

    SON, G.

    2017-05-01

    Full Text Available Although smart devices have become a widely-adopted tool for communication in modern society, it still requires a steep learning curve among the elderly. By introducing a voice-based interface for smart devices using voice recognition technology, smart devices can become more user-friendly and useful to the elderly. However, the voice recognition technology used in current devices is attuned to the voice patterns of the young. Therefore, speech recognition falters when an elderly user speaks into the device. This paper has identified that the elderly's improper speech rate by each syllable contributes to the failure in the voice recognition system. Thus, upon modifying the speech rate by each syllable, the voice recognition rate saw an increase of 12.3%. This paper demonstrates that by simply modifying the speech rate by each syllable, which is one of the factors that causes errors in voice recognition, the recognition rate can be substantially increased. Such improvements in voice recognition technology can make it easier for the elderly to operate smart devices that will allow them to be more socially connected in a mobile world and access information at their fingertips. It may also be helpful in bridging the communication divide between generations.

  18. Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms

    Directory of Open Access Journals (Sweden)

    Yan Ming Cheng

    2008-08-01

    Full Text Available We describe two voice-to-phoneme conversion algorithms for speaker-independent voice-tag creation specifically targeted at applications on embedded platforms. These algorithms (batch mode and sequential are compared in speech recognition experiments where they are first applied in a same-language context in which both acoustic model training and voice-tag creation and application are performed on the same language. Then, their performance is tested in a cross-language setting where the acoustic models are trained on a particular source language while the voice-tags are created and applied on a different target language. In the same-language environment, both algorithms either perform comparably to or significantly better than the baseline where utterances are manually transcribed by a phonetician. In the cross-language context, the voice-tag performances vary depending on the source-target language pair, with the variation reflecting predicted phonological similarity between the source and target languages. Among the most similar languages, performance nears that of the native-trained models and surpasses the native reference baseline.

  19. Additive attacks on speaker recognition

    Science.gov (United States)

    Farrokh Baroughi, Alireza; Craver, Scott

    2014-02-01

    Speaker recognition is used to identify a speaker's voice from among a group of known speakers. A common method of speaker recognition is a classification based on cepstral coefficients of the speaker's voice, using a Gaussian mixture model (GMM) to model each speaker. In this paper we try to fool a speaker recognition system using additive noise such that an intruder is recognized as a target user. Our attack uses a mixture selected from a target user's GMM model, inverting the cepstral transformation to produce noise samples. In our 5 speaker data base, we achieve an attack success rate of 50% with a noise signal at 10dB SNR, and 95% by increasing noise power to 0dB SNR. The importance of this attack is its simplicity and flexibility: it can be employed in real time with no processing of an attacker's voice, and little computation is needed at the moment of detection, allowing the attack to be performed by a small portable device. For any target user, knowing that user's model or voice sample is sufficient to compute the attack signal, and it is enough that the intruder plays it while he/she is uttering to be classiffed as the victim.

  20. Dominant Voice in Hamlet

    Institute of Scientific and Technical Information of China (English)

    李丹

    2015-01-01

    <正>The Tragedy of Hamlet dramatizes the revenge Prince Hamlet exacts on his uncle Claudius for murdering King Hamlet,Claudius’s brother and Prince Hamlet’s father,and then succeeding to the throne and taking as his wife Gertrude,the old king’s widow and Prince Hamlet’s mother.This paper will discuss something about dominant voice in the play.Dominant voice is the major voice in the country,the society,or the whole world.Those people who have the power or

  1. A Single Case Design Evaluation of a Software and Tutor Intervention Addressing Emotion Recognition and Social Interaction in Four Boys with ASD

    Science.gov (United States)

    Lacava, Paul G.; Rankin, Ana; Mahlios, Emily; Cook, Katie; Simpson, Richard L.

    2010-01-01

    Many students with Autism Spectrum Disorders (ASD) have delays learning to recognize emotions. Social behavior is also challenging, including initiating interactions, responding to others, developing peer relationships, and so forth. In this single case design study we investigated the relationship between use of computer software ("Mind Reading:…

  2. A Comparison of Text, Voice, and Screencasting Feedback to Online Students

    Science.gov (United States)

    Orlando, John

    2016-01-01

    The emergence of simple video and voice recording software has allowed faculty to deliver online course content in a variety of rich formats. But most faculty are still using traditional text comments for feedback to students. The author launched a study comparing student and faculty perceptions of text, voice, and screencasting feedback. The…

  3. Ethnographic Voice Memo Narratives

    DEFF Research Database (Denmark)

    Rasmussen, Mette Apollo; Conradsen, Maria Bosse

    1800-01-01

    -based technique which actively involves actors in producing ethnography-based data concerning their everyday practice. With the help from smartphone technology it is possible to complement ethnography-based research methods when involving the actors and having them create small voice memo narratives. The voice...... memos create insights of actors‟ everyday practice, without the direct presence of a researcher and could be considered a step towards meeting the dilemmas of research in complex fieldwork settings....

  4. Voice-Controlled Artificial Handspeak System

    Directory of Open Access Journals (Sweden)

    Jonathan Gatti

    2014-04-01

    Full Text Available A man-machine interaction project is described whic h aims to establish an automated voice to sign language translator for communication with the deaf using integrated open technologies. The first prototype consists of a robotic hand designed with OpenSCAD and manufactured with a low-cost 3D printer which smoothly reproduces the alphabet of the sign language controlled by voice only. The core automation comprises an Arduino UNO controller used to activate a set of servo motors that follow instructions from a Raspberry Pi mini-computer havi ng installed the open source speech recognition eng ine Julius. We discuss its features, limitations and po ssible future developmen

  5. Speech Recognition: Its Place in Business Education.

    Science.gov (United States)

    Szul, Linda F.; Bouder, Michele

    2003-01-01

    Suggests uses of speech recognition devices in the classroom for students with disabilities. Compares speech recognition software packages and provides guidelines for selection and teaching. (Contains 14 references.) (SK)

  6. SPEECH EMOTION RECOGNITION USING MODIFIED QUADRATIC DISCRIMINATION FUNCTION

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    Quadratic Discrimination Function(QDF)is commonly used in speech emotion recognition,which proceeds on the premise that the input data is normal distribution.In this Paper,we propose a transformation to normalize the emotional features,then derivate a Modified QDF(MQDF) to speech emotion recognition.Features based on prosody and voice quality are extracted and Principal Component Analysis Neural Network (PCANN) is used to reduce dimension of the feature vectors.The results show that voice quality features are effective supplement for recognition.and the method in this paper could improve the recognition ratio effectively.

  7. Voice disorders without organic diseases of the larynx. A 10-year review of 62 patients.

    Science.gov (United States)

    Watanabe, Y; Miura, M; Shoji, H

    1983-01-01

    We reviewed the clinical records of 62 patients with voice disorders without organic diseases of the larynx who were examined in the Department of Otolaryngology, Kurume University Hospital during the 10 years from 1971 to 1980. There were 9 patients with psychogenic dysphonia, 24 with vocal abuse, 4 with spastic dysphonia, 9 with mutational voice disturbance, 8 with virilization of voice, 7 with dysphonia attributed to diseases of other organs than the larynx, and 1 with senile change of voice. The clinical service to the patients with these kinds of dysphonia has been poor in Japan for lack of trained voice pathologists. This has been also the case with our department. Recognition for the necessity of voice pathologists is strongly demanded.

  8. Multidimensional assessment of strongly irregular voices such as in substitution voicing and spasmodic dysphonia: a compilation of own research.

    Science.gov (United States)

    Moerman, Mieke; Martens, Jean-Pierre; Dejonckere, Philippe

    2015-04-01

    This article is a compilation of own research performed during the European COoperation in Science and Technology (COST) action 2103: 'Advance Voice Function Assessment', an initiative of voice and speech processing teams consisting of physicists, engineers, and clinicians. This manuscript concerns analyzing largely irregular voicing types, namely substitution voicing (SV) and adductor spasmodic dysphonia (AdSD). A specific perceptual rating scale (IINFVo) was developed, and the Auditory Model Based Pitch Extractor (AMPEX), a piece of software that automatically analyses running speech and generates pitch values in background noise, was applied. The IINFVo perceptual rating scale has been shown to be useful in evaluating SV. The analysis of strongly irregular voices stimulated a modification of the European Laryngological Society's assessment protocol which was originally designed for the common types of (less severe) dysphonia. Acoustic analysis with AMPEX demonstrates that the most informative features are, for SV, the voicing-related acoustic features and, for AdSD, the perturbation measures. Poor correlations between self-assessment and acoustic and perceptual dimensions in the assessment of highly irregular voices argue for a multidimensional approach.

  9. Arabic Speech Recognition System using CMU-Sphinx4

    CERN Document Server

    Satori, H; Chenfour, N

    2007-01-01

    In this paper we present the creation of an Arabic version of Automated Speech Recognition System (ASR). This system is based on the open source Sphinx-4, from the Carnegie Mellon University. Which is a speech recognition system based on discrete hidden Markov models (HMMs). We investigate the changes that must be made to the model to adapt Arabic voice recognition. Keywords: Speech recognition, Acoustic model, Arabic language, HMMs, CMUSphinx-4, Artificial intelligence.

  10. AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS

    Directory of Open Access Journals (Sweden)

    Murat CANER

    2006-02-01

    Full Text Available In this study an artificial neural network (ANN is implemented, which has been used frequently as an implementation model in recent years, to recognize speaker identification. Generally, recognition is consist of three stages that, processing of signal, obtaining attributes and comparing them. Speech samples are transformed into digital data according to voice card of PC. In the analysis of voice stage, recurrent periods and white noise of voice data are trimmed by hamming window method and voice attribute part of the digital data is obtained. For obtaining attribute of voice data LPC (linear predictive coding and DFT (discrete fourier transform methods are used. Of those 28 coefficents, that is used for speaker recognition, 16 were obtained by the analysis of DFT and 12 were obtained by the analysis of LPC. The parameters that represent speaker voice, is used for training and test of ANN. Multilayer perceptron model is used as an architecture of ANN and backpropagation algorithm is used for training method. Voices of "a" is taken from 7 different person and their attributes are found. ANN is trained with these features to find the speaker who is the owner of the sample voice. And then using the test data that is not used for training part, recognition achievement of ANN is tested. As a result, good results were obtained with low failure rate.

  11. Speech recognition systems on the Cell Broadband Engine

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Y; Jones, H; Vaidya, S; Perrone, M; Tydlitat, B; Nanda, A

    2007-04-20

    In this paper we describe our design, implementation, and first results of a prototype connected-phoneme-based speech recognition system on the Cell Broadband Engine{trademark} (Cell/B.E.). Automatic speech recognition decodes speech samples into plain text (other representations are possible) and must process samples at real-time rates. Fortunately, the computational tasks involved in this pipeline are highly data-parallel and can receive significant hardware acceleration from vector-streaming architectures such as the Cell/B.E. Identifying and exploiting these parallelism opportunities is challenging, but also critical to improving system performance. We observed, from our initial performance timings, that a single Cell/B.E. processor can recognize speech from thousands of simultaneous voice channels in real time--a channel density that is orders-of-magnitude greater than the capacity of existing software speech recognizers based on CPUs (central processing units). This result emphasizes the potential for Cell/B.E.-based speech recognition and will likely lead to the future development of production speech systems using Cell/B.E. clusters.

  12. Voice Therapy Practices and Techniques: A Survey of Voice Clinicians.

    Science.gov (United States)

    Mueller, Peter B.; Larson, George W.

    1992-01-01

    Eighty-three voice disorder therapists' ratings of statements regarding voice therapy practices indicated that vocal nodules are the most frequent disorder treated; vocal abuse and hard glottal attack elimination, counseling, and relaxation were preferred treatment approaches; and voice therapy is more effective with adults than with children.…

  13. Voice in early glottic cancer compared to benign voice pathology

    NARCIS (Netherlands)

    Van Gogh, C. D. L.; Mahieu, H. F.; Kuik, D. J.; Rinkel, R. N. P. M.; Langendijk, J. A.; Verdonck-de Leeuw, I. M.

    2007-01-01

    The purpose of this study is to compare (Dutch) Voice Handicap Index (VHIvumc) scores from a selected group of patients with voice problems after treatment for early glottic cancer with patients with benign voice disorders and subjects from the normal population. The study included a group of 35 pat

  14. The inner voice

    Directory of Open Access Journals (Sweden)

    Anthony James Ridgway

    2009-12-01

    Full Text Available The inner voice- we all know what it is because we all have it and use it when we are thinking or reading, for example. Little work has been done on it in our field, with the notable exception of Brian Tomlinson, but presumably it must be a cognitive phenomenon which is of great importance in thinking, language learning, and reading in a foreign language. The inner voice will be discussed as a cognitive psychological phenomenon associated with short-term memory, and distinguished from the inner ear. The process of speech recoding will be examined (the process of converting written language into the inner voice and the importance of developing the inner voice, as a means of both facilitating the production of a new language and enhancing the comprehension of a text in a foreign language, will be emphasized. Finally, ways of developing the inner voice in beginning and intermediate readers of a foreign language will be explored and recommended.

  15. Smartphone App for Voice Disorders

    Science.gov (United States)

    ... on. Feature: Taste, Smell, Hearing, Language, Voice, Balance Smartphone App for Voice Disorders Past Issues / Fall 2013 ... developed a mobile monitoring device that relies on smartphone technology to gather a week's worth of talking, ...

  16. The effects of thematic context and presentation mode on memory for sentence voice.

    Science.gov (United States)

    Kerr, N H; Butler, S F; Maykuth, P L; Delis, D

    1982-05-01

    A sentence in discourse may appear in the passive voice to emphasize the logical object rather than the logical subject when it is thematically more important. Two experiments are reported that explore the impact of this textual function of voice on sentence memory. The first experiment required subjects to listen to prose passages and then recall them. Sentences were recalled predominantly in the active voice regardless of voice or thematic focus in the prose passage, showing that the English-language bias for the active voice was a more important determinant of sentence reconstruction than was the experiment manipulation of thematic context. The second experiment required subjects to listen to or read either prose passages or lists of unrelated sentences and then to try to recognize "key" sentences that were either unchanged or changed lexically, semantically, or in voice. Recognition, both overall and specifically for voice, was better for sentences that were read than for those that were heard, and recognition for semantic change was consistently higher than for any other. Only when passages were read was there evidence in support of a thematic textual influence on memory for sentence voice.

  17. Voice-based assessments of trustworthiness, competence, and warmth in blind and sighted adults.

    Science.gov (United States)

    Oleszkiewicz, Anna; Pisanski, Katarzyna; Lachowicz-Tabaczek, Kinga; Sorokowska, Agnieszka

    2017-06-01

    The study of voice perception in congenitally blind individuals allows researchers rare insight into how a lifetime of visual deprivation affects the development of voice perception. Previous studies have suggested that blind adults outperform their sighted counterparts in low-level auditory tasks testing spatial localization and pitch discrimination, as well as in verbal speech processing; however, blind persons generally show no advantage in nonverbal voice recognition or discrimination tasks. The present study is the first to examine whether visual experience influences the development of social stereotypes that are formed on the basis of nonverbal vocal characteristics (i.e., voice pitch). Groups of 27 congenitally or early-blind adults and 23 sighted controls assessed the trustworthiness, competence, and warmth of men and women speaking a series of vowels, whose voice pitches had been experimentally raised or lowered. Blind and sighted listeners judged both men's and women's voices with lowered pitch as being more competent and trustworthy than voices with raised pitch. In contrast, raised-pitch voices were judged as being warmer than were lowered-pitch voices, but only for women's voices. Crucially, blind and sighted persons did not differ in their voice-based assessments of competence or warmth, or in their certainty of these assessments, whereas the association between low pitch and trustworthiness in women's voices was weaker among blind than sighted participants. This latter result suggests that blind persons may rely less heavily on nonverbal cues to trustworthiness compared to sighted persons. Ultimately, our findings suggest that robust perceptual associations that systematically link voice pitch to the social and personal dimensions of a speaker can develop without visual input.

  18. The Voice of the Computer Is Heard in the Land

    Institute of Scientific and Technical Information of China (English)

    刘堃

    2002-01-01

    In the past year, products capable of turning nonstop speech into text have entered the commercial marketplace. Via Voice from IBM Corp. is one instance, and Naturally Speaking from Dragon Systems Inc.…, is another. Both systems consist of software written for IBM-compatible PCs with Windows 95 or NT,

  19. Sustainable Consumer Voices

    DEFF Research Database (Denmark)

    Klitmøller, Anders; Rask, Morten; Jensen, Nevena

    2011-01-01

    Aiming to explore how user driven innovation can inform high level design strategies, an in-depth empirical study was carried out, based on data from 50 observations of private vehicle users. This paper reports the resulting 5 consumer voices: Technology Enthusiast, Environmentalist, Design Lover......, Pragmatist and Status Seeker. Expedient use of the voices in creating design strategies is discussed, thus contributing directly to the practice of high level design managers. The main academic contribution of this paper is demonstrating how applied anthropology can be used to generate insights...... into disruptive emergence of product service systems, where quantitative user analyses rely on historical continuation....

  20. Voice application development for Android

    CERN Document Server

    McTear, Michael

    2013-01-01

    This book will give beginners an introduction to building voice-based applications on Android. It will begin by covering the basic concepts and will build up to creating a voice-based personal assistant. By the end of this book, you should be in a position to create your own voice-based applications on Android from scratch in next to no time.Voice Application Development for Android is for all those who are interested in speech technology and for those who, as owners of Android devices, are keen to experiment with developing voice apps for their devices. It will also be useful as a starting po

  1. Evaluation of a blood glucose monitoring system with automatic high- and low-pattern recognition software in insulin-using patients: pattern detection and patient-reported insights.

    Science.gov (United States)

    Grady, Mike; Campbell, Denise; MacLeod, Kirsty; Srinivasan, Aparna

    2013-07-01

    This study aimed to evaluate the performance of a glucose pattern recognition tool incorporated in a blood glucose monitoring system (BGMS) and its association with clinical measures, and to assess user perception and understanding of the pattern messages they receive. Participants had type 1 or type 2 diabetes mellitus and were self-adjusting insulin doses for ≥1 year. During a 4-week home testing period, participants performed ≥6 daily self-tests, adjusted their insulin regimen based on BGMS results, and recorded pattern messages in the logbook. Participants reflected on usability of the pattern tool in a questionnaire. Study participants (n = 101) received a mean ± standard deviation of 4.5 ± 1.9 pattern messages per week (3.6 ± 1.8 high glucose patterns and 0.9 ± 1.3 low glucose patterns). Most received ≥1 high (96.5%) and/or ≥1 low (46.0%) pattern message per week. The average number of high- and low-pattern messages per week was associated with higher and lower, respectively, baseline hemoglobin A1c (p < .01) and fasting plasma glucose (p < .05). Participants found high- and low-pattern messages clear and easy to understand (84.2% and 83.2%, respectively) and considered the frequency of low (82.0%) and high (63.4%) pattern messages about right. Overall, 71.3% of participants indicated they preferred to use a meter with pattern messages. The on-device Pattern tool identified meaningful blood glucose patterns, highlighting potential opportunities for improving glycemic control in patients who self-adjust their insulin. © 2013 Diabetes Technology Society.

  2. Internet-Based System for Voice Communication With the ISS

    Science.gov (United States)

    Chamberlain, James; Myers, Gerry; Clem, David; Speir, Terri

    2005-01-01

    The Internet Voice Distribution System (IVoDS) is a voice-communication system that comprises mainly computer hardware and software. The IVoDS was developed to supplement and eventually replace the Enhanced Voice Distribution System (EVoDS), which, heretofore, has constituted the terrestrial subsystem of a system for voice communications among crewmembers of the International Space Station (ISS), workers at the Payloads Operations Center at Marshall Space Flight Center, principal investigators at diverse locations who are responsible for specific payloads, and others. The IVoDS utilizes a communication infrastructure of NASA and NASArelated intranets in addition to, as its name suggests, the Internet. Whereas the EVoDS utilizes traditional circuitswitched telephony, the IVoDS is a packet-data system that utilizes a voice over Internet protocol (VOIP). Relative to the EVoDS, the IVoDS offers advantages of greater flexibility and lower cost for expansion and reconfiguration. The IVoDS is an extended version of a commercial Internet-based voice conferencing system that enables each user to participate in only one conference at a time. In the IVoDS, a user can receive audio from as many as eight conferences simultaneously while sending audio to one of them. The IVoDS also incorporates administrative controls, beyond those of the commercial system, that provide greater security and control of the capabilities and authorizations for talking and listening afforded to each user.

  3. Dissociating the cortical basis of memory for voices, words and tones.

    Science.gov (United States)

    Stevens, Alexander A

    2004-01-01

    Human speech carries both linguistic content and information about the speaker's identity and affect. While neuroimaging has been used extensively to study verbal memory, there has been little attention to the neural basis of memory for voices. Evidence from studies of aphasia and auditory agnosia suggests that voice memory may rely on anatomically distinct areas in the right temporal and parietal lobes regions, but there is little data on the broader neural systems involved in voice memory. The present study tested the hypothesis that the neural systems involved in voice memory are functionally distinct from the systems involved in word recognition and are primarily located in the right cerebral hemisphere. Subjects performed two-back tasks in which they were required to alternately remember the voices speaking (Voice condition), and the words they produced (Word condition). A tone memory condition was also included, as a non-speech comparison. The contrast between the Voice and Word conditions revealed greater Voice-related effects in left temporal, right frontal and right medial parietal areas, while the Word-related effects appeared in left frontal and bilateral parietal areas. These findings map out a partially right-lateralized fronto-parietal network associated with voice memory, which can be distinguished from predominantly left-hemisphere regions associated with verbal working memory. These results provide further evidence that distinct neural systems are associated with the carrier waves of speech and word identity.

  4. Sustainable Consumer Voices

    DEFF Research Database (Denmark)

    Klitmøller, Anders; Rask, Morten; Jensen, Nevena

    2011-01-01

    Aiming to explore how user driven innovation can inform high level design strategies, an in-depth empirical study was carried out, based on data from 50 observations of private vehicle users. This paper reports the resulting 5 consumer voices: Technology Enthusiast, Environmentalist, Design Lover...

  5. Voices of courage

    Directory of Open Access Journals (Sweden)

    Noraida Abdullah Karim

    2007-07-01

    Full Text Available In May 2007 the Women’s Commission for Refugee Women and Children1 presented its annual Voices of Courage awards to three displaced people who have dedicated their lives to promoting economic opportunities for refugee and displaced women and youth. These are their (edited testimonies.

  6. Listen to a voice

    DEFF Research Database (Denmark)

    Hølge-Hazelton, Bibi

    2001-01-01

    Listen to the voice of a young girl Lonnie, who was diagnosed with Type 1 diabetes at 16. Imagine that she is deeply involved in the social security system. She lives with her mother and two siblings in a working class part of a small town. She is at a special school for problematic youth, and he...

  7. Political animal voices

    NARCIS (Netherlands)

    Meijer, E.R.

    2017-01-01

    In this thesis, I develop a theory of political animal voices. The first part of the thesis focuses on non-human animal languages and forming interspecies worlds. I first investigate the relation between viewing language as exclusively human and seeing humans as categorically different from other

  8. Finding a Voice

    Science.gov (United States)

    Stuart, Shannon

    2012-01-01

    Schools have struggled for decades to provide expensive augmentative and alternative communication (AAC) resources for autistic students with communication challenges. Clunky voice output devices, often included in students' individualized education plans, cost about $8,000, a difficult expense to cover in hard times. However, mobile technology is…

  9. the Voice of Tomorrow

    Institute of Scientific and Technical Information of China (English)

    AlanBurdick

    2003-01-01

    Have you heard Mide? Coule be.Mike is a professional reader,and he's everywhere these days. On MapQuest, the Web-based map service,he'll read aloud whatever directions you ask for. If you like to have AOL or Yahoo! e-mail read aloud to you over the phone, that's Mike's voice you 're hearing. Soon

  10. What the voice reveals.

    NARCIS (Netherlands)

    Ko, Sei Jin

    2007-01-01

    Given that the voice is our main form of communication, we know surprisingly little about how it impacts judgment and behavior. Furthermore, the modern advancement in telecommunication systems, such as cellular phones, has meant that a large proportion of our everyday interactions are conducted voca

  11. The Inner Voice

    Science.gov (United States)

    Ridgway, Anthony James

    2009-01-01

    The inner voice- we all know what it is because we all have it and use it when we are thinking or reading, for example. Little work has been done on it in our field, with the notable exception of Brian Tomlinson, but presumably it must be a cognitive phenomenon which is of great importance in thinking, language learning, and reading in a foreign…

  12. Moving beyond Youth Voice

    Science.gov (United States)

    Serido, Joyce; Borden, Lynne M.; Perkins, Daniel F.

    2011-01-01

    This study combines research documenting the benefits of positive relationships between youth and caring adults on a young person's positive development with studies on youth voice to examine the mechanisms through which participation in youth programs contributes to positive developmental outcomes. Specifically, the study explores whether youth's…

  13. Bodies and Voices

    DEFF Research Database (Denmark)

    A wide-ranging collection of essays centred on readings of the body in contemporary literary and socio-anthropological discourse, from slavery and rape to female genital mutilation, from clothing, ocular pornography, voice, deformation and transmutation to the imprisoned, dismembered, remembered...

  14. Voices for Careers.

    Science.gov (United States)

    York, Edwin G.; Kapadia, Madhu

    Listed in this annotated bibliography are 502 cassette tapes of value to career exploration for Grade 7 through the adult level, whether as individualized instruction, small group study, or total class activity. Available to New Jersey educators at no charge, this Voices for Careers System is also available for duplication on request from the New…

  15. What the voice reveals

    NARCIS (Netherlands)

    Ko, Sei Jin

    2007-01-01

    Given that the voice is our main form of communication, we know surprisingly little about how it impacts judgment and behavior. Furthermore, the modern advancement in telecommunication systems, such as cellular phones, has meant that a large proportion of our everyday interactions are conducted voca

  16. Bodies and Voices

    DEFF Research Database (Denmark)

    A wide-ranging collection of essays centred on readings of the body in contemporary literary and socio-anthropological discourse, from slavery and rape to female genital mutilation, from clothing, ocular pornography, voice, deformation and transmutation to the imprisoned, dismembered, remembered...

  17. Training Implications of Airborne Applications of Automated Speech Recognition Technology.

    Science.gov (United States)

    1980-10-01

    Coler , C. R. Automated speech recognition and man- computer interaction research at NASA Ames Research Center. In S. Harris (Ed.), Proceedings: Voice...Sons, Inc., 1964. 56 NAVTRAEQUIPCEN 80-D-0009-0155-1 Coler , C. R. Automated speech recognition and man- computer interaction research at NASA Ames

  18. Incorporating Speech Recognition into a Natural User Interface

    Science.gov (United States)

    Chapa, Nicholas

    2017-01-01

    The Augmented/ Virtual Reality (AVR) Lab has been working to study the applicability of recent virtual and augmented reality hardware and software to KSC operations. This includes the Oculus Rift, HTC Vive, Microsoft HoloLens, and Unity game engine. My project in this lab is to integrate voice recognition and voice commands into an easy to modify system that can be added to an existing portion of a Natural User Interface (NUI). A NUI is an intuitive and simple to use interface incorporating visual, touch, and speech recognition. The inclusion of speech recognition capability will allow users to perform actions or make inquiries using only their voice. The simplicity of needing only to speak to control an on-screen object or enact some digital action means that any user can quickly become accustomed to using this system. Multiple programs were tested for use in a speech command and recognition system. Sphinx4 translates speech to text using a Hidden Markov Model (HMM) based Language Model, an Acoustic Model, and a word Dictionary running on Java. PocketSphinx had similar functionality to Sphinx4 but instead ran on C. However, neither of these programs were ideal as building a Java or C wrapper slowed performance. The most ideal speech recognition system tested was the Unity Engine Grammar Recognizer. A Context Free Grammar (CFG) structure is written in an XML file to specify the structure of phrases and words that will be recognized by Unity Grammar Recognizer. Using Speech Recognition Grammar Specification (SRGS) 1.0 makes modifying the recognized combinations of words and phrases very simple and quick to do. With SRGS 1.0, semantic information can also be added to the XML file, which allows for even more control over how spoken words and phrases are interpreted by Unity. Additionally, using a CFG with SRGS 1.0 produces a Finite State Machine (FSM) functionality limiting the potential for incorrectly heard words or phrases. The purpose of my project was to

  19. Double Fourier analysis for Emotion Identification in Voiced Speech

    Science.gov (United States)

    Sierra-Sosa, D.; Bastidas, M.; Ortiz P., D.; Quintero, O. L.

    2016-04-01

    We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech. Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions. A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds. Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions. Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it. Finally features related with emotions in voiced speech are extracted and presented.

  20. Speech Recognition: A World of Opportunities

    Science.gov (United States)

    PACER Center, 2004

    2004-01-01

    Speech recognition technology helps people with disabilities interact with computers more easily. People with motor limitations, who cannot use a standard keyboard and mouse, can use their voices to navigate the computer and create documents. The technology is also useful to people with learning disabilities who experience difficulty with spelling…

  1. Forensic Speaker Recognition Law Enforcement and Counter-Terrorism

    CERN Document Server

    Patil, Hemant

    2012-01-01

    Forensic Speaker Recognition: Law Enforcement and Counter-Terrorism is an anthology of the research findings of 35 speaker recognition experts from around the world. The volume provides a multidimensional view of the complex science involved in determining whether a suspect’s voice truly matches forensic speech samples, collected by law enforcement and counter-terrorism agencies, that are associated with the commission of a terrorist act or other crimes. While addressing such topics as the challenges of forensic case work, handling speech signal degradation, analyzing features of speaker recognition to optimize voice verification system performance, and designing voice applications that meet the practical needs of law enforcement and counter-terrorism agencies, this material all sounds a common theme: how the rigors of forensic utility are demanding new levels of excellence in all aspects of speaker recognition. The contributors are among the most eminent scientists in speech engineering and signal process...

  2. VOICE OVER INTERNET PROTOCOL (VOIP: FUTURE POTENTIAL

    Directory of Open Access Journals (Sweden)

    Deepti Kumari

    2015-10-01

    Full Text Available VoIP (voice over IP delivers standard voice over telephone services over Internet Protocol (IP. VoIP is the technology of digitizing sound, compressing it, breaking it up into data packets, and sending it over an IP (internet protocol network where it is reassembled, decompressed, and converted back into an analog wave form. Gateways are the key component required to facilitate IP Telephony. A gateway is used to bridge the traditional circuit switched PSTN with the packet switched Internet. The paper covers software, hardware and protocol requirements followed by weighing the VoIP advantages such as low cost, portability, free and advanced features, bandwidth efficiency, call recording and monitoring against the VoIP disadvantages such as power dependency, quality of voice and service, security, and reliability. With ever increasing internet penetration and better broadband connectivity, VoIP is going to expand further with businesses already using VoIP standalone or in a hybrid format, although our focus and scope here remains VoIP. Mobile VoIP, an infant with less than 4% market share, has so far been focusing on increasing active subscriptions without a sustainable revenue model, but has the potential and is going to see tussle with static VoIP for space in days ahead.

  3. Voice over Internet Protocol (VOIP: Future Potential

    Directory of Open Access Journals (Sweden)

    Ms. Deepti

    2014-11-01

    Full Text Available VoIP (voice over IP delivers standard voice over telephone services over Internet Protocol (IP. VoIP is the technology of digitizing sound, compressing it, breaking it up into data packets, and sending it over an IP (internet protocol network where it is reassembled, decompressed, and converted back into an analog wave form. Gateways are the key component required to facilitate IP Telephony. A gateway is used to bridge the traditional circuit switched PSTN with the packet switched Internet. The paper covers software, hardware and protocol requirements followed by weighing the VoIP advantages such as low cost, portability, free and advanced features, bandwidth efficiency, call recording and monitoring against the VoIP disadvantages such as power dependency, quality of voice and service, security, and reliability. With ever increasing internet penetration and better broadband connectivity, VoIP is going to expand further with businesses already using VoIP standalone or in a hybrid format, although our focus and scope here remains VoIP. Mobile VoIP, an infant with less than 4% market share, has so far been focusing on increasing active subscriptions without a sustainable revenue model, but has the potential and is going to see tussle with static VoIP for space in days ahead.

  4. April 16th : The World Voice Day

    NARCIS (Netherlands)

    Svec, Jan G.; Behlau, Mara

    2007-01-01

    Although the voice is used as an everyday basis of speech, most people realize its importance only when a voice problem arises. Increasing public awareness of the importance of the voice and alertness to voice problems are the main goals of the World Voice Day, which is celebrated yearly on April 16

  5. Risk factors for voice problems in teachers

    NARCIS (Netherlands)

    Kooijman, P. G. C.; de Jong, F. I. C. R. S.; Thomas, G.; Huinck, W.; Donders, R.; Graamans, K.; Schutte, H. K.

    2006-01-01

    In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints

  6. You're a What? Voice Actor

    Science.gov (United States)

    Liming, Drew

    2009-01-01

    This article talks about voice actors and features Tony Oliver, a professional voice actor. Voice actors help to bring one's favorite cartoon and video game characters to life. They also do voice-overs for radio and television commercials and movie trailers. These actors use the sound of their voice to sell a character's emotions--or an advertised…

  7. Risk factors for voice problems in teachers

    NARCIS (Netherlands)

    Kooijman, P. G. C.; de Jong, F. I. C. R. S.; Thomas, G.; Huinck, W.; Donders, R.; Graamans, K.; Schutte, H. K.

    2006-01-01

    In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints

  8. Risk factors for voice problems in teachers.

    NARCIS (Netherlands)

    Kooijman, P.G.C.; Jong, F.I.C.R.S. de; Thomas, G.; Huinck, W.J.; Donders, A.R.T.; Graamans, K.; Schutte, H.K.

    2006-01-01

    In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints

  9. Voice and GPS Based Navigation System For Visually Impaired

    Directory of Open Access Journals (Sweden)

    Harsha Gawari

    2014-04-01

    Full Text Available The paper represents the architecture and implementation of a system that will help to navigate the visually impaired people. The system designed uses GPS and voice recognition along with obstacle avoidance for the purpose of guiding visually impaired. The visually impaired person issues the command and receives the direction response using audio signals. The latitude and longitude values are received continuously from the GPS receiver. The directions are given to the user with the help of audio signals. An obstacle detector is used to help the user to avoid obstacles by sending an audio message.GPS receivers use NMEA standard. With the advancement in voice recognition it becomes easier to issue commands regarding directions to the visually impaired.

  10. Speech recognition with amplitude and frequency modulations

    Science.gov (United States)

    Zeng, Fan-Gang; Nie, Kaibao; Stickney, Ginger S.; Kong, Ying-Yee; Vongphoe, Michael; Bhargave, Ashish; Wei, Chaogang; Cao, Keli

    2005-02-01

    Amplitude modulation (AM) and frequency modulation (FM) are commonly used in communication, but their relative contributions to speech recognition have not been fully explored. To bridge this gap, we derived slowly varying AM and FM from speech sounds and conducted listening tests using stimuli with different modulations in normal-hearing and cochlear-implant subjects. We found that although AM from a limited number of spectral bands may be sufficient for speech recognition in quiet, FM significantly enhances speech recognition in noise, as well as speaker and tone recognition. Additional speech reception threshold measures revealed that FM is particularly critical for speech recognition with a competing voice and is independent of spectral resolution and similarity. These results suggest that AM and FM provide independent yet complementary contributions to support robust speech recognition under realistic listening situations. Encoding FM may improve auditory scene analysis, cochlear-implant, and audiocoding performance. auditory analysis | cochlear implant | neural code | phase | scene analysis

  11. Voice input/output capabilities at Perception Technology Corporation

    Science.gov (United States)

    Ferber, Leon A.

    1977-01-01

    Condensed resumes of key company personnel at the Perception Technology Corporation are presented. The staff possesses recognition, speech synthesis, speaker authentication, and language identification. Hardware and software engineers' capabilities are included.

  12. Keyboard With Voice Output

    Science.gov (United States)

    Huber, W. C.

    1986-01-01

    Voice synthesizer tells what key is about to be depressed. Verbal feedback useful for blind operators or where dim light prevents sighted operator from seeing keyboard. Also used where operator is busy observing other things while keying data into control system. Used as training aid for touch typing, and to train blind operators to use both standard and braille keyboards. Concept adapted to such equipment as typewriters, computers, calculators, telephones, cash registers, and on/off controls.

  13. Why Is My Voice Changing? (For Teens)

    Science.gov (United States)

    ... Week of Healthy Breakfasts Shyness Why Is My Voice Changing? KidsHealth > For Teens > Why Is My Voice ... deeper than a girl's, though. What Causes My Voice to Change? At puberty, guys' bodies begin producing ...

  14. Common Problems That Can Affect Your Voice

    Science.gov (United States)

    ... near you Common Problems That Can Affect Your Voice Common Problems That Can Affect Your Voice Patient ... that traditionally accompany gastro esophageal reflux disease (GERD). Voice Misuse and Overuse Speaking is a physical task ...

  15. Pattern recognition

    CERN Document Server

    Theodoridis, Sergios

    2003-01-01

    Pattern recognition is a scientific discipline that is becoming increasingly important in the age of automation and information handling and retrieval. Patter Recognition, 2e covers the entire spectrum of pattern recognition applications, from image analysis to speech recognition and communications. This book presents cutting-edge material on neural networks, - a set of linked microprocessors that can form associations and uses pattern recognition to ""learn"" -and enhances student motivation by approaching pattern recognition from the designer's point of view. A direct result of more than 10

  16. Voice and silence in organizations

    Directory of Open Access Journals (Sweden)

    Moaşa, H.

    2011-01-01

    Full Text Available Unlike previous research on voice and silence, this article breaksthe distance between the two and declines to treat them as opposites. Voice and silence are interrelated and intertwined strategic forms ofcommunication which presuppose each other in such a way that the absence of one would minimize completely the other’s presence. Social actors are not voice, or silence. Social actors can have voice or silence, they can do both because they operate at multiple levels and deal with multiple issues at different moments in time.

  17. VOICE REHABILITATION FOLLOWING TOTAL LARYNGECTOMY

    Directory of Open Access Journals (Sweden)

    Balasubramanian Thiagarajan

    2015-03-01

    Full Text Available Despite continuing advances in surgical management of laryngeal malignancy, total laryngectomy is still the treatment of choice in advanced laryngeal malignancies. Considering the longevity of the patient following total laryngectomy, various measures have been adopted in order to provide voice function to the patient. Significant advancements have taken place in voice rehabilitation of post laryngectomy patients. Advancements in oncological surgical techniques and irradiation techniques have literally cured laryngeal malignancies. Among the various voice rehabilitation techniques available TEP (Tracheo oesophageal puncture is considered to be the gold standard. This article attempts to explore the various voice rehabilitation technique available with primary focus on TEP.

  18. Speech emotion recognition based on statistical pitch model

    Institute of Scientific and Technical Information of China (English)

    WANG Zhiping; ZHAO Li; ZOU Cairong

    2006-01-01

    A modified Parzen-window method, which keep high resolution in low frequencies and keep smoothness in high frequencies, is proposed to obtain statistical model. Then, a gender classification method utilizing the statistical model is proposed, which have a 98% accuracy of gender classification while long sentence is dealt with. By separation the male voice and female voice, the mean and standard deviation of speech training samples with different emotion are used to create the corresponding emotion models. Then the Bhattacharyya distance between the test sample and statistical models of pitch, are utilized for emotion recognition in speech.The normalization of pitch for the male voice and female voice are also considered, in order to illustrate them into a uniform space. Finally, the speech emotion recognition experiment based on K Nearest Neighbor shows that, the correct rate of 81% is achieved, where it is only 73.85%if the traditional parameters are utilized.

  19. The impact of voice on speech realization

    OpenAIRE

    Jelka Breznik

    2014-01-01

    The study discusses spoken literary language and the impact of voice on speech realization. The voice consists of a sound made by a human being using the vocal folds for talking, singing, laughing, crying, screaming… The human voice is specifically the part of human sound production in which the vocal folds (vocal cords) are the primary sound source. Our voice is our instrument and identity card. How does the voice (voice tone) affect others and how do they respond, positively or negatively? ...

  20. Vocal registers of the countertenor voice: Based on signals recorded and analyzed in VoceVista

    Science.gov (United States)

    Chenez, Raymond

    Today's countertenors possess vocal ranges similar to the mezzo-soprano, and are trained to sing with a vibrant, focused tone. Little research has been conducted on the registers of the countertenor voice. Advancement in vocal techniques in the countertenor voice from the late 20th century to the present has been rapid. This treatise attempts to define the registers of the countertenor voice, and is intended as a resource for singers and teachers. The voices of eleven North American countertenors were recorded and analyzed using VoceVista Pro software, which was developed and designed by Donald Miller. Through spectrographic and electroglottographic analysis, the registers of the countertenor voice were identified and outlined.

  1. The Voice Handicap Index with Post-Laryngectomy Male Voices

    Science.gov (United States)

    Evans, Eryl; Carding, Paul; Drinnan, Michael

    2009-01-01

    Background: Surgical treatment for advanced laryngeal cancer involves complete removal of the larynx ("laryngectomy") and initial total loss of voice. Post-laryngectomy rehabilitation involves implementation of different means of "voicing" for these patients wherever possible. There is little information about laryngectomees'…

  2. Pedagogic Voice: Student Voice in Teaching and Engagement Pedagogies

    Science.gov (United States)

    Baroutsis, Aspa; McGregor, Glenda; Mills, Martin

    2016-01-01

    In this paper, we are concerned with the notion of "pedagogic voice" as it relates to the presence of student "voice" in teaching, learning and curriculum matters at an alternative, or second chance, school in Australia. This school draws upon many of the principles of democratic schooling via its utilisation of student voice…

  3. Chord Recognition Based on Temporal Correlation Support Vector Machine

    OpenAIRE

    Zhongyang Rao; Xin Guan; Jianfu Teng

    2016-01-01

    In this paper, we propose a method called temporal correlation support vector machine (TCSVM) for automatic major-minor chord recognition in audio music. We first use robust principal component analysis to separate the singing voice from the music to reduce the influence of the singing voice and consider the temporal correlations of the chord features. Using robust principal component analysis, we expect the low-rank component of the spectrogram matrix to contain the musical accompaniment and...

  4. Popstjerne af lys, lyd og software

    DEFF Research Database (Denmark)

    Hasse Jørgensen, Stina

    2016-01-01

    Hatsune Miku is a 3D animated hologram, her voice is a vocaloid. In other words she is a software application. Nevertheless she is a worldstar with stadion concerts and an astronomical number of fans. She is a crowdsourced Internet phenomena: her fans composes her hits and choreographs her...

  5. Design of household control system based on speech recognition%基于语音识别的家居控制系统设计

    Institute of Scientific and Technical Information of China (English)

    黄辉健; 程良鸿; 黄明杰; 林垣华; 李志杰

    2014-01-01

    This paper studied the technology of speaker-dependent recognition based on Sunplus SPCE061A, voice recognition technology will be applied to the home control system. Proposed a control scheme which is convenient operation,easy to expand, and applicable to home applications. The system will be analyzed from the perspective of hardware circuit and software design. Also in the Google App Inventer platform, built out a control software based on Android smartphone’s Bluetooth communication.The tested results showed that the system has successfully realized the voice technology appliances and Android smartphones remote control technology.%本文研究了凌阳SPCE061A的特定人的语音识别与控制技术,将语音识别技术应用到家居控制系统中。提出一种操作简便、易扩展、适用于家庭应用的控制方案。分析了系统的硬件组成和软件设计流程。同时在Google App Inventer平台下,介绍了基于蓝牙通信的Android智能手机控制软件的搭建。经实际测试表明,本系统成功地实现对家电的声控技术和Android智能手机远程控制。

  6. Air bridge docking -- on voice command recognition and synthesis technology in ATC%陆空对接的桥梁--论空管中的语音指令识别与合成技术

    Institute of Scientific and Technical Information of China (English)

    马林南

    2015-01-01

    空中交通管制指令标准用语的训练是非常重要的、不可或缺的内容,在我国民航运输业不断发展的时代,空中交通流量急剧增长,陆空通话标准用语的训练是空中管制模拟训练中的主要技术。为了改变当前训练系统依赖于专人飞行员席位的设置状况,本文对空中管制中的语音指令识别与合成技术进行研究,针对其中的多个关键技术开展讨论,以实现空中控制模拟训练机中自动飞行员席位替代专人飞行员席位。%The training of air traffic control instructions standard terminology is very important and indispensable content,in the era of the continuous development of China's civil aviation transportation industry,the rapid growth of air traffic flow,radiotelephony communication standard language training is training in the main technology of air traffic control simulation.In order to change the current training system depends on the special pilot seats in,this paper of air traffic control in the speech command recognition and synthesis technology research, for which a number of key technology to launch the discussion,in order to achieve the air control simulation training machine automatic pilot seat replacement pilot special seats.

  7. Software engineering

    CERN Document Server

    Sommerville, Ian

    2010-01-01

    The ninth edition of Software Engineering presents a broad perspective of software engineering, focusing on the processes and techniques fundamental to the creation of reliable, software systems. Increased coverage of agile methods and software reuse, along with coverage of 'traditional' plan-driven software engineering, gives readers the most up-to-date view of the field currently available. Practical case studies, a full set of easy-to-access supplements, and extensive web resources make teaching the course easier than ever.

  8. Voice and Speech after Laryngectomy

    Science.gov (United States)

    Stajner-Katusic, Smiljka; Horga, Damir; Musura, Maja; Globlek, Dubravka

    2006-01-01

    The aim of the investigation is to compare voice and speech quality in alaryngeal patients using esophageal speech (ESOP, eight subjects), electroacoustical speech aid (EACA, six subjects) and tracheoesophageal voice prosthesis (TEVP, three subjects). The subjects reading a short story were recorded in the sound-proof booth and the speech samples…

  9. Voice Quality of Psychological Origin

    Science.gov (United States)

    Teixeira, Antonio; Nunes, Ana; Coimbra, Rosa Lidia; Lima, Rosa; Moutinho, Lurdes

    2008-01-01

    Variations in voice quality are essentially related to modifications of the glottal source parameters, such as: F[subscript 0], jitter, and shimmer. Voice quality is affected by prosody, emotional state, and vocal pathologies. Psychogenic vocal pathology is particularly interesting. In the present case study, the speaker naturally presented a…

  10. Voice handicap index in Swedish.

    Science.gov (United States)

    Ohlsson, Ann-Christine; Dotevall, Hans

    2009-01-01

    The objective of this study was to evaluate a Swedish version of the voice handicap index questionnaire (Sw-VHI). A total of 57 adult, dysphonic patients and 15 healthy controls completed the Sw-VHI and rated the degree of vocal fatigue and hoarseness on visual analogue scales. A perceptual voice evaluation was also performed. Test-retest reliability was analyzed in 38 subjects without voice complaints. Sw-VHI distinguished between dysphonic subjects and controls (P 0.84) and test-retest reliability (intraclass correlation coefficient >0.75) were good. Only moderate or weak correlations were found between Sw-VHI and the subjective and perceptual voice ratings. The data indicate that a difference above 13 points for the total Sw-VHI score and above 6 points for the Sw-VHI subscales is significant for an individual when comparing two different occasions. In conclusion, the Sw-VHI appears to be a robust instrument for assessment of the psycho-social impact of a voice disorder. However, Sw-VHI seems to, at least partly, capture different aspects of voice function to the subjective voice ratings and the perceptual voice evaluation.

  11. Enhancing Author's Voice through Scripting

    Science.gov (United States)

    Young, Chase J.; Rasinski, Timothy V.

    2011-01-01

    The authors suggest using scripting as a strategy to mentor and enhance author's voice in writing. Through gradual release, students use authentic literature as a model for writing with voice. The authors also propose possible extensions for independent practice, integration across content areas, and tips for evaluation.

  12. Voices in History

    Directory of Open Access Journals (Sweden)

    Ivan Leudar

    2001-06-01

    Full Text Available Experiences of “hearing voices” nowadays usually count as verbal hallucinations and they indicate serious mental illness. Some are first rank symptoms of schizophrenia, and the mass media, at least in Britain, tend to present them as antecedents of impulsive violence. They are, however, also found in other psychiatric conditions and epidemiological surveys reveal that even individuals with no need of psychiatric help can hear voices, sometimes following bereavement or abuse, but sometimes for no discernible reason. So do these experiences necessarily mean insanity and violence, and must they be thought of as pathogenic hallucinations; or are there other ways to understand them and live with them, and with what consequences?One way to make our thinking more flexible is to turn to history. We find that hearing voices was always an enigmatic experience, and the people who had it were rare. The gallery of voice hearers is, though, distinguished and it includes Galilei, Bunyan and St Teresa. Socrates heard a daemon who guided his actions, but in his time this did not signify madness, nor was it described as a hallucination. Yet in 19th century French psychological medicine the daemon became a hallucination and Socrates was retrospectively diagnosed as mentally ill. This paper examines the controversies which surrounded the experience at different points in history as well as the practice of retrospective psychiatry. The conclusion reached on the basis of the historical materials is that the experience and the ontological status it is ascribed are not trans-cultural or trans-historic but situated both in history and in the contemporary conflicts.

  13. Facing Sound - Voicing Art

    DEFF Research Database (Denmark)

    Lønstrup, Ansa

    2013-01-01

    This article is based on examples of contemporary audiovisual art, with a special focus on the Tony Oursler exhibition Face to Face at Aarhus Art Museum ARoS in Denmark in March-July 2012. My investigation involves a combination of qualitative interviews with visitors, observations of the audienc......´s interactions with the exhibition and the artwork in the museum space and short analyses of individual works of art based on reception aesthetics and phenomenology and inspired by newer writings on sound, voice and listening....

  14. Voice over IP Security

    CERN Document Server

    Keromytis, Angelos D

    2011-01-01

    Voice over IP (VoIP) and Internet Multimedia Subsystem technologies (IMS) are rapidly being adopted by consumers, enterprises, governments and militaries. These technologies offer higher flexibility and more features than traditional telephony (PSTN) infrastructures, as well as the potential for lower cost through equipment consolidation and, for the consumer market, new business models. However, VoIP systems also represent a higher complexity in terms of architecture, protocols and implementation, with a corresponding increase in the potential for misuse. In this book, the authors examine the

  15. Effects on vocal range and voice quality of singing voice training: the classically trained female voice.

    Science.gov (United States)

    Pabon, Peter; Stallinga, Rob; Södersten, Maria; Ternström, Sten

    2014-01-01

    A longitudinal study was performed on the acoustical effects of singing voice training under a given study program, using the voice range profile (VRP). Pretraining and posttraining recordings were made of students who participated in a 3-year bachelor singing study program. A questionnaire that included questions on optimal range, register use, classification, vocal health and hygiene, mixing technique, and training goals was used to rate and categorize self-assessed voice changes. Based on the responses, a subgroup of 10 classically trained female voices was selected, which was homogeneous enough for effects of training to be identified. The VRP perimeter contour was analyzed for effects of voice training. Also, a mapping within the VRP of voice quality, as expressed by the crest factor, was used to indicate the register boundaries and to monitor the acoustical consequences of the newly learned vocal technique of "mixed voice." VRPs were averaged across subjects. Findings were compared with the self-assessed vocal changes. Pre/post comparison of the average VRPs showed, in the midrange, (1) a decrease in the VRP area that was associated with the loud chest voice, (2) a reduction of the crest factor values, and (3) a reduction of maximum sound pressure level values. The students' self-evaluations of the voice changes appeared in some cases to contradict the VRP findings. VRPs of individual voices were seen to change over the course of a singing education. These changes were manifest also in the average group. High-resolution computerized recording, complemented with an acoustic register marker, allows a meaningful assessment of some effects of training, on an individual basis and for groups that comprise singers of a specific genre. It is argued that this kind of investigation is possible only within a focused training program, given by a faculty who has agreed on the goals. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  16. Questioning Photovoice Research: Whose Voice?

    Science.gov (United States)

    Evans-Agnew, Robin A; Rosemberg, Marie-Anne S

    2016-07-01

    Photovoice is an important participatory research tool for advancing health equity. Our purpose is to critically review how participant voice is promoted through the photovoice process of taking and discussing photos and adding text/captions. PubMed, Scopus, PsycINFO, and Web of Science databases were searched from the years 2008 to 2014 using the keywords photovoice, photonovella, photovoice and social justice, and photovoice and participatory action research. Research articles were reviewed for how participant voice was (a) analyzed, (b) exhibited in community forums, and (c) disseminated through published manuscripts. Of 21 studies, 13 described participant voice in the data analysis, 14 described participants' control over exhibiting photo-texts, seven manuscripts included a comprehensive set of photo-texts, and none described participant input on choice of manuscript photo-texts. Photovoice designs vary in the advancement of participant voice, with the least advancement occurring in manuscript publication. Future photovoice researchers should expand approaches to advancing participant voice.

  17. Voice quality of psychological origin.

    Science.gov (United States)

    Teixeira, Antonio; Nunes, Ana; Coimbra, Rosa Lídia; Lima, Rosa; Moutinho, Lurdes

    2008-01-01

    Variations in voice quality are essentially related to modifications of the glottal source parameters, such as: F0, jitter, and shimmer. Voice quality is affected by prosody, emotional state, and vocal pathologies. Psychogenic vocal pathology is particularly interesting. In the present case study, the speaker naturally presented a ventricular band voice whereas in a controlled production he was able to use a more normal phonation process. A small corpus was recorded which included sustained vowels and short sentences in both registers. A normal speaker was also recorded in similar tasks. Annotation and extraction of parameters were made using Praat's voice report function. Application of the Hoarseness Diagram to sustained productions situates this case in the pseudo-glottic phonation region. Analysis of several different parameters related to F0, jitter, shimmer, and harmonicity revealed that the speaker with psychogenic voice was capable of controlling certain parameters (e.g. F0 maximum) but was unable to correct others such as shimmer.

  18. Muscular tension and body posture in relation to voice handicap and voice quality in teachers with persistent voice complaints.

    NARCIS (Netherlands)

    Kooijman, P.G.C.; Jong, F.I.C.R.S. de; Oudes, M.J.; Huinck, W.J.; Acht, H. van; Graamans, K.

    2005-01-01

    The aim of this study was to investigate the relationship between extrinsic laryngeal muscular hypertonicity and deviant body posture on the one hand and voice handicap and voice quality on the other hand in teachers with persistent voice complaints and a history of voice-related absenteeism. The st

  19. Chaos tool implementation for non-singer and singer voice comparison (preliminary study)

    Science.gov (United States)

    Dajer, Me; Pereira, Jc; Maciel, Cd

    2007-11-01

    Voice waveform is linked to the stretch, shorten, widen or constrict vocal tract. The articulation effects of the singer's vocal tract modify the voice acoustical characteristics and differ from the non-singer voices. In the last decades, Chaos Theory has shown the possibility to explore the dynamic nature of voice signals from a different point of view. The purpose of this paper is to apply the chaos technique of phase space reconstruction to analyze non- singers and singer voices in order to explore the signal nonlinear dynamic, and correlate them with traditional acoustic parameters. Eight voice samples of sustained vowel /i/ from non-singers and eight from singers were analyzed with "ANL" software. The samples were also acoustically analyzed with "Analise de Voz 5.0" in order to extract acoustic perturbation measures jitter and shimmer, and the coefficient of excess - (EX). The results showed different visual patterns for the two groups correlated with different jitter, shimmer, and coefficient of excess values. We conclude that these results clearly indicate the potential of phase space reconstruction technique for analysis and comparison of non-singers and singer voices. They also show a promising tool for training voices application.

  20. Evaluation of Voice Parameters in People with Head and Neck Cancers: An Investigational Study

    Directory of Open Access Journals (Sweden)

    Thomas Zacharia

    2016-10-01

    Full Text Available Background: Head and neck cancer severely affects the voice and compromises a patient’s vocal independence. In the present study we have investigated the difference in three vital components of the voice - pitch, jitter, and shimmer in head and neck cancer patients and compared the findings with age matched healthy volunteers. Methods: Voice parameters were ascertained in newly diagnosed head and neck cancer patients and controls using voice analysis software (PRAAT and a Dynamic Unidirectional AUD-59 XLR microphone in a sound proof voice recording laboratory. Results: We observed considerable changes in the three voice indices of pitch, jitter and shimmer. According to gender, there were significant changes in all three indices for men (P<0.015 to 0.0001, whereas only pitch significantly changed in women (P=0.0001. Conclusion: The results have suggested that individuals with any form of head and neck cancer will have a degree of voice problems even before the initiation of cancer treatments. Voice assessment is a vital aspect to be considered before initiation of treatment.

  1. Chaos tool implementation for non-singer and singer voice comparison (preliminary study)

    Energy Technology Data Exchange (ETDEWEB)

    Dajer, Me; Pereira, Jc; Maciel, Cd [Department of Electric Engineering, School of Engineering of Sao Carlos, University of Sao Paulo, Sao Carlos (Brazil); Av. Trabalhador Sao-Carlesnse, 400. CEP 13566-590. Sao Carlos. SP (Brazil)

    2007-11-15

    Voice waveform is linked to the stretch, shorten, widen or constrict vocal tract. The articulation effects of the singer's vocal tract modify the voice acoustical characteristics and differ from the non-singer voices. In the last decades, Chaos Theory has shown the possibility to explore the dynamic nature of voice signals from a different point of view. The purpose of this paper is to apply the chaos technique of phase space reconstruction to analyze non- singers and singer voices in order to explore the signal nonlinear dynamic, and correlate them with traditional acoustic parameters. Eight voice samples of sustained vowel /i/ from non-singers and eight from singers were analyzed with 'ANL' software. The samples were also acoustically analyzed with 'Analise de Voz 5.0' in order to extract acoustic perturbation measures jitter and shimmer, and the coefficient of excess - (EX). The results showed different visual patterns for the two groups correlated with different jitter, shimmer, and coefficient of excess values. We conclude that these results clearly indicate the potential of phase space reconstruction technique for analysis and comparison of non-singers and singer voices. They also show a promising tool for training voices application.

  2. Objective voice and speech analysis of persons with chronic hoarseness by prosodic analysis of speech samples.

    Science.gov (United States)

    Haderlein, Tino; Döllinger, Michael; Matoušek, Václav; Nöth, Elmar

    2016-10-01

    Automatic voice assessment is often performed using sustained vowels. In contrast, speech analysis of read-out texts can be applied to voice and speech assessment. Automatic speech recognition and prosodic analysis were used to find regression formulae between automatic and perceptual assessment of four voice and four speech criteria. The regression was trained with 21 men and 62 women (average age 49.2 years) and tested with another set of 24 men and 49 women (48.3 years), all suffering from chronic hoarseness. They read the text 'Der Nordwind und die Sonne' ('The North Wind and the Sun'). Five voice and speech therapists evaluated the data on 5-point Likert scales. Ten prosodic and recognition accuracy measures (features) were identified which describe all the examined criteria. Inter-rater correlation within the expert group was between r = 0.63 for the criterion 'match of breath and sense units' and r = 0.87 for the overall voice quality. Human-machine correlation was between r = 0.40 for the match of breath and sense units and r = 0.82 for intelligibility. The perceptual ratings of different criteria were highly correlated with each other. Likewise, the feature sets modeling the criteria were very similar. The automatic method is suitable for assessing chronic hoarseness in general and for subgroups of functional and organic dysphonia. In its current version, it is almost as reliable as a randomly picked rater from a group of voice and speech therapists.

  3. Bodies, Spaces, Voices, Silences

    Directory of Open Access Journals (Sweden)

    Donatella Mazzoleni

    2013-07-01

    Full Text Available A good architecture should not only allow functional, formal and technical quality for urban spaces, but also let the voice of the city be perceived, listened, enjoyed. Every city has got its specific sound identity, or “ISO” (R. O. Benenzon, made up of a complex texture of background noises and fluctuation of sound figures emerging and disappearing in a game of continuous fadings. For instance, the ISO of Naples is characterized by a spread need of hearing the sound return of one’s/others voices, by a hate of silence. Cities may fall ill: illness from noise, within super-crowded neighbourhoods, or illness from silence, in the forced isolation of peripheries. The proposal of an urban music therapy denotes an unpublished and innovative enlarged interdisciplinary research path, where architecture, music, medicine, psychology, communication science may converge, in order to work for rebalancing spaces and relation life of the urban collectivity, through the care of body and sound dimensions.

  4. Crossing Cultures with Multi-Voiced Journals

    Science.gov (United States)

    Styslinger, Mary E.; Whisenant, Alison

    2004-01-01

    In this article, the authors discuss the benefits of using multi-voiced journals as a teaching strategy in reading instruction. Multi-voiced journals, an adaptation of dual-voiced journals, encourage responses to reading in varied, cultured voices of characters. It is similar to reading journals in that they prod students to connect to the lives…

  5. SOFTWARE OPEN SOURCE, SOFTWARE GRATIS?

    Directory of Open Access Journals (Sweden)

    Nur Aini Rakhmawati

    2006-01-01

    Full Text Available Normal 0 false false false IN X-NONE X-NONE MicrosoftInternetExplorer4 Berlakunya Undang – undang Hak Atas Kekayaan Intelektual (HAKI, memunculkan suatu alternatif baru untuk menggunakan software open source. Penggunaan software open source menyebar seiring dengan isu global pada Information Communication Technology (ICT saat ini. Beberapa organisasi dan perusahaan mulai menjadikan software open source sebagai pertimbangan. Banyak konsep mengenai software open source ini. Mulai dari software yang gratis sampai software tidak berlisensi. Tidak sepenuhnya isu software open source benar, untuk itu perlu dikenalkan konsep software open source mulai dari sejarah, lisensi dan bagaimana cara memilih lisensi, serta pertimbangan dalam memilih software open source yang ada. Kata kunci :Lisensi, Open Source, HAKI

  6. Software Reviews.

    Science.gov (United States)

    Smith, Richard L., Ed.

    1985-01-01

    Reviews software packages by providing extensive descriptions and discussions of their strengths and weaknesses. Software reviewed include (1) "VISIFROG: Vertebrate Anatomy" (grade seven-adult); (2) "Fraction Bars Computer Program" (grades three to six) and (3) four telecommunications utilities. (JN)

  7. Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

    Science.gov (United States)

    Partila, Pavol; Voznak, Miroslav; Tovarek, Jaromir

    2015-01-01

    The impact of the classification method and features selection for the speech emotion recognition accuracy is discussed in this paper. Selecting the correct parameters in combination with the classifier is an important part of reducing the complexity of system computing. This step is necessary especially for systems that will be deployed in real-time applications. The reason for the development and improvement of speech emotion recognition systems is wide usability in nowadays automatic voice controlled systems. Berlin database of emotional recordings was used in this experiment. Classification accuracy of artificial neural networks, k-nearest neighbours, and Gaussian mixture model is measured considering the selection of prosodic, spectral, and voice quality features. The purpose was to find an optimal combination of methods and group of features for stress detection in human speech. The research contribution lies in the design of the speech emotion recognition system due to its accuracy and efficiency. PMID:26346654

  8. Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

    Directory of Open Access Journals (Sweden)

    Pavol Partila

    2015-01-01

    Full Text Available The impact of the classification method and features selection for the speech emotion recognition accuracy is discussed in this paper. Selecting the correct parameters in combination with the classifier is an important part of reducing the complexity of system computing. This step is necessary especially for systems that will be deployed in real-time applications. The reason for the development and improvement of speech emotion recognition systems is wide usability in nowadays automatic voice controlled systems. Berlin database of emotional recordings was used in this experiment. Classification accuracy of artificial neural networks, k-nearest neighbours, and Gaussian mixture model is measured considering the selection of prosodic, spectral, and voice quality features. The purpose was to find an optimal combination of methods and group of features for stress detection in human speech. The research contribution lies in the design of the speech emotion recognition system due to its accuracy and efficiency.

  9. Lexical frequency and voice assimilation.

    Science.gov (United States)

    Ernestus, Mirjam; Lahey, Mybeth; Verhees, Femke; Baayen, R Harald

    2006-08-01

    Acoustic duration and degree of vowel reduction are known to correlate with a word's frequency of occurrence. The present study broadens the research on the role of frequency in speech production to voice assimilation. The test case was regressive voice assimilation in Dutch. Clusters from a corpus of read speech were more often perceived as unassimilated in lower-frequency words and as either completely voiced (regressive assimilation) or, unexpectedly, as completely voiceless (progressive assimilation) in higher-frequency words. Frequency did not predict the voice classifications over and above important acoustic cues to voicing, suggesting that the frequency effects on the classifications were carried exclusively by the acoustic signal. The duration of the cluster and the period of glottal vibration during the cluster decreased while the duration of the release noises increased with frequency. This indicates that speakers reduce articulatory effort for higher-frequency words, with some acoustic cues signaling more voicing and others less voicing. A higher frequency leads not only to acoustic reduction but also to more assimilation.

  10. Facial, vocal and musical emotion recognition is altered in paranoid schizophrenic patients.

    Science.gov (United States)

    Weisgerber, Anne; Vermeulen, Nicolas; Peretz, Isabelle; Samson, Séverine; Philippot, Pierre; Maurage, Pierre; De Graeuwe D'Aoust, Catherine; De Jaegere, Aline; Delatte, Benoît; Gillain, Benoît; De Longueville, Xavier; Constant, Eric

    2015-09-30

    Disturbed processing of emotional faces and voices is typically observed in schizophrenia. This deficit leads to impaired social cognition and interactions. In this study, we investigated whether impaired processing of emotions also affects musical stimuli, which are widely present in daily life and known for their emotional impact. Thirty schizophrenic patients and 30 matched healthy controls evaluated the emotional content of musical, vocal and facial stimuli. Schizophrenic patients are less accurate than healthy controls in recognizing emotion in music, voices and faces. Our results confirm impaired recognition of emotion in voice and face stimuli in schizophrenic patients and extend this observation to the recognition of emotion in musical stimuli.

  11. Emotional cues during simultaneous face and voice processing: electrophysiological insights.

    Directory of Open Access Journals (Sweden)

    Taosheng Liu

    Full Text Available Both facial expression and tone of voice represent key signals of emotional communication but their brain processing correlates remain unclear. Accordingly, we constructed a novel implicit emotion recognition task consisting of simultaneously presented human faces and voices with neutral, happy, and angry valence, within the context of recognizing monkey faces and voices task. To investigate the temporal unfolding of the processing of affective information from human face-voice pairings, we recorded event-related potentials (ERPs to these audiovisual test stimuli in 18 normal healthy subjects; N100, P200, N250, P300 components were observed at electrodes in the frontal-central region, while P100, N170, P270 were observed at electrodes in the parietal-occipital region. Results indicated a significant audiovisual stimulus effect on the amplitudes and latencies of components in frontal-central (P200, P300, and N250 but not the parietal occipital region (P100, N170 and P270. Specifically, P200 and P300 amplitudes were more positive for emotional relative to neutral audiovisual stimuli, irrespective of valence, whereas N250 amplitude was more negative for neutral relative to emotional stimuli. No differentiation was observed between angry and happy conditions. The results suggest that the general effect of emotion on audiovisual processing can emerge as early as 200 msec (P200 peak latency post stimulus onset, in spite of implicit affective processing task demands, and that such effect is mainly distributed in the frontal-central region.

  12. Emotional cues during simultaneous face and voice processing: electrophysiological insights.

    Science.gov (United States)

    Liu, Taosheng; Pinheiro, Ana; Zhao, Zhongxin; Nestor, Paul G; McCarley, Robert W; Niznikiewicz, Margaret A

    2012-01-01

    Both facial expression and tone of voice represent key signals of emotional communication but their brain processing correlates remain unclear. Accordingly, we constructed a novel implicit emotion recognition task consisting of simultaneously presented human faces and voices with neutral, happy, and angry valence, within the context of recognizing monkey faces and voices task. To investigate the temporal unfolding of the processing of affective information from human face-voice pairings, we recorded event-related potentials (ERPs) to these audiovisual test stimuli in 18 normal healthy subjects; N100, P200, N250, P300 components were observed at electrodes in the frontal-central region, while P100, N170, P270 were observed at electrodes in the parietal-occipital region. Results indicated a significant audiovisual stimulus effect on the amplitudes and latencies of components in frontal-central (P200, P300, and N250) but not the parietal occipital region (P100, N170 and P270). Specifically, P200 and P300 amplitudes were more positive for emotional relative to neutral audiovisual stimuli, irrespective of valence, whereas N250 amplitude was more negative for neutral relative to emotional stimuli. No differentiation was observed between angry and happy conditions. The results suggest that the general effect of emotion on audiovisual processing can emerge as early as 200 msec (P200 peak latency) post stimulus onset, in spite of implicit affective processing task demands, and that such effect is mainly distributed in the frontal-central region.

  13. Memory for faces and voices varies as a function of sex and expressed emotion.

    Science.gov (United States)

    S Cortes, Diana; Laukka, Petri; Lindahl, Christina; Fischer, Håkan

    2017-01-01

    We investigated how memory for faces and voices (presented separately and in combination) varies as a function of sex and emotional expression (anger, disgust, fear, happiness, sadness, and neutral). At encoding, participants judged the expressed emotion of items in forced-choice tasks, followed by incidental Remember/Know recognition tasks. Results from 600 participants showed that accuracy (hits minus false alarms) was consistently higher for neutral compared to emotional items, whereas accuracy for specific emotions varied across the presentation modalities (i.e., faces, voices, and face-voice combinations). For the subjective sense of recollection ("remember" hits), neutral items received the highest hit rates only for faces, whereas for voices and face-voice combinations anger and fear expressions instead received the highest recollection rates. We also observed better accuracy for items by female expressers, and own-sex bias where female participants displayed memory advantage for female faces and face-voice combinations. Results further suggest that own-sex bias can be explained by recollection, rather than familiarity, rates. Overall, results show that memory for faces and voices may be influenced by the expressions that they carry, as well as by the sex of both items and participants. Emotion expressions may also enhance the subjective sense of recollection without enhancing memory accuracy.

  14. The voice of emotion across species: how do human listeners recognize animals' affective states?

    Directory of Open Access Journals (Sweden)

    Marina Scheumann

    Full Text Available Voice-induced cross-taxa emotional recognition is the ability to understand the emotional state of another species based on its voice. In the past, induced affective states, experience-dependent higher cognitive processes or cross-taxa universal acoustic coding and processing mechanisms have been discussed to underlie this ability in humans. The present study sets out to distinguish the influence of familiarity and phylogeny on voice-induced cross-taxa emotional perception in humans. For the first time, two perspectives are taken into account: the self- (i.e. emotional valence induced in the listener versus the others-perspective (i.e. correct recognition of the emotional valence of the recording context. Twenty-eight male participants listened to 192 vocalizations of four different species (human infant, dog, chimpanzee and tree shrew. Stimuli were recorded either in an agonistic (negative emotional valence or affiliative (positive emotional valence context. Participants rated the emotional valence of the stimuli adopting self- and others-perspective by using a 5-point version of the Self-Assessment Manikin (SAM. Familiarity was assessed based on subjective rating, objective labelling of the respective stimuli and interaction time with the respective species. Participants reliably recognized the emotional valence of human voices, whereas the results for animal voices were mixed. The correct classification of animal voices depended on the listener's familiarity with the species and the call type/recording context, whereas there was less influence of induced emotional states and phylogeny. Our results provide first evidence that explicit voice-induced cross-taxa emotional recognition in humans is shaped more by experience-dependent cognitive mechanisms than by induced affective states or cross-taxa universal acoustic coding and processing mechanisms.

  15. Software reliability

    CERN Document Server

    Bendell, A

    1986-01-01

    Software Reliability reviews some fundamental issues of software reliability as well as the techniques, models, and metrics used to predict the reliability of software. Topics covered include fault avoidance, fault removal, and fault tolerance, along with statistical methods for the objective assessment of predictive accuracy. Development cost models and life-cycle cost models are also discussed. This book is divided into eight sections and begins with a chapter on adaptive modeling used to predict software reliability, followed by a discussion on failure rate in software reliability growth mo

  16. Voice Habits and Behaviors: Voice Care Among Flamenco Singers.

    Science.gov (United States)

    Garzón García, Marina; Muñoz López, Juana; Y Mendoza Lara, Elvira

    2017-03-01

    The purpose of this study is to analyze the vocal behavior of flamenco singers, as compared with classical music singers, to establish a differential vocal profile of voice habits and behaviors in flamenco music. Bibliographic review was conducted, and the Singer's Vocal Habits Questionnaire, an experimental tool designed by the authors to gather data regarding hygiene behavior, drinking and smoking habits, type of practice, voice care, and symptomatology perceived in both the singing and the speaking voice, was administered. We interviewed 94 singers, divided into two groups: the flamenco experimental group (FEG, n = 48) and the classical control group (CCG, n = 46). Frequency analysis, a Likert scale, and discriminant and exploratory factor analysis were used to obtain a differential profile for each group. The FEG scored higher than the CCG in speaking voice symptomatology. The FEG scored significantly higher than the CCG in use of "inadequate vocal technique" when singing. Regarding voice habits, the FEG scored higher in "lack of practice and warm-up" and "environmental habits." A total of 92.6% of the subjects classified themselves correctly in each group. The Singer's Vocal Habits Questionnaire has proven effective in differentiating flamenco and classical singers. Flamenco singers are exposed to numerous vocal risk factors that make them more prone to vocal fatigue, mucosa dehydration, phonotrauma, and muscle stiffness than classical singers. Further research is needed in voice training in flamenco music, as a means to strengthen the voice and enable it to meet the requirements of this musical genre. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  17. Voices of the Unheard

    DEFF Research Database (Denmark)

    Matthiesen, Noomi Christine Linde

    2014-01-01

    . They were in two different classes at both schools, i.e. four classes in total. The families were followed for 18 months. Formal interviews were conducted with mothers and teachers, parent-teacher conferences were recorded, participant observations were conducted in classrooms and playgrounds, afterschool...... is that Somali diaspora parents (and with special focus on mothers as these where the parents who took most responsibility in the four cases of this research) have difficulty expressing their opinions as there are structural, historical and social dynamics that create conditions in which their voices...... are silenced, or at least restricted significantly, resulting in marginalizing consequences. The focus in each article is on here-and-now interactional dynamics but in order to understand these constitutive negotiations, it is argued that the analysis must be situated in a description of the constituted...

  18. Passing on power & voice

    DEFF Research Database (Denmark)

    Noer, Vibeke Røn; Nielsen, Cathrine Sand

    2014-01-01

    . The education lasts for 3,5 years and the landmark of the educational model is the continuously shifts between teaching in classroom and teaching in clinical practice. Clinical teaching takes place at approved clinical placement institutions in hospitals and in the social and health care services outside...... intention of gaining knowledge about other possible ways to perform the education. The class, named the E-class, followed what in the field was named ‘an experimental educational model based on experienced-based learning’ (Nielsen et al. 2011). The experiential educational model is argued as an experiment.......aspx Higher degree of student involvement in planning as well as teaching was in the field presented as a part of ‘the overall educational approach’. In the course ‘Acute, Critical Nursing & Terminal, Palliative Care’ this was transferred into an innovative pedagogy with intend to pass on power and voice...

  19. Voice stress analysis

    Science.gov (United States)

    Brenner, Malcolm; Shipp, Thomas

    1988-01-01

    In a study of the validity of eight candidate voice measures (fundamental frequency, amplitude, speech rate, frequency jitter, amplitude shimmer, Psychological Stress Evaluator scores, energy distribution, and the derived measure of the above measures) for determining psychological stress, 17 males age 21 to 35 were subjected to a tracking task on a microcomputer CRT while parameters of vocal production as well as heart rate were measured. Findings confirm those of earlier studies that increases in fundamental frequency, amplitude, and speech rate are found in speakers involved in extreme levels of stress. In addition, it was found that the same changes appear to occur in a regular fashion within a more subtle level of stress that may be characteristic, for example, of routine flying situations. None of the individual speech measures performed as robustly as did heart rate.

  20. Voice over IP

    OpenAIRE

    Mantula, Juha

    2006-01-01

    Tämä opinnäytetyö käsittelee Voice over Internet Protocol -tekniikkaa ja sen tuomia mahdollisuuksia yrityselämässä. Teoriaosa käsittelee VoIP:n kannalta tärkeitä pro-tokollia ja standardeja, VoIP:n ominaisuuksia sekä esittelee erilaisia puheohjelmia, jotka käyttävät VoIP-tekniikkaa hyväkseen. Empiirinen osuus tutkii Viestintä Ky Pitkärannan Skype-ohjelman käyttöä. Työn tarkoituksena on selvittää VoIP:n hyviä ja huonoja puolia ja sitä kuinka tek-niikkaa voidaan käyttää hyväksi päivittäisessä ...

  1. Dialogic Showcases Innovative Asian CT Solutions at Voice Asia '98

    Institute of Scientific and Technical Information of China (English)

    1998-01-01

    Dialogic Corporation showcases Computer Telephony (CT) solutions from some of Asia's leading CT developers at this Voice Asia'98 show. These vendors display the latest Asian solutions for IP Telephony,Speech Recognition, Telephone Company Enhanced Services Platform. Call Center and Unified Messaging, Open Switch and CT Servers.

  2. A Voice-Activated, Interactive Videodisc Case Study for Use in the Medical School Classroom.

    Science.gov (United States)

    Harless, William G.; And Others

    1986-01-01

    The Technological Innovations in Medical Education (TIME) Project of the Lister Hill National Center for Biomedical Communications is exploring the use of interactive videodisc, microcomputer, and voice recognition technology to create interactive case studies of simulated patients to train second-year medical students in the introduction to…

  3. A Voice-Activated, Interactive Videodisc Case Study for Use in the Medical School Classroom.

    Science.gov (United States)

    Harless, William G.; And Others

    1986-01-01

    The Technological Innovations in Medical Education (TIME) Project of the Lister Hill National Center for Biomedical Communications is exploring the use of interactive videodisc, microcomputer, and voice recognition technology to create interactive case studies of simulated patients to train second-year medical students in the introduction to…

  4. Computer software.

    Science.gov (United States)

    Rosenthal, L E

    1986-10-01

    Software is the component in a computer system that permits the hardware to perform the various functions that a computer system is capable of doing. The history of software and its development can be traced to the early nineteenth century. All computer systems are designed to utilize the "stored program concept" as first developed by Charles Babbage in the 1850s. The concept was lost until the mid-1940s, when modern computers made their appearance. Today, because of the complex and myriad tasks that a computer system can perform, there has been a differentiation of types of software. There is software designed to perform specific business applications. There is software that controls the overall operation of a computer system. And there is software that is designed to carry out specialized tasks. Regardless of types, software is the most critical component of any computer system. Without it, all one has is a collection of circuits, transistors, and silicone chips.

  5. 基于嵌入式Linux语音识别系统的设计%Design of Speech Recognition System Based on Embedded Linux

    Institute of Scientific and Technical Information of China (English)

    钟豪; 张常年; 徐成波

    2014-01-01

    该设计运用三星公司的S3C2440,结合ICRoute公司的高性能语音识别芯片LD3320,进行了语音识别系统的硬件和软件设计。在嵌入式Linux操作系统下,运用多进程机制完成了对语音识别芯片、超声波测距和云台的控制,并将语音识别技术应用于多角度超声波测距系统中。通过测试,系统可以通过识别语音指令控制测量方向,无需手动干预,最后将测量结果通过语音播放出来。%This paper fulfills the hardware and software design of the voice recognition system, using the Samsung’s S3C2440 and the high performance chip LD3320 designed by ICRoute. It uses multi-process mechanism to complete the speech recognition, ultrasonic ranging and PTZ control based on embedded Linux platform. At the same time, the system makes the speech recognition technology applied to multi-angle ultrasonic ranging. Through the actual testing, the system can control the direction of measure-ment by identifying the voice command, without manual intervention, and finally the measurement results play out through the voice.

  6. Introduction to Arabic Speech Recognition Using CMUSphinx System

    CERN Document Server

    Satori, H; Chenfour, N

    2007-01-01

    In this paper Arabic was investigated from the speech recognition problem point of view. We propose a novel approach to build an Arabic Automated Speech Recognition System (ASR). This system is based on the open source CMU Sphinx-4, from the Carnegie Mellon University. CMU Sphinx is a large-vocabulary; speaker-independent, continuous speech recognition system based on discrete Hidden Markov Models (HMMs). We build a model using utilities from the OpenSource CMU Sphinx. We will demonstrate the possible adaptability of this system to Arabic voice recognition.

  7. Voice and choice by delegation.

    Science.gov (United States)

    van de Bovenkamp, Hester; Vollaard, Hans; Trappenburg, Margo; Grit, Kor

    2013-02-01

    In many Western countries, options for citizens to influence public services are increased to improve the quality of services and democratize decision making. Possibilities to influence are often cast into Albert Hirschman's taxonomy of exit (choice), voice, and loyalty. In this article we identify delegation as an important addition to this framework. Delegation gives individuals the chance to practice exit/choice or voice without all the hard work that is usually involved in these options. Empirical research shows that not many people use their individual options of exit and voice, which could lead to inequality between users and nonusers. We identify delegation as a possible solution to this problem, using Dutch health care as a case study to explore this option. Notwithstanding various advantages, we show that voice and choice by delegation also entail problems of inequality and representativeness.

  8. The Christian voice in philosophy

    Directory of Open Access Journals (Sweden)

    Stuart Fowler

    1982-03-01

    Full Text Available In this paper the Rev. Stuart Fowler outlines a Christian voice in Philosophy and urges the Christian philosopher to investigate his position and his stance with integrity and honesty.

  9. Voice Force tulekul / Tõnu Ojala

    Index Scriptorium Estoniae

    Ojala, Tõnu, 1969-

    2005-01-01

    60. sünnipäeva tähistava Tallinna Tehnikaülikooli Akadeemilise Meeskoori juubelihooaja üritusest - a capella pop-gruppide festivalist Voice Force (kontserdid 12. nov. klubis Parlament ja 3. dets. Vene Kultuurikeskuses)

  10. Voice Force tulekul / Tõnu Ojala

    Index Scriptorium Estoniae

    Ojala, Tõnu, 1969-

    2005-01-01

    60. sünnipäeva tähistava Tallinna Tehnikaülikooli Akadeemilise Meeskoori juubelihooaja üritusest - a capella pop-gruppide festivalist Voice Force (kontserdid 12. nov. klubis Parlament ja 3. dets. Vene Kultuurikeskuses)

  11. Feature Extraction of Voice Segments Using Cepstral Analysis for Voice Regeneration

    OpenAIRE

    Banerjee, P. S.; Baisakhi Chakraborty; Jaya Banerjee

    2015-01-01

    Even though a lot of work has been done on areas of speech to text and vice versa or voice detection or similarity analysis of two voice samples but very less emphasis has be given to voice regeneration. General algorithms for distinct voice checking for two voice sources paved way for our endeavor in reconstructing the voice from the source voice samples provided. By utilizing these algorithms and putting further stress on the feature extraction part we tried to fabricate the source voice wi...

  12. Voice Simulation in Nursing Education.

    Science.gov (United States)

    Kepler, Britney B; Lee, Heeyoung; Kane, Irene; Mitchell, Ann M

    2016-01-01

    The goal of this study was to improve prelicensure nursing students' attitudes toward and self-efficacy related to delivering nursing care to patients with auditory hallucinations. Based on the Hearing Voices That Are Distressing curriculum, 87 participants were instructed to complete 3 tasks while wearing headphones delivering distressing voices. Comparing presimulation and postsimulation results, this study suggests that the simulation significantly improved attitudes toward patients with auditory hallucinations; however, self-efficacy related to caring for these patients remained largely unchanged.

  13. Work-related voice disorder

    OpenAIRE

    Paulo Eduardo Przysiezny; Luciana Tironi Sanson Przysiezny

    2015-01-01

    INTRODUCTION: Dysphonia is the main symptom of the disorders of oral communication. However, voice disorders also present with other symptoms such as difficulty in maintaining the voice (asthenia), vocal fatigue, variation in habitual vocal fundamental frequency, hoarseness, lack of vocal volume and projection, loss of vocal efficiency, and weakness when speaking. There are several proposals for the etiologic classification of dysphonia: functional, organofunctional, organic, and work-related...

  14. Tracheostomy cannulas and voice prosthesis.

    Science.gov (United States)

    Kramp, Burkhard; Dommerich, Steffen

    2009-01-01

    Cannulas and voice prostheses are mechanical aids for patients who had to undergo tracheotomy or laryngectomy for different reasons. For better understanding of the function of those artificial devices, first the indications and particularities of the previous surgical intervention are described in the context of this review. Despite the established procedure of percutaneous dilatation tracheotomy e.g. in intensive care units, the application of epithelised tracheostomas has its own position, especially when airway obstruction is persistent (e.g. caused by traumata, inflammations, or tumors) and a longer artificial ventilation or special care of the patient are required. In order to keep the airways open after tracheotomy, tracheostomy cannulas of different materials with different functions are available. For each patient the most appropriate type of cannula must be found. Voice prostheses are meanwhile the device of choice for rapid and efficient voice rehabilitation after laryngectomy. Individual sizes and materials allow adaptation of the voice prostheses to the individual anatomical situation of the patients. The combined application of voice prostheses with HME (Head and Moisture Exchanger) allows a good vocal as well as pulmonary rehabilitation. Precondition for efficient voice prosthesis is the observation of certain surgical principles during laryngectomy. The duration of the prosthesis mainly depends on material properties and biofilms, mostly consisting of funguses and bacteries. The quality of voice with valve prosthesis is clearly superior to esophagus prosthesis or electro-laryngeal voice. Whenever possible, tracheostoma valves for free-hand speech should be applied. Physicians taking care of patients with speech prostheses after laryngectomy should know exactly what to do in case the device fails or gets lost.

  15. Technical Evaluation Report 37: Assistive Software for Disabled Learners

    Directory of Open Access Journals (Sweden)

    Jon Baggaley

    2004-11-01

    Full Text Available Previous reports in this series (#32 and 36 have discussed online software features of value to disabled learners in distance education. The current report evaluates four specific assistive software products with useful features for visually and hearing impaired learners: ATutor, ACollab, Natural Voice, and Just Vanilla. The evaluative criteria discussed include the purpose, uses, costs, and features of each software product, all considered primarily from the accessibility perspective.

  16. Smart Homes with Voice Activated Systems for Disabled People

    Directory of Open Access Journals (Sweden)

    Bekir Busatlic

    2017-02-01

    Full Text Available Smart home refers to the application of various technologies to semi-unsupervised home control It refers to systems that control temperature, lighting, door locks, windows and many other appliances. The aim of this study was to design a system that will use existing technology to showcase how it can benefit people with disabilities. This work uses only off-the-shelf products (smart home devices and controllers, speech recognition technology, open-source code libraries. The Voice Activated Smart Home application was developed to demonstrate online grocery shopping and home control using voice comments and tested by measuring its effectiveness in performing tasks as well as its efficiency in recognizing user speech input.

  17. Challenging Institutional Conventions and Forming a Voice through Creativity

    DEFF Research Database (Denmark)

    Nielsen, Margit Saltofte

    2013-01-01

    This article explores and discusses examples of students’ everyday creativity that seem to be overlooked by teachers but are acknowledged by ‘peers’ in a 9th Grade (age 15–16) at a Danish free school. Creativity emerged as part of the everyday student interactions at school in ‘in-between’ social...... spaces, outside the formal teaching zones. Creative activities took place in the interstitial zones of time and space, where they gave voice to those students whose voice is not always heard in the formal teaching context. Creativity occurred also among students as a way to challenge institutional...... conditions and this practice gave them recognition by their peers. The argument is being made that students’ interactions in these zones draw on other forms of knowledge and ways of performing than those used in structured teaching zones. The creativity expressed in interstitial zones contributes to forming...

  18. Phoneme Recognition Using Acoustic Events

    CERN Document Server

    Huebener, K; Huebener, Kai; Carson-Berndsen, Julie

    1994-01-01

    This paper presents a new approach to phoneme recognition using nonsequential sub--phoneme units. These units are called acoustic events and are phonologically meaningful as well as recognizable from speech signals. Acoustic events form a phonologically incomplete representation as compared to distinctive features. This problem may partly be overcome by incorporating phonological constraints. Currently, 24 binary events describing manner and place of articulation, vowel quality and voicing are used to recognize all German phonemes. Phoneme recognition in this paradigm consists of two steps: After the acoustic events have been determined from the speech signal, a phonological parser is used to generate syllable and phoneme hypotheses from the event lattice. Results obtained on a speaker--dependent corpus are presented.

  19. The impact of voice on speech realization

    Directory of Open Access Journals (Sweden)

    Jelka Breznik

    2014-12-01

    Full Text Available The study discusses spoken literary language and the impact of voice on speech realization. The voice consists of a sound made by a human being using the vocal folds for talking, singing, laughing, crying, screaming… The human voice is specifically the part of human sound production in which the vocal folds (vocal cords are the primary sound source. Our voice is our instrument and identity card. How does the voice (voice tone affect others and how do they respond, positively or negatively? How important is voice (voice tone in communication process? The study presents how certain individuals perceive voice. The results of the research on the relationships between the spoken word, excellent speaker, voice and description / definition / identification of specific voices done by experts in the field of speech and voice as well as non-professionals are presented. The study encompasses two focus groups. One consists of amateurs (non-specialists in the field of speech or voice who have no knowledge in this field and the other consists of professionals who work with speech or language or voice. The questions were intensified from general to specific, directly related to the topic. The purpose of such a method of questioning was to create relaxed atmosphere, promote discussion, allow participants to interact, complement, and to set up self-listening and additional comments.

  20. Towards very large vocabulary word recognition

    Science.gov (United States)

    Waibel, A.

    1982-11-01

    In this paper, preliminary considerations and some experimental results are presented in an effort to design Very Large Vocabulary Recognition (VLVR) systems. We will first consider the applicability of current recognition techniques and argue their inadequacy for VLVR. Possible alternate strategies will be explored and their potential usefulness statistically evaluated. Our results indicate that suprasegmental cues such as syllabification, stress patterns, rhythmic patterns, rhythmic patterns and the voiced - unvoiced patterns in the syllables of a word provide powerful mechanisms for search space reduction. Suprasegmental feature could thus operate in a complementary fashion to segmental features.

  1. Does knowing speaker sex facilitate vowel recognition at short durations?

    Science.gov (United States)

    Smith, David R R

    2014-05-01

    A man, woman or child saying the same vowel do so with very different voices. The auditory system solves the complex problem of extracting what the man, woman or child has said despite substantial differences in the acoustic properties of their voices. Much of the acoustic variation between the voices of men and woman is due to changes in the underlying anatomical mechanisms for producing speech. If the auditory system knew the sex of the speaker then it could potentially correct for speaker sex related acoustic variation thus facilitating vowel recognition. This study measured the minimum stimulus duration necessary to accurately discriminate whether a brief vowel segment was spoken by a man or woman, and the minimum stimulus duration necessary to accuately recognise what vowel was spoken. Results showed that reliable vowel recognition precedesreliable speaker sex discrimination, thus questioning the use of speaker sex information in compensating for speaker sex related acoustic variation in the voice. Furthermore, the pattern of performance across experiments where the fundamental frequency and formant frequency information of speaker's voices were systematically varied, was markedly different depending on whether the task was speaker-sex discrimination or vowel recognition. This argues for there being little relationship between perception of speaker sex (indexical information) and perception of what has been said (linguistic information) at short durations. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. On the definition and interpretation of voice selective activation in the temporal cortex

    Directory of Open Access Journals (Sweden)

    Anja eBethmann

    2014-07-01

    Full Text Available Regions along the superior temporal sulci and in the anterior temporal lobes have been found to be involved in voice processing. It has even been argued that parts of the temporal cortices serve as voice-selective areas. Yet, evidence for voice-selective activation in the strict sense is still missing. The current fMRI study aimed at assessing the degree of voice-specific processing in different parts of the superior and middle temporal cortices. To this end, voices of famous persons were contrasted with widely different categories, which were sounds of animals and musical instruments. The argumentation was that only brain regions with statistically proven absence of activation by the control stimuli may be considered as candidates for voice-selective areas. Neural activity was found to be stronger in response to human voices in all analyzed parts of the temporal lobes except for the middle and posterior STG. More importantly, the activation differences between voices and the other environmental sounds increased continuously from the mid-posterior STG to the anterior MTG. Here, only voices but not the control stimuli excited an increase of the BOLD response above a resting baseline level. The findings are discussed with reference to the function of the anterior temporal lobes in person recognition and the general question on how to define selectivity of brain regions for a specific class of stimuli or tasks. In addition, our results corroborate recent assumptions about the hierarchical organization of auditory processing building on a processing stream from the primary auditory cortices to anterior portions of the temporal lobes.

  3. Software piracy

    OpenAIRE

    Kráčmer, Stanislav

    2011-01-01

    The objective of the present thesis is to clarify the term of software piracy and to determine responsibility of individual entities as to actual realization of software piracy. First, the thesis focuses on a computer programme, causes, realization and pitfalls of its inclusion under copyright protection. Subsequently, it observes methods of legal usage of a computer programme. This is the point of departure for the following attempt to define software piracy, accompanied with methods of actu...

  4. Industrial Applications of Automatic Speech Recognition Systems

    Directory of Open Access Journals (Sweden)

    Dr. Jayashri Vajpai

    2016-03-01

    Full Text Available Current trends in developing technologies form important bridges to the future, fortified by the early and productive use of technology for enriching the human life. Speech signal processing, which includes automatic speech recognition, synthetic speech, and natural language processing, is beginning to have a significant impact on business, industry and ease of operation of personal computers. Apart from this, it facilitates the deeper understanding of complex mechanism of functioning of human brain. Advances in speech recognition technology, over the past five decades, have enabled a wide range of industrial applications. Yet today's applications provide a small preview of a rich future for speech and voice interface technology that will eventually replace keyboards with microphones for designing human machine interface for providing easy access to increasingly intelligent machines. It also shows how the capabilities of speech recognition systems in industrial applications are evolving over time to usher in the next generation of voice-enabled services. This paper aims to present an effective survey of the speech recognition technology described in the available literature and integrate the insights gained during the process of study of individual research and developments. The current applications of speech recognition for real world and industry have also been outlined with special reference to applications in the areas of medical, industrial robotics, forensic, defence and aviation

  5. Hands-free human-machine interaction with voice

    Science.gov (United States)

    Juang, B. H.

    2001-05-01

    Voice is natural communication interface between a human and a machine. The machine, when placed in today's communication networks, may be configured to provide automation to save substantial operating cost, as demonstrated in AT&T's VRCP (Voice Recognition Call Processing), or to facilitate intelligent services, such as virtual personal assistants, to enhance individual productivity. These intelligent services often need to be accessible anytime, anywhere (e.g., in cars when the user is in a hands-busy-eyes-busy situation or during meetings where constantly talking to a microphone is either undersirable or impossible), and thus call for advanced signal processing and automatic speech recognition techniques which support what we call ``hands-free'' human-machine communication. These techniques entail a broad spectrum of technical ideas, ranging from use of directional microphones and acoustic echo cancellatiion to robust speech recognition. In this talk, we highlight a number of key techniques that were developed for hands-free human-machine communication in the mid-1990s after Bell Labs became a unit of Lucent Technologies. A video clip will be played to demonstrate the accomplishement.

  6. Practical Voice Recognition for the Aircraft Cockpit Project

    Data.gov (United States)

    National Aeronautics and Space Administration — This proposal responds to the urgent need for improved pilot interfaces in the modern aircraft cockpit. Recent advances in aircraft equipment bring tremendous...

  7. Indonesian Automatic Speech Recognition For Command Speech Controller Multimedia Player

    Directory of Open Access Journals (Sweden)

    Vivien Arief Wardhany

    2014-12-01

    Full Text Available The purpose of multimedia devices development is controlling through voice. Nowdays voice that can be recognized only in English. To overcome the issue, then recognition using Indonesian language model and accousticc model and dictionary. Automatic Speech Recognizier is build using engine CMU Sphinx with modified english language to Indonesian Language database and XBMC used as the multimedia player. The experiment is using 10 volunteers testing items based on 7 commands. The volunteers is classifiedd by the genders, 5 Male & 5 female. 10 samples is taken in each command, continue with each volunteer perform 10 testing command. Each volunteer also have to try all 7 command that already provided. Based on percentage clarification table, the word “Kanan” had the most recognize with percentage 83% while “pilih” is the lowest one. The word which had the most wrong clarification is “kembali” with percentagee 67%, while the word “kanan” is the lowest one. From the result of Recognition Rate by male there are several command such as “Kembali”, “Utama”, “Atas “ and “Bawah” has the low Recognition Rate. Especially for “kembali” cannot be recognized as the command in the female voices but in male voice that command has 4% of RR this is because the command doesn’t have similar word in english near to “kembali” so the system unrecognize the command. Also for the command “Pilih” using the female voice has 80% of RR but for the male voice has only 4% of RR. This problem is mostly because of the different voice characteristic between adult male and female which male has lower voice frequencies (from 85 to 180 Hz than woman (165 to 255 Hz.The result of the experiment showed that each man had different number of recognition rate caused by the difference tone, pronunciation, and speed of speech. For further work needs to be done in order to improving the accouracy of the Indonesian Automatic Speech Recognition system

  8. Towards Real-Time Speech Emotion Recognition for Affective E-Learning

    Science.gov (United States)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2016-01-01

    This paper presents the voice emotion recognition part of the FILTWAM framework for real-time emotion recognition in affective e-learning settings. FILTWAM (Framework for Improving Learning Through Webcams And Microphones) intends to offer timely and appropriate online feedback based upon learner's vocal intonations and facial expressions in order…

  9. Word Intelligibility in Multi-voice Singing: The Influence of Chorus Size.

    Science.gov (United States)

    Condit-Schultz, Nathaniel; Huron, David

    2017-01-01

    This study investigated how the intelligibility of sung words is influenced by the number of singers in a choral music style. The study used repeated measures factorial. One hundred forty-nine participants listened to recordings of spoken and sung English words and attempted to identify the words. Each stimuli word was sung or spoken in sync by either one, four, eight, sixteen, or twenty-seven members of a high-quality Soprano Alto Tenor Bass (SATB) choir. In general, single-voice word recognition was higher than multi-voice word recognition in the sung condition. However, the difference between four concurrent singers and the full choir was negligible; that is, reduced intelligibility with multiple singers shows little sensitivity to the number of singers. The principal effect of voice density on intelligibility is found to occur with coda consonants-a result consistent with the importance many choral conductors attribute to coordinating word offsets. In particular, the plosives /b/, /d/, /g/, and /p/ are easily confused. Coda liquids (/l/,/r/) were also found to be a source of confusion. Finally, an increasing density of voices appears to have a facilitating effect for the coda nasal /m/. Groups of four or more choral singers do appear to be less intelligible than single singers, although the observed effect is modest. However, increasing the number of singers in a choral texture beyond four singers does not appear to further degrade intelligibility. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  10. [A comparative study of pathological voice based on traditional acoustic characteristics and nonlinear features].

    Science.gov (United States)

    Gan, Deying; Hu, Weiping; Zhao, Bingxin

    2014-10-01

    By analyzing the mechanism of pronunciation, traditional acoustic parameters, including fundamental frequency, Mel frequency cepstral coefficients (MFCC), linear prediction cepstrum coefficient (LPCC), frequency perturbation, amplitude perturbation, and nonlinear characteristic parameters, including entropy (sample entropy, fuzzy entropy, multi-scale entropy), box-counting dimension, intercept and Hurst, are extracted as feature vectors for identification of pathological voice. Seventy-eight normal voice samples and 73 pathological voice samples for /a/, and 78 normal samples and 80 pathological samples for /i/ are recognized based on support vector machine (SVM). The results showed that compared with traditional acoustic parameters, nonlinear characteristic parameters could be well used to distinguish between healthy and pathological voices, and the recognition rates for /a/ were all higher than those for /i/ except for multi-scale entropy. That is why the /a/ sound data is used widely in related research at home and abroad for obtaining better identification of pathological voices. Adopting multi-scale entropy for /i/ could obtain higher recognition rate than /a/ between healthy and pathological samples, which may provide some useful inspiration for evaluating vocal compensatory function.

  11. Mechanics of human voice production and control

    Science.gov (United States)

    Zhang, Zhaoyan

    2016-01-01

    As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed. PMID:27794319

  12. Facial Recognition

    National Research Council Canada - National Science Library

    Mihalache Sergiu; Stoica Mihaela-Zoica

    2014-01-01

    .... From birth, faces are important in the individual's social interaction. Face perceptions are very complex as the recognition of facial expressions involves extensive and diverse areas in the brain...

  13. Software engineering

    CERN Document Server

    Sommerville, Ian

    2016-01-01

    For courses in computer science and software engineering The Fundamental Practice of Software Engineering Software Engineering introduces readers to the overwhelmingly important subject of software programming and development. In the past few years, computer systems have come to dominate not just our technological growth, but the foundations of our world's major industries. This text seeks to lay out the fundamental concepts of this huge and continually growing subject area in a clear and comprehensive manner. The Tenth Edition contains new information that highlights various technological updates of recent years, providing readers with highly relevant and current information. Sommerville's experience in system dependability and systems engineering guides the text through a traditional plan-based approach that incorporates some novel agile methods. The text strives to teach the innovators of tomorrow how to create software that will make our world a better, safer, and more advanced place to live.

  14. Fingerprint recognition

    OpenAIRE

    Diefenderfer, Graig T.

    2006-01-01

    The use of biometrics is an evolving component in today's society. Fingerprint recognition continues to be one of the most widely used biometric systems. This thesis explores the various steps present in a fingerprint recognition system. The study develops a working algorithm to extract fingerprint minutiae from an input fingerprint image. This stage incorporates a variety of image pre-processing steps necessary for accurate minutiae extraction and includes two different methods of ridge thin...

  15. Native voice, self-concept and the moral case for personalized voice technology.

    Science.gov (United States)

    Nathanson, Esther

    2017-01-01

    Purpose (1) To explore the role of native voice and effects of voice loss on self-concept and identity, and survey the state of assistive voice technology; (2) to establish the moral case for developing personalized voice technology. Methods This narrative review examines published literature on the human significance of voice, the impact of voice loss on self-concept and identity, and the strengths and limitations of current voice technology. Based on the impact of voice loss on self and identity, and voice technology limitations, the moral case for personalized voice technology is developed. Results Given the richness of information conveyed by voice, loss of voice constrains expression of the self, but the full impact is poorly understood. Augmentative and alternative communication (AAC) devices facilitate communication but, despite advances in this field, voice output cannot yet express the unique nuances of individual voice. The ethical principles of autonomy, beneficence and equality of opportunity establish the moral responsibility to invest in accessible, cost-effective, personalized voice technology. Conclusions Although further research is needed to elucidate the full effects of voice loss on self-concept, identity and social functioning, current understanding of the profoundly negative impact of voice loss establishes the moral case for developing personalized voice technology. Implications for Rehabilitation Rehabilitation of voice-disordered patients should facilitate self-expression, interpersonal connectedness and social/occupational participation. Proactive questioning about the psychological and social experiences of patients with voice loss is a valuable entry point for rehabilitation planning. Personalized voice technology would enhance sense of self, communicative participation and autonomy and promote shared healthcare decision-making. Further research is needed to identify the best strategies to preserve and strengthen identity and sense of

  16. Children's Voice or Children's Voices? How Educational Research Can Be at the Heart of Schooling

    Science.gov (United States)

    Stern, Julian

    2015-01-01

    There are problems with considering children and young people in schools as quite separate individuals, and with considering them as members of a single collectivity. The tension is represented in the use of "voice" and "voices" in educational debates. Voices in dialogue, in contrast to "children's voice", are…

  17. Voice complaints, risk factors for voice problems and history of voice problems in relation to puberty in female student teachers.

    NARCIS (Netherlands)

    Thomas, G.; Jong, F.I.C.R.S. de; Kooijman, P.G.C.; Donders, A.R.T.; Cremers, C.W.R.J.

    2006-01-01

    The aim of the study was to estimate voice complaints, risk factors for voice complaints and history of voice problems in student teachers before they embarked on their professional teaching career. A cross-sectional questionnaire survey was performed among female student teachers. The response rate

  18. Voice complaints, risk factors for voice problems and history of voice problems in relation to puberty in female student teachers.

    NARCIS (Netherlands)

    Thomas, G.; Jong, F.I.C.R.S. de; Kooijman, P.G.C.; Donders, A.R.T.; Cremers, C.W.R.J.

    2006-01-01

    The aim of the study was to estimate voice complaints, risk factors for voice complaints and history of voice problems in student teachers before they embarked on their professional teaching career. A cross-sectional questionnaire survey was performed among female student teachers. The response rate

  19. Quick Statistics about Voice, Speech, and Language

    Science.gov (United States)

    ... here Home » Health Info » Statistics and Epidemiology Quick Statistics About Voice, Speech, Language Voice, Speech, Language, and ... no 205. Hyattsville, MD: National Center for Health Statistics. 2015. Hoffman HJ, Li C-M, Losonczy K, ...

  20. Introduction: Textual and contextual voices of translation

    DEFF Research Database (Denmark)

    2017-01-01

    Voices – marks of the tangle of subjectivities involved in textual processes – constitute the very fabric of texts in general and translations in particular. The title of this book, Textual and Contextual Voices of Translation, refers both to textual voices, that is, the voices found within...... the translated texts, and to contextual voices, that is, the voices of those involved in shaping, commenting, or otherwise influencing the textual voices. The latter appear in prefaces, reviews, and other texts that surround the translated texts and provide them with a context. Our main claim is that studying...... both the textual and contextual voices helps us better understand and explain the complexity of both the translation process and the translation product. The dovetailed approach to translation research that is advocated in this book aims at highlighting the diversity of participants, power positions...

  1. Parameter Estimations for Signal Type Classification of Korean Disordered Voices

    Directory of Open Access Journals (Sweden)

    JiYeoun Lee

    2015-12-01

    Full Text Available Although many signal-typing studies have been published, they are primarily based on manual inspection and experts’ judgments of voice samples’ acoustic content. Software may be required to automatically and objectively classify pathological voices into the four signal types and to facilitate experts’ opinion formation by providing specific signal type determination criteria. This paper suggests the coefficient of normalized skewness variation (CSV, coefficient of normalized kurtosis variation (CKV, and bicoherence value (BV based on the linear predictive coding (LPC residual to categorize voice signals. Its objective is to improve the performances of acoustic parameters such as jitter, shimmer, and the signal-to-noise ratio (SNR in signal type classification. In this study, the classification and regression tree (CART was used to estimate the performances of the acoustic, CSV, CKV, and BV parameters by using the LPC residual. In the investigation of acoustic parameters such as jitter, shimmer, and the SNR, the optimal tree generated by jitter alone yielded an average accuracy of 78.6%. When the acoustic, CSV, CKV, and BV parameters together were used to generate the decision tree, the average accuracy was 82.1%. In this case, the optimal tree formed by jitter and the BV effectively discriminated between the signal types. To perform accurate acoustic pathological voice analysis, signal type quantification is of great interest. Automatic pathological voice classification can be an important objective tool as the signal type can be numerically measured. Future investigations will incorporate multiple pathological data in classification methods to improve their performance and implement more reliable detectors.

  2. Speaker's voice as a memory cue.

    Science.gov (United States)

    Campeanu, Sandra; Craik, Fergus I M; Alain, Claude

    2015-02-01

    Speaker's voice occupies a central role as the cornerstone of auditory social interaction. Here, we review the evidence suggesting that speaker's voice constitutes an integral context cue in auditory memory. Investigation into the nature of voice representation as a memory cue is essential to understanding auditory memory and the neural correlates which underlie it. Evidence from behavioral and electrophysiological studies suggest that while specific voice reinstatement (i.e., same speaker) often appears to facilitate word memory even without attention to voice at study, the presence of a partial benefit of similar voices between study and test is less clear. In terms of explicit memory experiments utilizing unfamiliar voices, encoding methods appear to play a pivotal role. Voice congruency effects have been found when voice is specifically attended at study (i.e., when relatively shallow, perceptual encoding takes place). These behavioral findings coincide with neural indices of memory performance such as the parietal old/new recollection effect and the late right frontal effect. The former distinguishes between correctly identified old words and correctly identified new words, and reflects voice congruency only when voice is attended at study. Characterization of the latter likely depends upon voice memory, rather than word memory. There is also evidence to suggest that voice effects can be found in implicit memory paradigms. However, the presence of voice effects appears to depend greatly on the task employed. Using a word identification task, perceptual similarity between study and test conditions is, like for explicit memory tests, crucial. In addition, the type of noise employed appears to have a differential effect. While voice effects have been observed when white noise is used at both study and test, using multi-talker babble does not confer the same results. In terms of neuroimaging research modulations, characterization of an implicit memory effect

  3. Software requirements

    CERN Document Server

    Wiegers, Karl E

    2003-01-01

    Without formal, verifiable software requirements-and an effective system for managing them-the programs that developers think they've agreed to build often will not be the same products their customers are expecting. In SOFTWARE REQUIREMENTS, Second Edition, requirements engineering authority Karl Wiegers amplifies the best practices presented in his original award-winning text?now a mainstay for anyone participating in the software development process. In this book, you'll discover effective techniques for managing the requirements engineering process all the way through the development cy

  4. Voicing Consciousness: The Mind in Writing

    Science.gov (United States)

    Luce-Kapler, Rebecca; Catlin, Susan; Sumara, Dennis; Kocher, Philomene

    2011-01-01

    In this paper, the authors investigate the enduring power of voice as a concept in writing pedagogy. They argue that one can benefit from considering Elbow's assertion that both text and voice be considered as important aspects of written discourse. In particular, voice is a powerful metaphor for the material, social and historical nature of…

  5. Understanding the 'Anorexic Voice' in Anorexia Nervosa.

    Science.gov (United States)

    Pugh, Matthew; Waller, Glenn

    2016-07-20

    In common with individuals experiencing a number of disorders, people with anorexia nervosa report experiencing an internal 'voice'. The anorexic voice comments on the individual's eating, weight and shape and instructs the individual to restrict or compensate. However, the core characteristics of the anorexic voice are not known. This study aimed to develop a parsimonious model of the voice characteristics that are related to key features of eating disorder pathology and to determine whether patients with anorexia nervosa fall into groups with different voice experiences. The participants were 49 women with full diagnoses of anorexia nervosa. Each completed validated measures of the power and nature of their voice experience and of their responses to the voice. Different voice characteristics were associated with current body mass index, duration of disorder and eating cognitions. Two subgroups emerged, with 'weaker' and 'stronger' voice experiences. Those with stronger voices were characterized by having more negative eating attitudes, more severe compensatory behaviours, a longer duration of illness and a greater likelihood of having the binge-purge subtype of anorexia nervosa. The findings indicate that the anorexic voice is an important element of the psychopathology of anorexia nervosa. Addressing the anorexic voice might be helpful in enhancing outcomes of treatments for anorexia nervosa, but that conclusion might apply only to patients with more severe eating psychopathology. Copyright © 2016 John Wiley & Sons, Ltd.

  6. Voice and culture: A prospect theory approach

    NARCIS (Netherlands)

    Paddock, E.L.; Ko, Junsu; Cropanzano, R.; Bagger, J.; El Akremi, A.; Camerman, A.; Greguras, G. J.; Mladinic, A.; Moliner, C.; Nam, K.; Törnblom, K.; Van den Bos, Kees

    2015-01-01

    The present study examines the congruence of individuals' minimum preferred amounts of voice with the prospect theory value function across nine countries. Accounting for previously ignored minimum preferred amounts of voice and actual voice amounts integral to testing the steepness of gain and loss

  7. Finding Voice: Learning about Language and Power

    Science.gov (United States)

    Christensen, Linda

    2011-01-01

    Christensen discusses why teachers need to teach students "voice" in its social and political context, to show the intersection of voice and power, to encourage students to ask, "Whose voices get heard? Whose are marginalized?" As Christensen writes, "Once students begin to understand that Standard English is one language among many, we can help…

  8. Analyzing the mediated voice - a datasession

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    Broadcasted voices are technologically manipulated. In order to achieve a certain autencity or sound of “reality” paradoxically the voices are filtered and trained in order to reach the listeners. This “mis-en-scene” is important knowledge when it comes to the development of a consistent method o...... of analysis of the mediated voice...

  9. Analyzing the mediated voice - a datasession

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    Broadcasted voices are technologically manipulated. In order to achieve a certain autencity or sound of “reality” paradoxically the voices are filtered and trained in order to reach the listeners. This “mis-en-scene” is important knowledge when it comes to the development of a consistent method...... of analysis of the mediated voice...

  10. Voice and culture: A prospect theory approach

    NARCIS (Netherlands)

    Paddock, E.L.; Ko, Junsu; Cropanzano, R.; Bagger, J.; El Akremi, A.; Camerman, A.; Greguras, G. J.; Mladinic, A.; Moliner, C.; Nam, K.; Törnblom, K.; Van den Bos, Kees

    2015-01-01

    The present study examines the congruence of individuals' minimum preferred amounts of voice with the prospect theory value function across nine countries. Accounting for previously ignored minimum preferred amounts of voice and actual voice amounts integral to testing the steepness of gain and loss

  11. "Voice Forum" The Human Voice as Primary Instrument in Music Therapy

    DEFF Research Database (Denmark)

    Pedersen, Inge Nygaard; Storm, Sanne

    2009-01-01

    Aspects will be drawn on the human voice as tool for embodying our psychological and physiological state, and attempting integration of feelings. Presentations and dialogues on different methods and techniques in "Therapy related body-and voice work.", as well as the human voice as a tool for non...... for nonverbal orientation and information both to our selves and others. Focus on training on the voice instrument, the effect and impact of the human voice, and listening perspectives...

  12. Software Innovation

    DEFF Research Database (Denmark)

    Rose, Jeremy

      Innovation is the forgotten key to modern systems development - the element that defines the enterprising engineer, the thriving software firm and the cutting edge software application.  Traditional forms of technical education pay little attention to creativity - often encouraging overly...... rationalistic ways of thinking which stifle the ability to innovate. Professional software developers are often drowned in commercial drudgery and overwhelmed by work pressure and deadlines. The topic that will both ensure success in the market and revitalize their work lives is never addressed. This book sets...... out the new field of software innovation. It organizes the existing scientific research into eight simple heuristics - guiding principles for organizing a system developer's work-life so that it focuses on innovation....

  13. Software Reviews.

    Science.gov (United States)

    Classroom Computer Learning, 1990

    1990-01-01

    Reviewed are three computer software packages including "Martin Luther King, Jr.: Instant Replay of History,""Weeds to Trees," and "The New Print Shop, School Edition." Discussed are hardware requirements, costs, grade levels, availability, emphasis, strengths, and weaknesses. (CW)

  14. Software Reviews.

    Science.gov (United States)

    Wulfson, Stephen, Ed.

    1987-01-01

    Reviews seven computer software programs that can be used in science education programs. Describes courseware which deals with muscles and bones, terminology, classifying animals without backbones, molecular structures, drugs, genetics, and shaping the earth's surface. (TW)

  15. Software Reviews.

    Science.gov (United States)

    Dwyer, Donna; And Others

    1989-01-01

    Reviewed are seven software packages for Apple and IBM computers. Included are: "Toxicology"; "Science Corner: Space Probe"; "Alcohol and Pregnancy"; "Science Tool Kit Plus"; Computer Investigations: Plant Growth"; "Climatrolls"; and "Animal Watch: Whales." (CW)

  16. Reusable Software.

    Science.gov (United States)

    1984-03-01

    overseeing reusable software, the Reusable Software Organization ( RUSO ). This author does not feel at this time that establishment of such a specific...49] have not been accompanied by establishment of RUSO -like activities. There is need, however, for assurance that functions which a RUSO might be...assurance 6. establishment and maintenance of reuse archival facilities and activities. Actual establishment of a RUSO is best dictated by size of the

  17. Software Epistemology

    Science.gov (United States)

    2016-03-01

    comprehensive approach for determining software epistemology which significantly advances the state of the art in automated vulnerability discovery...straightforward. First, internet -based repositories of open source software (e.g., FreeBSD ports, GitHub, SourceForge, etc.) are mined Approved for...the fix delta, we attempted to perform the same process to determine if the firmware release present in an Internet -of-Things (IoT) streaming camera

  18. Voice-Specialized Speech-Language Pathologist's Criteria for Discharge from Voice Therapy.

    Science.gov (United States)

    Gillespie, Amanda I; Gartner-Schmidt, Jackie

    2017-08-07

    No standard protocol exists to determine when a patient is ready and able to be discharged from voice therapy. The aim of the present study was to determine what factors speech-language pathologists (SLPs) deem most important when discharging a patient from voice therapy. A second aim was to determine if responses differed based on years of voice experience. Step 1: Seven voice-specialized SLPs generated a list of items thought to be relevant to voice therapy discharge. Step 2: Fifty voice-specialized SLPs rated each item on the list in terms of importance in determining discharge from voice therapy. Step 1: Four themes emerged-outcome measures, laryngeal appearance, SLP perceptions, and patient factors-as important items when determining discharge from voice therapy. Step 2: The top five most important criteria for discharge readiness were that the patient had to be able to (1) independently use a better voice (transfer), (2) function with his or her new voice production in activities of daily living (transfer), (3) differentiate between good and bad voice, (4) take responsibility for voice, and (5) sound better from baseline. Novice and experienced clinicians agreed between 94% and 97% concerning what was deemed "very important." SLPs agree that a patient's ability to use voice techniques in conversation and real-life situations outside of the therapy room are the most important determinants for voice therapy discharge. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  19. Multi-thread Parallel Speech Recognition for Mobile Applications

    Directory of Open Access Journals (Sweden)

    LOJKA Martin

    2014-05-01

    Full Text Available In this paper, the server based solution of the multi-thread large vocabulary automatic speech recognition engine is described along with the Android OS and HTML5 practical application examples. The basic idea was to bring speech recognition available for full variety of applications for computers and especially for mobile devices. The speech recognition engine should be independent of commercial products and services (where the dictionary could not be modified. Using of third-party services could be also a security and privacy problem in specific applications, when the unsecured audio data could not be sent to uncontrolled environments (voice data transferred to servers around the globe. Using our experience with speech recognition applications, we have been able to construct a multi-thread speech recognition serverbased solution designed for simple applications interface (API to speech recognition engine modified to specific needs of particular application.

  20. The development of the Spanish verb ir into auxiliary of voice

    DEFF Research Database (Denmark)

    Vinther, Thora

    2005-01-01

    spanish, syntax, grammaticalisation, past participle, passive voice, middle voice, language development......spanish, syntax, grammaticalisation, past participle, passive voice, middle voice, language development...

  1. Playful Interaction with Voice Sensing Modular Robots

    DEFF Research Database (Denmark)

    Heesche, Bjarke; MacDonald, Ewen; Fogh, Rune

    2013-01-01

    This paper describes a voice sensor, suitable for modular robotic systems, which estimates the energy and fundamental frequency, F0, of the user’s voice. Through a number of example applications and tests with children, we observe how the voice sensor facilitates playful interaction between...... children and two different robot configurations. In future work, we will investigate if such a system can motivate children to improve voice control and explore how to extend the sensor to detect emotions in the user’s voice....

  2. Recognition and Exteriority: Towards a Recognition-Theoretic Account of Globalization

    Directory of Open Access Journals (Sweden)

    Sebastian Purcell

    2011-06-01

    Full Text Available This essay aims to extend Paul Ricœur’s account of recognition to address some of the concerns of globalization, especially those voiced by Enrique Dussel. The extension is accomplished in two parts.  First, Dussel’s account of spatial existence as dwelling is reviewed as it is pertinent to the concerns of globalization.  Next, it is demonstrated that each of the aspects of Ricœur’s account of recognition may be given a spatial re-articulation.  The results thus establish an outline of how recognition theory might address some of the concerns of globalization.  The essay concludes with several consequences for the modification of recognition politics as one finds it in the late work of Ricœur and in Axel Honneth’s ongoing inquiries. 

  3. Application Of t-Cherry Junction Trees in Pattern Recognition

    Directory of Open Access Journals (Sweden)

    Edith Kovacs

    2010-06-01

    Full Text Available Pattern recognition aims to classify data (patterns based ei-
    ther on a priori knowledge or on statistical information extracted from the data. In this paper we will concentrate on statistical pattern recognition using a new probabilistic approach which makes possible to select the so called 'informative' features. We develop a pattern recognition algorithm which is based on the conditional independence structure underlying the statistical data. Our method was succesfully applied on a real problem of recognizing Parkinson's disease on the basis of voice disorders.

  4. VOICE QUALITY BEFORE AND AFTER THYROIDECTOMY

    Directory of Open Access Journals (Sweden)

    Dora CVELBAR

    2016-04-01

    Full Text Available Introduction: Voice disorders are a well-known complication which is often associated with thyroid gland diseases and because voice is still the basic mean of communication it is very important to maintain its quality healthy. Objectives: The aim of this study referred to questions whether there is a statistically significant difference between results of voice self-assessment, perceptual voice assessment and acoustic voice analysis before and after thyroidectomy and whether there are statistically significant correlations between variables of voice self-assessment, perceptual assessment and acoustic analysis before and after thyroidectomy. Methods: This scientific research included 12 participants aged between 41 and 76. Voice self-assessment was conducted with the help of Croatian version of Voice Handicap Index (VHI. Recorded reading samples were used for perceptual assessment and later evaluated by two clinical speech and language therapists. Recorded samples of phonation were used for acoustic analysis which was conducted with the help of acoustic program Praat. All of the data was processed through descriptive statistics and nonparametric statistical methods. Results: Results showed that there are statistically significant differences between results of voice self-assessments and results of acoustic analysis before and after thyroidectomy. Statistically significant correlations were found between variables of perceptual assessment and acoustic analysis. Conclusion: Obtained results indicate the importance of multidimensional, preoperative and postoperative assessment. This kind of assessment allows the clinician to describe all of the voice features and provides appropriate recommendation for further rehabilitation to the patient in order to optimize voice outcomes.

  5. Nonlinear dynamic analysis of voice: A normative study in the Indian population

    Directory of Open Access Journals (Sweden)

    Jacqueline B. Fernandes, Radish Kumar Balasubramanium, Arivudai Nambi Pitchaimuthu, Jayashree S. Bhat

    2014-03-01

    Full Text Available Background: The aim of this study was to establish normative data for the Indian population using Nonlinear dynamic analysis. In this study, correlation dimension, a measure of nonlinear dynamic analysis was performed for normophonic young, middle aged and elderly voices. Materials and Methods: For this purpose, normophonic young, middle aged and elderly individuals were selected without a history of voice/respiratory problems and vocal abuse/ misuse. 60 participants were selected in each group. All of these individuals had a normal voice as evaluated through GRBAS scale. Sound Recorder, on a computer desktop was used for voice recording and “convert” code in MATLAB as well as D2.ini.writer software based on TISEAN package (Hegger, Kantz & Schreiber, 1999 was used for the calculation of Correlation dimension (D2. Correlation dimension measures were obtained for each participant, for both steady vowel phonations (/a/, /i/, /u/ as well as narration samples. Results: The correlation dimension measures across the group revealed a significant main effect of the groups indicating correlation dimension increases with increase in age. Conclusions: The application of nonlinear dynamic measures in the assessment of voice is a novel venture and thus this study provides normative data for correlation dimensions in the Indian population for future comparisons against the disordered voice samples. Further studies are warranted to investigate the same in the clinical population. Also other nonlinear dynamic analysis methods need to be investigated to obtain the normative data in the Indian population.

  6. Beyond Insularity: Releasing the Voices.

    Science.gov (United States)

    Greene, Maxine

    1993-01-01

    Aspects of English-as-a-Second-Language are discussed from the standpoint of a teacher-educator with a particular interest in philosophy, the arts, and humanities and what they signify for the schools. The idea of giving voice to all viewpoints and sociocultural circumstances is considered for content learning and heterogeneous grouping. (Contains…

  7. A voice and nothing more

    DEFF Research Database (Denmark)

    Mebus, Andreas Nozic Lindgren

    2012-01-01

    Andreas Mebus fokuserer herefter på et helt konkret aspekt af talen, nemlig ”stemmen” i sin artikel ”A voice and nothing more – en filosofisk udredning af stemmen”. Gennem Mladen Dolars teori om stemmen, redegør Mebus for de forskellige aspekter ved stemmen; som bærer af mening, som æstetisk...

  8. Voice, Citizenship, and Civic Action

    DEFF Research Database (Denmark)

    Tufte, Thomas

    2014-01-01

    In recent years the world has experienced a resurgence in practices of bottom-up communication for social change, a plethora of agency in which claims for voice and citizenship through massive civic action have conquered center stage in the public debate. This resurgence has sparked a series...

  9. The Performing Voice of Radio

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    The ongoing international development of opening media archives for researchers as well as for broader audiences calls for a closer discussion of the mediated voice and how to analyse it. Which parameters can be analysed and which parameters are not covered by the analysis? Furthermore, how do we...

  10. Voice and choice by delegation

    NARCIS (Netherlands)

    van de Bovenkamp, H.; Vollaard, H.; Trappenburg, M.; Grit, K

    2013-01-01

    In many Western countries, options for citizens to influence public services are increased to improve the quality of services and democratize decision making. Possibilities to influence are often cast into Albert Hirschman's taxonomy of exit (choice), voice, and loyalty. In this article we identify

  11. Work-related voice disorder

    Directory of Open Access Journals (Sweden)

    Paulo Eduardo Przysiezny

    2015-04-01

    Full Text Available INTRODUCTION: Dysphonia is the main symptom of the disorders of oral communication. However, voice disorders also present with other symptoms such as difficulty in maintaining the voice (asthenia, vocal fatigue, variation in habitual vocal fundamental frequency, hoarseness, lack of vocal volume and projection, loss of vocal efficiency, and weakness when speaking. There are several proposals for the etiologic classification of dysphonia: functional, organofunctional, organic, and work-related voice disorder (WRVD.OBJECTIVE: To conduct a literature review on WRVD and on the current Brazilian labor legislation.METHODS: This was a review article with bibliographical research conducted on the PubMed and Bireme databases, using the terms "work-related voice disorder", "occupational dysphonia", "dysphonia and labor legislation", and a review of labor and social security relevant laws.CONCLUSION: WRVD is a situation that frequently is listed as a reason for work absenteeism, functional rehabilitation, or for prolonged absence from work. Currently, forensic physicians have no comparative parameters to help with the analysis of vocal disorders. In certain situations WRVD may cause, work disability. This disorder may be labor-related, or be an adjuvant factor to work-related diseases.

  12. The Performing Voice of Radio

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    The ongoing international development of opening media archives for researchers as well as for broader audiences calls for a closer discussion of the mediated voice and how to analyse it. Which parameters can be analysed and which parameters are not covered by the analysis? Furthermore, how do we...

  13. Adolescent Leadership: The Female Voice

    Science.gov (United States)

    Archard, Nicole

    2013-01-01

    This research investigated the female adolescent view of leadership by giving voice to student leaders through focus group discussions. The questions: What is leadership? Where/how was leadership taught?, and How was leadership practised? were explored within the context of girls' schools located in Australia, with one school located in South…

  14. Voicing children's critique and utopias

    DEFF Research Database (Denmark)

    Husted, Mia; Lind, Unni

    2016-01-01

    , designed to accommodate children's participation through graphic illustrations of young children's critique and utopias. The study is informed by a commitment to democratic participation and processes (Reason and Bradbury 2001, Gunnarsson et al. 2016). Ethical guidelines implied dialogues and discussions......, children's voice, critique and utopias, pedagogical work...

  15. Women's Voices in Experiential Education.

    Science.gov (United States)

    Warren, Karen, Ed.

    This book is a collection of feminist analyses of various topics in experiential education, particularly as it applies to outdoors and adventure education, as well as practical examples of how women's experiences can contribute to the field as a whole. Following an introduction, "The Quilt of Women's Voices" (Maya Angelou), the 25…

  16. MIAWARE Software

    DEFF Research Database (Denmark)

    Wilkowski, Bartlomiej; Pereira, Oscar N. M.; Dias, Paulo

    2008-01-01

    This article presents MIAWARE, a software for Medical Image Analysis With Automated Reporting Engine, which was designed and developed for doctor/radiologist assistance. It allows to analyze an image stack from computed axial tomography scan of lungs (thorax) and, at the same time, to mark all...... pathologies on images and report their characteristics. The reporting process is normalized - radiologists cannot describe pathological changes with their own words, but can only use some terms from a specific vocabulary set provided by the software. Consequently, a normalized radiological report...... is automatically generated. Furthermore, MIAWARE software is accompanied with an intelligent search engine for medical reports, based on the relations between parts of the lungs. A logical structure of the lungs is introduced to the search algorithm through the specially developed ontology. As a result...

  17. Software engineering

    CERN Document Server

    Thorin, Marc

    1985-01-01

    Software Engineering describes the conceptual bases as well as the main methods and rules on computer programming. This book presents software engineering as a coherent and logically built synthesis and makes it possible to properly carry out an application of small or medium difficulty that can later be developed and adapted to more complex cases. This text is comprised of six chapters and begins by introducing the reader to the fundamental notions of entities, actions, and programming. The next two chapters elaborate on the concepts of information and consistency domains and show that a proc

  18. Vowel recognition by fuzzy inference and application to recognition of continuous Korean speech. Fuzzy suiron ni yoru boin ninshiki to kankokugo renzoku onsei eno oyo

    Energy Technology Data Exchange (ETDEWEB)

    Choi, W.K.; Akizuki, K. (Waseda Univ., Tokyo (Japan)); Lee, H.H. (Fukuoka Inst. of Tech., Fukuoka (Japan))

    1991-05-20

    The target of voice recognition is to recognize continuous speech which is effective for speech recognition of unspecified persons. As a new matching method, the variations of feature parameters of speakers are represented as fuzzy variables to express the variation by membership functions. It is a new pattern matching method of fuzzy inference using feature parameters, fuzzy relation and synthesis of each formant, and the fuzzy rule. It is a recognition method for the inference of best formant which matches the fact by providing each characteristic quantity and fuzzy rule for composite calculation. For consonant recognition, pitch, logarithmic energies, zero crossing rates, etc. are used which represent features of each formant. KOSRES 2, recognition system for continuous Korean speech, was structured using this method which was subjected to recognition experiments on continuous Korean speech, and the recognition method by fuzzy inference is found to be effective for speech recognition of unspecified persons. 8 refs., 9 figs., 3 tabs.

  19. Efficacy of voice therapy in patient with arytenoid dislocation as a complication of tracheal intubation

    Directory of Open Access Journals (Sweden)

    Md Noorain Alam

    2015-01-01

    Full Text Available Introduction: Tracheal intubation is the placement of a flexible plastic tube into the trachea to maintain an open airway or to serve as a conduit through which certain drugs are administered. Arytenoid dislocation is one of the rare reported complications of tracheal intubation. Decreased volume and breathiness are the most common voice symptoms. Need for the Study: There is lack of prospective and systemic study of the incidence of arytenoid dislocation and efficacy of voice therapy in such cases. Aim of the Study: Present study was carried out to find out the efficacy of voice therapy as independent management option in persons with arytenoid dislocation as a consequence of tracheal intubation. Materials and Methods: The study was based on a case study of a 37 year old male patient who reported to C U Shah Medical College and Hospital, Surendranagar with complaint of breathy and soft voice quality post laparotomy as a consequence of intubation. ENT examination revealed both vocal cord bowing with ? subclinical sublaxtition of arytenoid. Pre and post voice therapy assessment was done using GRBAS scale (for perceptual analysis, Praat software (for acoustical analysis and VHI scale (for measuring the effect of voice disorder on the quality of life. Voice therapy was given for two months and pre and post findings were compared. Results: Significant improvement was observed on all the measures. Conclusion: Although arytenoid cartilage dislocation following the use of intubation is a rare event, it is important to be aware of its occurrence and to conduct diagnostic tests as early as possible in case of persistent hoarseness. Voice therapy may be used as an adjunct to phonosurgery or independently an effective intervention.

  20. It Is Not Too Late for Software Assurance!

    Science.gov (United States)

    2013-12-01

    aided by Software Assurance practices regardless of where an application is in its lifecycle. In 2012, Yahoo suffered from an attack on an...application initially developed by another company and had Yahoo executed Software Assurance techniques on the mature application, they could have prevented...houses the vast majority of our critical informa- tion; thereby hardening the current infrastructure. Case Study: Yahoo Voices In July of 2012, a

  1. [Software version and medical device software supervision].

    Science.gov (United States)

    Peng, Liang; Liu, Xiaoyan

    2015-01-01

    The importance of software version in the medical device software supervision does not cause enough attention at present. First of all, the effect of software version in the medical device software supervision is discussed, and then the necessity of software version in the medical device software supervision is analyzed based on the discussion of the misunderstanding of software version. Finally the concrete suggestions on software version naming rules, software version supervision for the software in medical devices, and software version supervision scheme are proposed.

  2. Robotic Software Integration Using MARIE

    Directory of Open Access Journals (Sweden)

    Dominic Letourneau Clement Raievsky

    2008-11-01

    Full Text Available This paper presents MARIE, a middleware framework oriented towards developing and integrating new and existing software for robotic systems. By using a generic communication framework, MARIE aims to create a flexible distributed component system that allows robotics developers to share software programs and algorithms, and design prototypes rapidly based on their own integration needs. The use of MARIE is illustrated with the design of a socially interactive autonomous mobile robot platform capable of map building, localization, navigation, tasks scheduling, sound source localization, tracking and separation, speech recognition and generation, visual tracking, message reading and graphical interaction using a touch screen interface.

  3. Robotic Software Integration Using MARIE

    Directory of Open Access Journals (Sweden)

    Carle Côté

    2006-03-01

    Full Text Available This paper presents MARIE, a middleware framework oriented towards developing and integrating new and existing software for robotic systems. By using a generic communication framework, MARIE aims to create a flexible distributed component system that allows robotics developers to share software programs and algorithms, and design prototypes rapidly based on their own integration needs. The use of MARIE is illustrated with the design of a socially interactive autonomous mobile robot platform capable of map building, localization, navigation, tasks scheduling, sound source localization, tracking and separation, speech recognition and generation, visual tracking, message reading and graphical interaction using a touch screen interface.

  4. Asymmetries in the exploitation of phonetic features for word recognition.

    Science.gov (United States)

    Martin, Alexander; Peperkamp, Sharon

    2015-04-01

    French listeners' reliance on voicing, manner, and place was tested in a mispronunciation detection task. Mispronounced words were more likely to be recognized when the mispronunciation concerned voicing rather than manner or place. This indicates that listeners rely less on the former than on the latter for the purposes of word recognition. Further, the role of visual cues to phonetic features was explored by the task being conducted in both an audio-only and an audiovisual version, but no effect of modality was found. Discussion focuses on crosslinguistic comparisons and lexical factors that might influence the weight of individual features.

  5. Educational Software.

    Science.gov (United States)

    Northwest Regional Educational Lab., Portland, OR.

    The third session of IT@EDU98 consisted of five papers on educational software and was chaired by Tran Van Hao (University of Education, Ho Chi Minh City, Vietnam). "Courseware Engineering" (Nguyen Thanh Son, Ngo Ngoc Bao Tran, Quan Thanh Tho, Nguyen Hong Lam) briefly describes the use of courseware. "Machine Discovery Theorems in Geometry: A…

  6. Software Patents.

    Science.gov (United States)

    Burke, Edmund B.

    1994-01-01

    Outlines basic patent law information that pertains to computer software programs. Topics addressed include protection in other countries; how to obtain patents; kinds of patents; duration; classes of patentable subject matter, including machines and processes; patentability searches; experimental use prior to obtaining a patent; and patent…

  7. Software Systems

    Institute of Scientific and Technical Information of China (English)

    崔涛; 周淼

    1996-01-01

    The information used with computers is known as software and includesprograms and data. Programs are sets of instructions telling the computerwhat operations have to be carried out and in what order they should be done. Specialised programs which enable the computer to be used for particularpurposes are called applications programs. A collection of these programs kept

  8. Software Reviews.

    Science.gov (United States)

    Science and Children, 1990

    1990-01-01

    Reviewed are seven computer software packages for IBM and/or Apple Computers. Included are "Windows on Science: Volume 1--Physical Science"; "Science Probe--Physical Science"; "Wildlife Adventures--Grizzly Bears"; "Science Skills--Development Programs"; "The Clean Machine"; "Rock Doctor"; and "Geology Search." Cost, quality, hardware, and…

  9. Software Review.

    Science.gov (United States)

    McGrath, Diane, Ed.

    1989-01-01

    Reviewed is a computer software package entitled "Audubon Wildlife Adventures: Grizzly Bears" for Apple II and IBM microcomputers. Included are availability, hardware requirements, cost, and a description of the program. The murder-mystery flavor of the program is stressed in this program that focuses on illegal hunting and game management. (CW)

  10. The Role of Occupational Voice Demand and Patient-Rated Impairment in Predicting Voice Therapy Adherence.

    Science.gov (United States)

    Ebersole, Barbara; Soni, Resha S; Moran, Kathleen; Lango, Miriam; Devarajan, Karthik; Jamal, Nausheen

    2017-07-11

    Examine the relationship among the severity of patient-perceived voice impairment, perceptual dysphonia severity, occupational voice demand, and voice therapy adherence. Identify clinical predictors of increased risk for therapy nonadherence. A retrospective cohort study of patients presenting with a chief complaint of persistent dysphonia at an interdisciplinary voice center was done. The Voice Handicap Index-10 (VHI-10) and the Voice-Related Quality of Life (V-RQOL) survey scores, clinician rating of dysphonia severity using the Grade score from the Grade, Roughness Breathiness, Asthenia, and Strain scale, occupational voice demand, and patient demographics were tested for associations with therapy adherence, defined as completion of the treatment plan. Classification and Regression Tree (CART) analysis was performed to establish thresholds for nonadherence risk. Of 166 patients evaluated, 111 were recommended for voice therapy. The therapy nonadherence rate was 56%. Occupational voice demand category, VHI-10, and V-RQOL scores were the only factors significantly correlated with therapy adherence (P occupational voice demand are significantly more likely to be nonadherent with therapy than those with high occupational voice demand (P 40 is a significant cutoff point for predicting therapy nonadherence (P Occupational voice demand and patient perception of impairment are significantly and independently correlated with therapy adherence. A VHI-10 score of ≤9 or a V-RQOL score of >40 is a significant cutoff point for predicting nonadherence risk. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  11. Facial Recognition

    Directory of Open Access Journals (Sweden)

    Mihalache Sergiu

    2014-05-01

    Full Text Available During their lifetime, people learn to recognize thousands of faces that they interact with. Face perception refers to an individual's understanding and interpretation of the face, particularly the human face, especially in relation to the associated information processing in the brain. The proportions and expressions of the human face are important to identify origin, emotional tendencies, health qualities, and some social information. From birth, faces are important in the individual's social interaction. Face perceptions are very complex as the recognition of facial expressions involves extensive and diverse areas in the brain. Our main goal is to put emphasis on presenting human faces specialized studies, and also to highlight the importance of attractiviness in their retention. We will see that there are many factors that influence face recognition.

  12. VOICE ACTIVATED MULTIPROCESSOR EMBEDDED SYSTEM TO IMPROVE THE CONTROL OF A MOTORIZED WHEELCHAIR

    Directory of Open Access Journals (Sweden)

    SANGMESHWAR S. KENDRE,

    2010-11-01

    Full Text Available The main idea of this work is to process analog voice signal. The theme is implemented for controlling the wheelchair by voice through speech processing using Hawkboard (OMAP processor. The adopted model is based on grouping a ARM and a DSP processor for speech enhancement with a voice recognition module for isolated word and speaker dependent. The Texas Instruments OMAP-L138 is integrated in order to enhance the quality of speech signal by reducing noise and connected with the wheelchair for processing of the voice signal. The Hawkboard denoises speech signal and HMC2007 recognizes the commnads. It also generates different desired signals according to the spoken words which further used to control the movement of wheelchair, a vector of information on the context given by a set of sensors for security actions. Six words are recognized which are start, forward, reverse, left, right, stop. In order to gain in time design, experiments have shown that the best way is to choose a speech recognition kit and to adapt it to the application. The result at the end show the efficiency of the system.

  13. Design and implementation of a user-oriented speech recognition interface: the synergy of technology and human factors

    NARCIS (Netherlands)

    Kloosterman, Sietse H.

    1994-01-01

    The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection. In the design of the interface a synergy of technology

  14. DESIGN AND IMPLEMENTATION OF A USER-ORIENTED SPEECH RECOGNITION INTERFACE - THE SYNERGY OF TECHNOLOGY AND HUMAN-FACTORS

    NARCIS (Netherlands)

    KLOOSTERMAN, SH

    The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection. In the design of the interface a synergy of technology

  15. Penguins use the two-voice system to recognize each other.

    Science.gov (United States)

    Aubin, T; Jouventin, P; Hildebrand, C

    2000-06-07

    The sound-producing structure in birds is the syrinx, which is usually a two-part organ located at the junction of the bronchi. As each branch of the syrinx produces sound independently, many birds have two acoustic sources. Thirty years ago, we had anatomical, physiological and acoustical evidence of this two-voice phenomenon but no function was known. In songbirds, often these two voices with their respective harmonics are not activated simultaneously but they are obvious in large penguins and generate a beat pattern which varies between individuals. The emperor penguin breeds during the Antarctic winter, incubating and carrying its egg on its feet. Without the topographical cue of a nest, birds identify each other only by vocal means when switching duties during incubation or chick rearing. To test whether the two-voice system contains the identity code, we played back the modified call of their mate to both adults and also the modified call of their parents to chicks. Both the adults and the chicks replied to controls (two voices) but not to modified signals (one voice being experimentally suppressed). Our experiments demonstrate that the beat generated by the interaction of these two fundamental frequencies conveys information about individual identity and also propagates well through obstacles, being robust to sound degradation through the medium of bodies in a penguin colony. The two-voice structure is also clear in the call of other birds such as the king penguin, another non-nesting species, but not in the 14 other nesting penguins. We concluded that the two-voice phenomenon functions as an individual recognition system in species using few if any landmarks to meet. In penguins, this coding process, increasing the call complexity and resisting sound degradation, has evolved in parallel with the loss of territoriality.

  16. Classifying human voices by using hybrid SFX time-series preprocessing and ensemble feature selection.

    Science.gov (United States)

    Fong, Simon; Lan, Kun; Wong, Raymond

    2013-01-01

    Voice biometrics is one kind of physiological characteristics whose voice is different for each individual person. Due to this uniqueness, voice classification has found useful applications in classifying speakers' gender, mother tongue or ethnicity (accent), emotion states, identity verification, verbal command control, and so forth. In this paper, we adopt a new preprocessing method named Statistical Feature Extraction (SFX) for extracting important features in training a classification model, based on piecewise transformation treating an audio waveform as a time-series. Using SFX we can faithfully remodel statistical characteristics of the time-series; together with spectral analysis, a substantial amount of features are extracted in combination. An ensemble is utilized in selecting only the influential features to be used in classification model induction. We focus on the comparison of effects of various popular data mining algorithms on multiple datasets. Our experiment consists of classification tests over four typical categories of human voice data, namely, Female and Male, Emotional Speech, Speaker Identification, and Language Recognition. The experiments yield encouraging results supporting the fact that heuristically choosing significant features from both time and frequency domains indeed produces better performance in voice classification than traditional signal processing techniques alone, like wavelets and LPC-to-CC.

  17. Familiar Speaker Recognition

    Science.gov (United States)

    2012-05-01

    linguistic information (such as a laughter or stutter ) may allow the listener to quickly identify the speaker. For speakers that are less familiar to...EXPERIMENTAL RESULTS Figure 2 shows the results for Sessions 1-6. Remember that there are 25 voices and for every session, each voice is presented twice...Thus, a 90% correct means that out of 50 voice presentations , the listener identified the correct speaker 45 times. It was always a forced choice

  18. Perturbation Measures of Voice: A Comparative Study between Multi-Dimensional Voice Program and Praat

    National Research Council Canada - National Science Library

    Maryn, Youri; Corthals, Paul; De Bodt, Marc; Van Cauwenberge, Paul; Deliyski, Dimitar

    2009-01-01

    .... In the present study, perturbation measures provided by two computer systems (a purpose-built professional voice analysis apparatus and a personal computer-based system for acoustic voice assessment...

  19. [The smokers voice self assessment based on Voice Handicap Index (VHI)].

    Science.gov (United States)

    Wiskirska-Woźnica, Bozena; Wojnowski, Waldemar

    2009-01-01

    Complex voice assessment due to European Laryngeal Society proposals (2000) contains voice self estimation based on the Polish version of the Voice Handicap Index (VHI). This study focuses on the relation between voice handicap and smoking in dysphonic patients, who are using voice professionally. Thirty outpatient (25 female and 5 male, aged 40 to 55 years) voice department attendees suffering from professional dysphonia took part in this study. All patients after phoniatric examination completed the Polish version of the Voice Handicap Index (VHI). The questions concern functional, emotional and physical complains due to dysphonia. Most of smokers did not complain of dysphonia related problems comparing to non smokers. Even the scores of functional and emotional scales of VHI in smokers shown better results (less handicap) than in nonsmokers. Smoking does not affect patients handicap due to dysphonia measured in the Voice Handicap Index.

  20. Qos and Voice Over IP

    Directory of Open Access Journals (Sweden)

    Adrian GHENCEA

    Full Text Available As Voice over Internet Protocol (VoIP technology matures, companies are increasingly adopting it to cut costs, improve efficiency and enhance customer service. Using the Internet as an existing network for integrating data and telecom systems through intelligent VoIP, a range of benefits results: lower long distance costs, cost cuts in cabling processes and more flexible telephony management. However, as voice over IP services grow in popularity, major threats arise: this rapid growth leads to traffic congestion, security is jeopardizedand the poor quality of calls affects communication. The objective of this article is to present all the elements that can affect voicequality in a VoIP network and to provide methods for solving them. A detailed analysis to minimize the impact of implementation of QoS will be made, and at the end solutions to management strategies will be proposed.

  1. Effect of Religiosity on Voice.

    Science.gov (United States)

    Hamdan, Abdul-Latif; Khneisser, Gebran; Dowli, Alex; Ziade, Georges; Tamim, Hani

    2017-04-01

    The aim of this study was to examine the relationship between religiosity and phonatory behavior. A total of 186 participants participated in a survey that included four sections: demographic data, extent of religiosity, history of dysphonia, phonatory behavior and laryngeal manipulation, in addition to the Voice Handicap Index (VHI-10). There was no significant association between the prevalence of phonatory symptoms and any of the religiosity questions. There was no significant association between phonatory behavior, history of laryngeal manipulation and any of the religiosity questions. There was also no significant association between the score of the Voice Handicap Index and any of the five religiosity questions. There is no association between religiosity and prevalence of phonatory disturbances, phonotraumatic behavior and/or history of laryngeal manipulation.

  2. Feature based sliding window technique for face recognition

    Science.gov (United States)

    Javed, Muhammad Younus; Mohsin, Syed Maajid; Anjum, Muhammad Almas

    2010-02-01

    Human beings are commonly identified by biometric schemes which are concerned with identifying individuals by their unique physical characteristics. The use of passwords and personal identification numbers for detecting humans are being used for years now. Disadvantages of these schemes are that someone else may use them or can easily be forgotten. Keeping in view of these problems, biometrics approaches such as face recognition, fingerprint, iris/retina and voice recognition have been developed which provide a far better solution when identifying individuals. A number of methods have been developed for face recognition. This paper illustrates employment of Gabor filters for extracting facial features by constructing a sliding window frame. Classification is done by assigning class label to the unknown image that has maximum features similar to the image stored in the database of that class. The proposed system gives a recognition rate of 96% which is better than many of the similar techniques being used for face recognition.

  3. Multimodal approaches for emotion recognition: a survey

    Science.gov (United States)

    Sebe, Nicu; Cohen, Ira; Gevers, Theo; Huang, Thomas S.

    2005-01-01

    Recent technological advances have enabled human users to interact with computers in ways previously unimaginable. Beyond the confines of the keyboard and mouse, new modalities for human-computer interaction such as voice, gesture, and force-feedback are emerging. Despite important advances, one necessary ingredient for natural interaction is still missing-emotions. Emotions play an important role in human-to-human communication and interaction, allowing people to express themselves beyond the verbal domain. The ability to understand human emotions is desirable for the computer in several applications. This paper explores new ways of human-computer interaction that enable the computer to be more aware of the user's emotional and attentional expressions. We present the basic research in the field and the recent advances into the emotion recognition from facial, voice, and physiological signals, where the different modalities are treated independently. We then describe the challenging problem of multimodal emotion recognition and we advocate the use of probabilistic graphical models when fusing the different modalities. We also discuss the difficult issues of obtaining reliable affective data, obtaining ground truth for emotion recognition, and the use of unlabeled data.

  4. Named Entity Recognition for IDEAL

    OpenAIRE

    Du, Qianzhou; Zhang, Xuan

    2015-01-01

    This project explored how to apply Named Entity Recognition to large Twitter and web page datasets to extract useful entities such as people, organization, location, and date. In addition, this NER utility has been scaled to the MapReduce framework on the Hadoop cluster. A schema and software allow this to be integrated with IDEAL. The term “Named Entity”, which was first introduced by Grishman and Sundheim, is widely used in Natural Language Processing (NLP). The researchers were focusing...

  5. Feature Recognition for Virtual Machining

    OpenAIRE

    Xú, Shixin; Anwer, Nabil; Qiao, Lihong

    2014-01-01

    International audience; Virtual machining uses software tools to simulate machining processes in virtual environments ahead of actual production. This paper proposes that feature recognition techniques can be applied in the course of virtual machining, such as identifying some process problems, and presenting corresponding correcting advices. By comparing with the original CAD model, form errors of the machining features can be found. And then corrections are suggested to process designers. T...

  6. Voice disorders in mucosal leishmaniasis.

    Directory of Open Access Journals (Sweden)

    Ana Cristina Nunes Ruas

    Full Text Available INTRODUCTION: Leishmaniasis is considered as one of the six most important infectious diseases because of its high detection coefficient and ability to produce deformities. In most cases, mucosal leishmaniasis (ML occurs as a consequence of cutaneous leishmaniasis. If left untreated, mucosal lesions can leave sequelae, interfering in the swallowing, breathing, voice and speech processes and requiring rehabilitation. OBJECTIVE: To describe the anatomical characteristics and voice quality of ML patients. MATERIALS AND METHODS: A descriptive transversal study was conducted in a cohort of ML patients treated at the Laboratory for Leishmaniasis Surveillance of the Evandro Chagas National Institute of Infectious Diseases-Fiocruz, between 2010 and 2013. The patients were submitted to otorhinolaryngologic clinical examination by endoscopy of the upper airways and digestive tract and to speech-language assessment through directed anamnesis, auditory perception, phonation times and vocal acoustic analysis. The variables of interest were epidemiologic (sex and age and clinic (lesion location, associated symptoms and voice quality. RESULTS: 26 patients under ML treatment and monitored by speech therapists were studied. 21 (81% were male and five (19% female, with ages ranging from 15 to 78 years (54.5+15.0 years. The lesions were distributed in the following structures 88.5% nasal, 38.5% oral, 34.6% pharyngeal and 19.2% laryngeal, with some patients presenting lesions in more than one anatomic site. The main complaint was nasal obstruction (73.1%, followed by dysphonia (38.5%, odynophagia (30.8% and dysphagia (26.9%. 23 patients (84.6% presented voice quality perturbations. Dysphonia was significantly associated to lesions in the larynx, pharynx and oral cavity. CONCLUSION: We observed that vocal quality perturbations are frequent in patients with mucosal leishmaniasis, even without laryngeal lesions; they are probably associated to disorders of some

  7. English Spoken Language & Voice Quality

    OpenAIRE

    Folsberg, Jens; Nielsen, Charlotte; Brusokaite, Giedre; Beinkamp, Line; Bach Jensen, Niels; Aalbæk Jensen, Sandra

    2013-01-01

    This project investigates the way language and accents are depicted in animated features and how linguistic stereotypes can be used in the process of character construction. In order to look into that, examples from four movies, produced by two studios, have been selected; the American Disney studios being represented by The Lion King (1994) and Up (2009), and the Japanese Studio Ghibli being represented by Ponyo (2008) and Howl's Moving Castle (2004). Voice qualities and specific accents ...

  8. Tracheostomy cannulas and voice prosthesis

    OpenAIRE

    2011-01-01

    Cannulas and voice prostheses are mechanical aids for patients who had to undergo tracheotomy or laryngectomy for different reasons. For better understanding of the function of those artificial devices, first the indications and particularities of the previous surgical intervention are described in the context of this review. Despite the established procedure of percutaneous dilatation tracheotomy e.g. in intensive care units, the application of epithelised tracheostomas has its own position,...

  9. The voice of the voiceless

    Directory of Open Access Journals (Sweden)

    Carlos Morales

    2014-12-01

    Full Text Available Women’s voice, these are stories of a group of women and their partners who lived part of a particular historical period agriculture during the twentieth century, the hacienda. The present work attempts to reflect their everyday life, reinterpreted from nowadays as a reflection of collective memory, in the big farms where they grew by putting in scene their roles in juxtaposition to the roles of male tradition.

  10. EPIQR software

    Energy Technology Data Exchange (ETDEWEB)

    Flourentzos, F. [Federal Institute of Technology, Lausanne (Switzerland); Droutsa, K. [National Observatory of Athens, Athens (Greece); Wittchen, K.B. [Danish Building Research Institute, Hoersholm (Denmark)

    1999-11-01

    The support of the EPIQR method is a multimedia computer program. Several modules help the users of the method to treat the data collected during a diagnosis survey, to set up refurbishment scenario and calculate their cost or energy performance, and finally to visualize the results in a comprehensive way and to prepare quality reports. This article presents the structure and the main features of the software. (au)

  11. EPIQR software

    Energy Technology Data Exchange (ETDEWEB)

    Flourentzos, F. [Federal Institute of Technology-Lausanne (EPFL), Solar Energy and Building Physics Laboratory (LESO-PB), Lausanne (Switzerland); Droutsa, K. [National Observatory of Athens, Institute of Meteorology and Physics of Atmospheric Environment, Group Energy Conservation, Athens (Greece); Wittchen, K.B. [Danish Building Research Institute, Division of Energy and Indoor Environment, Hoersholm, (Denmark)

    2000-07-01

    The support of the EPIQR method is a multimedia computer program. Several modules help the users of the method to treat the data collected during a diagnosis survey, to set up refurbishment scenarios and calculate their cost or energy performance, and finally to visualize the results in a comprehensive way and to prepare quality reports. This article presents the structure and the main features of the software. (author)

  12. Software preservation

    Directory of Open Access Journals (Sweden)

    Tadej Vodopivec

    2011-01-01

    Full Text Available Comtrade Ltd. covers a wide range of activities related to information and communication technologies; its deliverables include web applications, locally installed programs,system software, drivers, embedded software (used e.g. in medical devices, auto parts,communication switchboards. Also the extensive knowledge and practical experience about digital long-term preservation technologies have been acquired. This wide spectrum of activities puts us in the position to discuss the often overlooked aspect of the digital preservation - preservation of software programs. There are many resources dedicated to digital preservation of digital data, documents and multimedia records,but not so many about how to preserve the functionalities and features of computer programs. Exactly these functionalities - dynamic response to inputs - render the computer programs rich compared to documents or linear multimedia. The article opens the questions on the beginning of the way to the permanent digital preservation. The purpose is to find a way in the right direction, where all relevant aspects will be covered in proper balance. The following questions are asked: why at all to preserve computer programs permanently, who should do this and for whom, when we should think about permanent program preservation, what should be persevered (such as source code, screenshots, documentation, and social context of the program - e.g. media response to it ..., where and how? To illustrate the theoretic concepts given the idea of virtual national museum of electronic banking is also presented.

  13. Voices in (and around the Museum: Introduction

    Directory of Open Access Journals (Sweden)

    Ruth Holt

    2012-08-01

    Full Text Available The voice already plays an important role in contemporary art. This introductory paper summarises a series of four sessions in which speakers explored the place of the voice in the museum context. It became clear that the voice not only offered richness in interpretation of and response to other museum artefacts but was itself an artefact meriting conservation  and interpretation.

  14. Software Engineering to Professionalize Software Development

    Directory of Open Access Journals (Sweden)

    Juan Miguel Alonso

    2011-12-01

    Full Text Available The role, increasingly important, that plays the software in the systems with widespread effects presents new challenges for the formation of Software Engineers. Not only because social dependence software is increasing, but also because the character of software development is also changing and with it the demands for software developers certified. In this paper are propose some challenges and aspirations that guide the learning processes Software Engineering and help to identify the need to train professionals in software development.

  15. Speak Up! But don't strain your voice

    Science.gov (United States)

    ... Disorders Speak Up! But don't strain your voice Past Issues / Fall 2008 Table of Contents For ... Javascript on. A clinical trial at the NIDCD Voice Center gave Sherdina Jones tools to limit voice ...

  16. Speaking up in groups: a cross-level study of group voice climate and voice.

    Science.gov (United States)

    Morrison, Elizabeth Wolfe; Wheeler-Smith, Sara L; Kamdar, Dishan

    2011-01-01

    Despite a growing body of research on employee voice—defined as the discretionary communication of ideas, suggestions, or opinions intended to improve organizational or unit functioning—the effects of shared or collective-level cognitions have received scant attention. There has also been relatively little research on voice within work groups. Our goal in this study was to address these important gaps by focusing on the effects of group-level beliefs about voice (i.e., group voice climate) on individual voice behavior within work groups. We conducted a cross-level investigation of voice behavior within 42 groups of engineers from a large chemical company. Consistent with our hypotheses, group voice climate was highly predictive of voice and explained variance beyond the effects of individual-level identification and satisfaction, and procedural justice climate. Also consistent with predictions, the effect of identification on voice was stronger in groups with favorable voice climates. These findings provide evidence that voice is shaped not just by individual attitudes and perceptions of the work context, as past research has shown, but also by group-level beliefs. The results also highlight the importance of broadening our conceptual models of voice to include shared cognitions and of conducting additional cross-level research on voice.

  17. Alterations in tone of voice in patients with restrictive anorexia nervosa: A pilot study

    Directory of Open Access Journals (Sweden)

    Concepción García-Santana

    2016-06-01

    Full Text Available This article is about the tone of voice (acoustic perception in patients with restrictive anorexia nervosa (AN-R. Our goal was to study whether or not there is an alteration of the tone in restrictive anorexia nervosa when the disease has started in the puberty. The total sample consisted of 148 subjects divided in two groups: control (n=102 and AN-R (n=46. The voice´s fundamental frequencies (F0 were determined based on the repetition of two phonemes ("a" and "i" and measured by a microphone Plantonic 300 and a digital recorder. We analyzed the voice´s F0 using Praat software. We present the first data for the normal range of the F0 in Spanish healthy women from 9 to 17 years old who were Spanish native speakers. Finally, we show a comparison of data between AN-R patients and control group.

  18. Automatic Detection of Pathologies in The Voice by HOS Based Parameters

    Directory of Open Access Journals (Sweden)

    de Leon José

    2001-01-01

    Full Text Available In the current panorama the conclusive identification of a laryngeal pathology relies inevitably on the observation of the vocal folds by means of laryngoscopical techniques. This inspection technique is inconvenient for a number of reasons, such as its high cost, the duration of the inspection, and, above all, the fact that it is an invasive technique. This paper looks into the possibility of measuring the quality of a voice starting from an audio recording. The existing parameters in current literature ("classic parameters" which allow quantifying the quality of a voice have been studied, and the parameters that present better results have been selected. Also, seven new High Order Statistics (HOS based parameters are proposed to parametrize the voice signal. On the other hand, a software package has been developed which carries out the automatic detection of dysfunction in phonation. A success rate of % has been obtained by using both the classic and the HOS based proposed parameters.

  19. The electrolarynx: voice restoration after total laryngectomy.

    Science.gov (United States)

    Kaye, Rachel; Tang, Christopher G; Sinclair, Catherine F

    2017-01-01

    The ability to speak and communicate with one's voice is a unique human characteristic and is fundamental to many activities of daily living, such as talking on the phone and speaking to loved ones. When the larynx is removed during a total laryngectomy (TL), loss of voice can lead to a devastating decrease in a patient's quality of life, and precipitate significant frustration over their inability to communicate with others effectively. Over the past 50 years there have been many advances in techniques of voice restoration after TL. Currently, there are three main methods of voice restoration: the electrolarynx, esophageal speech, and tracheoesophageal speech through a tracheoesophageal puncture (TEP) with voice prosthesis. Although TEP voice is the current gold standard for vocal rehabilitation, a significant minority of patients cannot use or obtain TEP speech for various reasons. As such, the electrolarynx is a viable and useful alternative for these patients. This article will focus on voice restoration using an electrolarynx with the following objectives: 1) To provide an understanding of the importance of voice restoration after total laryngectomy. 2) To discuss how the electrolarynx may be used to restore voice following total laryngectomy. 3) To outline some of the current electrolarynx devices available, including their mechanism of action and limitations. 4) To compare pros and cons of electrolaryngeal speech to TEP and esophageal speech.

  20. The Performing Voice of the Audiobook

    DEFF Research Database (Denmark)

    Pedersen, Birgitte Stougaard; Have, Iben

    2014-01-01

    will be based on a reception aesthetic and phenomenological approach, the latter as presented by Don Ihde in his book Listening and Voice. Phenomenologies of Sound , and my analytical sketches will be related to theoretical statements concerning the understanding of voice and media (Cavarero, Dolar, La......Belle, Neumark). Finally, the article will discuss the specific artistic combination and our auditory experience of mediated human voices and sculpturally projected faces in an art museum context under the general conditions of the societal panophonia of disembodied and mediated voices, as promoted by Steven...

  1. Methods of Translating the English passive voice

    Institute of Scientific and Technical Information of China (English)

    张李丽

    2009-01-01

    @@ 一、Comparisons between Voices in English and Chinese In English as in many other languages,the passive voice is the form of a transitive verb whose grammatical sabjOct serves as the patient,receiving the action of the verb.The passive voice is typically contrasted with the active voice,which is the form of a transitive verb whose subject serves as the agent,performing the action of the verb.The subject of a verb in the passive voice corresponds to the object of the same verb in the active voice.English's passive voice is periphrastic;that is,it does not have a one-word form.Rather,it is formed using a form of the auxiliary verb be together with a verb's past participle.The passive voice is widely used in English when it is unnecessary,undesirable,or impossible to ilame the agent of an action,or when the passive voice is needed to link the text better.

  2. Prevalence of voice disorders among future teachers.

    Science.gov (United States)

    Simberg, S; Laine, A; Sala, E; Rönnemaa, A M

    2000-06-01

    An epidemiological study was conducted in order to find out the prevalence of voice disorders among students studying to be teachers. Vocal symptoms were inquired of 226 students. Their voices were assessed perceptually by a speech therapist and those who had abnormal voice quality or reported several vocal symptoms were referred to a clinical examination by a laryngologist. The results showed that 20% of this population reported two or more vocal symptoms during the previous year and that 19% had an organic voice disorder. This reinforces the need for clinical evaluation of students with vocal symptoms and more vocal training in the teacher education programs.

  3. Software"Visual Image"for classical women jackets and men suits reconstruction and recognition%经典男女正装的重建与识别的"视觉形象"软件

    Institute of Scientific and Technical Information of China (English)

    叶洪光; 陈明珍; 维克多·库兹米切夫

    2008-01-01

    "视觉形象"这种新软件其功能是帮助重建20世纪五十年代至21世纪初男女正装图片中地实际尺寸和其所属年代风格的对应.该软件的开发是在研究了大量地时尚杂志地原始图片及服装企业所使用地板型基础上建立地数据库.该程序可以作为独立地计算机软件或服装CAD系统其中一个应用模块.%New software named"Visual Image"allows to reconstruct real sizes of women jackets and men suits have been taken from their photos and to establish the possible time(1950th, 1960th, 1970th. 1980th, 1990th,2000th)when the style was created.Software is consisted the original data bases obtained after exploration a lot of photos from fashion magazines and pattern block for cutting used by industrial enterprises.The program can be used as independent software or as an integrated module in CAD.

  4. V2S: Voice to Sign Language Translation System for Malaysian Deaf People

    Science.gov (United States)

    Mean Foong, Oi; Low, Tang Jung; La, Wai Wan

    The process of learning and understand the sign language may be cumbersome to some, and therefore, this paper proposes a solution to this problem by providing a voice (English Language) to sign language translation system using Speech and Image processing technique. Speech processing which includes Speech Recognition is the study of recognizing the words being spoken, regardless of whom the speaker is. This project uses template-based recognition as the main approach in which the V2S system first needs to be trained with speech pattern based on some generic spectral parameter set. These spectral parameter set will then be stored as template in a database. The system will perform the recognition process through matching the parameter set of the input speech with the stored templates to finally display the sign language in video format. Empirical results show that the system has 80.3% recognition rate.

  5. The voice conveys emotion in ten globalized cultures and one remote village in Bhutan.

    Science.gov (United States)

    Cordaro, Daniel T; Keltner, Dacher; Tshering, Sumjay; Wangchuk, Dorji; Flynn, Lisa M

    2016-02-01

    With data from 10 different globalized cultures and 1 remote, isolated village in Bhutan, we examined universals and cultural variations in the recognition of 16 nonverbal emotional vocalizations. College students in 10 nations (Study 1) and villagers in remote Bhutan (Study 2) were asked to match emotional vocalizations to 1-sentence stories of the same valence. Guided by previous conceptualizations of recognition accuracy, across both studies, 7 of the 16 vocal burst stimuli were found to have strong or very strong recognition in all 11 cultures, 6 vocal bursts were found to have moderate recognition, and 4 were not universally recognized. All vocal burst stimuli varied significantly in terms of the degree to which they were recognized across the 11 cultures. Our discussion focuses on the implications of these results for current debates concerning the emotion conveyed in the voice.

  6. Portable EGG recording system based on a digital voice recorder.

    Science.gov (United States)

    Jang, J-K; Shieh, M-J; Kuo, T-S; Jaw, F-S

    2009-01-01

    Cutaneous electrogastrogram (EGG) recording offers the benefit of non-invasive gastrointestinal diagnosis. With long-term ambulatory recording of signals, researchers and clinicians could have more opportunities to investigate and analyse paroxysmal or acute symptoms. A portable EGG system based on a digital voice recorder (DVR) is designed for long-term recording of cutaneous EGG signals. The system consists of electrodes, an EGG amplifier, a modulator, and a DVR. Online monitoring and off-line acquisition of EGG are handled by software. A special design employing an integrated timer circuit is used to modulate the EGG frequency to meet the input requirements of the DVR. This approach involves low supply voltage and low power consumption. Software demodulation is used to simplify the complexity of the system, and is helpful in reducing the size of the portable device. By using surface-mount devices (SMD) and a low-power design, the system is robust, compact, and suitable for long-term portable recording. As a result, researchers can record an ambulatory EGG signal by means of the proposed circuits in conjunction with an up-to-date voice-recording device.

  7. Influence of classroom acoustics on the voice levels of teachers with and without voice problems: a field study

    DEFF Research Database (Denmark)

    Pelegrin Garcia, David; Lyberg-Åhlander, Viveka; Rydell, Roland

    2010-01-01

    Many teachers suffer from voice problems and classroom acoustics has been considered as one of the potential hazards for this. The present study examines how classroom acoustics interacts with the voices of 14 teachers without voice problems and 13 teachers with voice problems. The assessment...... of Reverberation Time and Voice Support were measured in the 30 empty classrooms of the study. An empirical model shows that the measured voice levels depended on the activity noise levels and the voice support. Teachers with and without voice problems were differently affected by the voice support...... of the classroom. The results thus suggest that teachers with voice problems are more aware of classroom acoustic conditions than their healthy colleagues and make use of the more supportive rooms to lower their voice levels. This behavior may result from an adaptation process of the teachers with voice problems...

  8. Reasons for non-adherence to cardiometabolic medications, and acceptability of an interactive voice response intervention in patients with hypertension and type 2 diabetes in primary care: a qualitative study.

    Science.gov (United States)

    Kassavou, Aikaterini; Sutton, Stephen

    2017-08-11

    This study explored the reasons for patients' non-adherence to cardiometabolic medications, and tested the acceptability of the interactive voice response (IVR) as a way to address these reasons, and support patients, between primary care consultations. The study included face-to-face interviews with 19 patients with hypertension and/or type 2 diabetes mellitus, selected from primary care databases, and presumed to be non-adherent. Thirteen of these patients pretested elements of the IVR intervention few months later, using a think-aloud protocol. Five practice nurses were interviewed. Data were analysed using multiperspective, and longitudinalthematic analysis. Negative beliefs about taking medications, the complexity of prescribed medication regimens, and the limited ability to cope with the underlying affective state, within challenging contexts, were mentioned as important reasons for non-adherence. Nurses reported time constraints to address each patient's different reasons for non-adherence, and limited efficacy to support patients, between primary care consultations. Patients gave positive experiential feedback about the IVR messages as a way to support them take their medicines, and provided recommendations for intervention content and delivery mode. Specifically, they liked the voice delivering the messages and the voice recognition software. For intervention content, they preferred messages that were tailored, and included messages with 'information about health consequences', 'action plans', or simple reminders for performing the behaviour. Patients with hypertension and/or type 2 diabetes, and practice nurses, suggested messages tailored to each patient's reasons for non-adherence. Participants recommended IVR as an acceptable platform to support adherence to cardiometabolic medications between primary care consultations. Future studies could usefully test the acceptability, and feasibility, of tailored IVR interventions to support medication adherence

  9. The recognition of work

    OpenAIRE

    Nierling, Linda

    2007-01-01

    The following article argues that recognition structures in work relations differ significantly in the sphere of paid work in contrast to unpaid work in private spheres. According to the systematic approach on recognition of Axel Honneth three different levels of recognition are identified: the interpersonal recognition, organisational recognition and societal recognition. Based on this framework it can be stated that recognition structures in the sphere of paid work and in private spheres di...

  10. Space Software

    Science.gov (United States)

    1990-01-01

    Xontech, Inc.'s software package, XonVu, simulates the missions of Voyager 1 at Jupiter and Saturn, Voyager 2 at Jupiter, Saturn, Uranus and Neptune, and Giotto in close encounter with Comet Halley. With the program, the user can generate scenes of the planets, moons, stars or Halley's nucleus and tail as seen by Giotto, all graphically reproduced with high accuracy in wireframe representation. Program can be used on a wide range of computers, including PCs. User friendly and interactive, with many options, XonVu can be used by a space novice or a professional astronomer. With a companion user's manual, it sells for $79.

  11. Software architecture

    CERN Document Server

    Vogel, Oliver; Chughtai, Arif

    2011-01-01

    As a software architect you work in a wide-ranging and dynamic environment. You have to understand the needs of your customer, design architectures that satisfy both functional and non-functional requirements, and lead development teams in implementing the architecture. And it is an environment that is constantly changing: trends such as cloud computing, service orientation, and model-driven procedures open up new architectural possibilities. This book will help you to develop a holistic architectural awareness and knowledge base that extends beyond concrete methods, techniques, and technologi

  12. Effect of adenoid hypertrophy on the voice and laryngeal mucosa in children.

    Science.gov (United States)

    Gomaa, Mohammed A; Mohammed, Haitham M; Abdalla, Adel A; Nasr, Dalia M

    2013-12-01

    The adenoids, or pharyngeal tonsils, are lymphatic tissue localized at the mucous layer of the roof and posterior wall of nasopharynx. Dysphonia defined as perceptual audible change of a patient's habitual voice as self judged or judged by his or her listeners. The diagnosis of dysphonia relies on clinical judgment based on phoniatric symptoms, auditory perceptual assessment of voice (APA) and full laryngeal examination. Our study was conducted to evaluate the effect of adenoid hypertrophy on voice and laryngeal mucosa. The study sample composed of sixty children, forty of them had adenoid hypertrophy (patient's group) and twenty healthy children (control group). Patient's group composed of 17 boys (42.5%) and 23 girls (57.5%), while control group consists of 8 males (40%) and 12 females (60%). All patients and control group subjected to history taking, clinical examination, lateral soft tissue X-ray on the nasopharynx, APA based on the modified GRBAS scale and full laryngeal examination. The data are collected and analyzed statistically by using software SPSS. Our results showed that there is a significant association between adenoid hypertrophy and, degree of dysphonia, leaky voice, pitch of voice and laryngeal lesion. Adenoid hypertrophy did not associate with loudness of voice, as well as character (irregular, breathy and strained). Laryngeal lesions were detected in thirteen children from patient group (32.5%): nodules (n = 6), thickening (n = 5), congestion (n = 2), while one child only out of 20 children of the control group had congestion (5.0%). Our results showed the importance of the assessment of voice and laryngeal examination in patients with adenoid hypertrophy, also treating the minimal mucosal lesions that results from adenoid hypertrophy should be taken in consideration. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  13. Prosody recognition in adults with high-functioning autism spectrum disorders: from psychoacoustics to cognition.

    Science.gov (United States)

    Globerson, Eitan; Amir, Noam; Kishon-Rabin, Liat; Golan, Ofer

    2015-04-01

    Prosody is an important tool of human communication, carrying both affective and pragmatic messages in speech. Prosody recognition relies on processing of acoustic cues, such as the fundamental frequency of the voice signal, and their interpretation according to acquired socioemotional scripts. Individuals with autism spectrum disorders (ASD) show deficiencies in affective prosody recognition. These deficiencies have been mostly associated with general difficulties in emotion recognition. The current study explored an additional association between affective prosody recognition in ASD and auditory perceptual abilities. Twenty high-functioning male adults with ASD and 32 typically developing male adults, matched on age and verbal abilities undertook a battery of auditory tasks. These included affective and pragmatic prosody recognition tasks, two psychoacoustic tasks (pitch direction recognition and pitch discrimination), and a facial emotion recognition task, representing nonvocal emotion recognition. Compared with controls, the ASD group demonstrated poorer performance on both vocal and facial emotion recognition, but not on pragmatic prosody recognition or on any of the psychoacoustic tasks. Both groups showed strong associations between psychoacoustic abilities and prosody recognition, both affective and pragmatic, although these were more pronounced in the ASD group. Facial emotion recognition predicted vocal emotion recognition in the ASD group only. These findings suggest that auditory perceptual abilities, alongside general emotion recognition abilities, play a significant role in affective prosody recognition in ASD.

  14. The design of a digital voice data compression technique for orbiter voice channels

    Science.gov (United States)

    1975-01-01

    Voice bandwidth compression techniques were investigated to anticipate link margin difficulties in the shuttle S-band communication system. It was felt that by reducing the data rate on each voice channel from the baseline 24 (or 32) Kbps to 8 Kbps, additional margin could be obtained. The feasibility of such an alternate voice transmission system was studied. Several factors of prime importance that were addressed are: (1) achieving high quality voice at 8 Kbps; (2) performance in the presence of the anticipated shuttle cabin environmental noise; (3) performance in the presence of the anticipated channel error statistics; and (4) minimal increase in size, weight, and power over the current baseline voice processor.

  15. A case study of employee voice and organizational performance in the context of residential aged care

    Directory of Open Access Journals (Sweden)

    Azam Bazooband

    2016-06-01

    Full Text Available As the population of old people is on the great increase in countries, the necessity of aged-care centers is not deniable. Likewise the other health care services, the quality of services provided by care givers for old people in residential aged care centers is a significant and vital factor. The main objective of this paper is to conduct a descriptive research of correlation type on the impact of employee voice on organizational performance in four non-for-profit aged care centers, in Shiraz, Iran. Sample size was estimated 156 based on Morgan table. Data collection tool was employee voice standard questionnaire with 12 items at three dimensions and organizational performance questionnaire with 22 items at seven dimensions. Collected data was analyzed by SPSS statistical software version 22 and the correlation coefficient test (p = <0.05. The significance level was set at P < 0.05 and P < 0.001. Findings showed that there are positive and significant correlations between self-efficacy, encouragement and safety with organizational performance. Also there are positive and significant correlations between organizational performance dimensions including (reliability, attitude, job quality, innovation, co-operation, job quantity, personal learning and employee voice. Indeed, results showed that, when encouragement taken in to consideration employees may perceive a positive voice climate in an organization, and as a result, the other components of voice climate (safety and self-efficacy would be developed and the organizational performance will be increased.

  16. Acute effects of radioiodine therapy on the voice and larynx of basedow-Graves patients

    Energy Technology Data Exchange (ETDEWEB)

    Isolan-Cury, Roberta Werlang; Cury, Adriano Namo [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP); Monte, Osmar [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Physiology Department; Silva, Marta Assumpcao de Andrada e [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Speech Therapy School; Duprat, Andre [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Otorhinolaryngology Department; Marone, Marilia [Nuclimagem - Irmanity of the Sao Paulo Santa Casa de Misericordia, SP (Brazil). Nuclear Medicine Unit; Almeida, Renata de; Iglesias, Alexandre [Sao Paulo Santa Casa de Misericordia, SP (Brazil). Medical Science School (FCMSCSP). Otorhinolaryngology Department. Endocrinology and Metabology Unit

    2008-07-01

    Graves's disease is the most common cause of hyperthyroidism. There are three current therapeutic options: anti-thyroid medication, surgery, and radioactive iodine (I 131). There are few data in the literature regarding the effects of radioiodine therapy on the larynx and voice. The aim of this study was: to assess the effect of radioiodine therapy on the voice of Basedow-Graves patients. Material and method: A prospective study was done. Following the diagnosis of Grave's disease, patients underwent investigation of their voice, measurement of maximum phonatory time (/a/) and the s/z ratio, fundamental frequency analysis (Praat software), laryngoscopy and (perceptive-auditory) analysis in three different conditions: pre-treatment, 4 days, and 20 days post-radioiodine therapy. Conditions are based on the inflammatory pattern of thyroid tissue (Jones et al. 1999). Results: No statistically significant differences were found in voice characteristics in these three conditions. Conclusion: Radioiodine therapy does not affect voice quality. (author)

  17. Impact of fetal-neonatal iron deficiency on recognition memory at two months of age

    Science.gov (United States)

    Geng, Fengji; Mai, Xiaoqin; Zhan, Jianying; Xu, Lin; Zhao, Zhengyan; Georgieff, Michael; Shao, Jie; Lozoff, Betsy

    2015-01-01

    Objective To assess the effects of fetal-neonatal iron deficiency on recognition memory in early infancy. Perinatal iron deficiency delays or disrupts hippocampal development in animal models and thus may impair related neural functions in human infants, such as recognition memory. Study design Event-related potentials were used in an auditory recognition memory task to compare 2-month-old Chinese infants with iron sufficiency or deficiency at birth. Fetal- neonatal iron deficiency was defined two ways: high zinc protoporphyrin/heme ratio (ZPP/H > 118 μmol/mol) or low serum ferritin (iron deficiency as defined by high cord ZPP/H but not low ferritin. Comparing 35 infants with iron deficiency (ZPP/H > 118 μmol/mol) to 92 with lower ZPP/H (iron-sufficient), only infants with iron sufficiency showed larger LSW amplitude for stranger’s voice than mother’s voice in frontal-central and parietal-occipital locations, indicating the recognition of mother’s voice. Conclusions Infants with iron sufficiency showed electrophysiological evidence of recognizing their mother’s voice, whereas infants with fetal-neonatal iron deficiency did not. Their poorer auditory recognition memory at two months of age is consistent with effects of fetal-neonatal iron deficiency on the developing hippocampus. PMID:26382625

  18. Innovations in Computing Sciences and Software Engineering

    CERN Document Server

    Sobh, Tarek

    2010-01-01

    "Innovations in Computing Sciences and Software Engineering" includes a set of rigorously reviewed world-class manuscripts addressing and detailing state-of-the-art research projects in the areas of Computer Science, Software Engineering, Computer Engineering, and Systems Engineering and Sciences. The topics covered include: Image and Pattern Recognition: Compression, Image processing, Signal Processing Architectures, Signal Processing for Communication, Signal Processing Implementation, Speech Compression, and Video Coding Architectures; Languages and Systems: Algorithms, Databases,

  19. Global Software Engineering: A Software Process Approach

    Science.gov (United States)

    Richardson, Ita; Casey, Valentine; Burton, John; McCaffery, Fergal

    Our research has shown that many companies are struggling with the successful implementation of global software engineering, due to temporal, cultural and geographical distance, which causes a range of factors to come into play. For example, cultural, project managementproject management and communication difficulties continually cause problems for software engineers and project managers. While the implementation of efficient software processes can be used to improve the quality of the software product, published software process models do not cater explicitly for the recent growth in global software engineering. Our thesis is that global software engineering factors should be included in software process models to ensure their continued usefulness in global organisations. Based on extensive global software engineering research, we have developed a software process, Global Teaming, which includes specific practices and sub-practices. The purpose is to ensure that requirements for successful global software engineering are stipulated so that organisations can ensure successful implementation of global software engineering.

  20. Muscular tension and body posture in relation to voice handicap and voice quality in teachers with persistent voice complaints.

    Science.gov (United States)

    Kooijman, P G C; de Jong, F I C R S; Oudes, M J; Huinck, W; van Acht, H; Graamans, K

    2005-01-01

    The aim of this study was to investigate the relationship between extrinsic laryngeal muscular hypertonicity and deviant body posture on the one hand and voice handicap and voice quality on the other hand in teachers with persistent voice complaints and a history of voice-related absenteeism. The study group consisted of 25 female teachers. A voice therapist assessed extrinsic laryngeal muscular tension and a physical therapist assessed body posture. The assessed parameters were clustered in categories. The parameters in the different categories represent the same function. Further a tension/posture index was created, which is the summation of the different parameters. The different parameters and the index were related to the Voice Handicap Index (VHI) and the Dysphonia Severity Index (DSI). The scores of the VHI and the individual parameters differ significantly except for the posterior weight bearing and tension of the sternocleidomastoid muscle. There was also a significant difference between the individual parameters and the DSI, except for tension of the cricothyroid muscle and posterior weight bearing. The score of the tension/posture index correlates significantly with both the VHI and the DSI. In a linear regression analysis, the combination of hypertonicity of the sternocleidomastoid, the geniohyoid muscles and posterior weight bearing is the most important predictor for a high voice handicap. The combination of hypertonicity of the geniohyoid muscle, posterior weight bearing, high position of the hyoid bone, hypertonicity of the cricothyroid muscle and anteroposition of the head is the most important predictor for a low DSI score. The results of this study show the higher the score of the index, the higher the score of the voice handicap and the worse the voice quality is. Moreover, the results are indicative for the importance of assessment of muscular tension and body posture in the diagnosis of voice disorders.

  1. The relationship between VHI scores and specific acoustic measures of mildly disordered voice production.

    Science.gov (United States)

    Wheeler, Karen M; Collins, Savita P; Sapienza, Christine M

    2006-06-01

    This study was designed to examine the relationship between the Voice Handicap Index (VHI) and acoustic measures of voice samples common in clinical practice. Fifty participants, 38 women and 12 men, ranging in age from 19 to 80 years, with a mean age of 49 years, served as participants. Of these 50 participants, 17 participants could be included in the acoustic analysis of voice based on measures of error calculated with the TF32 software. All participants completed the VHI and provided voice samples including three trials of the sustained vowel /A/ at a comfortable loudness level as well as a connected speech sample consisting of the Zoo Passage. Acoustic measures were made with TF32 and Cool Edit software and included fundamental frequency, jitter %, shimmer %, signal-to-noise ratio, mean root-mean-square intensity, fundamental frequency standard deviation, aphonic periods, and breath groups. Results indicate that these measures were not predictive of overall VHI score, and no cohesive or predictable pattern was identified when comparing individual measures with overall VHI or with each subscale item. Likely contributions to this lack of correlation and subsequent clinical implications are discussed, as well as the direction for further research.

  2. Using the Voice to Design Ceramics

    DEFF Research Database (Denmark)

    Hansen, Flemming Tvede; Jensen, Kristoffer

    2011-01-01

    SoundShaping, a system to create ceramics from the human voice. Based on a generic audio feature extraction system, and the principal component analysis to ensure that the pertinent information in the voice is used, a 3D shape is created using simple geometric rules. This shape is output to a 3D printer...

  3. The Voice of Conscience in Rousseau's Emile

    Science.gov (United States)

    Kodelja, Zdenko

    2015-01-01

    According to Rousseau, conscience and conscience alone can elevate human beings to a level above that of animals. It is conscience, understood as infallible judge of good and bad, which makes man like God. Conscience itself is, in this context, understood as divine, as an "immortal and celestial voice". Therefore, if the voice of…

  4. Why Is My Voice Changing? (For Teens)

    Science.gov (United States)

    ... enter puberty earlier or later than others. How Deep Will My Voice Get? How deep a guy's voice gets depends on his genes: ... of Use Notice of Nondiscrimination Visit the Nemours Web site. Note: All information on TeensHealth® is for ...

  5. Gender in voice perception in autism.

    NARCIS (Netherlands)

    Groen, W.B.; Orsouw, L van; Zwiers, M.; Swinkels, S.; Gaag, R.J. van der; Buitelaar, J.K.

    2008-01-01

    Deficits in the perception of social stimuli may contribute to the characteristic impairments in social interaction in high functioning autism (HFA). Although the cortical processing of voice is abnormal in HFA, it is unclear whether this gives rise to impairments in the perception of voice gender.

  6. Speaking with the voice of authority

    CERN Multimedia

    2002-01-01

    GPB Consulting has developed a scientific approach to voice coaching. A digital recording of the voice is sent to a lab in Switzerland and analyzed by a computer programme designed by a doctor of psychology and linguistics and a scientist at CERN (1 page).

  7. The Voice of Conscience in Rousseau's Emile

    Science.gov (United States)

    Kodelja, Zdenko

    2015-01-01

    According to Rousseau, conscience and conscience alone can elevate human beings to a level above that of animals. It is conscience, understood as infallible judge of good and bad, which makes man like God. Conscience itself is, in this context, understood as divine, as an "immortal and celestial voice". Therefore, if the voice of…

  8. Epidemiology of voice problems in Dutch teachers

    NARCIS (Netherlands)

    de Jong, F. I. C. R. S.; Kooijman, P. G. C.; Thomas, G.; Huinck, W. J.; Graamans, K.; Schutte, H. K.

    2006-01-01

    In order to assess voice complaints and absence from work due to voice problems among teachers of primary and secondary education, as well as among a control group, 2,117 questionnaires were analysed. The total group consisted of 1,878 teachers and 239 controls. Female teachers more frequently repor

  9. Voice Deviations and Coexisting Communication Disorders.

    Science.gov (United States)

    St. Louis, Kenneth O.; And Others

    1992-01-01

    This study examined the coexistence of other communicative disorders with voice disorders in about 3,400 children in grades 1-12 at 100 sites throughout the United States. The majority of voice-disordered children had coexisting articulation deviations and also differed from controls on two language measures and mean pure-tone hearing thresholds.…

  10. Using the Voice to Design Ceramics

    DEFF Research Database (Denmark)

    Hansen, Flemming Tvede; Jensen, Kristoffer

    2011-01-01

    SoundShaping, a system to create ceramics from the human voice. Based on a generic audio feature extraction system, and the principal component analysis to ensure that the pertinent information in the voice is used, a 3D shape is created using simple geometric rules. This shape is output to a 3D printer...

  11. Epidemiology of voice problems in Dutch teachers.

    NARCIS (Netherlands)

    Jong, F.I.C.R.S. de; Kooijman, P.G.C.; Thomas, G.; Huinck, W.J.; Graamans, K.; Schutte, H.K.

    2006-01-01

    In order to assess voice complaints and absence from work due to voice problems among teachers of primary and secondary education, as well as among a control group, 2,117 questionnaires were analysed. The total group consisted of 1,878 teachers and 239 controls. Female teachers more frequently repor

  12. Epidemiology of voice problems in Dutch teachers

    NARCIS (Netherlands)

    de Jong, F. I. C. R. S.; Kooijman, P. G. C.; Thomas, G.; Huinck, W. J.; Graamans, K.; Schutte, H. K.

    2006-01-01

    In order to assess voice complaints and absence from work due to voice problems among teachers of primary and secondary education, as well as among a control group, 2,117 questionnaires were analysed. The total group consisted of 1,878 teachers and 239 controls. Female teachers more frequently repor

  13. Predictors of Choral Directors' Voice Handicap

    Science.gov (United States)

    Schwartz, Sandra

    2013-01-01

    Vocal demands of teaching are considerable and these challenges are greater for choral directors who depend on the voice as a musical and instructive instrument. The purpose of this study was to (1) examine choral directors' vocal condition using a modified Voice Handicap Index (VHI), and (2) determine the extent to which the major variables…

  14. Gender in Voice Perception in Autism

    Science.gov (United States)

    Groen, Wouter B.; van Orsouw, Linda; Zwiers, Marcel; Swinkels, Sophie; van der Gaag, Rutger Jan; Buitelaar, Jan K.

    2008-01-01

    Deficits in the perception of social stimuli may contribute to the characteristic impairments in social interaction in high functioning autism (HFA). Although the cortical processing of voice is abnormal in HFA, it is unclear whether this gives rise to impairments in the perception of voice gender. About 20 children with HFA and 20 matched…

  15. Two Factors Related to Effective Voice Interpreting.

    Science.gov (United States)

    Hurwitz, T. Alan

    1986-01-01

    Thirty-two interpreters for the deaf were measured on accuracy and quality of voice interpreting of the same story in two different sign language types: Pidgin Signed English and American Sign Language. Results indicated that previous experience interpreting was significantly related to the effectiveness of voice interpreting both languages.…

  16. Student Voice and the Common Core

    Science.gov (United States)

    Yonezawa, Susan

    2015-01-01

    Common Core proponents and detractors debate its merits, but students have voiced their opinion for years. Using a decade's worth of data gathered through design-research on youth voice, this article discusses what high school students have long described as more ideal learning environments for themselves--and how remarkably similar the Common…

  17. Student Voice and the Common Core

    Science.gov (United States)

    Yonezawa, Susan

    2015-01-01

    Common Core proponents and detractors debate its merits, but students have voiced their opinion for years. Using a decade's worth of data gathered through design-research on youth voice, this article discusses what high school students have long described as more ideal learning environments for themselves--and how remarkably similar the Common…

  18. Objects Control through Speech Recognition Using LabVIEW

    Directory of Open Access Journals (Sweden)

    Ankush Sharma

    2013-01-01

    Full Text Available Speech is the natural form of human communication and the speech processing is the one of the most stimulating area of the signal processing. Speech recognition technology has made it possible for computer to follow the human voice command and understand the human languages. The objects (LED, Toggle switch etc. control through human speech is designed in this paper. By combine the virtual instrumentation technology and speech recognition techniques. And also provided password authentication. This can be done with the help of LabVIEW programming concepts. The microphone is using to take voice commands from Human. This microphone signals interface with LabVIEW code. The LabVIEW code will generate appropriate control signal to control the objects. The entire work done on the LabVIEW platform.

  19. Voices from Around the Globe

    Directory of Open Access Journals (Sweden)

    Birgit Schreiber

    2017-07-01

    Full Text Available JSAA has been seeking to provide an opportunity for Student Affairs professionals and higher education scholars from around the globe to share their research and experiences of student services and student affairs programmes from their respective regional and institutional contexts. This has been given a specific platform with the guest-edited issue “Voices from Around the Globe” which is the result of a collaboration with the International Association of Student Affairs and Services (IASAS, and particularly with the guest editors, Kathleen Callahan and Chinedu Mba.

  20. Improving Quality of Voice Conversion Systems

    Science.gov (United States)

    Farhid, M.; Tinati, M. A.

    New improvement scheme for voice conversion are proposed in this paper. We take Human factor cepstral coefficients (HFCC), a modification of MFCC that uses the known relationship between center frequency and critical bandwidth from human psychoacoustics to decouple filter bandwidth from filter spacing, as the basic feature. We propose U/V (Unvoiced/Voiced) decision rule such that two sets of codebooks are used to capture the difference between unvoiced and voiced segments of the source speaker. Moreover, we apply three schemes to refine the synthesized voice, including pitch refinement, energy equalization, and frame concatenation. The acceptable performance of the voice conversion system can be verified through ABX listening test and MOS grad.

  1. Romantic Voice in Three Contemporary Ghazals

    Directory of Open Access Journals (Sweden)

    فرّخ لطیف نژاد

    2014-01-01

    Full Text Available In linguistics, the relation between syntax and thought is expressed by verbs and their relationship with subjects, objects and predicates. This link, in turn, creates grammatical voices such as active, passive and so on. Grammatical voice indicates the writer's attitudes towards and viewpoints on a subject and reflects his/her mental and spiritual state. Grammatical voice can be employed to compare the moods of different poets. In this article, we seek to examine and compare the grammatical voice in three ghazals by Ebtehaj, Naderpour and Farrokhzad and relate it to the School Romanticism using statistical analysis. These ghazals are an imitation of one of Sa’di's poems. Results indicate that in the chosen poem the active voice is used more because of the conversation taking place between the lover and the beloved. Farrokhzad has used newer language strategies in her poem as compared with the other two.

  2. Voice Quality Estimation in Wireless Networks

    Directory of Open Access Journals (Sweden)

    Petr Zach

    2015-01-01

    Full Text Available This article deals with the impact of Wireless (Wi-Fi networks on the perceived quality of voice services. The Quality of Service (QoS metrics must be monitored in the computer network during the voice data transmission to ensure proper voice service quality the end-user has paid for, especially in the wireless networks. In addition to the QoS, research area called Quality of Experience (QoE provides metrics and methods for quality evaluation from the end-user’s perspective. This article focuses on a QoE estimation of Voice over IP (VoIP calls in the wireless networks using network simulator. Results contribute to voice quality estimation based on characteristics of the wireless network and location of a wireless client.

  3. Voice pedagogy-what do we need?

    Science.gov (United States)

    Gill, Brian P; Herbst, Christian T

    2016-12-01

    The final keynote panel of the 10th Pan-European Voice Conference (PEVOC) was concerned with the topic 'Voice pedagogy-what do we need?' In this communication the panel discussion is summarized, and the authors provide a deepening discussion on one of the key questions, addressing the roles and tasks of people working with voice students. In particular, a distinction is made between (1) voice building (derived from the German term 'Stimmbildung'), primarily comprising the functional and physiological aspects of singing; (2) coaching, mostly concerned with performance skills; and (3) singing voice rehabilitation. Both public and private educators are encouraged to apply this distinction to their curricula, in order to arrive at more efficient singing teaching and to reduce the risk of vocal injury to the singers concerned.

  4. Recognition of handprinted characters for automated cartography A progress report

    Science.gov (United States)

    Lybanon, M.; Brown, R. M.; Gronmeyer, L. K.

    1980-01-01

    A research program for developing handwritten character recognition techniques is reported. The generation of cartographic/hydrographic manuscripts is overviewed. The performance of hardware/software systems is discussed, along with future research problem areas and planned approaches.

  5. Reliable jitter and shimmer measurements in voice clinics: the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task.

    Science.gov (United States)

    Brockmann, Meike; Drinnan, Michael J; Storck, Claudio; Carding, Paul N

    2011-01-01

    The aims of this study were to examine vowel and gender effects on jitter and shimmer in a typical clinical voice task while correcting for the confounding effects of voice sound pressure level (SPL) and fundamental frequency (F(0)). Furthermore the relative effect sizes of vowel, gender, voice SPL, and F(0) were assessed, and recommendations for clinical measurements were derived. With this cross-sectional single cohort study, 57 healthy adults (28 women, 29 men) aged 20-40 years were investigated. Three phonations of /a/, /o/, and /i/ at "normal" voice loudness were analyzed using Praat (software). The effects of vowel, gender, voice SPL, and F(0) on jitter and shimmer were assessed using descriptive and inferential (analysis of covariance) statistics. The effect sizes were determined with the eta-squared statistic. Vowels, gender, voice SPL, and F(0), each had significant effects either on jitter or on shimmer, or both. Voice SPL was the most important factor, whereas vowel, gender, and F(0) effects were comparatively small. Because men had systematically higher voice SPL, the gender effects on jitter and shimmer were smaller when correcting for SPL and F(0). Surprisingly, in clinical assessments, voice SPL has the single biggest impact on jitter and shimmer. Vowel and gender effects were clinically important, whereas fundamental frequency had a relatively small influence. Phonations at a predefined voice SPL (80 dB minimum) and vowel (/a/) would enhance measurement reliability. Furthermore, gender-specific thresholds applying these guidelines should be established. However, the efficiency of these measures should be verified and tested with patients. Copyright © 2011 The Voice Foundation. All rights reserved.

  6. Automatic stereoscopic system for person recognition

    Science.gov (United States)

    Murynin, Alexander B.; Matveev, Ivan A.; Kuznetsov, Victor D.

    1999-06-01

    A biometric access control system based on identification of human face is presented. The system developed performs remote measurements of the necessary face features. Two different scenarios of the system behavior are implemented. The first one assumes the verification of personal data entered by visitor from console using keyboard or card reader. The system functions as an automatic checkpoint, that strictly controls access of different visitors. The other scenario makes it possible to identify visitors without any person identifier or pass. Only person biometrics are used to identify the visitor. The recognition system automatically finds necessary identification information preliminary stored in the database. Two laboratory models of recognition system were developed. The models are designed to use different information types and sources. In addition to stereoscopic images inputted to computer from cameras the models can use voice data and some person physical characteristics such as person's height, measured by imaging system.

  7. Social power and recognition of emotional prosody: High power is associated with lower recognition accuracy than low power.

    Science.gov (United States)

    Uskul, Ayse K; Paulmann, Silke; Weick, Mario

    2016-02-01

    Listeners have to pay close attention to a speaker's tone of voice (prosody) during daily conversations. This is particularly important when trying to infer the emotional state of the speaker. Although a growing body of research has explored how emotions are processed from speech in general, little is known about how psychosocial factors such as social power can shape the perception of vocal emotional attributes. Thus, the present studies explored how social power affects emotional prosody recognition. In a correlational study (Study 1) and an experimental study (Study 2), we show that high power is associated with lower accuracy in emotional prosody recognition than low power. These results, for the first time, suggest that individuals experiencing high or low power perceive emotional tone of voice differently.

  8. New Public Management, Care and Struggles about Recognition

    DEFF Research Database (Denmark)

    Dahl, Hanne Marlene

    2009-01-01

    New Public Management (NPM) is usually perceived as a homogeneous discourse. However, when we examine it by looking at micro-politics in municipalities and understand its consequences drawing on the voices of home helpers, the picture is more complex and ambiguous. NPM is seen as disciplining paid...... of resistance. The analysis applies feminist theories of recognition and care, and its findings are based on focus group interviews and feminist discourse analysis...

  9. Free Software for Disorders of Human Communication

    Directory of Open Access Journals (Sweden)

    William Ricardo Rodríguez Dueñas

    2015-05-01

    Full Text Available Introduction: New technologies are increasingly used by the health sector for its implementation in therapeutic interventions. However, in the case of speech therapists, there are many unknown free software-based tools which could support their daily work. This paper summarizes fourteen free software-based tools that can support interventions in early stimulation, assessment and control of voice and speech, several resources for augmentative and alternative communication and tools that facilitate access to the computer. Materials and methods: The information presented here is the result of a general review of software-based tools designed to treat human communication disorders. Criteria for inclusion and exclusion were established to select tools and these were installed and tested. Results: 22 tools were found and 14 were selected and classified in these categories: Early stimulation and capture attention, acoustic signal processing of voice, speech processing, Augmentative and Alternative Communication and Other; the latter includes tools for access to the computer without the need for advanced computer skills. Discussion: The set of tools discussed in this paper provides free computer-based tools to therapists in order to help their interventions, additionally, promotes the improvement of computer skills so necessary in today’s society of professionals.

  10. SOFTWARE METRICS VALIDATION METHODOLOGIES IN SOFTWARE ENGINEERING

    Directory of Open Access Journals (Sweden)

    K.P. Srinivasan

    2014-12-01

    Full Text Available In the software measurement validations, assessing the validation of software metrics in software engineering is a very difficult task due to lack of theoretical methodology and empirical methodology [41, 44, 45]. During recent years, there have been a number of researchers addressing the issue of validating software metrics. At present, software metrics are validated theoretically using properties of measures. Further, software measurement plays an important role in understanding and controlling software development practices and products. The major requirement in software measurement is that the measures must represent accurately those attributes they purport to quantify and validation is critical to the success of software measurement. Normally, validation is a collection of analysis and testing activities across the full life cycle and complements the efforts of other quality engineering functions and validation is a critical task in any engineering project. Further, validation objective is to discover defects in a system and assess whether or not the system is useful and usable in operational situation. In the case of software engineering, validation is one of the software engineering disciplines that help build quality into software. The major objective of software validation process is to determine that the software performs its intended functions correctly and provides information about its quality and reliability. This paper discusses the validation methodology, techniques and different properties of measures that are used for software metrics validation. In most cases, theoretical and empirical validations are conducted for software metrics validations in software engineering [1-50].

  11. Text-Independent Speaker Recognition for Low SNR Environments with Encryption

    CERN Document Server

    Chadha, Aman; Roja, M Mani; 10.5120/3864-5394

    2011-01-01

    Recognition systems are commonly designed to authenticate users at the access control levels of a system. A number of voice recognition methods have been developed using a pitch estimation process which are very vulnerable in low Signal to Noise Ratio (SNR) environments thus, these programs fail to provide the desired level of accuracy and robustness. Also, most text independent speaker recognition programs are incapable of coping with unauthorized attempts to gain access by tampering with the samples or reference database. The proposed text-independent voice recognition system makes use of multilevel cryptography to preserve data integrity while in transit or storage. Encryption and decryption follow a transform based approach layered with pseudorandom noise addition whereas for pitch detection, a modified version of the autocorrelation pitch extraction algorithm is used. The experimental results show that the proposed algorithm can decrypt the signal under test with exponentially reducing Mean Square Error ...

  12. Voice activity detection based on deep neural networks and Viterbi

    Science.gov (United States)

    Bai, Liang; Zhang, Zhen; Hu, Jun

    2017-09-01

    Voice Activity Detection (VAD) is important in speech processing. In the applications, the systems usually need to separate speech/non-speech parts, so that only the speech part can be dealt with. How to improve the performances of VAD in different noisy environments is an important issue in speech processing. Deep Neural network, which proves its efficiency in speech recognition, has been widely used in recent years. This paper studies the present typical VAD algorithms, and presents a new VAD algorithm based on deep neural networks and Viterbi algorithm. The result demonstrates the effectiveness of the deep neural network with Viterbi used in VAD. In addition, it shows the flexibility and the real-time performance of the algorithms.

  13. Auditory gist: recognition of very short sounds from timbre cues.

    Science.gov (United States)

    Suied, Clara; Agus, Trevor R; Thorpe, Simon J; Mesgarani, Nima; Pressnitzer, Daniel

    2014-03-01

    Sounds such as the voice or musical instruments can be recognized on the basis of timbre alone. Here, sound recognition was investigated with severely reduced timbre cues. Short snippets of naturally recorded sounds were extracted from a large corpus. Listeners were asked to report a target category (e.g., sung voices) among other sounds (e.g., musical instruments). All sound categories covered the same pitch range, so the task had to be solved on timbre cues alone. The minimum duration for which performance was above chance was found to be short, on the order of a few milliseconds, with the best performance for voice targets. Performance was independent of pitch and was maintained when stimuli contained less than a full waveform cycle. Recognition was not generally better when the sound snippets were time-aligned with the sound onset compared to when they were extracted with a random starting time. Finally, performance did not depend on feedback or training, suggesting that the cues used by listeners in the artificial gating task were similar to those relevant for longer, more familiar sounds. The results show that timbre cues for sound recognition are available at a variety of time scales, including very short ones.

  14. Current trends in small vocabulary speech recognition for equipment control

    Science.gov (United States)

    Doukas, Nikolaos; Bardis, Nikolaos G.

    2017-09-01

    Speech recognition systems allow human - machine communication to acquire an intuitive nature that approaches the simplicity of inter - human communication. Small vocabulary speech recognition is a subset of the overall speech recognition problem, where only a small number of words need to be recognized. Speaker independent small vocabulary recognition can find significant applications in field equipment used by military personnel. Such equipment may typically be controlled by a small number of commands that need to be given quickly and accurately, under conditions where delicate manual operations are difficult to achieve. This type of application could hence significantly benefit by the use of robust voice operated control components, as they would facilitate the interaction with their users and render it much more reliable in times of crisis. This paper presents current challenges involved in attaining efficient and robust small vocabulary speech recognition. These challenges concern feature selection, classification techniques, speaker diversity and noise effects. A state machine approach is presented that facilitates the voice guidance of different equipment in a variety of situations.

  15. Sandia software guidelines: Software quality planning

    Energy Technology Data Exchange (ETDEWEB)

    1987-08-01

    This volume is one in a series of Sandia Software Guidelines intended for use in producing quality software within Sandia National Laboratories. In consonance with the IEEE Standard for Software Quality Assurance Plans, this volume identifies procedures to follow in producing a Software Quality Assurance Plan for an organization or a project, and provides an example project SQA plan. 2 figs., 4 tabs.

  16. Introduction: Cybersecurity and Software Assurance Minitrack

    Energy Technology Data Exchange (ETDEWEB)

    Burns, Luanne [JHU Applied Physics Laboratory; George, Richard [JHU Applied Physics Laboratory; Linger, Richard C [ORNL

    2015-01-01

    Modern society is dependent on software systems of remarkable scope and complexity. Yet methods for assuring their security and functionality have not kept pace. The result is persistent compromises and failures despite best efforts. Cybersecurity methods must work together for situational awareness, attack prevention and detection, threat attribution, minimization of consequences, and attack recovery. Because defective software cannot be secure, assurance technologies must play a central role in cybersecurity approaches. There is increasing recognition of the need for rigorous methods for cybersecurity and software assurance. The goal of this minitrack is to develop science foundations, technologies, and practices that can improve the security and dependability of complex systems.

  17. Sonorous Voice and Feminist Teaching: Lessons from Cavarero

    Science.gov (United States)

    Forrest, Michelle

    2015-01-01

    I claim that Adriana Cavarero's concept of sonorous voice is significant in feminist teaching because, as she argues, dominant concepts of voice refer to voice in semantic terms thereby discounting voice in sonorous terms. This process of "devocalization", spanning the history of Western philosophy, devalues the uniqueness embodied in…

  18. 14 CFR 23.1457 - Cockpit voice recorders.

    Science.gov (United States)

    2010-01-01

    ... 14 Aeronautics and Space 1 2010-01-01 2010-01-01 false Cockpit voice recorders. 23.1457 Section 23... Equipment § 23.1457 Cockpit voice recorders. (a) Each cockpit voice recorder required by the operating rules... cockpit-mounted area microphone, located in the best position for recording voice...

  19. Parent Trigger Laws and the Promise of Parental Voice

    Science.gov (United States)

    Smith, William C.; Rowland, Julie

    2014-01-01

    Parent trigger laws have gained momentum nationally under the premise that they will increase local authority by amplifying parental voice in the decision to turn around "failing" schools. Using Hirschman's exit, voice, and loyalty framework we create two conceptual models of voice and evaluate the promise of voice in California, home of…

  20. Sonorous Voice and Feminist Teaching: Lessons from Cavarero

    Science.gov (United States)

    Forrest, Michelle

    2015-01-01

    I claim that Adriana Cavarero's concept of sonorous voice is significant in feminist teaching because, as she argues, dominant concepts of voice refer to voice in semantic terms thereby discounting voice in sonorous terms. This process of "devocalization", spanning the history of Western philosophy, devalues the uniqueness embodied in…