WorldWideScience

Sample records for voice recognition vr

  1. Voice Recognition: A New Assessment Tool?

    Science.gov (United States)

    Jones, Darla

    2005-01-01

    This article presents the results of a study conducted in Anchorage, Alaska, that evaluated the accuracy and efficiency of using voice recognition (VR) technology to collect oral reading fluency data for classroom-based assessments. The primary research question was as follows: Is voice recognition technology a valid and reliable alternative to…

  2. Neural mechanisms for voice recognition

    NARCIS (Netherlands)

    Andics, A.V.; McQueen, J.M.; Petersson, K.M.; Gal, V.; Rudas, G.; Vidnyanszky, Z.

    2010-01-01

    We investigated neural mechanisms that support voice recognition in a training paradigm with fMRI. The same listeners were trained on different weeks to categorize the mid-regions of voice-morph continua as an individual's voice. Stimuli implicitly defined a voice-acoustics space, and training expli

  3. FILTWAM and Voice Emotion Recognition

    NARCIS (Netherlands)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2014-01-01

    This paper introduces the voice emotion recognition part of our framework for improving learning through webcams and microphones (FILTWAM). This framework enables multimodal emotion recognition of learners during game-based learning. The main goal of this study is to validate the use of microphone d

  4. FILTWAM and Voice Emotion Recognition

    NARCIS (Netherlands)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2014-01-01

    This paper introduces the voice emotion recognition part of our framework for improving learning through webcams and microphones (FILTWAM). This framework enables multimodal emotion recognition of learners during game-based learning. The main goal of this study is to validate the use of microphone

  5. Voice congruency facilitates word recognition.

    Directory of Open Access Journals (Sweden)

    Sandra Campeanu

    Full Text Available Behavioral studies of spoken word memory have shown that context congruency facilitates both word and source recognition, though the level at which context exerts its influence remains equivocal. We measured event-related potentials (ERPs while participants performed both types of recognition task with words spoken in four voices. Two voice parameters (i.e., gender and accent varied between speakers, with the possibility that none, one or two of these parameters was congruent between study and test. Results indicated that reinstating the study voice at test facilitated both word and source recognition, compared to similar or no context congruency at test. Behavioral effects were paralleled by two ERP modulations. First, in the word recognition test, the left parietal old/new effect showed a positive deflection reflective of context congruency between study and test words. Namely, the same speaker condition provided the most positive deflection of all correctly identified old words. In the source recognition test, a right frontal positivity was found for the same speaker condition compared to the different speaker conditions, regardless of response success. Taken together, the results of this study suggest that the benefit of context congruency is reflected behaviorally and in ERP modulations traditionally associated with recognition memory.

  6. WHEEL CHAIR USING VOICE RECOGNITION

    OpenAIRE

    Manish Kumar Yadav*; Rajat Kumar; Santosh Yadav; Ravindra Prajapati; Prof. Kshirsagar

    2016-01-01

    The wide spread prevalence of lost limbs and sensing system is of major concern in present day due to wars, accident, age and health problems. This Omni-directional wheelchair was designed for the less able elderly to move more flexibly in narrow spaces, such as elevators or small aisle. The wheelchair is developed to help disabled patients by using speech recognition system to control the movement of wheelchair in different directions by using voice commands and also the simple movement of t...

  7. Human voice recognition depends on language ability.

    Science.gov (United States)

    Perrachione, Tyler K; Del Tufo, Stephanie N; Gabrieli, John D E

    2011-07-29

    The ability to recognize people by their voice is an important social behavior. Individuals differ in how they pronounce words, and listeners may take advantage of language-specific knowledge of speech phonology to facilitate recognizing voices. Impaired phonological processing is characteristic of dyslexia and thought to be a basis for difficulty in learning to read. We tested voice-recognition abilities of dyslexic and control listeners for voices speaking listeners' native language or an unfamiliar language. Individuals with dyslexia exhibited impaired voice-recognition abilities compared with controls only for voices speaking their native language. These results demonstrate the importance of linguistic representations for voice recognition. Humans appear to identify voices by making comparisons between talkers' pronunciations of words and listeners' stored abstract representations of the sounds in those words.

  8. Voice Recognition in Face-Blind Patients.

    Science.gov (United States)

    Liu, Ran R; Pancaroglu, Raika; Hills, Charlotte S; Duchaine, Brad; Barton, Jason J S

    2016-04-01

    Right or bilateral anterior temporal damage can impair face recognition, but whether this is an associative variant of prosopagnosia or part of a multimodal disorder of person recognition is an unsettled question, with implications for cognitive and neuroanatomic models of person recognition. We assessed voice perception and short-term recognition of recently heard voices in 10 subjects with impaired face recognition acquired after cerebral lesions. All 4 subjects with apperceptive prosopagnosia due to lesions limited to fusiform cortex had intact voice discrimination and recognition. One subject with bilateral fusiform and anterior temporal lesions had a combined apperceptive prosopagnosia and apperceptive phonagnosia, the first such described case. Deficits indicating a multimodal syndrome of person recognition were found only in 2 subjects with bilateral anterior temporal lesions. All 3 subjects with right anterior temporal lesions had normal voice perception and recognition, 2 of whom performed normally on perceptual discrimination of faces. This confirms that such lesions can cause a modality-specific associative prosopagnosia.

  9. Enhancing nursing practice by utilizing voice recognition for direct documentation.

    Science.gov (United States)

    Fratzke, Jason; Tucker, Sharon; Shedenhelm, Heidi; Arnold, Jackie; Belda, Tom; Petera, Michael

    2014-02-01

    Innovative strategies that preserve nursing time for direct patient care activities are needed. This study examined the utility, feasibility, and acceptability of voice recognition (VR) software to document nursing care and patient outcomes in an electronic health record in a simulated nursing care environment. A phase 1 trial included 5 iterative experiments with observations and nurse participant feedback to allow enhancements to the speech detection capabilities and refinement of the technology, software, and processes. Utility ratings improved over time; however, interference on nursing care remained a concern throughout. Nurse participants favored keyboard entry electronic health record, largely due to software and technical issues, but also relative to the culture shift the new technology brings to nursing practice. Successful adoption of VR technology by nursing will be dependent on receptiveness of the nurses and perceived benefits, timely access to education and training, and minimization of barriers to using the software.

  10. Voice recognition software for clinical use.

    Science.gov (United States)

    Korn, K

    1998-11-01

    The current generation voice recognition products truly offer the promise of voice recognition systems, that are financially and operationally acceptable for use in a health care facility. Although the initial capital outlay for the purchase of such equipment may be substantial, the long-term benefit is felt to outweigh the expense. The ability to utilize computer equipment for educational purposes and information management alone helps to rationalize the cost. In addition, it is important to remember that the Internet has become a substantial source of information which provides another functional use for this equipment. Although one can readily see the implication for such a program in clinical practice, other uses for the program should not be overlooked. Uses far beyond the writing of clinic notes and correspondence can be easily envisioned. Utilization of voice recognition software offers clinical practices the ability to produce quality printed records in a timely and cost-effective manner. After learning procedures for the selected product and appropriately formatting word processing software and printers, printed progress notes should be able to be produced in less time than traditional dictation and transcription methods. Although certain procedures and practices may need to be altered, or may preclude optimal utilization of this type of system, many advantages are apparent. It is recommended that facilities consider utilization of Voice Recognition products such as Dragon Systems Naturally Speaking Software, or at least consider a trial of this method with one of the limited-feature products, if current dictation practices are unsatisfactory or excessively costly. Free downloadable trial software or single user software can provide a reduced-cost method for trial evaluation of such products if a major commitment is not felt to be desired. A list of voice recognition software manufacturer web sites may be accessed through the following: http

  11. Implicit multisensory associations influence voice recognition.

    Directory of Open Access Journals (Sweden)

    Katharina von Kriegstein

    2006-10-01

    Full Text Available Natural objects provide partially redundant information to the brain through different sensory modalities. For example, voices and faces both give information about the speech content, age, and gender of a person. Thanks to this redundancy, multimodal recognition is fast, robust, and automatic. In unimodal perception, however, only part of the information about an object is available. Here, we addressed whether, even under conditions of unimodal sensory input, crossmodal neural circuits that have been shaped by previous associative learning become activated and underpin a performance benefit. We measured brain activity with functional magnetic resonance imaging before, while, and after participants learned to associate either sensory redundant stimuli, i.e. voices and faces, or arbitrary multimodal combinations, i.e. voices and written names, ring tones, and cell phones or brand names of these cell phones. After learning, participants were better at recognizing unimodal auditory voices that had been paired with faces than those paired with written names, and association of voices with faces resulted in an increased functional coupling between voice and face areas. No such effects were observed for ring tones that had been paired with cell phones or names. These findings demonstrate that brief exposure to ecologically valid and sensory redundant stimulus pairs, such as voices and faces, induces specific multisensory associations. Consistent with predictive coding theories, associative representations become thereafter available for unimodal perception and facilitate object recognition. These data suggest that for natural objects effective predictive signals can be generated across sensory systems and proceed by optimization of functional connectivity between specialized cortical sensory modules.

  12. Machine Recognition vs Human Recognition of Voices

    Science.gov (United States)

    2012-05-01

    recognized. The accuracy of speaker recognition for disyllables was 87%. For monosyllables, it was 81%, consonant- vowel excerpts were 63%, and... vowel excerpts were 56%. Thus, they demonstrated that the identification performance decreased as the number of phonemes decreased. In [2], the...will still sound natural and the performance of listeners could be tied directly to the degradation of particular frequencies. If the performance

  13. Improving Speaker Recognition by Biometric Voice Deconstruction

    Directory of Open Access Journals (Sweden)

    Luis Miguel eMazaira-Fernández

    2015-09-01

    Full Text Available Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g. YouTube to broadcast its message. In this new scenario, classical identification methods (such fingerprints or face recognition have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. Through the present paper, a new methodology to characterize speakers will be shown. This methodology is benefiting from the advances achieved during the last years in understanding and modelling voice production. The paper hypothesizes that a gender dependent characterization of speakers combined with the use of a new set of biometric parameters extracted from the components resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract gender-dependent extended biometric parameters are given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.

  14. Improving Speaker Recognition by Biometric Voice Deconstruction.

    Science.gov (United States)

    Mazaira-Fernandez, Luis Miguel; Álvarez-Marquina, Agustín; Gómez-Vilda, Pedro

    2015-01-01

    Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g., YouTube) to broadcast its message. In this new scenario, classical identification methods (such as fingerprints or face recognition) have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. The present study benefits from the advances achieved during last years in understanding and modeling voice production. The paper hypothesizes that a gender-dependent characterization of speakers combined with the use of a set of features derived from the components, resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract the gender-dependent extended biometric parameters is given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.

  15. Voice Recognition Technology: Has It Come of Age?

    Directory of Open Access Journals (Sweden)

    Joseph R. Zumalt

    2005-12-01

    Full Text Available Voice recognition software allows computer users to bypass their keyboards and use their voices to enter text. While the library literature is somewhat silent about voice recognition technology, the medical and legal communities have reported some success using it. Voice recognition software was tested for dictation accuracy and usability within an agriculture library at the University of Illinois. Dragon NaturallySpeaking 8.0 was found to be more accurate than speech recognition within Microsoft Office 2003. Helpful Web sites and a short history regarding this breakthrough technology are included.

  16. Temporal voice areas exist in autism spectrum disorder but are dysfunctional for voice identity recognition

    Science.gov (United States)

    Borowiak, Kamila; von Kriegstein, Katharina

    2016-01-01

    The ability to recognise the identity of others is a key requirement for successful communication. Brain regions that respond selectively to voices exist in humans from early infancy on. Currently, it is unclear whether dysfunction of these voice-sensitive regions can explain voice identity recognition impairments. Here, we used two independent functional magnetic resonance imaging studies to investigate voice processing in a population that has been reported to have no voice-sensitive regions: autism spectrum disorder (ASD). Our results refute the earlier report that individuals with ASD have no responses in voice-sensitive regions: Passive listening to vocal, compared to non-vocal, sounds elicited typical responses in voice-sensitive regions in the high-functioning ASD group and controls. In contrast, the ASD group had a dysfunction in voice-sensitive regions during voice identity but not speech recognition in the right posterior superior temporal sulcus/gyrus (STS/STG)—a region implicated in processing complex spectrotemporal voice features and unfamiliar voices. The right anterior STS/STG correlated with voice identity recognition performance in controls but not in the ASD group. The findings suggest that right STS/STG dysfunction is critical for explaining voice recognition impairments in high-functioning ASD and show that ASD is not characterised by a general lack of voice-sensitive responses. PMID:27369067

  17. Building Domain Specific Languages for Voice Recognition Applications

    Directory of Open Access Journals (Sweden)

    Cristian IONITA

    2008-01-01

    Full Text Available This paper presents a method of implementing the voice recognition for the control of software applications. The solutions proposed are based on transforming a subset of the natural language in commands recognized by the application using a formal language defined by the means of a context free grammar. At the end of the paper is presented the modality of integration of voice recognition and of voice synthesis for the Romanian language in Windows applications.

  18. A meta-analysis of in-vehicle and nomadic voice-recognition system interaction and driving performance.

    Science.gov (United States)

    Simmons, Sarah M; Caird, Jeff K; Steel, Piers

    2017-09-01

    Driver distraction is a growing and pervasive issue that requires multiple solutions. Voice-recognition (V-R) systems may decrease the visual-manual (V-M) demands of a wide range of in-vehicle system and smartphone interactions. However, the degree that V-R systems integrated into vehicles or available in mobile phone applications affect driver distraction is incompletely understood. A comprehensive meta-analysis of experimental studies was conducted to address this knowledge gap. To meet study inclusion criteria, drivers had to interact with a V-R system while driving and doing everyday V-R tasks such as dialing, initiating a call, texting, emailing, destination entry or music selection. Coded dependent variables included detection, reaction time, lateral position, speed and headway. Comparisons of V-R systems with baseline driving and/or a V-M condition were also coded. Of 817 identified citations, 43 studies involving 2000 drivers and 183 effect sizes (r) were analyzed in the meta-analysis. Compared to baseline, driving while interacting with a V-R system is associated with increases in reaction time and lane positioning, and decreases in detection. When V-M systems were compared to V-R systems, drivers had slightly better performance with the latter system on reaction time, lane positioning and headway. Although V-R systems have some driving performance advantages over V-M systems, they have a distraction cost relative to driving without any system at all. The pattern of results indicates that V-R systems impose moderate distraction costs on driving. In addition, drivers minimally engage in compensatory performance adjustments such as reducing speed and increasing headway while using V-R systems. Implications of the results for theory, design guidelines and future research are discussed. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Familiar Person Recognition: Is Autonoetic Consciousness More Likely to Accompany Face Recognition Than Voice Recognition?

    Science.gov (United States)

    Barsics, Catherine; Brédart, Serge

    2010-11-01

    Autonoetic consciousness is a fundamental property of human memory, enabling us to experience mental time travel, to recollect past events with a feeling of self-involvement, and to project ourselves in the future. Autonoetic consciousness is a characteristic of episodic memory. By contrast, awareness of the past associated with a mere feeling of familiarity or knowing relies on noetic consciousness, depending on semantic memory integrity. Present research was aimed at evaluating whether conscious recollection of episodic memories is more likely to occur following the recognition of a familiar face than following the recognition of a familiar voice. Recall of semantic information (biographical information) was also assessed. Previous studies that investigated the recall of biographical information following person recognition used faces and voices of famous people as stimuli. In this study, the participants were presented with personally familiar people's voices and faces, thus avoiding the presence of identity cues in the spoken extracts and allowing a stricter control of frequency exposure with both types of stimuli (voices and faces). In the present study, the rate of retrieved episodic memories, associated with autonoetic awareness, was significantly higher from familiar faces than familiar voices even though the level of overall recognition was similar for both these stimuli domains. The same pattern was observed regarding semantic information retrieval. These results and their implications for current Interactive Activation and Competition person recognition models are discussed.

  20. Electrolarynx Voice Recognition Utilizing Pulse Coupled Neural Network

    Directory of Open Access Journals (Sweden)

    Fatchul Arifin

    2010-08-01

    Full Text Available The laryngectomies patient has no ability to speak normally because their vocal chords have been removed. The easiest option for the patient to speak again is by using electrolarynx speech. This tool is placed on the lower chin. Vibration of the neck while speaking is used to produce sound. Meanwhile, the technology of "voice recognition" has been growing very rapidly. It is expected that the technology of "voice recognition" can also be used by laryngectomies patients who use electrolarynx.This paper describes a system for electrolarynx speech recognition. Two main parts of the system are feature extraction and pattern recognition. The Pulse Coupled Neural Network – PCNN is used to extract the feature and characteristic of electrolarynx speech. Varying of β (one of PCNN parameter also was conducted. Multi layer perceptron is used to recognize the sound patterns. There are two kinds of recognition conducted in this paper: speech recognition and speaker recognition. The speech recognition recognizes specific speech from every people. Meanwhile, speaker recognition recognizes specific speech from specific person. The system ran well. The "electrolarynx speech recognition" has been tested by recognizing of “A” and "not A" voice. The results showed that the system had 94.4% validation. Meanwhile, the electrolarynx speaker recognition has been tested by recognizing of “saya” voice from some different speakers. The results showed that the system had 92.2% validation. Meanwhile, the best β parameter of PCNN for electrolarynx recognition is 3.

  1. An interactive VR system based on full-body tracking and gesture recognition

    Science.gov (United States)

    Zeng, Xia; Sang, Xinzhu; Chen, Duo; Wang, Peng; Guo, Nan; Yan, Binbin; Wang, Kuiru

    2016-10-01

    Most current virtual reality (VR) interactions are realized with the hand-held input device which leads to a low degree of presence. There is other solutions using sensors like Leap Motion to recognize the gestures of users in order to interact in a more natural way, but the navigation in these systems is still a problem, because they fail to map the actual walking to virtual walking only with a partial body of the user represented in the synthetic environment. Therefore, we propose a system in which users can walk around in the virtual environment as a humanoid model, selecting menu items and manipulating with the virtual objects using natural hand gestures. With a Kinect depth camera, the system tracks the joints of the user, mapping them to a full virtual body which follows the move of the tracked user. The movements of the feet can be detected to determine whether the user is in walking state, so that the walking of model in the virtual world can be activated and stopped by means of animation control in Unity engine. This method frees the hands of users comparing to traditional navigation way using hand-held device. We use the point cloud data getting from Kinect depth camera to recognize the gestures of users, such as swiping, pressing and manipulating virtual objects. Combining the full body tracking and gestures recognition using Kinect, we achieve our interactive VR system in Unity engine with a high degree of presence.

  2. Noise Robust Speech Recognition Applied to Voice-Driven Wheelchair

    Science.gov (United States)

    Sasou, Akira; Kojima, Hiroaki

    2009-12-01

    Conventional voice-driven wheelchairs usually employ headset microphones that are capable of achieving sufficient recognition accuracy, even in the presence of surrounding noise. However, such interfaces require users to wear sensors such as a headset microphone, which can be an impediment, especially for the hand disabled. Conversely, it is also well known that the speech recognition accuracy drastically degrades when the microphone is placed far from the user. In this paper, we develop a noise robust speech recognition system for a voice-driven wheelchair. This system can achieve almost the same recognition accuracy as the headset microphone without wearing sensors. We verified the effectiveness of our system in experiments in different environments, and confirmed that our system can achieve almost the same recognition accuracy as the headset microphone without wearing sensors.

  3. When the face fits: recognition of celebrities from matching and mismatching faces and voices.

    Science.gov (United States)

    Stevenage, Sarah V; Neil, Greg J; Hamlin, Iain

    2014-01-01

    The results of two experiments are presented in which participants engaged in a face-recognition or a voice-recognition task. The stimuli were face-voice pairs in which the face and voice were co-presented and were either "matched" (same person), "related" (two highly associated people), or "mismatched" (two unrelated people). Analysis in both experiments confirmed that accuracy and confidence in face recognition was consistently high regardless of the identity of the accompanying voice. However accuracy of voice recognition was increasingly affected as the relationship between voice and accompanying face declined. Moreover, when considering self-reported confidence in voice recognition, confidence remained high for correct responses despite the proportion of these responses declining across conditions. These results converged with existing evidence indicating the vulnerability of voice recognition as a relatively weak signaller of identity, and results are discussed in the context of a person-recognition framework.

  4. The Neuropsychology of Familiar Person Recognition from Face and Voice

    OpenAIRE

    2014-01-01

    Prosopagnosia has been considered for a long period of time as the most important and almost exclusive disorder in the recognition of familiar people. In recent years, however, this conviction has been undermined by the description of patients showing a concomitant defect in the recognition of familiar faces and voices as a consequence of lesions encroaching upon the right anterior temporal lobe (ATL). These new data have obliged researchers to reconsider on one hand the construct of ‘associa...

  5. Emotional Recognition in Autism Spectrum Conditions from Voices and Faces

    Science.gov (United States)

    Stewart, Mary E.; McAdam, Clair; Ota, Mitsuhiko; Peppe, Sue; Cleland, Joanne

    2013-01-01

    The present study reports on a new vocal emotion recognition task and assesses whether people with autism spectrum conditions (ASC) perform differently from typically developed individuals on tests of emotional identification from both the face and the voice. The new test of vocal emotion contained trials in which the vocal emotion of the sentence…

  6. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob;

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...

  7. Pegembangan Game dengan Menggunakan Teknologi Voice Recognition Berbasis Android

    Directory of Open Access Journals (Sweden)

    Franky Hadinata Marpaung

    2014-06-01

    Full Text Available The purpose of this research is to create a new kind of game by using technology that rarely used in current games. It is developed as an entertainment media and also a social media in which the users can play the games together via multiplayer mode. This research uses Scrum development method since it supports small scaled developer and it supports software increment along the development. Using this game application, the users can play and watch interesting animations by controlling it with their voice, listen the character imitating the users’ voice, play various mini games both in single player or multiplayer mode via Bluetooth connection. The conclusion is that game application of My Name is Dug use voice recognition and inter-devices connection as its main features. It also has various mini games that support both single player and multiplayer.

  8. Acoustic cues for the recognition of self-voice and other-voice

    Directory of Open Access Journals (Sweden)

    Mingdi eXu

    2013-10-01

    Full Text Available Self-recognition, being indispensable for successful social communication, has become a major focus in current social neuroscience. The physical aspects of the self are most typically manifested in the face and voice. Compared with the wealth of studies on self-face recognition, self-voice recognition (SVR has not gained much attention. Converging evidence has suggested that the fundamental frequency (F0 and formant structures serve as the key acoustic cues for other-voice recognition (OVR. However, little is known about which, and how, acoustic cues are utilized for SVR as opposed to OVR. To address this question, we independently manipulated the F0 and formant information of recorded voices and investigated their contributions to SVR and OVR. Japanese participants were presented with recorded vocal stimuli and were asked to identify the speaker—either themselves or one of their peers. Six groups of 5 peers of the same sex participated in the study. Under conditions where the formant information was fully preserved and where only the frequencies lower than the third formant (F3 were retained, accuracies of SVR deteriorated significantly with the modulation of the F0, and the results were comparable for OVR. By contrast, under a condition where only the frequencies higher than F3 were retained, the accuracy of SVR was significantly higher than that of OVR throughout the range of F0 modulations, and the F0 scarcely affected the accuracies of SVR and OVR. Our results indicate that while both F0 and formant information are involved in SVR, as well as in OVR, the advantage of SVR is manifested only when major formant information for speech intelligibility is absent. These findings imply the robustness of self-voice representation, possibly by virtue of auditory familiarity and other factors such as its association with motor/articulatory representation.

  9. Secure Recognition of Voice-Less Commands Using Videos

    Science.gov (United States)

    Yau, Wai Chee; Kumar, Dinesh Kant; Weghorn, Hans

    Interest in voice recognition technologies for internet applications is growing due to the flexibility of speech-based communication. The major drawback with the use of sound for internet access with computers is that the commands will be audible to other people in the vicinity. This paper examines a secure and voice-less method for recognition of speech-based commands using video without evaluating sound signals. The proposed approach represents mouth movements in the video data using 2D spatio-temporal templates (STT). Zernike moments (ZM) are computed from STT and fed into support vector machines (SVM) to be classified into one of the utterances. The experimental results demonstrate that the proposed technique produces a high accuracy of 98% in a phoneme classification task. The proposed technique is demonstrated to be invariant to global variations of illumination level. Such a system is useful for securely interpreting user commands for internet applications on mobile devices.

  10. The Neuropsychology of Familiar Person Recognition from Face and Voice

    Directory of Open Access Journals (Sweden)

    Guido Gainotti

    2014-05-01

    Full Text Available Prosopagnosia has been considered for a long period of time as the most important and almost exclusive disorder in the recognition of familiar people. In recent years, however, this conviction has been undermined by the description of patients showing a concomitant defect in the recognition of familiar faces and voices as a consequence of lesions encroaching upon the right anterior temporal lobe (ATL. These new data have obliged researchers to reconsider on one hand the construct of ‘associative prosopagnosia’ and on the other hand current models of people recognition. A systematic review of the patterns of familiar people recognition disorders observed in patients with right and left ATL lesions has shown that in patients with right ATL lesions face familiarity feelings and the retrieval of person-specific semantic information from faces are selectively affected, whereas in patients with left ATL lesions the defect selectively concerns famous people naming. Furthermore, some patients with right ATL lesions and intact face familiarity feelings show a defect in the retrieval of person-specific semantic knowledge greater from face than from name. These data are at variance with current models assuming: (a that familiarity feelings are generated at the level of person identity nodes (PINs where information processed by various sensory modalities converge, and (b that PINs provide a modality-free gateway to a single semantic system, where information about people is stored in an amodal format. They suggest, on the contrary: (a that familiarity feelings are generated at the level of modality-specific recognition units; (b that face and voice recognition units are represented more in the right than in the left ATLs; (c that in the right ATL are mainly stored person-specific information based on a convergence of perceptual information, whereas in the left ATLs are represented verbally-mediated person-specific information.

  11. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...... be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. METHODS: Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS...

  12. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob;

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...... be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. METHODS: Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS...... was compared with the same dictate transcribed by an experienced research secretary, and the effect of adding words to the vocabulary of the VRS was investigated. The number of errors per hundred words was used as outcome. Furthermore, three experienced researchers assessed the subjective readability using...

  13. Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

    CERN Document Server

    Muda, Lindasalwa; Elamvazuthi, I

    2010-01-01

    Digital processing of speech signal and voice recognition algorithm is very important for fast and accurate automatic voice recognition technology. The voice is a signal of infinite information. A direct analysis and synthesizing the complex voice signal is due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching are introduced to represent the voice signal. Several methods such as Liner Predictive Predictive Coding (LPC), Hidden Markov Model (HMM), Artificial Neural Network (ANN) and etc are evaluated with a view to identify a straight forward and effective method for voice signal. The extraction and matching process is implemented right after the Pre Processing or filtering signal is performed. The non-parametric method for modelling the human auditory perception system, Mel Frequency Cepstral Coefficients (MFCCs) are utilize as extraction techniques. The non linear sequence alignment known as Dynamic Time Warping (DTW) intro...

  14. Embodied Transcription: A Creative Method for Using Voice-Recognition Software

    Science.gov (United States)

    Brooks, Christine

    2010-01-01

    Voice-recognition software is designed to be used by one user (voice) at a time, requiring a researcher to speak all of the words of a recorded interview to achieve transcription. Thus, the researcher becomes a conduit through which interview material is inscribed as written word. Embodied Transcription acknowledges performative and interpretative…

  15. Impact of PACS and Voice-Recognition Reporting on the Education of Radiology Residents

    OpenAIRE

    Gutierrez, Antonio J.; Mullins, Mark E.; Robert A. Novelline

    2005-01-01

    Rationale and Objectives: The introduction of picture archiving and communication system (PACS) has decreased the time needed to interpret radiology examinations resulting in an increased workflow. Because of concerns that the increase in exam throughput and the use of voice recognition may have a negative impact upon radiology resident education, a survey was conducted to assess the impact of PACS and voice recognition. Materials and Methods: Residents at four diagnostic radiology training p...

  16. Superior voice recognition in a patient with acquired prosopagnosia and object agnosia.

    Science.gov (United States)

    Hoover, Adria E N; Démonet, Jean-François; Steeves, Jennifer K E

    2010-11-01

    Anecdotally, it has been reported that individuals with acquired prosopagnosia compensate for their inability to recognize faces by using other person identity cues such as hair, gait or the voice. Are they therefore superior at the use of non-face cues, specifically voices, to person identity? Here, we empirically measure person and object identity recognition in a patient with acquired prosopagnosia and object agnosia. We quantify person identity (face and voice) and object identity (car and horn) recognition for visual, auditory, and bimodal (visual and auditory) stimuli. The patient is unable to recognize faces or cars, consistent with his prosopagnosia and object agnosia, respectively. He is perfectly able to recognize people's voices and car horns and bimodal stimuli. These data show a reverse shift in the typical weighting of visual over auditory information for audiovisual stimuli in a compromised visual recognition system. Moreover, the patient shows selectively superior voice recognition compared to the controls revealing that two different stimulus domains, persons and objects, may not be equally affected by sensory adaptation effects. This also implies that person and object identity recognition are processed in separate pathways. These data demonstrate that an individual with acquired prosopagnosia and object agnosia can compensate for the visual impairment and become quite skilled at using spared aspects of sensory processing. In the case of acquired prosopagnosia it is advantageous to develop a superior use of voices for person identity recognition in everyday life. Copyright © 2010 Elsevier Ltd. All rights reserved.

  17. Voice recognition software can be used for scientific articles.

    Science.gov (United States)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob; Rosenberg, Jacob

    2015-02-01

    Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS was compared with the same dictate transcribed by an experienced research secretary, and the effect of adding words to the vocabulary of the VRS was investigated. The number of errors per hundred words was used as outcome. Furthermore, three experienced researchers assessed the subjective readability using a Likert scale (0-10). Dragon Nuance Premium version 12.5 was used as VRS. The median number of errors per hundred words was 18 (range: 8.5-24.3), which improved when 15,000 words were added to the vocabulary. Subjective readability assessment showed that the texts were understandable with a median score of five (range: 3-9), which was improved with the addition of 5,000 words. The out-of-the-box performance of VRS was acceptable and improved after additional words were added. Further studies are needed to investigate the effect of additional software accuracy training.

  18. Who gets credit for input? Demographic and structural status cues in voice recognition.

    Science.gov (United States)

    Howell, Taeya M; Harrison, David A; Burris, Ethan R; Detert, James R

    2015-11-01

    The authors investigate the employee features that, alongside overall voice expression, affect supervisors' voice recognition. Drawing primarily from status characteristics and network position theories, the authors propose and find in a study of 693 employees from 89 different credit union units that supervisors are more likely to credit those reporting the same amount of voice if the employees have higher ascribed or assigned (by the organization) status--cued by demographic variables such as majority ethnicity and full-time work hours. Further, supervisors are more likely to recognize voice from employees who have higher achieved status--cued by their centrality in informal social structures. The authors also find that even when certain groups of lower status employees speak up more, they cannot compensate for the negative effect of their demographic membership on voice recognition by their boss. The authors underscore how recognition of employee voice by supervisors matters for employees. It carries (mediates) the effects of voice expression and status onto performance evaluations 1 year later, which means that demographic differences in the assignment of credit for voice can serve as an implicit pathway for discrimination.

  19. Voice identity recognition: functional division of the right STS and its behavioral relevance.

    Science.gov (United States)

    Schall, Sonja; Kiebel, Stefan J; Maess, Burkhard; von Kriegstein, Katharina

    2015-02-01

    The human voice is the primary carrier of speech but also a fingerprint for person identity. Previous neuroimaging studies have revealed that speech and identity recognition is accomplished by partially different neural pathways, despite the perceptual unity of the vocal sound. Importantly, the right STS has been implicated in voice processing, with different contributions of its posterior and anterior parts. However, the time point at which vocal and speech processing diverge is currently unknown. Also, the exact role of the right STS during voice processing is so far unclear because its behavioral relevance has not yet been established. Here, we used the high temporal resolution of magnetoencephalography and a speech task control to pinpoint transient behavioral correlates: we found, at 200 msec after stimulus onset, that activity in right anterior STS predicted behavioral voice recognition performance. At the same time point, the posterior right STS showed increased activity during voice identity recognition in contrast to speech recognition whereas the left mid STS showed the reverse pattern. In contrast to the highly speech-sensitive left STS, the current results highlight the right STS as a key area for voice identity recognition and show that its anatomical-functional division emerges around 200 msec after stimulus onset. We suggest that this time point marks the speech-independent processing of vocal sounds in the posterior STS and their successful mapping to vocal identities in the anterior STS.

  20. Ability for voice recognition is a marker for dyslexia in children.

    Science.gov (United States)

    Perea, Manuel; Jiménez, María; Suárez-Coalla, Paz; Fernández, Nohemí; Viña, Cecilia; Cuetos, Fernando

    2014-01-01

    A recent voice recognition experiment conducted by Perrachione, Del Tufo, and Gabrieli (2011) revealed that, in normal adult readers, the accuracy at identifying human voices was better in the participants' mother tongue than in an unfamiliar language, while this difference was absent in a group of adults with dyslexia. This pattern favored a view of dyslexia as due to "fundamentally impoverished native-language phonological representations." To further examine this issue, we conducted two voice recognition experiments, one with children with/without dyslexia, and the other with adults with/without dyslexia. Results revealed that children/adults with dyslexia were less accurate at identifying voices than normal readers and, importantly, this effect was independent of language. These data are more consistent with the assumption of dyslexia as due to a deficit in multisensory integration rather than a deficit based on impoverished native-language phonologically based representations.

  1. Literature Review of Voice Recognition and Generation Technology for Army Helicopter Applications.

    Science.gov (United States)

    1984-08-01

    support up this conclusion (Jay, 1981; Coler , 1983). Based upon the research presented, the following statements can be made: a. When flight control...dB) must be overcome by the voice recognizer ( Coler , 1983). 11 55i The effects of noise on voice recognition were the topic of a study performed at...noise when the subject was also required to perform a tracking task and enter data ( Coler , 1983). Performance was evaluated for three different

  2. Analysis of the influence of sound signal processing parameters on the quality voice command recognition

    Directory of Open Access Journals (Sweden)

    L. P. Dyuzhayev

    2014-04-01

    Full Text Available Introduction. For the task of voice control over different devices recognition of single (isolated voice commands is required. Typically, this control method requires high reliability (at least 95% accuracy voice recognition. It should be noted that voice commands are often pronounced in high noisiness. All presently known methods and algorithms of speech recognition do not allow to clearly determine which parameters of sound signal can provide the best results. The main part. On the first level of voice recognition is about preprocessing and extracting of acoustic features that have a number of useful features – they are easily calculated, providing a compact representation of the voice commands that are resistant to noise interference; On the next level given command is looked for in the reference dictionary. To get MFCC coefficients input file has to be divided into frames. Each frame is measured by a window function and processed by discrete Fourier transform. The resulting representation of signal in the frequency domain is divided into ranges using a set of triangular filters. The last step is to perform discrete cosine transform. Method of dynamic time warping allows to get a value that is an inverse of degree of similarity between given command and a reference. Conclusions. Research has shown that in the field of voice commands recognition optimum results in terms of quality / performance can be achieved using the following parameters of sound signal processing:8 kHz sample rate, frame duration 70–120 ms, Hamming weighting function of a window, number of Fourier samples is 512.

  3. Body expressions influence recognition of emotions in the face and voice.

    Science.gov (United States)

    Van den Stock, Jan; Righart, Ruthger; de Gelder, Beatrice

    2007-08-01

    The most familiar emotional signals consist of faces, voices, and whole-body expressions, but so far research on emotions expressed by the whole body is sparse. The authors investigated recognition of whole-body expressions of emotion in three experiments. In the first experiment, participants performed a body expression-matching task. Results indicate good recognition of all emotions, with fear being the hardest to recognize. In the second experiment, two alternative forced choice categorizations of the facial expression of a compound face-body stimulus were strongly influenced by the bodily expression. This effect was a function of the ambiguity of the facial expression. In the third experiment, recognition of emotional tone of voice was similarly influenced by task irrelevant emotional body expressions. Taken together, the findings illustrate the importance of emotional whole-body expressions in communication either when viewed on their own or, as is often the case in realistic circumstances, in combination with facial expressions and emotional voices.

  4. Effects of emotional and perceptual-motor stress on a voice recognition system's accuracy: An applied investigation

    Science.gov (United States)

    Poock, G. K.; Martin, B. J.

    1984-02-01

    This was an applied investigation examining the ability of a speech recognition system to recognize speakers' inputs when the speakers were under different stress levels. Subjects were asked to speak to a voice recognition system under three conditions: (1) normal office environment, (2) emotional stress, and (3) perceptual-motor stress. Results indicate a definite relationship between voice recognition system performance and the type of low stress reference patterns used to achieve recognition.

  5. Motorcycle Start-stop System based on Intelligent Biometric Voice Recognition

    Science.gov (United States)

    Winda, A.; E Byan, W. R.; Sofyan; Armansyah; Zariantin, D. L.; Josep, B. G.

    2017-03-01

    Current mechanical key in the motorcycle is prone to bulgary, being stolen or misplaced. Intelligent biometric voice recognition as means to replace this mechanism is proposed as an alternative. The proposed system will decide whether the voice is belong to the user or not and the word utter by the user is ‘On’ or ‘Off’. The decision voice will be sent to Arduino in order to start or stop the engine. The recorded voice is processed in order to get some features which later be used as input to the proposed system. The Mel-Frequency Ceptral Coefficient (MFCC) is adopted as a feature extraction technique. The extracted feature is the used as input to the SVM-based identifier. Experimental results confirm the effectiveness of the proposed intelligent voice recognition and word recognition system. It show that the proposed method produces a good training and testing accuracy, 99.31% and 99.43%, respectively. Moreover, the proposed system shows the performance of false rejection rate (FRR) and false acceptance rate (FAR) accuracy of 0.18% and 17.58%, respectively. In the intelligent word recognition shows that the training and testing accuracy are 100% and 96.3%, respectively.

  6. Recognition of voice commands using adaptation of foreign language speech recognizer via selection of phonetic transcriptions

    Science.gov (United States)

    Maskeliunas, Rytis; Rudzionis, Vytautas

    2011-06-01

    In recent years various commercial speech recognizers have become available. These recognizers provide the possibility to develop applications incorporating various speech recognition techniques easily and quickly. All of these commercial recognizers are typically targeted to widely spoken languages having large market potential; however, it may be possible to adapt available commercial recognizers for use in environments where less widely spoken languages are used. Since most commercial recognition engines are closed systems the single avenue for the adaptation is to try set ways for the selection of proper phonetic transcription methods between the two languages. This paper deals with the methods to find the phonetic transcriptions for Lithuanian voice commands to be recognized using English speech engines. The experimental evaluation showed that it is possible to find phonetic transcriptions that will enable the recognition of Lithuanian voice commands with recognition accuracy of over 90%.

  7. (Almost) Word for Word: As Voice Recognition Programs Improve, Students Reap the Benefits

    Science.gov (United States)

    Smith, Mark

    2006-01-01

    Voice recognition software is hardly new--attempts at capturing spoken words and turning them into written text have been available to consumers for about two decades. But what was once an expensive and highly unreliable tool has made great strides in recent years, perhaps most recognized in programs such as Nuance's Dragon NaturallySpeaking…

  8. Frequency and analysis of non-clinical errors made in radiology reports using the National Integrated Medical Imaging System voice recognition dictation software.

    Science.gov (United States)

    Motyer, R E; Liddy, S; Torreggiani, W C; Buckley, O

    2016-11-01

    Voice recognition (VR) dictation of radiology reports has become the mainstay of reporting in many institutions worldwide. Despite benefit, such software is not without limitations, and transcription errors have been widely reported. Evaluate the frequency and nature of non-clinical transcription error using VR dictation software. Retrospective audit of 378 finalised radiology reports. Errors were counted and categorised by significance, error type and sub-type. Data regarding imaging modality, report length and dictation time was collected. 67 (17.72 %) reports contained ≥1 errors, with 7 (1.85 %) containing 'significant' and 9 (2.38 %) containing 'very significant' errors. A total of 90 errors were identified from the 378 reports analysed, with 74 (82.22 %) classified as 'insignificant', 7 (7.78 %) as 'significant', 9 (10 %) as 'very significant'. 68 (75.56 %) errors were 'spelling and grammar', 20 (22.22 %) 'missense' and 2 (2.22 %) 'nonsense'. 'Punctuation' error was most common sub-type, accounting for 27 errors (30 %). Complex imaging modalities had higher error rates per report and sentence. Computed tomography contained 0.040 errors per sentence compared to plain film with 0.030. Longer reports had a higher error rate, with reports >25 sentences containing an average of 1.23 errors per report compared to 0-5 sentences containing 0.09. These findings highlight the limitations of VR dictation software. While most error was deemed insignificant, there were occurrences of error with potential to alter report interpretation and patient management. Longer reports and reports on more complex imaging had higher error rates and this should be taken into account by the reporting radiologist.

  9. Application of Voice Recognition Input to Decision Support Systems

    Science.gov (United States)

    1988-12-01

    namely, a Bark-scale frequency warping and the incorporation of suprasegmental energy information. All distortion measures and their modifications were...lowest score; (2) Whereas the addition of suprasegmental energy information helped the recognition performance, the use of gain and absolute loudness

  10. Voice recognition through phonetic features with Punjabi utterances

    Science.gov (United States)

    Kaur, Jasdeep; Juglan, K. C.; Sharma, Vishal; Upadhyay, R. K.

    2017-07-01

    This paper deals with perception and disorders of speech in view of Punjabi language. Visualizing the importance of voice identification, various parameters of speaker identification has been studied. The speech material was recorded with a tape recorder in their normal and disguised mode of utterances. Out of the recorded speech materials, the utterances free from noise, etc were selected for their auditory and acoustic spectrographic analysis. The comparison of normal and disguised speech of seven subjects is reported. The fundamental frequency (F0) at similar places, Plosive duration at certain phoneme, Amplitude ratio (A1:A2) etc. were compared in normal and disguised speech. It was found that the formant frequency of normal and disguised speech remains almost similar only if it is compared at the position of same vowel quality and quantity. If the vowel is more closed or more open in the disguised utterance the formant frequency will be changed in comparison to normal utterance. The ratio of the amplitude (A1: A2) is found to be speaker dependent. It remains unchanged in the disguised utterance. However, this value may shift in disguised utterance if cross sectioning is not done at the same location.

  11. Voice emotion recognition by cochlear-implanted children and their normally-hearing peers.

    Science.gov (United States)

    Chatterjee, Monita; Zion, Danielle J; Deroche, Mickael L; Burianek, Brooke A; Limb, Charles J; Goren, Alison P; Kulkarni, Aditya M; Christensen, Julie A

    2015-04-01

    Despite their remarkable success in bringing spoken language to hearing impaired listeners, the signal transmitted through cochlear implants (CIs) remains impoverished in spectro-temporal fine structure. As a consequence, pitch-dominant information such as voice emotion, is diminished. For young children, the ability to correctly identify the mood/intent of the speaker (which may not always be visible in their facial expression) is an important aspect of social and linguistic development. Previous work in the field has shown that children with cochlear implants (cCI) have significant deficits in voice emotion recognition relative to their normally hearing peers (cNH). Here, we report on voice emotion recognition by a cohort of 36 school-aged cCI. Additionally, we provide for the first time, a comparison of their performance to that of cNH and NH adults (aNH) listening to CI simulations of the same stimuli. We also provide comparisons to the performance of adult listeners with CIs (aCI), most of whom learned language primarily through normal acoustic hearing. Results indicate that, despite strong variability, on average, cCI perform similarly to their adult counterparts; that both groups' mean performance is similar to aNHs' performance with 8-channel noise-vocoded speech; that cNH achieve excellent scores in voice emotion recognition with full-spectrum speech, but on average, show significantly poorer scores than aNH with 8-channel noise-vocoded speech. A strong developmental effect was observed in the cNH with noise-vocoded speech in this task. These results point to the considerable benefit obtained by cochlear-implanted children from their devices, but also underscore the need for further research and development in this important and neglected area. This article is part of a Special Issue entitled .

  12. Recognition disorders for famous faces and voices: a review of the literature and normative data of a new test battery.

    Science.gov (United States)

    Quaranta, Davide; Piccininni, Chiara; Carlesimo, Giovanni Augusto; Luzzi, Simona; Marra, Camillo; Papagno, Costanza; Trojano, Luigi; Gainotti, Guido

    2016-03-01

    Several anatomo-clinical investigations have shown that familiar face recognition disorders not due to high level perceptual defects are often observed in patients with lesions of the right anterior temporal lobe (ATL). The meaning of these findings is, however, controversial, because some authors claim that these patients show pure instances of modality-specific 'associative prosopagnosia', whereas other authors maintain that in these patients voice recognition is also impaired and that these patients have a 'multimodal person recognition disorder'. To solve the problem of the nature of famous faces recognition disorders in patients affected by right ATL lesions, it is therefore very important to verify with formal tests if these patients are or are not able to recognize others by voice, but a direct comparison between the two modalities is hindered by the fact that voice recognition is more difficult than face recognition. To circumvent this difficulty, we constructed a test battery in which subjects were requested to recognize the same persons (well-known at the national level) through their faces and voices, evaluating familiarity and identification processes. The present paper describes the 'Famous People Recognition Battery' and reports the normative data necessary to clarify the nature of person recognition disorders observed in patients affected by right ATL lesions.

  13. Economic Evaluation of Voice Recognition (VR) for the Clinician’s Desktop at the Naval Hospital Roosevelt Roads

    Science.gov (United States)

    1997-09-01

    34 and "The League of Nations" in an electronic encyclopedia or online reference. [Ref. 16: p. 38] The reason we haven’t conversed with computers until...When did you first start working on this job? I have work here for 2 years. I have been a family physician since June 1994, I was a GMO from 1990 to

  14. Examining the effects of variation in emotional tone of voice on spoken word recognition.

    Science.gov (United States)

    Krestar, Maura L; McLennan, Conor T

    2013-09-01

    Emotional tone of voice (ETV) is essential for optimal verbal communication. Research has found that the impact of variation in nonlinguistic features of speech on spoken word recognition differs according to a time course. In the current study, we investigated whether intratalker variation in ETV follows the same time course in two long-term repetition priming experiments. We found that intratalker variability in ETVs affected reaction times to spoken words only when processing was relatively slow and difficult, not when processing was relatively fast and easy. These results provide evidence for the use of both abstract and episodic lexical representations for processing within-talker variability in ETV, depending on the time course of spoken word recognition.

  15. It doesn't matter what you say: FMRI correlates of voice learning and recognition independent of speech content.

    Science.gov (United States)

    Zäske, Romi; Awwad Shiekh Hasan, Bashar; Belin, Pascal

    2017-09-01

    Listeners can recognize newly learned voices from previously unheard utterances, suggesting the acquisition of high-level speech-invariant voice representations during learning. Using functional magnetic resonance imaging (fMRI) we investigated the anatomical basis underlying the acquisition of voice representations for unfamiliar speakers independent of speech, and their subsequent recognition among novel voices. Specifically, listeners studied voices of unfamiliar speakers uttering short sentences and subsequently classified studied and novel voices as "old" or "new" in a recognition test. To investigate "pure" voice learning, i.e., independent of sentence meaning, we presented German sentence stimuli to non-German speaking listeners. To disentangle stimulus-invariant and stimulus-dependent learning, during the test phase we contrasted a "same sentence" condition in which listeners heard speakers repeating the sentences from the preceding study phase, with a "different sentence" condition. Voice recognition performance was above chance in both conditions although, as expected, performance was higher for same than for different sentences. During study phases activity in the left inferior frontal gyrus (IFG) was related to subsequent voice recognition performance and same versus different sentence condition, suggesting an involvement of the left IFG in the interactive processing of speaker and speech information during learning. Importantly, at test reduced activation for voices correctly classified as "old" compared to "new" emerged in a network of brain areas including temporal voice areas (TVAs) of the right posterior superior temporal gyrus (pSTG), as well as the right inferior/middle frontal gyrus (IFG/MFG), the right medial frontal gyrus, and the left caudate. This effect of voice novelty did not interact with sentence condition, suggesting a role of temporal voice-selective areas and extra-temporal areas in the explicit recognition of learned voice identity

  16. Educational Pedagogy Explored: Attachment, Voice, and Students’ Limited Recognition of the Purpose of Writing

    Directory of Open Access Journals (Sweden)

    Rebecca A. Fairchild

    2013-07-01

    Full Text Available The following teacher research case-study involved an exploration of educational pedagogy by working with a freshman composition student at a college university. All data collected for the study was gathered during the 2013 spring semester. The study was driven by an inquiry based approach where the researcher determined the center of focus that arose from an exploration of the student as a writer through a survey, a classroom observation, multiple one-on-one meetings, and email conversations. The focus area that arose was the student’s limited recognition that writing was done solely for school purposes. Related puzzlements stemming from this focus area included the student’s lack of attachment and lack of voice in her writing. The conclusive data provided insights for how to educate students in future classrooms regarding how vital it is for students to be able to attach themselves to their work.

  17. Voice Activity Detector of Wake-Up-Word Speech Recognition System Design on FPGA

    Directory of Open Access Journals (Sweden)

    Veton Z. Këpuska

    2014-12-01

    Full Text Available A typical speech recognition system is push-to-talk operated that requires activation. However for those who use hands-busy applications, movement may by restricted or impossible. One alternative is to use Speech-Only Interface. The proposed method that is called Wake-Up-Word Speech Recognition (WUW-SR that utilizes speech only interface. A WUW-SR system would allow the user to activate systems (Cell phone, Computer, etc. with only speech commands instead of manual activation. The trend in WUW-SR hardware design is towards implementing a complete system on a single chip intended for various applications. This paper presents an experimental FPGA design and implementation of a novel architecture of a real time feature extraction processor that includes: Voice Activity Detector (VAD, and features extraction, MFCC, LPC, and ENH_MFCC. In the WUW-SR system, the recognizer front-end with VAD is located at the terminal which is typically connected over a data network(e.g., serverfor remote back-end recognition. VAD is responsible for segmenting the signal into speech-like and non-speech-like segments. For any given frame VAD reports one of two possible states: VAD_ON or VAD_OFF. The back-end is then responsible to score the features that are being segmented during VAD_ON stage. The most important characteristic of the presented design is that it should guarantee virtually 100% correct rejection for non-WUW (out of vocabulary words - OOV while maintaining correct acceptance rate of 99.9% or higher (in vocabulary words - INV. This requirement sets apart WUW-SR from other speech recognition tasks because no existing system can guarantee 100% reliability by any measure.

  18. Automatic Speech Recognition Systems for the Evaluation of Voice and Speech Disorders in Head and Neck Cancer

    Directory of Open Access Journals (Sweden)

    Andreas Maier

    2010-01-01

    Full Text Available In patients suffering from head and neck cancer, speech intelligibility is often restricted. For assessment and outcome measurements, automatic speech recognition systems have previously been shown to be appropriate for objective and quick evaluation of intelligibility. In this study we investigate the applicability of the method to speech disorders caused by head and neck cancer. Intelligibility was quantified by speech recognition on recordings of a standard text read by 41 German laryngectomized patients with cancer of the larynx or hypopharynx and 49 German patients who had suffered from oral cancer. The speech recognition provides the percentage of correctly recognized words of a sequence, that is, the word recognition rate. Automatic evaluation was compared to perceptual ratings by a panel of experts and to an age-matched control group. Both patient groups showed significantly lower word recognition rates than the control group. Automatic speech recognition yielded word recognition rates which complied with experts' evaluation of intelligibility on a significant level. Automatic speech recognition serves as a good means with low effort to objectify and quantify the most important aspect of pathologic speech—the intelligibility. The system was successfully applied to voice and speech disorders.

  19. 病态嗓音的识别与研究%study and recognition of pathological voice

    Institute of Scientific and Technical Information of China (English)

    陈承义; 高俊芬

    2013-01-01

      通过分析嗓音的发音机理,提取正常与病态嗓音的传统声学参数:基频、共振峰、Mel 倒谱系数(MFCC),以及非线性特征参数:计盒维数与截距,作为病态嗓音识别的特征矢量集.应用高斯混合模型(GMM)对156例正常嗓音与146例病态嗓音进行建模与识别.结果表明:非线性特征参数计盒维数与截距能很好地区分正常与病态嗓音,它们与传统声学参数基频和共振峰的组合,能够取得92.60%的识别率.%By analyzing the mechanism of pronunciation, normal and pathological voice of traditional acoustic parameters:fun-damental frequency, formant, Mel Frequency Cepstrum Coefficient(MFCC), and non-linear feature parameters:box-counting dimension and intercept, are extracted as feature vectors of recognition of pathological voice. 156 normal voice samples and 146 pathological voice samples are recognized based on Gaussian Mixture Model(GMM). The results show that the nonlinear fea-ture parameters of box-counting dimension and intercept can well distinguish between normal and pathological voice. The com-bination of box-counting dimension, intercept and the traditional acoustic parameters-fundamental frequency and formant can achieve a better recognition rate of 92.60%.

  20. A self-teaching image processing and voice-recognition-based, intelligent and interactive system to educate visually impaired children

    Science.gov (United States)

    Iqbal, Asim; Farooq, Umar; Mahmood, Hassan; Asad, Muhammad Usman; Khan, Akrama; Atiq, Hafiz Muhammad

    2010-02-01

    A self teaching image processing and voice recognition based system is developed to educate visually impaired children, chiefly in their primary education. System comprises of a computer, a vision camera, an ear speaker and a microphone. Camera, attached with the computer system is mounted on the ceiling opposite (on the required angle) to the desk on which the book is placed. Sample images and voices in the form of instructions and commands of English, Urdu alphabets, Numeric Digits, Operators and Shapes are already stored in the database. A blind child first reads the embossed character (object) with the help of fingers than he speaks the answer, name of the character, shape etc into the microphone. With the voice command of a blind child received by the microphone, image is taken by the camera which is processed by MATLAB® program developed with the help of Image Acquisition and Image processing toolbox and generates a response or required set of instructions to child via ear speaker, resulting in self education of a visually impaired child. Speech recognition program is also developed in MATLAB® with the help of Data Acquisition and Signal Processing toolbox which records and process the command of the blind child.

  1. Subjects taught in VR

    NARCIS (Netherlands)

    van der Sluis, Frans; van den Broek, Egon; Stam, Liesbeth M.; Abrahamse, E.L.; Luursema, J.M.

    2007-01-01

    This deliverable serves to reinstate a broad view on Virtual Reality (VR), capturing all its constituting disciplines. The core target of this report is to establish a foundation for an educational program where all disciplines subordinate to VR technology will converge. Over the past decade(s) the

  2. Subjects taught in VR

    NARCIS (Netherlands)

    Sluis, van der Frans; Broek, van den Egon L.; Stam, Liesbeth M.; Abrahamse, Elger L.

    2006-01-01

    This deliverable serves to reinstate a broad view on Virtual Reality (VR), capturing all its constituting disciplines. The core target of this report is to establish a foundation for an educational program where all disciplines subordinate to VR technology will converge. Over the past decade(s) the

  3. Speech Recognition System For Robotic Control And Movement

    Directory of Open Access Journals (Sweden)

    Biraja Nalini Rout

    2015-08-01

    Full Text Available Abstract In a current scenario voice and data recognition is one of the most sought after field in the area of artificial intelligence and robotic 1 engineering. The idea specializes on deriving a voice to voice intelligent system which operates purely on audiovoice instructions using a specialized voice recognition module a micro controller a set of wheels and a movable arm to operate. The working involves real time voice inputs feeded to the VR module which equivalently processes the audio signals and produces the output in audio format. It consists an IDE for both Windows and UNIX based operating system for manipulating and processing instructions both at software and hardware levels. The system also can perform a basic set of manual operations decides through the expert system. The VR module processes the data using multilayer perceptron to generate the required result. Movable arm operates to pick and place objects as per the given voice instructions. Its usability involves substituting manual work at both personal and professional levels.

  4. The Cambridge Mindreading Face-Voice Battery for Children (CAM-C): complex emotion recognition in children with and without autism spectrum conditions.

    Science.gov (United States)

    Golan, Ofer; Sinai-Gavrilov, Yana; Baron-Cohen, Simon

    2015-01-01

    Difficulties in recognizing emotions and mental states are central characteristics of autism spectrum conditions (ASC). However, emotion recognition (ER) studies have focused mostly on recognition of the six 'basic' emotions, usually using still pictures of faces. This study describes a new battery of tasks for testing recognition of nine complex emotions and mental states from video clips of faces and from voice recordings taken from the Mindreading DVD. This battery (the Cambridge Mindreading Face-Voice Battery for Children or CAM-C) was given to 30 high-functioning children with ASC, aged 8 to 11, and to 25 matched controls. The ASC group scored significantly lower than controls on complex ER from faces and voices. In particular, participants with ASC had difficulty with six out of nine complex emotions. Age was positively correlated with all task scores, and verbal IQ was correlated with scores in the voice task. CAM-C scores were negatively correlated with parent-reported level of autism spectrum symptoms. Children with ASC show deficits in recognition of complex emotions and mental states from both facial and vocal expressions. The CAM-C may be a useful test for endophenotypic studies of ASC and is one of the first to use dynamic stimuli as an assay to reveal the ER profile in ASC. It complements the adult version of the CAM Face-Voice Battery, thus providing opportunities for developmental assessment of social cognition in autism.

  5. 语音情感识别研究现状综述%A General Summary of the Research Status Que about the Voice Emotion Recognition

    Institute of Scientific and Technical Information of China (English)

    何秉羲

    2015-01-01

    This article starts from the concept and process of voice emotion recognition, the phased research situation about the process of voice emotion recognition has carried on the comprehensive elaboration in recent years, and the fu-ture research and its development are prospected.%本文从语音情感识别的概念以及流程入手,对近些年来关于语音情感识别过程情况的阶段性研究成果进行了综合阐述,并对其未来研究及其发展进行了展望。

  6. Using voice input and audio feedback to enhance the reality of a virtual experience

    Energy Technology Data Exchange (ETDEWEB)

    Miner, N.E.

    1994-04-01

    Virtual Reality (VR) is a rapidly emerging technology which allows participants to experience a virtual environment through stimulation of the participant`s senses. Intuitive and natural interactions with the virtual world help to create a realistic experience. Typically, a participant is immersed in a virtual environment through the use of a 3-D viewer. Realistic, computer-generated environment models and accurate tracking of a participant`s view are important factors for adding realism to a virtual experience. Stimulating a participant`s sense of sound and providing a natural form of communication for interacting with the virtual world are equally important. This paper discusses the advantages and importance of incorporating voice recognition and audio feedback capabilities into a virtual world experience. Various approaches and levels of complexity are discussed. Examples of the use of voice and sound are presented through the description of a research application developed in the VR laboratory at Sandia National Laboratories.

  7. VR for Mars Pathfinder

    Science.gov (United States)

    Blackmon, Theodore

    1998-01-01

    Virtual reality (VR) technology has played an integral role for Mars Pathfinder mission, operations Using an automated machine vision algorithm, the 3d topography of the Martian surface was rapidly recovered fro -a the stereo images captured. by the Tender camera to produce photo-realistic 3d models, An advanced, interface was developed for visualization and interaction with. the virtual environment of the Pathfinder landing site for mission scientists at the Space Flight Operations Facility of the Jet Propulsion Laboratory. The VR aspect of the display allowed mission scientists to navigate on Mars in Bud while remaining here on Earth, thus improving their spatial awareness of the rock field that surrounds the lenders Measurements of positions, distances and angles could be easily extracted from the topographic models, providing valuable information for science analysis and mission. planning. Moreover, the VR map of Mars has also been used to assist with the archiving and planning of activities for the Sojourner rover.

  8. Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization

    Directory of Open Access Journals (Sweden)

    Buddhamas eKriengwatana

    2015-01-01

    Full Text Available The extent to which human speech perception evolved by taking advantage of predispositions and pre-existing features of vertebrate auditory and cognitive systems remains a central question in the evolution of speech. This paper reviews asymmetries in vowel perception, speaker voice recognition, and speaker normalization in non-human animals – topics that have not been thoroughly discussed in relation to the abilities of non-human animals, but are nonetheless important aspects of vocal perception. Throughout this paper we demonstrate that addressing these issues in non-human animals is relevant and worthwhile because many non-human animals must deal with similar issues in their natural environment. That is, they must also discriminate between similar-sounding vocalizations, determine signaler identity from vocalizations, and resolve signaler-dependent variation in vocalizations from conspecifics. Overall, we find that, although plausible, the current evidence is insufficiently strong to conclude that directional asymmetries in vowel perception are specific to humans, or that non-human animals can use voice characteristics to recognize human individuals. However, we do find some indication that non-human animals can normalize speaker differences. Accordingly, we identify avenues for future research that would greatly improve and advance our understanding of these topics.

  9. Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders.

    Science.gov (United States)

    Kitzing, Peter; Maier, Andreas; Ahlander, Viveka Lyberg

    2009-01-01

    In general opinion computerized automatic speech recognition (ASR) seems to be regarded as a method only to accomplish transcriptions from spoken language to written text and as such quite insecure and rather cumbersome. However, due to great advances in computer technology and informatics methodology ASR has nowadays become quite dependable and easier to handle, and the number of applications has increased considerably. After some introductory background information on ASR a number of applications of great interest for professionals in voice, speech, and language therapy are pointed out. In the foreseeable future, the keyboard and mouse will by means of ASR technology be replaced in many functions by a microphone as the human-computer interface, and the computer will talk back via its loud-speaker. It seems important that professionals engaged in the care of oral communication disorders take part in this development so their clients may get the optimal benefit from this new technology.

  10. Design and Implementation of Monophones and Triphones-Based Speech Recognition Systems for Voice Activated Telephony

    Directory of Open Access Journals (Sweden)

    Rupayan Das

    2013-07-01

    Full Text Available Speech recognition is the ability of a machine or program to convert spoken words into its equivalent text form. Nowadays, most recognition systems use Hidden Markov Models for modeling the spoken utterances. In this paper we have implemented two speaker independent speech recognition systems which include all the words required for dialing a phone. The systems contain 42 words including digits from zero to nine and also include names of 20 persons. A total of 16,800 utterances have been used for training each system. The two systems are able to recognize continuous speech and it is implemented with the help of monophones and triphones using HTK. Experimental results show an accuracy of 74.11% for monophones based models and 93.77% for triphones based models.

  11. A Real-Time Face Motion Based Approach towards Modeling Socially Assistive Wireless Robot Control with Voice Recognition

    Directory of Open Access Journals (Sweden)

    Abhinaba Bhattacharjee

    2015-10-01

    Full Text Available The robotics domain has a couple of specific general design requirements which requires the close integration of planning, sensing, control and modeling and for sure the robot must take into account the interactions between itself, its task and its environment surrounding it. Thus considering the fundamental configurations, the main motive is to design a system with user-friendly interfaces that possess the ability to control embedded robotic systems by natural means. While earlier works have focused primarily on issues such as manipulation and navigation only, this proposal presents a conceptual and intuitive approach towards man-machine interaction in order to provide a secured live biometric logical authorization to the user access, while making an intelligent interaction with the control station to navigate advanced gesture controlled wireless Robotic prototypes or mobile surveillance systems along desired directions through required displacements. The intuitions are based on tracking real-time 3-Dimensional Face Motions using skin tone segmentation and maximum area considerations of segmented face-like blobs, Or directing the system with voice commands using real-time speech recognition. The system implementation requires designing a user interface to communicate between the Control station and prototypes wirelessly, either by accessing the internet over an encrypted Wi-Fi Protected Access (WPA via a HTML web page for communicating with face motions or with the help of natural voice commands like “Trace 5 squares”, “Trace 10 triangles”, “Move 10 meters”, etc. evaluated on an iRobot Create over Bluetooth connectivity using a Bluetooth Access Module (BAM. Such an implementation can prove to be highly effective for designing systems of elderly aid and maneuvering the physically challenged.

  12. DLMS Voice Data Entry.

    Science.gov (United States)

    1980-06-01

    between operator and computer displayed on ADM-3A 20c A-I Possible Hardware Configuration for a Multistation Cartographic VDES ...this program a Voice Recognition System (VRS) which can be used to explore the use of voice data entry ( VDE ) in the DIMS or other cartographic data...Multi-Station Cartographic Voice Data Entry System An engineering development model voice data entry system ( VDES ) could be most efficiently

  13. A basic study on application of voice recognition input to an electronic nursing record system -evaluation of the function as an input interface-.

    Science.gov (United States)

    Marukami, Terutaka; Tani, Shoko; Matsuda, Atsuko; Takemoto, Keiko; Shindo, Akiko; Inada, Hiroshi

    2012-06-01

    As computerization in the nursing field has been recently progressing, an electronic nursing record system is gradually introduced in the medical institution in Japan. Although it is expected for the electronic nursing record system to reduce the load of nursing work, the conventional keyboard operation is used for information input of the present electronic nursing record system and it has some problems concerning the input time and the operationability for common nurses who are unfamiliar with the computer operation. In the present study, we conducted a basic study on application of voice recognition input to an electronic nursing record system. The voice input is recently introduced to an electronic medical record system in a few clinics. However, so far the entered information cannot be processed because the information of the medical record must be entered as a free sentence. Therefore, we contrived a template for an electronic nursing record system and introduced it to the system for simple information entry and easy processing of the entered information in this study. Furthermore, an input experiment for evaluation of the voice input with the template was carried out by voluntary subjects for evaluation of the function as an input interface of an electronic nursing record system. The results of the experiment revealed that the input time by the voice input is obviously fast compared with that by the keyboard input and operationability of the voice input was superior to the keyboard input although all subjects had inexperience of the voice input. As a result, it was suggested our method, the voice input using the template made by us, might be useful for an input interface of an electronic nursing record system.

  14. Evaluation of MPEG-7-Based Audio Descriptors for Animal Voice Recognition over Wireless Acoustic Sensor Networks.

    Science.gov (United States)

    Luque, Joaquín; Larios, Diego F; Personal, Enrique; Barbancho, Julio; León, Carlos

    2016-05-18

    Environmental audio monitoring is a huge area of interest for biologists all over the world. This is why some audio monitoring system have been proposed in the literature, which can be classified into two different approaches: acquirement and compression of all audio patterns in order to send them as raw data to a main server; or specific recognition systems based on audio patterns. The first approach presents the drawback of a high amount of information to be stored in a main server. Moreover, this information requires a considerable amount of effort to be analyzed. The second approach has the drawback of its lack of scalability when new patterns need to be detected. To overcome these limitations, this paper proposes an environmental Wireless Acoustic Sensor Network architecture focused on use of generic descriptors based on an MPEG-7 standard. These descriptors demonstrate it to be suitable to be used in the recognition of different patterns, allowing a high scalability. The proposed parameters have been tested to recognize different behaviors of two anuran species that live in Spanish natural parks; the Epidalea calamita and the Alytes obstetricans toads, demonstrating to have a high classification performance.

  15. Virtual reality in mining: VR worlds, VR simulators, VR team training; Virtuelle Realitaet im Bergbau: VR Welten, VR Simulatoren und VR Teamtraining

    Energy Technology Data Exchange (ETDEWEB)

    Schmid, M. [DMT GmbH, Essen (Germany). Geschaeftsbereich Bergbau Service

    2005-11-01

    VR is being applied increasingly in mining as many projects and indented projects show. Interactive and simulator training of staff and better information supply via 3D programs will contribute to making mining more profitable. (orig.)

  16. Voice integrated systems

    Science.gov (United States)

    Curran, P. Mike

    1977-01-01

    The program at Naval Air Development Center was initiated to determine the desirability of interactive voice systems for use in airborne weapon systems crew stations. A voice recognition and synthesis system (VRAS) was developed and incorporated into a human centrifuge. The speech recognition aspect of VRAS was developed using a voice command system (VCS) developed by Scope Electronics. The speech synthesis capability was supplied by a Votrax, VS-5, speech synthesis unit built by Vocal Interface. The effects of simulated flight on automatic speech recognition were determined by repeated trials in the VRAS-equipped centrifuge. The relationship of vibration, G, O2 mask, mission duration, and cockpit temperature and voice quality was determined. The results showed that: (1) voice quality degrades after 0.5 hours with an O2 mask; (2) voice quality degrades under high vibration; and (3) voice quality degrades under high levels of G. The voice quality studies are summarized. These results were obtained with a baseline of 80 percent recognition accuracy with VCS.

  17. The development of VR technology for nuclear industry applications

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Yong Bum; Cho, Jai Wan; Lee, Nam Ho; Choi, Young Soo; Park, Soon Yong

    1998-01-01

    By searching the present condition of virtual reality technology of which researches were carried out not only abroad but also the country in nuclear power industry, we confirm the possibility of practical usage of VR in it. And as a fundamental research for applications of VR in nuclear power industry, gesture recognition for remote working and VR training system for severe working were performed. 1. A study on gesture recognition for remote working : The hand gesture recognition technology using visual signal and tactile magnetic sensor as a basic study for the introduction of task command and communication were performed. 2. A study on an construction of the virtual environment training system for the task in a severe condition: A construction of virtual reality training system for the tasks in a severe working condition was implemented. This system was intended to enhance the efficiency of actual tasks through advanced practicing the motion procedures those should be performed in a severe working condition where it is difficult to access for personnel. The motion information which is came from the sensors attached on trainers body was used for construction of the virtual environment through the computer graphic procedures. The VR training system has many merits relative to the conservative training method that was performed with mock-up which was made as the same size and shape as real component in nuclear power plant. (author). 27 refs., 21 tabs., 51 figs

  18. Voice, Schooling, Inequality, and Scale

    Science.gov (United States)

    Collins, James

    2013-01-01

    The rich studies in this collection show that the investigation of voice requires analysis of "recognition" across layered spatial-temporal and sociolinguistic scales. I argue that the concepts of voice, recognition, and scale provide insight into contemporary educational inequality and that their study benefits, in turn, from paying attention to…

  19. Voice Matching Using Genetic Algorithm

    Directory of Open Access Journals (Sweden)

    Abhishek Bal

    2014-03-01

    Full Text Available In this paper, the use of Genetic Algorithm (GA for voice recognition is described. The practical application of Genetic Algorithm (GA to the solution of engineering problem is a rapidly emerging approach in the field of control engineering and signal processing. Genetic algorithms are useful for searching a space in multi-directional way from large spaces and poorly defined space. Voice is a signal of infinite information. Digital processing of voice signal is very important for automatic voice recognition technology. Nowadays, voice processing is very much important in security mechanism due to mimicry characteristic. So studying the voice feature extraction in voice processing is very necessary in military, hospital, telephone system, investigation bureau and etc. In order to extract valuable information from the voice signal, make decisions on the process, and obtain results, the data needs to be manipulated and analyzed. In this paper, if the instant voice is not matched with same person’s reference voices in the database, then Genetic Algorithm (GA is applied between two randomly chosen reference voices. Again the instant voice is compared with the result of Genetic Algorithm (GA which is used, including its three main steps: selection, crossover and mutation. We illustrate our approach with different sample of voices from human in our institution.

  20. Face and Voice Recognition Algorithms of Sign-in System for Underground Coalmine%人脸与声音结合的矿井人员签到识别

    Institute of Scientific and Technical Information of China (English)

    王君; 李成武; 杨茜; 刘世森

    2012-01-01

    矿井时有安全事故发生,签到管理系统可及时、准确掌握人员出入人员状况,保障矿井安全生产,方便及时救援.针对传统签到管理系统用于矿井,遇到光线昏暗、人脸易附着粉尘、干扰噪音等因素影响,签到识别方法检测率低,提出了—种根据KL变换(Karhunen-Loeve Transform)和TAN分类(Tree-Augmented Naive Bayesian network)相结合的人脸识别,并辅以声音识别的方法.通过形态学滤波变换快速去掉大部分无用背景,使处理更快速,特征点更突出;自动根据具体环境选择图像识别或声音识别,使识别准确率更高.仿真结果表明:结合声音的系统识别方法既减小了计算复杂度,又提高了人员识别率,还增强了适应性.%Coalmine accidents happen sometimes. It is significant to know the accurate statement of the miners in coalmine or outside, which is convenient for rescue. When the traditional Sign—in Management System was used in coal mine, the system meets new problems, such as black, hazy face, etc. Aiming at this issue, this paper put forward a face recognition algorithm based on the combination of Karhunen—Loeve Transform and Tree—Augmented Naive Bayesian network classifier, which uses the morphological filtering to remove most of useless transform background quickly. In addition, the voice recognition method was addede to that algorithm which makes feature point more outstanding and identification more accuracy, according to the specific environment automatic selection of face recognition or voice recognition. The simulation shows that this algorithm not only reduces the computational complexity and improves the human face recognition rate, but also enhances the adaptability.

  1. Validity of jitter measures in non-quasi-periodic voices. Part I: perceptual and computer performances in cycle pattern recognition.

    Science.gov (United States)

    Dejonckere, Philippe; Schoentgen, Jean; Giordano, Andrea; Fraj, Samia; Bocchi, Leonardo; Manfredi, Claudia

    2011-07-01

    The limit of about 5% for reliable quantification of jitter in sustained vowels of dysphonic voices-a widely accepted guideline-deserves critical analysis. The present study pertains to the effect of experience and training on the perceptual (visual) capability of correctly identifying periods in (highly) perturbed signals, and to a comparison of the performance of several programs for voice analysis. Synthesized realistic vowels (/a:/) with exactly known jitter (2.7%-31.5%) are used as material. After selection and training, experienced raters demonstrate excellent agreement in correctly identifying periods up to high values of jitter put in. Perceptual rating outperforms all computer programs in accuracy. Most remain reliable up to 10% jitter; one of them correctly measures up to the highest level.

  2. Evaluation of an Intelligent Assistive Technology for Voice Navigation of Spreadsheets

    CERN Document Server

    Flood, Derek; Caffery, Fergal Mc; Bishop, Brian

    2008-01-01

    An integral part of spreadsheet auditing is navigation. For sufferers of Repetitive Strain Injury who need to use voice recognition technology this navigation can be highly problematic. To counter this the authors have developed an intelligent voice navigation system, iVoice, which replicates common spreadsheet auditing behaviours through simple voice commands. This paper outlines the iVoice system and summarizes the results of a study to evaluate iVoice when compared to a leading voice recognition technology.

  3. VR closure rates for two vocational models.

    Science.gov (United States)

    Fraser, Virginia V; Jones, Amanda M; Frounfelker, Rochelle; Harding, Brian; Hardin, Teresa; Bond, Gary R

    2008-01-01

    The Individual Placement and Support (IPS) model of supported employment is an evidence-based practice for individuals with psychiatric disabilities. To be financially viable, IPS programs require funding from the state-federal vocational rehabilitation (VR) system. However, some observers have questioned the compatibility of IPS and the VR system. Using a randomized controlled trial comparing IPS to a well-established vocational program called the Diversified Placement Approach (DPA), we examined rates of VR sponsorship and successful VR closures. We also describe the establishment of an active collaboration between a psychiatric rehabilitation agency and the state VR system to facilitate rapid VR sponsorship for IPS clients. Both IPS and DPA achieved a 44% rate of VR Status 26 closure when considering all clients entering the study. IPS and DPA averaged similar amount of time to achieve VR sponsorship. Time from vocational program entry to Status 26 was 51 days longer on average for IPS. Even though several IPS principles seem to run counter to VR practices, such as zero exclusion and rapid job search, we found IPS closure rates comparable to those for DPA, a vocational model that screens for readiness, provides prevocational preparation, and extensively uses agency-run businesses.

  4. Practical applications of interactive voice technologies: Some accomplishments and prospects

    Science.gov (United States)

    Grady, Michael W.; Hicklin, M. B.; Porter, J. E.

    1977-01-01

    A technology assessment of the application of computers and electronics to complex systems is presented. Three existing systems which utilize voice technology (speech recognition and speech generation) are described. Future directions in voice technology are also described.

  5. The expression and recognition of emotions in the voice across five nations: A lens model analysis based on acoustic features.

    Science.gov (United States)

    Laukka, Petri; Elfenbein, Hillary Anger; Thingujam, Nutankumar S; Rockstuhl, Thomas; Iraki, Frederick K; Chui, Wanda; Althoff, Jean

    2016-11-01

    This study extends previous work on emotion communication across cultures with a large-scale investigation of the physical expression cues in vocal tone. In doing so, it provides the first direct test of a key proposition of dialect theory, namely that greater accuracy of detecting emotions from one's own cultural group-known as in-group advantage-results from a match between culturally specific schemas in emotional expression style and culturally specific schemas in emotion recognition. Study 1 used stimuli from 100 professional actors from five English-speaking nations vocally conveying 11 emotional states (anger, contempt, fear, happiness, interest, lust, neutral, pride, relief, sadness, and shame) using standard-content sentences. Detailed acoustic analyses showed many similarities across groups, and yet also systematic group differences. This provides evidence for cultural accents in expressive style at the level of acoustic cues. In Study 2, listeners evaluated these expressions in a 5 × 5 design balanced across groups. Cross-cultural accuracy was greater than expected by chance. However, there was also in-group advantage, which varied across emotions. A lens model analysis of fundamental acoustic properties examined patterns in emotional expression and perception within and across groups. Acoustic cues were used relatively similarly across groups both to produce and judge emotions, and yet there were also subtle cultural differences. Speakers appear to have a culturally nuanced schema for enacting vocal tones via acoustic cues, and perceivers have a culturally nuanced schema in judging them. Consistent with dialect theory's prediction, in-group judgments showed a greater match between these schemas used for emotional expression and perception. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  6. Designing (for) experiences in photorealistic VR environments

    Science.gov (United States)

    Carroll, Fiona

    2010-04-01

    This paper investigates the role of aesthetics in the design of "intended" experiences in photorealistic virtual reality (VR) environments. It is motivated by the very notion that the aesthetic potential of photorealistic VR content is, and continues to be, underestimated whilst the emphasis on the development of newer and more efficient visualisation technologies to create new and exciting VR experiences increases. Challenging this, the paper looks beyond the technological (and the more traditional human computer interaction approaches that have primarily focused on the performance and efficiency issues of the technology) in order to explore more human values and the experiential side of VR. It focuses on the design of an "engaged interaction" and in doing so, implements a comparative study to explore how the strategic patterning of the aesthetic elements (particularly colour) within a photorealistic VR environment can allow for the design of a certain experience. In conclusion, the paper demonstrates that aesthetics and the "engaged interaction" can play an important role in getting to the heart of the photorealistic VR "user" experience. It highlights how we might design for (i.e. suggest, coax and guide) an "intended" VR experience.

  7. Voice Disorders

    Science.gov (United States)

    Voice is the sound made by air passing from your lungs through your larynx, or voice box. In your larynx are your vocal cords, ... to make sound. For most of us, our voices play a big part in who we are, ...

  8. Every Voice

    Science.gov (United States)

    Patrick, Penny

    2008-01-01

    This article discusses how the author develops an approach that allows her students, who are part of the marginalized population, to learn the power of their own voices--not just their writing voices, but their oral voices as well. The author calls it "TWIST": Thoughts, Writing folder, Inquiring mind, Supplies, and Teamwork. It is where…

  9. Every Voice

    Science.gov (United States)

    Patrick, Penny

    2008-01-01

    This article discusses how the author develops an approach that allows her students, who are part of the marginalized population, to learn the power of their own voices--not just their writing voices, but their oral voices as well. The author calls it "TWIST": Thoughts, Writing folder, Inquiring mind, Supplies, and Teamwork. It is where…

  10. Voice restoration

    NARCIS (Netherlands)

    Hilgers, F.J.M.; Balm, A.J.M.; van den Brekel, M.W.M.; Tan, I.B.; Remacle, M.; Eckel, H.E.

    2010-01-01

    Surgical prosthetic voice restoration is the best possible option for patients to regain oral communication after total laryngectomy. It is considered to be the present "gold standard" for voice rehabilitation of laryngectomized individuals. Surgical prosthetic voice restoration, in essence, is alwa

  11. An Immersive VR System for Sports Education

    Science.gov (United States)

    Song, Peng; Xu, Shuhong; Fong, Wee Teck; Chin, Ching Ling; Chua, Gim Guan; Huang, Zhiyong

    The development of new technologies has undoubtedly promoted the advances of modern education, among which Virtual Reality (VR) technologies have made the education more visually accessible for students. However, classroom education has been the focus of VR applications whereas not much research has been done in promoting sports education using VR technologies. In this paper, an immersive VR system is designed and implemented to create a more intuitive and visual way of teaching tennis. A scalable system architecture is proposed in addition to the hardware setup layout, which can be used for various immersive interactive applications such as architecture walkthroughs, military training simulations, other sports game simulations, interactive theaters, and telepresent exhibitions. Realistic interaction experience is achieved through accurate and robust hybrid tracking technology, while the virtual human opponent is animated in real time using shader-based skin deformation. Potential future extensions are also discussed to improve the teaching/learning experience.

  12. Speech Recognition: How Do We Teach It?

    Science.gov (United States)

    Barksdale, Karl

    2002-01-01

    States that growing use of speech recognition software has made voice writing an essential computer skill. Describes how to present the topic, develop basic speech recognition skills, and teach speech recognition outlining, writing, proofreading, and editing. (Contains 14 references.) (SK)

  13. Critical illness VR rehabilitation device (X-VR-D): evaluation of the potential use for early clinical rehabilitation.

    NARCIS (Netherlands)

    Meent, H. van de; Baken, B.C.M.; Opstal, S Van; Hogendoorn, P.

    2008-01-01

    We present a new critical illness VR rehabilitation device (X-VR-D) that enables diversified self-training and is applicable early in the rehabilitation of severely injured or ill patients. The X-VR-D consists of a VR program delivering a virtual scene on a flat screen and simultaneously processing

  14. Scientific sketching for collaborative VR visualization design.

    Science.gov (United States)

    Keefe, Daniel F; Acevedo, Daniel; Miles, Jadrian; Drury, Fritz; Swartz, Sharon M; Laidlaw, David H

    2008-01-01

    We present four studies investigating tools and methodologies for artist-scientist-technologist collaboration in designing multivariate, virtual reality (VR) visualizations. Design study 1 identifies the promise of 3D drawing-style interfaces for VR design and also establishes limitations of these tools with respect to precision and support for animation. Design study 2 explores animating artist-created visualization designs with scientific 3D fluid flow data. While results captured an accurate sense of flow that was advantageous as compared to the results of study 1, the potential for visual exploration using the design tools tested was limited. Design study 3 reveals the importance of a new 3D interface that overcomes the precision limitation found in study 1 while remaining accessible to artist collaborators. Drawing upon previous results, design study 4 engages collaborative teams in a design process that begins with traditional paper sketching and moves to animated, interactive, VR prototypes "sketched" by designers in VR using interactive 3D tools. Conclusions from these four studies identify important characteristics of effective artist-accessible VR visualization design tools and lead to a proposed formalized methodology for successful collaborative design that we expect to be useful in guiding future collaborations. We call this proposed methodology Scientific Sketching.

  15. VR Medical Gamification for Training and Education.

    Science.gov (United States)

    Nicola, Stelian; Virag, Ioan; Stoicu-Tivadar, Lăcrămioara

    2017-01-01

    The new virtual reality based medical applications is providing a better understanding of healthcare related subjects for both medical students and physicians. The work presented in this paper underlines gamification as a concept and uses VR as a new modality to study the human skeleton. The team proposes a mobile Android platform application based on Unity 5.4 editor and Google VR SDK. The results confirmed that the approach provides a more intuitive user experience during the learning process, concluding that the gamification of classical medical software provides an increased interactivity level for medical students during the study of the human skeleton.

  16. Design of a Virtual Reality System for Affect Analysis in Facial Expressions (VR-SAAFE); application to schizophrenia.

    Science.gov (United States)

    Bekele, Esubalew; Bian, Dayi; Peterman, Joel; Park, Sohee; Sarkar, Nilanjan

    2016-07-14

    Schizophrenia is a life-long, debilitating psychotic disorder with poor outcome that affects about 1% of the population. Although pharmacotherapy can alleviate some of the acute psychotic symptoms, residual social impairments present a significant barrier that prevents successful rehabilitation. With limited resources and access to social skills training opportunities, innovative technology has emerged as a potentially powerful tool for intervention. In this paper, we present a novel virtual reality (VR)-based system for understanding facial emotion processing impairments that may lead to poor social outcome in schizophrenia. We henceforth call it a VR System for Affect Analysis in Facial Expressions (VR-SAAFE). This system integrates a VR-based task presentation platform that can minutely control facial expressions of an avatar with or without accompanying verbal interaction, with an eye-tracker to quantitatively measure a participants real-time gaze and a set of physiological sensors to infer his/her affective states to allow in-depth understanding of the emotion recognition mechanism of patients with schizophrenia based on quantitative metrics. A usability study with 12 patients with schizophrenia and 12 healthy controls was conducted to examine processing of the emotional faces. Preliminary results indicated that there were significant differences in the way patients with schizophrenia processed and responded towards the emotional faces presented in the VR environment compared with healthy control participants. The preliminary results underscore the utility of such a VR-based system that enables precise and quantitative assessment of social skill deficits in patients with schizophrenia.

  17. Immersive STEM: From Fulldome to VR Technologies

    Science.gov (United States)

    Wyatt, R. J.

    2015-12-01

    For more than 15 years, fulldome video technology has transformed planetariums worldwide, using data-driven visualizations to support science storytelling. Fulldome video shares significant technical infrastructure with emerging VR headset technologies, and these personalized VR experiences allow for new audiences and new experiences of an existing library of context—as well as affording new opportunities for fulldome producers to explore. At the California Academy of Sciences, we are translating assets for our planetarium shows into immersive experiences for a variety of HR headsets. We have adapted scenes from our four award-wining features—Fragile Planet (2008), Life: A Cosmic Story (2010), Earthquake: Evidence of a Restless Planet (2012), and Habitat Earth (2015)—to place viewers inside a virtual planetarium viewing the shows. Similarly, we have released two creative-commons mini-shows on various VR outlets. This presentation will also highlight content the Academy will make available from our upcoming 2016 planetarium show about asteroids, comets, and solar system origins, some of which has been formatted for a full four-pi-steradian perspective. The shared immersive environment of digital planetariums offers significant opportunities for education and affective engagement of STEM-hungry audiences—including students, families, and adults. With the advent of VR technologies, we can leverage the experience of fulldome producers and planetarium professionals to create personalized home experiences that allow new ways to experience their content.

  18. Marketing: A How-to Book for VR.

    Science.gov (United States)

    Spann, Jinny

    This guide, written for vocational rehabilitation (VR) agency policymakers and staff alike, deals with the concept of marketing from a VR perspective. Covered in the individual chapters of the guide are the meaning of the term marketing; a conceptual framework for marketing in a VR agency (product definition, target group definition, differential…

  19. Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

    Directory of Open Access Journals (Sweden)

    Maureen Fontaine

    2017-07-01

    Full Text Available The ability to recognize an individual from their voice is a widespread ability with a long evolutionary history. Yet, the perceptual representation of familiar voices is ill-defined. In two experiments, we explored the neuropsychological processes involved in the perception of voice identity. We specifically explored the hypothesis that familiar voices (trained-to-familiar (Experiment 1, and famous voices (Experiment 2 are represented as a whole complex pattern, well approximated by the average of multiple utterances produced by a single speaker. In experiment 1, participants learned three voices over several sessions, and performed a three-alternative forced-choice identification task on original voice samples and several “speaker averages,” created by morphing across varying numbers of different vowels (e.g., [a] and [i] produced by the same speaker. In experiment 2, the same participants performed the same task on voice samples produced by familiar speakers. The two experiments showed that for famous voices, but not for trained-to-familiar voices, identification performance increased and response times decreased as a function of the number of utterances in the averages. This study sheds light on the perceptual representation of familiar voices, and demonstrates the power of average in recognizing familiar voices. The speaker average captures the unique characteristics of a speaker, and thus retains the information essential for recognition; it acts as a prototype of the speaker.

  20. Keeping Your Voice Healthy

    Science.gov (United States)

    ... Find an ENT Doctor Near You Keeping Your Voice Healthy Keeping Your Voice Healthy Patient Health Information ... heavily voice-related. Key Steps for Keeping Your Voice Healthy Drink plenty of water. Moisture is good ...

  1. Multimodal recognition of emotions

    NARCIS (Netherlands)

    Datcu, D.

    2009-01-01

    This thesis proposes algorithms and techniques to be used for automatic recognition of six prototypic emotion categories by computer programs, based on the recognition of facial expressions and emotion patterns in voice. Considering the applicability in real-life conditions, the research is carried

  2. Costume style from Vrčin

    Directory of Open Access Journals (Sweden)

    Radojičić Dragana

    2006-01-01

    Full Text Available Suburbs in general are very much affected by the distance from urban centers and its internal changes. Belgrade, for example, has, in the past two centuries, undergone major changes in forms of territorial expansion and population growth. Vrčin so became a suburb of Belgrade and an important crossroad. This paper analyses the changes which transformed Vrčin from rural settlement into suburb, using variables such as industry, migrations and accompanying influences to its daily life, including dress habits of the Vrčin population. A general tendency to transform from rural to urban is present, along with an effort to attain a bio-rhythm of the bigger city. The scope of this paper is limited to only a few sources, which, nevertheless picture this general tendency. The process of abandoning a traditional dress has begun long before 1970’s and has reached its final phase at present. The rural population gradually abandoned the traditional costume, and adopted urban way of dressing, most of the time mixing the two. It is evident that only elderly individuals have kept parts of the traditional dress and jewelry, while so called middle and young generations dress in the latest urban fashion style. This is especially evident in women’s style of dress brought about with a new role they entered during the years after WW II employment outside home and the city’s influences. In conclusion, dress mode in Vrčin was determined by general social, economic and historical events that took place.

  3. Using VR in Learning and Teaching

    OpenAIRE

    Cocking, Chris

    2011-01-01

    This paper investigates the use of a Virtual Reality (VR) simulation (The Underground Station Evacuation Simulator- USES) that was initially developed to explore mass emergency behaviour experimentally (see Drury & Cocking, 2007). However, since January 2009, I have been using the USES to teach Research Methods to 1st year Undergraduate Psychology Students, and realised that it can also be useful in teaching students about theories relating to the psychology of crowd behaviour. Therefore, I i...

  4. Neutron imaging on the VR-1 reactor

    Science.gov (United States)

    Crha, J.; Sklenka, L.; Soltes, J.

    2016-09-01

    Training reactor VR-1 is a low power research reactor with maximal thermal power of 1 kW. The reactor is operated by the Faculty of Nuclear Science and Physical Engineering of the Czech Technical University in Prague. Due to its low power it suits as a tool for education of university students and training of professionals. In 2015, as part of student research project, neutron imaging was introduced as another type of reactor utilization. The low available neutron flux and the limiting spatial and construction capabilities of the reactor's radial channel led to the development of a special filter/collimator insertion inside the channel and choosing a nonstandard approach by placing a neutron imaging plate inside the channel. The paper describes preliminary experiments carried out on the VR-1 reactor which led to first radiographic images. It seems, that due to the reactor construction and low reactor power, the neutron imaging technique on the VR-1 reactor is feasible mainly for demonstration or educational and training purposes.

  5. A Unique Wavelet Steganography Based Voice Biometric Protection Scheme

    Directory of Open Access Journals (Sweden)

    Sanjaypande M. B

    2013-03-01

    Full Text Available Voice biometric is an easy and cost effective biometric technique which requires minimalistic hardware and software complexity. General voice biometric needs a voice phrase by user which is processed with Mel Filter and Vector Quantized features are extracted. Vector quantization reduces the codebook size but decreases the accuracy of recognition. Therefore we propose a voice biometric system where voice file's non quantized code books are matched with spoken phrase. In order to ensure security to such direct voice sample we embed the voice file in a randomly selected image using DWT technique. Imposters are exposed to only images and are unaware of the voice files. We show that the technique produces better efficiency in comparison to VQ based technique.

  6. Voice-related quality of life in the pediatric population: validation of the Brazilian version of the Pediatric Voice-Related Quality-of Life survey.

    Science.gov (United States)

    Ribeiro, Lívia Lima; Paula, Kely Maria Pereira de; Behlau, Mara

    2014-01-01

    To measure the voice-related quality of life in children/adolescents with vocal complaints through the validation of the Brazilian Pediatric Voice-Related Quality-of-Life survey - VR-QOL-P (Qualidade de Vida em Voz Pediátrico - QVV-P), to verify whether the presence of vocal complaints interfere with the quality of life of children/adolescents, and to determine the relationship between the vocal assessment carried out by parents and the VR-QOL-P scores. The participants included 246 parents of children/adolescents of both sexes, aged between 2 years and 18 years (divided into preschoolers, schoolers and adolescents), with and without vocal complaints. All participants signed the informed consent form. Translation, linguistic and cultural adaptation, assessment of cultural equivalence, implementation of the protocol in its final version, voice assessment by parents, demographic and clinical descriptive statistical analysis of the population, individual analysis of the items, verification of psychometric measures of validation, reliability, reproducibility and responsiveness of the instrument to treatment, were carried out. Low scores, especially in the physical domain, were found in subjects with vocal complaints. Among those, adolescents suffered the greatest impact. The social-emotional domain was not sensitive in preschoolers. There was a correlation among the overall, social-emotional and physical scores, and the vocal assessment performed by parents. The VR-QOL-P was reliable, reproducible and responsive to voice problems. Voice change interferes with the quality of life of children/adolescents, and there is a relationship between the assessment of voice quality and VR-QOL-P scores - the older the individual, the worse the quality of life in aspects related to voice, especially in the physical domain, and the better the vocal quality, as perceived by the parents.

  7. Scientific Bases of Human-Machine Communication by Voice

    Science.gov (United States)

    Schafer, Ronald W.

    1995-10-01

    The scientific bases for human-machine communication by voice are in the fields of psychology, linguistics, acoustics, signal processing, computer science, and integrated circuit technology. The purpose of this paper is to highlight the basic scientific and technological issues in human-machine communication by voice and to point out areas of future research opportunity. The discussion is organized around the following major issues in implementing human-machine voice communication systems: (i) hardware/software implementation of the system, (ii) speech synthesis for voice output, (iii) speech recognition and understanding for voice input, and (iv) usability factors related to how humans interact with machines.

  8. Development of VR simulator for nurse training

    Science.gov (United States)

    Nakagawa, Y.; Tsetserukou, D.; Terashima, K.

    2014-02-01

    Our research focuses on the development of the VR simulator NurseSim to train nurse and hospital aides how to carry unconscious or injured person. The motivation behind this project is the fact that nurses consider patient lifting, transfer, and turning as most physically demanding. The user experiences the 3D environment in which they hold the subject. The task is to maintain such posture so that to prevent further injuries of the patient and distribute the weight of the patient over both hands evenly. Nurses are taught to mitigate and manage fatigue while at work.

  9. 基于BP和ARM的发动机声音识别系统%Voice recognition engine based on BP's system in realization of ARM

    Institute of Scientific and Technical Information of China (English)

    姜愉

    2012-01-01

    Aimed at addressing automatic fee charging of highway toll stations and large-scale re- chargeable parking lots, this paper introduces the design of a embedded speech recognition system based on ARM9 and embedded Linux system of the engine sound by analyzing the BP neural network recognition theory. The design consisting of S3C2410 microprocessors and Linux operating systems involves trans- planting the C language of speech recognition program to the embedded Linux operating system's file system when cross-compiled. The paper describes the system s hardware and software framework, and offers the experiments results produced by real-time recognition of the car type by the engine sound. The results prove its accuracy, real-time and validity.%为解决高速公路收费站及大型停车收费场自动收费问题,依据BP神经网络识别理论,设计了一个基于ARM9及嵌入式Linux系统的发动机声音识别系统。选用S3C2410微处理器和嵌入式Linux操作系统,把交叉编译后的发动机声音识别C语言程序移植到操作系统的文件中,实现了发动机声音实时识别功能,给出了系统整体软硬件结构框架以及实时输入发动机声音判别汽车类型的识别结果。现场实验证实了该系统的准确性、实时性和有效性。

  10. Improved sensitivity of wearable nanogenerators made of electrospun Eu3+ doped P(VDF-HFP)/graphene composite nanofibers for self-powered voice recognition

    Science.gov (United States)

    Adhikary, Prakriti; Biswas, Anirban; Mandal, Dipankar

    2016-12-01

    Composite nanofibers of Eu3+ doped poly(vinylidene fluoride-co-hexafluoropropylene) (P(VDF-HFP))/graphene are prepared by the electrospinning technique for the fabrication of ultrasensitive wearable piezoelectric nanogenerators (WPNGs) where the post-poling technique is not necessary. It is found that the complete conversion of the piezoelectric β-phase and the improvement of the degree of crystallinity is governed by the incorporation of Eu3+ and graphene sheets into P(VDF-HFP) nanofibers. The flexible nanocomposite fibers are associated with a hypersensitive electronic transition that results in an intense red light emission, and WPNGs also have the capability of detecting external pressure as low as ~23 Pa with a higher degree of acoustic sensitivity, ~11 V Pa-1, than has ever been previously reported. This means that ultrasensitive WPNGs can be utilized to recognize human voices, which suggests they could be a potential tool in the biomedical and national security sectors. The capacitor’s ability to charge from abundant environmental vibrations, such as music, wind, body motion, etc, drives WPNGs as a power source for portable electronics. This fact may open up the prospect of using the Eu3+ doped P(VDF-HFP)/graphene composite electrospun nanofibers, with their multifunctional properties such as vibration sensitivity, wearability, red light emission capability and piezoelectric energy harvesting, for various promising applications in portable electronics, health care monitoring, noise detection and security monitoring.

  11. Speaker Recognition

    DEFF Research Database (Denmark)

    Mølgaard, Lasse Lohilahti; Jørgensen, Kasper Winther

    2005-01-01

    Speaker recognition is basically divided into speaker identification and speaker verification. Verification is the task of automatically determining if a person really is the person he or she claims to be. This technology can be used as a biometric feature for verifying the identity of a person...... in applications like banking by telephone and voice mail. The focus of this project is speaker identification, which consists of mapping a speech signal from an unknown speaker to a database of known speakers, i.e. the system has been trained with a number of speakers which the system can recognize....

  12. 20 CFR 411.385 - What does a State VR agency do if a beneficiary who is eligible for VR services has a ticket that...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false What does a State VR agency do if a beneficiary who is eligible for VR services has a ticket that is available for assignment or reassignment? 411... a State VR agency do if a beneficiary who is eligible for VR services has a ticket that is available...

  13. Mobile VR in Education: From the Fringe to the Mainstream

    Science.gov (United States)

    Cochrane, Thomas

    2016-01-01

    This paper explores the development of virtual reality (VR) use in education and the emergence of mobile VR based content creation and sharing as a platform for enabling learner-generated content and learner-generated contexts. The author argues that an ecology of resources that maps the user content creation and sharing affordances of mobile…

  14. New-Level VR Marine Training System

    Institute of Scientific and Technical Information of China (English)

    LIU; Xiu-wen; JIV; Yi-cheng; YIN; Yong; LI; Zhi-hua

    2007-01-01

    Marine simulator is a valuable and effective medium for maritime education and training and is becoming widespread to foster the qualified and competent personnel for working in the maritime industry. However, many problems with most marine simulation systems today limit their pervasive use: the limited accessibility, immobility, the high expenditure, the long development cycle, and the difficult maintenance etc. A web-based maritime training environment is proposed in this paper as a new marine training level, which fully combining web technology, VR, high fidelity simulation, and e-learning tools to create a more costeffective and flexible training environment still with very realistic and high-interactive simulation characteristics, the system will provide a consistent and more flexible training medium for improving maritime education and training.

  15. The Glasgow Voice Memory Test: Assessing the ability to memorize and recognize unfamiliar voices.

    Science.gov (United States)

    Aglieri, Virginia; Watson, Rebecca; Pernet, Cyril; Latinus, Marianne; Garrido, Lúcia; Belin, Pascal

    2017-02-01

    One thousand one hundred and twenty subjects as well as a developmental phonagnosic subject (KH) along with age-matched controls performed the Glasgow Voice Memory Test, which assesses the ability to encode and immediately recognize, through an old/new judgment, both unfamiliar voices (delivered as vowels, making language requirements minimal) and bell sounds. The inclusion of non-vocal stimuli allows the detection of significant dissociations between the two categories (vocal vs. non-vocal stimuli). The distributions of accuracy and sensitivity scores (d') reflected a wide range of individual differences in voice recognition performance in the population. As expected, KH showed a dissociation between the recognition of voices and bell sounds, her performance being significantly poorer than matched controls for voices but not for bells. By providing normative data of a large sample and by testing a developmental phonagnosic subject, we demonstrated that the Glasgow Voice Memory Test, available online and accessible from all over the world, can be a valid screening tool (~5 min) for a preliminary detection of potential cases of phonagnosia and of "super recognizers" for voices.

  16. Speaking and Nonspeaking Voice Professionals: Who Has the Better Voice?

    Science.gov (United States)

    Chitguppi, Chandala; Raj, Anoop; Meher, Ravi; Rathore, P K

    2017-04-18

    Voice professionals can be classified into two major subgroups: the primarily speaking and the primarily nonspeaking voice professionals. Nonspeaking voice professionals mainly include singers, whereas speaking voice professionals include the rest of the voice professionals. Although both of these groups have high vocal demands, it is currently unknown whether both groups show similar voice changes after their daily voice use. Comparison of these two subgroups of voice professionals has never been done before. This study aimed to compare the speaking voice of speaking and nonspeaking voice professionals with no obvious vocal fold pathology or voice-related complaints on the day of assessment. After obtaining relevant voice-related history, voice analysis and videostroboscopy were performed in 50 speaking and 50 nonspeaking voice professionals. Speaking voice professionals showed significantly higher incidence of voice-related complaints as compared with nonspeaking voice professionals. Voice analysis revealed that most acoustic parameters including fundamental frequency, jitter percent, and harmonic-to-noise ratio were significantly higher in speaking voice professionals, whereas videostroboscopy did not show any significant difference between the two groups. This is the first study of its kind to analyze the effect of daily voice use in the two subgroups of voice professionals with no obvious vocal fold pathology. We conclude that voice professionals should not be considered as a homogeneous group. The detrimental effects of excessive voice use were observed to occur more significantly in speaking voice professionals than in nonspeaking voice professionals. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  17. Leveraging voice

    DEFF Research Database (Denmark)

    Frølunde, Lisbeth

    2017-01-01

    researchers improve our practices and how could digital online video help offer more positive stories about research and higher education? How can academics in higher education be better to tell about our research, thereby reclaiming and leveraging our voice in a post-factual era? As higher education......This paper speculates on how researchers share research without diluting our credibility and how to make strategies for the future. It also calls for consideration of new traditions and practices for communicating knowledge to a wider audience across multiple media platforms. How might we...... continues to engage with digital and networked technologies it becomes increasingly relevant to question why and how academics could (re) position research knowledge in the digital and online media landscape of today and the future. The paper highlights methodological issues that arise in relation...

  18. Feeling voices.

    Directory of Open Access Journals (Sweden)

    Paolo Ammirante

    Full Text Available Two experiments investigated deaf individuals' ability to discriminate between same-sex talkers based on vibrotactile stimulation alone. Nineteen participants made same/different judgments on pairs of utterances presented to the lower back through voice coils embedded in a conforming chair. Discrimination of stimuli matched for F0, duration, and perceived magnitude was successful for pairs of spoken sentences in Experiment 1 (median percent correct = 83% and pairs of vowel utterances in Experiment 2 (median percent correct = 75%. Greater difference in spectral tilt between "different" pairs strongly predicted their discriminability in both experiments. The current findings support the hypothesis that discrimination of complex vibrotactile stimuli involves the cortical integration of spectral information filtered through frequency-tuned skin receptors.

  19. Comparison of post menopausal voice changes across professional and non-professional users of the voice

    Directory of Open Access Journals (Sweden)

    Pallavi Vishwas Sovani

    2010-12-01

    Full Text Available Menopause effects a permanent change in certain body functions, one of them being voice. Moreover, if the voice is used continuously as a part of one’s occupation, this may further impact postmenopausal voice changes. The present study investigated the impact of menopause and professional voice use, and their interaction effect, on the voice. 92 women were classified into reproductive (52 and postmenopausal (40. Each group was divided into Level II (teachers and Level IV (clerks of Koufman and Isaacson’s (1991 classification. Acoustic parameters were analyzed using the VisiPitch III software. Aerodynamic parameters were manually calculated. The VHI (Voice Handicap Index was also included to improve the face validity of the study. Results suggest that Fo, SFo and MPT reduce post menopause while NHR and VTI increase. Some changes are accelerated in teachers as compared to clerks while some are decelerated. VHI scores of teachers are significantly greater than clerks, though not significantly different across menopause. Thus the presence or absence of voice use in one’s profession differentially affects postmenopausal changes. The study has implications in improving the condition of teachers in India, developing norms for menopausal changes and modifying allowable limits for voice recognition systems in future.

  20. Integrating Virtual Reality (VR) into traditional instructional design ...

    African Journals Online (AJOL)

    PROF. OLIVER OSUAGWA

    2015-12-01

    Dec 1, 2015 ... this technology and presented how to integrat VR with traditional instructional technology. Key words: ... forced companies to turn to new and emerging technologies in the area ... real space and virtual space. This implies the.

  1. 20 CFR 411.395 - Is a State VR agency required to provide periodic reports?

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Is a State VR agency required to provide... § 411.395 Is a State VR agency required to provide periodic reports? (a) For cases where a State VR agency provided services functioning as an EN, the State VR agency will be required to prepare periodic...

  2. Human Emotional State and its Relevance for Military VR Training

    Science.gov (United States)

    2005-07-01

    experience of military VR simulation training: human emotional state. Anxiety is a common emotional state in military operating environments. Real world...anecdotally be seen when repeated exposure to stressful military training leads to a gradual decline in anxiety responses as the trainee learns to “manage...Ample evidence for such a habituation process can be seen in the fledgling VR/mental health field whereby phobic patients are able to effectively face

  3. The illusion of presence influences VR distraction: effects on cold-pressor pain.

    Science.gov (United States)

    Gutierrez-Martinez, Olga; Gutierrez-Maldonado, Jose; Cabas-Hoyos, Kattia; Loreto, Desirée

    2010-01-01

    This study investigated whether VR presence influences how effectively VR distraction reduces pain intensity during a cold-pressor experience. Thirty-seven healthy students underwent a cold pressor task while interacting with a VR distraction world. After the VR cold-pressor experience, each subject provided VAS ratings of the most intense pain experienced during the hand immersion and rated their illusion of having been inside the virtual world. Results showed that the amount of VR presence reported correlated significantly and negatively with ratings of pain intensity. The importance of using an appropriately designed VR to achieve effective VR analgesia is highlighted.

  4. Significance of lead aVR in acute coronary syndrome

    Institute of Scientific and Technical Information of China (English)

    Akira; Tamura

    2014-01-01

    The 12-lead electrocardiogram(ECG)is a crucial tool in the diagnosis and risk stratification of acute coronary syndrome(ACS).Unlike other 11 leads,lead aVR has been long neglected until recent years.However,recent investigations have shown that an analysis of ST-segment shift in lead aVR provides useful information on the coronary angiographic anatomy and risk stratification in ACS.ST-segment elevation in lead aVR can be caused by(1)transmural ischemia in the basal part of the interventricular septum caused by impaired coronary blood flow of the first major branch originating from the left anterior descending coronary artery;(2)transmural ischemia in the right ventricular outflow tract caused by impaired coronary blood flow of the large conal branch originating from the right coronary artery;and(3)reciprocal changes opposite to ischemic or non-ischemic ST-segment depression in the lateral limb and precordial leads.On the other hand,ST-segment depression in lead aVR can be caused by transmural ischemia in the inferolateral and apical regions.It has been recently shown that an analysis of T wave in lead aVR also provides useful prognostic information in the general population and patients with prior myocardial infarction.Cardiologists should pay more attention to the tracing of lead aVR when interpreting the12-lead ECG in clinical practice.

  5. Fractal Dimension of Voice-Signal Waveforms

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The fractal dimension is one important parameter that characterizes waveforms. In this paper, we derive a new method to calculate fractal dimension of digital voice-signal waveforms. We show that fractal dimension is an efficient tool for speaker recognition or speech recognition. It can be used to identify different speakers or distinguish speech. We apply our results to Chinese speaker recognition and numerical experiment shows that fractal dimension is an efficient parameter to characterize individual Chinese speakers. We have developed a semiautomatic voiceprint analysis system based on the theory of this paper and former researches.

  6. Perceiving a stranger's voice as being one's own: a 'rubber voice' illusion?

    Directory of Open Access Journals (Sweden)

    Zane Z Zheng

    Full Text Available We describe an illusion in which a stranger's voice, when presented as the auditory concomitant of a participant's own speech, is perceived as a modified version of their own voice. When the congruence between utterance and feedback breaks down, the illusion is also broken. Compared to a baseline condition in which participants heard their own voice as feedback, hearing a stranger's voice induced robust changes in the fundamental frequency (F0 of their production. Moreover, the shift in F0 appears to be feedback dependent, since shift patterns depended reliably on the relationship between the participant's own F0 and the stranger-voice F0. The shift in F0 was evident both when the illusion was present and after it was broken, suggesting that auditory feedback from production may be used separately for self-recognition and for vocal motor control. Our findings indicate that self-recognition of voices, like other body attributes, is malleable and context dependent.

  7. Dimensionality in voice quality.

    Science.gov (United States)

    Bele, Irene Velsvik

    2007-05-01

    This study concerns speaking voice quality in a group of male teachers (n = 35) and male actors (n = 36), as the purpose was to investigate normal and supranormal voices. The goal was the development of a method of valid perceptual evaluation for normal to supranormal and resonant voices. The voices (text reading at two loudness levels) had been evaluated by 10 listeners, for 15 vocal characteristics using VA scales. In this investigation, the results of an exploratory factor analysis of the vocal characteristics used in this method are presented, reflecting four dimensions of major importance for normal and supranormal voices. Special emphasis is placed on the effects on voice quality of a change in the loudness variable, as two loudness levels are studied. Furthermore, the vocal characteristics Sonority and Ringing voice quality are paid special attention, as the essence of the term "resonant voice" was a basic issue throughout a doctoral dissertation where this study was included.

  8. Voice box (image)

    Science.gov (United States)

    The larynx, or voice box, is located in the neck and performs several important functions in the body. The larynx is involved in swallowing, breathing, and voice production. Sound is produced when the air which ...

  9. Voice and Aging

    Science.gov (United States)

    ... dramatic voice changes are those during childhood and adolescence. The larynx (or voice box) and vocal cord tissues do not fully mature until late teenage years. Hormone-related changes during adolescence are ...

  10. 20 CFR 411.380 - What does a State VR agency do if the State VR agency wants to determine whether a person seeking...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false What does a State VR agency do if the State VR agency wants to determine whether a person seeking services has a ticket? 411.380 Section 411.380... Vocational Rehabilitation Agencies' Participation Ticket Status § 411.380 What does a State VR agency do if...

  11. Voice and endocrinology

    OpenAIRE

    KVS Hari Kumar; Anurag Garg; Ajai Chandra, N. S.; Singh, S. P.; Rakesh Datta

    2016-01-01

    Voice is one of the advanced features of natural evolution that differentiates human beings from other primates. The human voice is capable of conveying the thoughts into spoken words along with a subtle emotion to the tone. This extraordinary character of the voice in expressing multiple emotions is the gift of God to the human beings and helps in effective interpersonal communication. Voice generation involves close interaction between cerebral signals and the peripheral apparatus consistin...

  12. Voice-Controlled Artificial Handspeak System

    Directory of Open Access Journals (Sweden)

    Carlo Fonda

    2014-01-01

    Full Text Available A man-machine interaction project is described which aims to establish an automated voice to sign language translator for communication with the deaf using integrated open technologies. The first prototype consists of a robotic hand designed with OpenSCAD and manufactured with a low-cost 3D printer ─which smoothly reproduces the alphabet of the sign language controlled by voice only. The core automation comprises an Arduino UNO controller used to activate a set of servo motors that follow instructions from a Raspberry Pi mini-computer having installed the open source speech recognition engine Julius. We discuss its features, limitations and possible future developments.

  13. Writing with Voice

    Science.gov (United States)

    Kesler, Ted

    2012-01-01

    In this Teaching Tips article, the author argues for a dialogic conception of voice, based in the work of Mikhail Bakhtin. He demonstrates a dialogic view of voice in action, using two writing examples about the same topic from his daughter, a fifth-grade student. He then provides five practical tips for teaching a dialogic conception of voice in…

  14. Tips for Healthy Voices

    Science.gov (United States)

    ... social interaction as well as for most people’s occupation. Proper care and use of your voice will give you the best chance for having a healthy voice for your entire lifetime. Hoarseness or roughness in your voice is often ...

  15. Promotion of emotional wellbeing in oncology inpatients using VR.

    Science.gov (United States)

    Espinoza, Macarena; Baños, Rosa M; García-Palacios, Azucena; Cervera, José M; Esquerdo, Gaspar; Barrajón, Enrique; Botella, Cristina

    2012-01-01

    In Psycho-oncology, VR has been utilized mainly to manage pain and distress associated to medical procedures and chemotherapy, with very few applications aimed at promotion of wellbeing in hospitalized patients. Considering this, it was implemented a psychological intervention that uses VR to induce positive emotions on adult oncology inpatients with the purpose of evaluating its utility to improve emotional wellbeing in this population. Sample was composed of 33 patients (69.7% men, aged from 41 to 85 years old; X=62.1; SD=10.77). Intervention lasted 4 sessions of 30 minutes, along one week. In these sessions, two virtual environments designed to induce joy or relaxation were used. Symptoms of depression and anxiety (Hospital Anxiety and Depression Scale, HADS) and level of happiness (Fordyce Scale) were assessed before and after the VR intervention. Also, Visual Analogue Scales (VAS) were used to assess emotional state and physical discomfort before and after each session. There were significant improvements in distress and level of happiness after the VR intervention. Also, it was detected an increment in positive emotions and a decrease in negative emotions after sessions. Results emphasize the potential of VR as a positive technology that can be used to promote wellbeing during hospitalization, especially considering the shortness of the intervention and the advanced state of disease of the participants. Despite the encouraging of these results, it is necessary to confirm them in studies with larger samples and control groups.

  16. Voice Recognition Accuracy: What Is Acceptable?

    Science.gov (United States)

    1982-11-01

    127. Modesto Sumter 128. Worchester Catskills 129. Huntsville Janesville 130. Waterville Osage Beach 131. Baton Rouge Phoenix 132. Marquette Billings...133. New Orleans Antingua 134. Walla Walla Modesto 135. Tupelo Augusta 136. Astoria Greinville 137. Catskills Bermuda 138. Atlanta Huntsville 139...78 Sort Sos 79 Type Up 80 Debug 81 Papa Alpha 82 Quebec Stack 83 Romeo Tango 84 Sierra Alpha 85 Tango Romeo 86 Uniform Nine 87 Victor

  17. Controlling An Electric Car Starter System Through Voice

    Directory of Open Access Journals (Sweden)

    A.B. Muhammad Firdaus

    2015-04-01

    Full Text Available Abstract These days automotive has turned into a stand out amongst the most well-known modes of transportation on the grounds that a large number of Malaysians could bear to have an auto. There are numerous decisions of innovations in auto that have in the market. One of the engineering is voice controlled framework. Voice Recognition is the procedure of consequently perceiving a certain statement talked by a specific speaker focused around individual data included in discourse waves. This paper is to make an car controlled by voice of human. An essential pre-processing venture in Voice Recognition systems is to recognize the vicinity of noise. Sensitivity to speech variability lacking recognition precision and helplessness to mimic are among the principle specialized obstacles that keep the far reaching selection of speech-based recognition systems. Voice recognition systems work sensibly well with a quiet conditions however inadequately under loud conditions or in twisted channels. The key focus of the project is to control an electric car starter system.

  18. Voice handicap in singers.

    Science.gov (United States)

    Murry, Thomas; Zschommler, Anne; Prokop, Jan

    2009-05-01

    The study aimed to determine the differences in responses to the Voice Handicap Index (VHI-10) between singers and nonsingers and to evaluate the ranked order differences of the VHI-10 statements for both groups. The VHI-10 was modified to include statements related to the singing voice for comparison to the original VHI-10. Thirty-five nonsingers with documented voice disorders responded to the VHI-10. A second group, consisting of 35 singers with voice complaints, responded to the VHI-10 with three statements added specifically addressing the singing voice. Data from both groups were analyzed in terms of overall subject self-rating of voice handicap and the rank order of statements from least to most important. The difference between the mean VHI-10 for the singers and nonsingers was not statistically significant, thus, supporting the validity of the VHI-10. However, the 10 statements were ranked differently in terms of their importance by both groups. In addition, when three statements related specifically to the singing voice were substituted in the original VHI-10, the singers judged their voice problem to be more severe than when using the original VHI-10. The type of statements used to assess self-perception of voice handicap may be related to the subject population. Singers with voice problems do not rate their voices to be more handicapped than nonsingers unless statements related specifically to singing are included.

  19. LABORATORY VOICE DATA ENTRY SYSTEM.

    Energy Technology Data Exchange (ETDEWEB)

    PRAISSMAN,J.L.SUTHERLAND,J.C.

    2003-04-01

    We have assembled a system using a personal computer workstation equipped with standard office software, an audio system, speech recognition software and an inexpensive radio-based wireless microphone that permits laboratory workers to enter or modify data while performing other work. Speech recognition permits users to enter data while their hands are holding equipment or they are otherwise unable to operate a keyboard. The wireless microphone allows unencumbered movement around the laboratory without a ''tether'' that might interfere with equipment or experimental procedures. To evaluate the potential of voice data entry in a laboratory environment, we developed a prototype relational database that records the disposal of radionuclides and/or hazardous chemicals Current regulations in our laboratory require that each such item being discarded must be inventoried and documents must be prepared that summarize the contents of each container used for disposal. Using voice commands, the user enters items into the database as each is discarded. Subsequently, the program prepares the required documentation.

  20. METHODS FOR QUALITY ENHANCEMENT OF USER VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    O. N. Faizulaieva

    2014-03-01

    Full Text Available The reasonability for the usage of computer systems user voice in the authentication process is proved. The scientific task for improving the signal/noise ratio of the user voice signal in the authentication system is considered. The object of study is the process of input and output of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of voice signal against external interference signals are researched. Methods for quality enhancement of user voice signal in voice authentication systems are suggested. As modern computer facilities, including mobile ones, have two-channel audio card, the usage of two microphones is proposed in the voice signal input system of authentication system. Meanwhile, the task of forming a lobe of microphone array in a desired area of voice signal registration (100 Hz to 8 kHz is solved. The usage of directional properties of the proposed microphone array gives the possibility to have the influence of external interference signals two or three times less in the frequency range from 4 to 8 kHz. The possibilities for implementation of space-time processing of the recorded signals using constant and adaptive weighting factors are investigated. The simulation results of the proposed system for input and extraction of signals during digital processing of narrowband signals are presented. The proposed solutions make it possible to improve the value of the signal/noise ratio of the useful signals recorded up to 10, ..., 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker’s discrimination.

  1. Equipment for neutron measurements at VR-1 Sparrow training reactor.

    Science.gov (United States)

    Kolros, Antonin; Huml, Ondrej; Kríz, Martin; Kos, Josef

    2010-01-01

    The VR-1 sparrow reactor is an experimental nuclear facility for training, student education and teaching purposes. The sparrow reactor is an educational platform for the basic experiments at the reactor physic and dosimetry. The aim of this article is to describe the new experimental equipment EMK310 features and possibilities for neutron detection by different gas filled detectors at VR-1 reactor. Among the EMK310 equipment typical attributes belong precise set-up, simple control, resistance to electromagnetic interference, high throughput (counting rate), versatility and remote controllability. The methods for non-linearity correction of pulse neutron detection system and reactimeter application are presented.

  2. Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance

    Directory of Open Access Journals (Sweden)

    Ravichander Vipperla

    2010-01-01

    Full Text Available With ageing, human voices undergo several changes which are typically characterized by increased hoarseness and changes in articulation patterns. In this study, we have examined the effect on Automatic Speech Recognition (ASR and found that the Word Error Rates (WER on older voices is 10% absolute higher compared to those of adult voices. Subsequently, we compared several voice source parameters including fundamental frequency, jitter, shimmer, harmonicity, and cepstral peak prominence of adult and older males. Several of these parameters show statistically significant difference for the two groups. However, artificially increasing jitter and shimmer measures do not effect the ASR accuracies significantly. Artificially lowering the fundamental frequency degrades the ASR performance marginally but this drop in performance can be overcome to some extent using Vocal Tract Length Normalisation (VTLN. Overall, we observe that the changes in the voice source parameters do not have a significant impact on ASR performance. Comparison of the likelihood scores of all the phonemes for the two age groups show that there is a systematic mismatch in the acoustic space of the two age groups. Comparison of the phoneme recognition rates show that mid vowels, nasals, and phonemes that depend on the ability to create constrictions with tongue tip for articulation are more affected by ageing than other phonemes.

  3. A Voice Operated Tour Planning System for Autonomous Mobile Robots

    Directory of Open Access Journals (Sweden)

    Charles V. Smith Iii

    2010-06-01

    Full Text Available Control systems driven by voice recognition software have been implemented before but lacked the context driven approach to generate relevant responses and actions. A partially voice activated control system for mobile robotics is presented that allows an autonomous robot to interact with people and the environment in a meaningful way, while dynamically creating customized tours. Many existing control systems also require substantial training for voice application. The system proposed requires little to no training and is adaptable to chaotic environments. The traversable area is mapped once and from that map a fully customized route is generated to the user

  4. FUNDAMENTALS OF SPEAKER RECOGNITION

    OpenAIRE

    ERTAŞ, Figen

    2000-01-01

    The explosive growth of information technology in the last decade has made a considerable impact on the design and construction of systems for human-machine communication, which is becoming increasingly important in many aspects of life. Amongst other speech processing tasks, a great deal of attention has been devoted to developing procedures that identify people from their voices, and the design and construction of speaker recognition systems has been a fascinating enterprise pursued over ma...

  5. Using the verona coding definitions of emotional sequences (VR-CoDES) and health provider responses (VR-CoDES-P) in the dental context.

    Science.gov (United States)

    Wright, Alice; Humphris, Gerry; Wanyonyi, Kristina L; Freeman, Ruth

    2012-10-01

    To show if cues, concerns and provider responses (defined in VR-CoDES and VR-CoDES-P manuals) are present, can be reliably coded and require additional advice for adoption in a dental context. Thirteen patients in a dental practice setting were videoed with either their dentist or hygienist and dental nurse present in routine treatment sessions. All utterances were coded using the Verona systems: VR-CoDES and the VR-CoDES-P. Rates of cue, concerns and provider responses described and reliability tested. The VR-CoDES and VR-CoDES-P were successfully applied in the dental context. The intra-rater ICCs for the detection of cues and concerns and provider response were acceptable and above 0.75. A similar satisfactory result was found for the inter-rater reliability. The VR-CoDES and the VR-CoDES-P are applicable in the dental setting with minor supporting guidelines and show evidence of reliable coding. The VR-CoDES and the VR-CoDES-P may be helpful tools for analysing patient cues and concerns and the dental professionals' responses in the dental context. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  6. Singing voice outcomes following singing voice therapy.

    Science.gov (United States)

    Dastolfo-Hromack, Christina; Thomas, Tracey L; Rosen, Clark A; Gartner-Schmidt, Jackie

    2016-11-01

    The objectives of this study were to describe singing voice therapy (SVT), describe referred patient characteristics, and document the outcomes of SVT. Retrospective. Records of patients receiving SVT between June 2008 and June 2013 were reviewed (n = 51). All diagnoses were included. Demographic information, number of SVT sessions, and symptom severity were retrieved from the medical record. Symptom severity was measured via the 10-item Singing Voice Handicap Index (SVHI-10). Treatment outcome was analyzed by diagnosis, history of previous training, and SVHI-10. SVHI-10 scores decreased following SVT (mean change = 11, 40% decrease) (P singing lessons (n = 10) also completed an average of three SVT sessions. Primary muscle tension dysphonia (MTD1) and benign vocal fold lesion (lesion) were the most common diagnoses. Most patients (60%) had previous vocal training. SVHI-10 decrease was not significantly different between MTD and lesion. This is the first outcome-based study of SVT in a disordered population. Diagnosis of MTD or lesion did not influence treatment outcomes. Duration of SVT was short (approximately three sessions). Voice care providers are encouraged to partner with a singing voice therapist to provide optimal care for the singing voice. This study supports the use of SVT as a tool for the treatment of singing voice disorders. 4 Laryngoscope, 126:2546-2551, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.

  7. Multimodal user input to supervisory control systems - Voice-augmented keyboard

    Science.gov (United States)

    Mitchell, Christine M.; Forren, Michelle G.

    1987-01-01

    The use of a voice-augmented keyboard input modality is evaluated in a supervisory control application. An implementation of voice recognition technology in supervisory control is proposed: voice is used to request display pages, while the keyboard is used to input system reconfiguration commands. Twenty participants controlled GT-MSOCC, a high-fidelity simulation of the operator interface to a NASA ground control system, via a workstation equipped with either a single keyboard or a voice-augmented keyboard. Experimental results showed that in all cases where significant performance differences occurred, performance with the voice-augmented keyboard modality was inferior to and had greater variance than the keyboard-only modality. These results suggest that current moderately priced voice recognition systems are an inappropriate human-computer interaction technology in supervisory control systems.

  8. FUNDAMENTALS OF SPEAKER RECOGNITION

    Directory of Open Access Journals (Sweden)

    Figen ERTAŞ

    2000-02-01

    Full Text Available The explosive growth of information technology in the last decade has made a considerable impact on the design and construction of systems for human-machine communication, which is becoming increasingly important in many aspects of life. Amongst other speech processing tasks, a great deal of attention has been devoted to developing procedures that identify people from their voices, and the design and construction of speaker recognition systems has been a fascinating enterprise pursued over many decades. This paper introduces speaker recognition in general and discusses its relevant parameters in relation to system performance.

  9. Experiential Fidelity: Leveraging the Mind to Improve the VR Experience

    Science.gov (United States)

    Beckhaus, Steffi; Lindeman, Robert W.

    Much of Virtual Reality (VR) is about creating environments that are believable. But though the visual and audio experiences we provide today are already of a rather high sensory fidelity, there is still something lacking; something hinders us from fully buying into the worlds we experience through VR technology. We introduce the notion of Experiential Fidelity, which is an attempt to create a deeper sense of presence by carefully designing the user experience. We suggest to guide the users' frame of mind in a way that their expectations, attitude, and attention are aligned with the actual VR experience, and that the user's own imagination is stimulated to complete the experience. This work was inspired by a collection of personal magic moments and factors that were named by leading researchers in VR. We present those magic moments and some thoughts on how we can tap into experiential fidelity. We propose to do this not through technological means, but rather through the careful use of suggestion and allusion. By priming the user's mind prior to exposure to our virtual worlds, we can assist her in entering a mental state that is more willing to believe, even using the limited actual fidelity available today.

  10. Vocational Rehabilitation (VR) Approaches to Job Development. Research Report

    Science.gov (United States)

    Ipsen, Catherine

    2012-01-01

    Vocational Rehabilitation (VR) agencies provide a range of services to help people with disabilities become employed. How services are delivered, however, depends on several factors including client interests and abilities as well as economic opportunities within the local community. For better or worse, rural and urban clients face vastly…

  11. Immersive VR Visualizations by VFIVE. Part 1: Development

    CERN Document Server

    Kageyama, Akira

    2013-01-01

    We have been developing a visualization application for CAVE-type virtual reality (VR) systems for more than a decade. This application, VFIVE, is currently used in several CAVE systems in Japan for routine visualizations. It is also used as a base system of further developments of advanced visualizations. The development of VFIVE is summarized.

  12. Nicotine craving: ERPs correlates after VR exposure to smoking cues.

    Science.gov (United States)

    Gamito, Pedro; Oliveira, Jorge; Baptista, André; Morais, Diogo; Lopes, Paulo; Saraiva, Tomaz; Rosa, Pedro; Santos, Nuno; Soares, Fábio; Sottomayor, Catarina; Barata, Filipa

    2012-01-01

    Even though it is diminishing in Europe, smoking is still a serious health problem. The craving of Nicotine is one of the hardest behaviours to tackle when a smoking cessation programme is implemented. Following on previous work [1], which aimed at evaluating the possibility of inducing smoking craving in smokers using a VR platform, the present study was devised to assess the role of craving in cognitive processing through event related potentials (ERP). From an initial sample of 89 university students (smokers and non-smokers), which was randomly exposed to VR smoking cues and VR non-smoking cue scenarios, a subsample of 13 smokers and non-smokers was drawn. This subsample (M = 23.08; SD = 4.39), which had previously been immersed in the VR smoking cues environment, was presented to a rapid (1 sec) serial of smoking and neutral images. Data on brain activity was recorded through an EEG during this task to further estimate ERPs. When compared to non-smokers, smokers showed higher frontal activation when watching smoking related images.

  13. Training in surgical oncology - the role of VR simulation.

    Science.gov (United States)

    Lewis, T M; Aggarwal, R; Rajaretnam, N; Grantcharov, T P; Darzi, A

    2011-09-01

    There have been dramatic changes in surgical training over the past two decades which have resulted in a number of concerns for the development of future surgeons. Changes in the structure of cancer services, working hour restrictions and a commitment to patient safety has led to a reduction in training opportunities that are available to the surgeon in training. Simulation and in particular virtual reality (VR) simulation has been heralded as an effective adjunct to surgical training. Advances in VR simulation has allowed trainees to practice realistic full length procedures in a safe and controlled environment, where mistakes are permitted and can be used as learning points. There is considerable evidence to demonstrate that the VR simulation can be used to enhance technical skills and improve operating room performance. Future work should focus on the cost effectiveness and predictive validity of VR simulation, which in turn would increase the uptake of simulation and enhance surgical training. Copyright © 2011 Elsevier Ltd. All rights reserved.

  14. Design of a digital voice data compression technique for orbiter voice channels

    Science.gov (United States)

    1975-01-01

    Candidate techniques were investigated for digital voice compression to a transmission rate of 8 kbps. Good voice quality, speaker recognition, and robustness in the presence of error bursts were considered. The technique of delayed-decision adaptive predictive coding is described and compared with conventional adaptive predictive coding. Results include a set of experimental simulations recorded on analog tape. The two FM broadcast segments produced show the delayed-decision technique to be virtually undegraded or minimally degraded at .001 and .01 Viterbi decoder bit error rates. Preliminary estimates of the hardware complexity of this technique indicate potential for implementation in space shuttle orbiters.

  15. Clinical Voices - an update

    DEFF Research Database (Denmark)

    Fusaroli, Riccardo; Weed, Ethan

    Anomalous aspects of speech and voice, including pitch, fluency, and voice quality, are reported to characterise many mental disorders. However, it has proven difficult to quantify and explain this oddness of speech by employing traditional statistical methods. In this talk we will show how the t...... the temporal dynamics of the voice in Asperger's patients enable us to automatically reconstruct the diagnosis, and assess the Autism quotient score. We then generalise the findings to Danish and American children with autism....

  16. Effects of Medications on Voice

    Science.gov (United States)

    ... ENT Doctor Near You Effects of Medications on Voice Effects of Medications on Voice Patient Health Information ... entnet.org . Could Your Medication Be Affecting Your Voice? Some medications including prescription, over-the-counter, and ...

  17. Voice-based assessments of trustworthiness, competence, and warmth in blind and sighted adults

    OpenAIRE

    Oleszkiewicz, Anna; Pisanski, Katarzyna; Lachowicz-Tabaczek, Kinga; Sorokowska, Agnieszka

    2016-01-01

    The study of voice perception in congenitally blind individuals allows researchers rare insight into how a lifetime of visual deprivation affects the development of voice perception. Previous studies have suggested that blind adults outperform their sighted counterparts in low-level auditory tasks testing spatial localization and pitch discrimination, as well as in verbal speech processing; however, blind persons generally show no advantage in nonverbal voice recognition or discrimination tas...

  18. Who Is Going to College? Predicting Education Training from Pre-VR Consumer Characteristics

    Science.gov (United States)

    Boutin, Daniel L.; Wilson, Keith B.

    2012-01-01

    The relationship of receiving college and university training within the state vocational rehabilitation (VR) program to pre-VR consumer characteristics was investigated with a multiple direct logistic regression technique. A model containing 11 pre-VR characteristics predict the reception of college and university training for a multidisability…

  19. Access to, Use of, and Attitudes toward Telecommunication among Rural VR Clients

    Science.gov (United States)

    Ipsen, Catherine; Rigles, Bethany; Arnold, Nancy; Seekins, Tom

    2013-01-01

    Telecommunication offers rural vocational rehabilitation (VR) clients a method of communicating with their VR counselor between face-to-face visits. Unfortunately, certain telecommunication methods may not be available to many rural VR clients or may pose barriers in the rehabilitation process. This article describes findings from an exploratory…

  20. Access to, Use of, and Attitudes toward Telecommunication among Rural VR Clients

    Science.gov (United States)

    Ipsen, Catherine; Rigles, Bethany; Arnold, Nancy; Seekins, Tom

    2013-01-01

    Telecommunication offers rural vocational rehabilitation (VR) clients a method of communicating with their VR counselor between face-to-face visits. Unfortunately, certain telecommunication methods may not be available to many rural VR clients or may pose barriers in the rehabilitation process. This article describes findings from an exploratory…

  1. 20 CFR 416.2204 - Participation by State VR agencies or alternate participants.

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Participation by State VR agencies or... General Provisions § 416.2204 Participation by State VR agencies or alternate participants. (a) General. In order to participate in the payment program under this subpart through its VR agency(ies), a State...

  2. Who Is Going to College? Predicting Education Training from Pre-VR Consumer Characteristics

    Science.gov (United States)

    Boutin, Daniel L.; Wilson, Keith B.

    2012-01-01

    The relationship of receiving college and university training within the state vocational rehabilitation (VR) program to pre-VR consumer characteristics was investigated with a multiple direct logistic regression technique. A model containing 11 pre-VR characteristics predict the reception of college and university training for a multidisability…

  3. 20 CFR 411.355 - What payment options does a State VR agency have?

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false What payment options does a State VR agency... Ticket to Work Program § 411.355 What payment options does a State VR agency have? (a) The Ticket to Work program provides different payment options that are available to a State VR agency for providing services...

  4. 20 CFR 404.2104 - Participation by State VR agencies or alternate participants.

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Participation by State VR agencies or... General Provisions § 404.2104 Participation by State VR agencies or alternate participants. (a) General. In order to participate in the payment program under this subpart through its VR agency(ies), a State...

  5. Onset and Maturation of Fetal Heart Rate Response to the Mother's Voice over Late Gestation

    Science.gov (United States)

    Kisilevsky, Barbara S.; Hains, Sylvia M. J.

    2011-01-01

    Background: Term fetuses discriminate their mother's voice from a female stranger's, suggesting recognition/learning of some property of her voice. Identification of the onset and maturation of the response would increase our understanding of the influence of environmental sounds on the development of sensory abilities and identify the period when…

  6. Method of Display Entity Information of VR-forces on MGS Platform%基于MGS平台的VR-Forces实体信息显示方法

    Institute of Scientific and Technical Information of China (English)

    韩晓光; 赵志军; 蔡郁知

    2015-01-01

    在同一模拟系统中同时使用MGS与VR-Forces平台,存在实体信息格式转换问题.为实现在MGS平台中显示VR-Forces中实体信息,分析了VR-Forces平台中实体信息的组织方式,基于XML技术构建了VR-Forces平台与MGS平台间的实体ID编码映射关系,实现了VR-Forces平台中实体经纬度、首向角、敌我关系等属性信息的获取方法,解决了基于MGS开发的模拟仿真训练程序与基于VR-Forces开发的模拟训练程序共存的问题,为后续具有类似功能需求的模拟系统开发提供了技术支持.%To use MGS and VR-Forces in the same simulation system, there is an entity information transform problem. To display VR-Forces entity information on MGS, the entity in heritance structure of VR-Forces is analyzed, building a Mapping Table of entity ID between VR-Forces and MGS platforms on XML technology. To achieve the methods to get latitude,heading,the enemy relationship from VR-Forces, the coexistence problem of MGS and VR-Forces are solved. The technical support for the development of simulation system with the similar function is provided.

  7. EXPERIMENTAL STUDY OF FIRMWARE FOR INPUT AND EXTRACTION OF USER’S VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    O. N. Faizulaieva

    2014-09-01

    Full Text Available Scientific task for improving the signal-to-noise ratio for user’s voice signal in computer systems and networks during the process of user’s voice authentication is considered. The object of study is the process of input and extraction of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of the voice signal on the background of external interference signals are investigated. Ways for quality improving of the user’s voice signal in systems of voice authentication are investigated experimentally. Firmware means for experimental unit of input and extraction of the user’s voice signal against external interference influence are considered. As modern computer means, including mobile, have two-channel audio card, two microphones are used in the voice signal input. The distance between sonic-wave sensors is 20 mm and it provides forming one direction pattern lobe of microphone array in a desired area of voice signal registration (from 100 Hz to 8 kHz. According to the results of experimental studies, the usage of directional properties of the proposed microphone array and space-time processing of the recorded signals with implementation of constant and adaptive weighting factors has made it possible to reduce considerably the influence of interference signals. The results of firmware experimental studies for input and extraction of the user’s voice signal against external interference influence are shown. The proposed solutions will give the possibility to improve the value of the signal/noise ratio of the useful signals recorded up to 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker discrimination.

  8. Critical illness VR rehabilitation device (X-VR-D): evaluation of the potential use for early clinical rehabilitation.

    Science.gov (United States)

    Van de Meent, H; Baken, B C M; Van Opstal, S; Hogendoorn, P

    2008-06-01

    We present a new critical illness VR rehabilitation device (X-VR-D) that enables diversified self-training and is applicable early in the rehabilitation of severely injured or ill patients. The X-VR-D consists of a VR program delivering a virtual scene on a flat screen and simultaneously processing commands to a moving chair mounted on a motion system. Sitting in the moving chair and exposed to a virtual reality environment the device evokes anticipatory and reactive muscle contractions in trunk and extremities for postural control. In this study we tested the device in 10 healthy subjects to evaluate whether the enforced perturbations indeed evoke sufficient and reproducible EMG muscle activations. We found that particular fast roll and pitch movements evoke adequate trunk and leg muscle activity. Higher angular velocities and higher angles of inclination elicited broader EMG bursts and larger amplitudes. The muscle activation pattern was highly consistent between different subjects and although we found some habituation of EMG responses in consecutive training sessions, the general pattern was maintained and was predictable for specific movements. The habituation was characterized by more efficient muscle contractions and better muscle relaxation during the rest positions of the device. Furthermore we found that the addition of a virtual environment to the training session evoked more preparatory and anticipatory muscle activation than sessions without a virtual environment. We conclude that the X-VR-D is safe and effective to elicit consistent and reproducible muscle activity in trunk and leg muscles in healthy subjects and thus can be used as a training method.

  9. Interface Everywhere: Further Development of a Gesture and Voice Commanding Interface Project

    Data.gov (United States)

    National Aeronautics and Space Administration — Natural User Interface (NUI) is a term used to describe a number of technologies such as speech recognition, multi-touch, and kinetic interfaces. Gesture and voice...

  10. CalVR: an advanced open source virtual reality software framework

    Science.gov (United States)

    Schulze, Jürgen P.; Prudhomme, Andrew; Weber, Philip; DeFanti, Thomas A.

    2013-03-01

    We developed CalVR because none of the existing virtual reality software frameworks offered everything we needed, such as cluster-awareness, multi-GPU capability, Linux compatibility, multi-user support, collaborative session support, or custom menu widgets. CalVR combines features from multiple existing VR frameworks into an open-source system, which we use in our laboratory on a daily basis, and for which dozens of VR applications have already been written at UCSD but also other research laboratories world-wide. In this paper, we describe the philosophy behind CalVR, its standard and unique features and functions, its programming interface, and its inner workings.

  11. Voiced Reading and Rhythm

    Institute of Scientific and Technical Information of China (English)

    詹艳萍

    2007-01-01

    Since voiced reading is an important way in learning English,rhythm is the most critical factor that enables to read beautifully.This article illustrates the relationship between rhythm and voiced reading,the importance of rhythm,and the methods to develop the sense of rhythm.

  12. Clinical Voices - an update

    DEFF Research Database (Denmark)

    Fusaroli, Riccardo; Weed, Ethan

    Anomalous aspects of speech and voice, including pitch, fluency, and voice quality, are reported to characterise many mental disorders. However, it has proven difficult to quantify and explain this oddness of speech by employing traditional statistical methods. In this talk we will show how...

  13. Borderline Space for Voice

    Science.gov (United States)

    Batchelor, Denise

    2012-01-01

    Being on the borderline as a student in higher education is not always negative, to do with marginalisation, exclusion and having a voice that is vulnerable. Paradoxically, being on the edge also has positive connections with integration, inclusion and having a voice that is strong. Alternative understandings of the concept of borderline space can…

  14. Voice and endocrinology

    Directory of Open Access Journals (Sweden)

    KVS Hari Kumar

    2016-01-01

    Full Text Available Voice is one of the advanced features of natural evolution that differentiates human beings from other primates. The human voice is capable of conveying the thoughts into spoken words along with a subtle emotion to the tone. This extraordinary character of the voice in expressing multiple emotions is the gift of God to the human beings and helps in effective interpersonal communication. Voice generation involves close interaction between cerebral signals and the peripheral apparatus consisting of the larynx, vocal cords, and trachea. The human voice is susceptible to the hormonal changes throughout life right from the puberty until senescence. Thyroid, gonadal and growth hormones have tremendous impact on the structure and function of the vocal apparatus. The alteration of voice is observed even in physiological states such as puberty and menstruation. Astute clinical observers make out the changes in the voice and refer the patients for endocrine evaluation. In this review, we shall discuss the hormonal influence on the voice apparatus in normal and endocrine disorders.

  15. Face the voice

    DEFF Research Database (Denmark)

    Lønstrup, Ansa

    2014-01-01

    will be based on a reception aesthetic and phenomenological approach, the latter as presented by Don Ihde in his book Listening and Voice. Phenomenologies of Sound , and my analytical sketches will be related to theoretical statements concerning the understanding of voice and media (Cavarero, Dolar, La...

  16. Ontario's Student Voice Initiative

    Science.gov (United States)

    Courtney, Jean

    2014-01-01

    This article describes in some detail aspects of the Student Voice initiative funded and championed by Ontario's Ministry of Education since 2008. The project enables thousands of students to make their voices heard in meaningful ways and to participate in student-led research. Some students from grades 7 to 12 become members of the Student…

  17. EasyVoice: Integrating voice synthesis with Skype

    CERN Document Server

    Condado, Paulo A

    2007-01-01

    This paper presents EasyVoice, a system that integrates voice synthesis with Skype. EasyVoice allows a person with voice disabilities to talk with another person located anywhere in the world, removing an important obstacle that affect these people during a phone or VoIP-based conversation.

  18. Research on multi - channel interactive virtual assembly system for power equipment under the “VR+” era

    Science.gov (United States)

    Ren, Yilong; Duan, Xitong; Wu, Lei; He, Jin; Xu, Wu

    2017-06-01

    With the development of the “VR+” era, the traditional virtual assembly system of power equipment has been unable to satisfy our growing needs. In this paper, based on the analysis of the traditional virtual assembly system of electric power equipment and the application of VR technology in the virtual assembly system of electric power equipment in our country, this paper puts forward the scheme of establishing the virtual assembly system of power equipment: At first, we should obtain the information of power equipment, then we should using OpenGL and multi texture technology to build 3D solid graphics library. After the completion of three-dimensional modeling, we can use the dynamic link library DLL package three-dimensional solid graphics generation program to realize the modularization of power equipment model library and power equipment model library generated hidden algorithm. After the establishment of 3D power equipment model database, we set up the virtual assembly system of 3D power equipment to separate the assembly operation of the power equipment from the space. At the same time, aiming at the deficiency of the traditional gesture recognition algorithm, we propose a gesture recognition algorithm based on improved PSO algorithm for BP neural network data glove. Finally, the virtual assembly system of power equipment can really achieve multi-channel interaction function.

  19. Immersive VR Visualizations by VFIVE. Part 2: Applications

    CERN Document Server

    Kageyama, Akira; Kawahara, Shintaro; Kashiyama, Kazuo; Ohtani, Hiroaki

    2013-01-01

    VFIVE is a scientific visualization application for CAVE-type immersive virtual reality systems. The source codes are freely available. VFIVE is used as a research tool in various VR systems. It also lays the groundwork for developments of new visualization software for CAVEs. In this paper, we pick up five CAVE systems in four different institutions in Japan. Applications of VFIVE in each CAVE system are summarized. Special emphases will be placed on scientific and technical achievements made possible by VFIVE.

  20. Voice Savers for Music Teachers

    Science.gov (United States)

    Cookman, Starr

    2012-01-01

    Music teachers are in a class all their own when it comes to voice use. These elite vocal athletes require stamina, strength, and flexibility from their voices day in, day out for hours at a time. Voice rehabilitation clinics and research show that music education ranks high among the professionals most commonly affected by voice problems.…

  1. 20 CFR 416.2211 - Criteria for determining when VR services will be considered to have contributed to a continuous...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Criteria for determining when VR services... determining when VR services will be considered to have contributed to a continuous period of 9 months. The State VR agency or alternate participant may be paid for VR services if such services contribute to the...

  2. 20 CFR 404.2111 - Criteria for determining when VR services will be considered to have contributed to a continuous...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Criteria for determining when VR services... VR services will be considered to have contributed to a continuous period of 9 months. The State VR agency or alternate participant may be paid for VR services if such services contribute to the individual...

  3. 20 CFR 411.510 - How is the State VR agency paid under the Ticket to Work program?

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false How is the State VR agency paid under the... VR agency paid under the Ticket to Work program? (a) The State VR agency's payment choices are described in § 411.355. (b) The State VR agency's decision to serve the beneficiary must be communicated to...

  4. 20 CFR 411.410 - Does each referral from an EN to a State VR agency require its own agreement?

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Does each referral from an EN to a State VR...' Participation Agreements Between Employment Networks and State Vr Agencies § 411.410 Does each referral from an EN to a State VR agency require its own agreement? No. The agreements between ENs and State VR...

  5. 20 CFR 411.435 - How will disputes arising under the agreements between ENs and State VR agencies be resolved?

    Science.gov (United States)

    2010-04-01

    ... agreements between ENs and State VR agencies be resolved? 411.435 Section 411.435 Employees' Benefits SOCIAL... Vr Agencies § 411.435 How will disputes arising under the agreements between ENs and State VR agencies be resolved? Disputes arising under agreements between ENs and State VR agencies must be resolved...

  6. 20 CFR 411.220 - What if my ticket is no longer assigned to an EN or State VR agency?

    Science.gov (United States)

    2010-04-01

    ... EN or State VR agency? 411.220 Section 411.220 Employees' Benefits SOCIAL SECURITY ADMINISTRATION THE... or State VR agency? (a) If your ticket was once assigned to an EN or State VR agency acting as an EN... § 411.145(a)) or because you relocated to an area not served by your previous EN or State VR agency; or...

  7. Mandarin recognition over the telephone

    Science.gov (United States)

    Kao, Yuhung

    1996-06-01

    Mandarin Chinese is the official language in China and Taiwan, it is the native language of a quarter of the world population. As the services enabled by speech recognition technology (e.g. telephone voice dialing, information query) become more popular in English, we would like to extend this capability to other languages. Mandarin is one of the major languages under research in our laboratory. This paper describes how we extend our work in English speech recognition into Mandarin. We will described the corpus: Voice Across Taiwan, the training of a complete set of Mandarin syllable models, preliminary performance results and error analysis. A fast prototyping system was built, where a user can write any context free grammar with no restriction of vocabulary, then the grammar can be compiled into recognition models. It enables user to quickly test the performance of a new vocabulary.

  8. An industrial approach to design compelling VR and AR experience

    Science.gov (United States)

    Richir, Simon; Fuchs, Philippe; Lourdeaux, Domitile; Buche, Cédric; Querrec, Ronan

    2013-03-01

    The convergence of technologies currently observed in the field of VR, AR, robotics and consumer electronic reinforces the trend of new applications appearing every day. But when transferring knowledge acquired from research to businesses, research laboratories are often at a loss because of a lack of knowledge of the design and integration processes in creating an industrial scale product. In fact, the innovation approaches that take a good idea from the laboratory to a successful industrial product are often little known to researchers. The objective of this paper is to present the results of the work of several research teams that have finalized a working method for researchers and manufacturers that allow them to design virtual or augmented reality systems and enable their users to enjoy "a compelling VR experience". That approach, called "the I2I method", present 11 phases from "Establishing technological and competitive intelligence and industrial property" to "Improvements" through the "Definition of the Behavioral Interface, Virtual Environment and Behavioral Software Assistance". As a result of the experience gained by various research teams, this design approach benefits from contributions from current VR and AR research. Our objective is to validate and continuously move such multidisciplinary design team methods forward.

  9. Work Integrated Learning (WIL In Virtual Reality (VR

    Directory of Open Access Journals (Sweden)

    Waleed Abdullah Al Shehri

    2012-09-01

    Full Text Available The ultimate goal of industrialized, developed economies is to increase economic growth and to have a highly-skilled, flexible and responsive workforce in order to remain competitive in an increasingly global economic context. Therefore, there is increasing focus on exploring the various options for optimizing worker training. Apprenticeships, traineeships and internships are some examples of the applications of Work Integrated Learning (WIL with various aims such as to enhance learner skills, provide learners with real life facilities to experience the work environment in which they have to perform in the near future and to enhance their capabilities to handle real time problems, even before they are exposed to them in the workplace. However, technology is increasingly replacing the concept of traditional learning, particularly through innovative IT approaches such as Virtual Reality (VR. Integrating both WIL and VR offers the opportunity to apply WIL in a virtual rather than real work environment as Virtual Work Integrated Learning (VWIL. The focus of this report is to initially discuss the concepts WIL and VR, their main characteristics and current applications. Moreover, the pros and cons of VWIL are also analyzed. Finally, the report presents some recommendation including further researches into areas where VWIL has potential to be successful in the future.

  10. [The VR, the Russian version of the nerve agent VX].

    Science.gov (United States)

    Cuquel, A-C; Dorandeu, F; Ceppa, F; Renard, C; Burnat, P

    2015-05-01

    A product of the arms race during the Cold War, the Russian VX, or VR, is an organophosphorus compound that is a structural isomer of the western VX compound (or A4), with which it shares a very high toxicity. It is much less studied and known than VX because the knowledge of its existence is relatively recent. A very low volatility and high resistance in the environment make it a persistent agent. Poisoning occurs mainly following penetration through skin and mucosa but vapour inhalation is a credible risk in some circumstances. The clinical presentation may be differed by several hours and despite the absence of signs and symptoms, the casualty should not be considered as contamination or intoxication-free. This agent has a long residence time in blood, a characteristics that clearly differentiates it from other compounds such as sarin. The protocols for antidote administration may thus have to be changed accordingly. The fact that VR poisoned individuals will less respond to the current oxime therapy used in France, the 2-PAM and that VR represents a higher threat than VX, being probably possessed by some proliferating states, justify the interest for this toxic product.

  11. Acquisition of stereo panoramas for display in VR environments

    KAUST Repository

    Ainsworth, Richard A.

    2011-01-23

    Virtual reality systems are an excellent environment for stereo panorama displays. The acquisition and display methods described here combine high-resolution photography with surround vision and full stereo view in an immersive environment. This combination provides photographic stereo-panoramas for a variety of VR displays, including the StarCAVE, NexCAVE, and CORNEA. The zero parallax point used in conventional panorama photography is also the center of horizontal and vertical rotation when creating photographs for stereo panoramas. The two photographically created images are displayed on a cylinder or a sphere. The radius from the viewer to the image is set at approximately 20 feet, or at the object of major interest. A full stereo view is presented in all directions. The interocular distance, as seen from the viewer\\'s perspective, displaces the two spherical images horizontally. This presents correct stereo separation in whatever direction the viewer is looking, even up and down. Objects at infinity will move with the viewer, contributing to an immersive experience. Stereo panoramas created with this acquisition and display technique can be applied without modification to a large array of VR devices having different screen arrangements and different VR libraries.

  12. Speech Rate Control for Improving Elderly Speech Recognition of Smart Devices

    Directory of Open Access Journals (Sweden)

    SON, G.

    2017-05-01

    Full Text Available Although smart devices have become a widely-adopted tool for communication in modern society, it still requires a steep learning curve among the elderly. By introducing a voice-based interface for smart devices using voice recognition technology, smart devices can become more user-friendly and useful to the elderly. However, the voice recognition technology used in current devices is attuned to the voice patterns of the young. Therefore, speech recognition falters when an elderly user speaks into the device. This paper has identified that the elderly's improper speech rate by each syllable contributes to the failure in the voice recognition system. Thus, upon modifying the speech rate by each syllable, the voice recognition rate saw an increase of 12.3%. This paper demonstrates that by simply modifying the speech rate by each syllable, which is one of the factors that causes errors in voice recognition, the recognition rate can be substantially increased. Such improvements in voice recognition technology can make it easier for the elderly to operate smart devices that will allow them to be more socially connected in a mobile world and access information at their fingertips. It may also be helpful in bridging the communication divide between generations.

  13. Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms

    Directory of Open Access Journals (Sweden)

    Yan Ming Cheng

    2008-08-01

    Full Text Available We describe two voice-to-phoneme conversion algorithms for speaker-independent voice-tag creation specifically targeted at applications on embedded platforms. These algorithms (batch mode and sequential are compared in speech recognition experiments where they are first applied in a same-language context in which both acoustic model training and voice-tag creation and application are performed on the same language. Then, their performance is tested in a cross-language setting where the acoustic models are trained on a particular source language while the voice-tags are created and applied on a different target language. In the same-language environment, both algorithms either perform comparably to or significantly better than the baseline where utterances are manually transcribed by a phonetician. In the cross-language context, the voice-tag performances vary depending on the source-target language pair, with the variation reflecting predicted phonological similarity between the source and target languages. Among the most similar languages, performance nears that of the native-trained models and surpasses the native reference baseline.

  14. Exploring expressivity and emotion with artificial voice and speech technologies.

    Science.gov (United States)

    Pauletto, Sandra; Balentine, Bruce; Pidcock, Chris; Jones, Kevin; Bottaci, Leonardo; Aretoulaki, Maria; Wells, Jez; Mundy, Darren P; Balentine, James

    2013-10-01

    Emotion in audio-voice signals, as synthesized by text-to-speech (TTS) technologies, was investigated to formulate a theory of expression for user interface design. Emotional parameters were specified with markup tags, and the resulting audio was further modulated with post-processing techniques. Software was then developed to link a selected TTS synthesizer with an automatic speech recognition (ASR) engine, producing a chatbot that could speak and listen. Using these two artificial voice subsystems, investigators explored both artistic and psychological implications of artificial speech emotion. Goals of the investigation were interdisciplinary, with interest in musical composition, augmentative and alternative communication (AAC), commercial voice announcement applications, human-computer interaction (HCI), and artificial intelligence (AI). The work-in-progress points towards an emerging interdisciplinary ontology for artificial voices. As one study output, HCI tools are proposed for future collaboration.

  15. Additive attacks on speaker recognition

    Science.gov (United States)

    Farrokh Baroughi, Alireza; Craver, Scott

    2014-02-01

    Speaker recognition is used to identify a speaker's voice from among a group of known speakers. A common method of speaker recognition is a classification based on cepstral coefficients of the speaker's voice, using a Gaussian mixture model (GMM) to model each speaker. In this paper we try to fool a speaker recognition system using additive noise such that an intruder is recognized as a target user. Our attack uses a mixture selected from a target user's GMM model, inverting the cepstral transformation to produce noise samples. In our 5 speaker data base, we achieve an attack success rate of 50% with a noise signal at 10dB SNR, and 95% by increasing noise power to 0dB SNR. The importance of this attack is its simplicity and flexibility: it can be employed in real time with no processing of an attacker's voice, and little computation is needed at the moment of detection, allowing the attack to be performed by a small portable device. For any target user, knowing that user's model or voice sample is sufficient to compute the attack signal, and it is enough that the intruder plays it while he/she is uttering to be classiffed as the victim.

  16. Evolution of the Varrier autostereoscopic VR display: 2001-2007

    Science.gov (United States)

    Peterka, Tom; Kooima, Robert L.; Girado, Javier I.; Ge, Jinghua; Sandin, Daniel J.; DeFanti, Thomas A.

    2007-02-01

    Autostereoscopy (AS) is an increasingly valuable virtual reality (VR) display technology; indeed, the IS&T / SPIE Electronic Imaging Conference has seen rapid growth in the number and scope of AS papers in recent years. The first Varrier paper appeared at SPIE in 2001, and much has changed since then. What began as a single-panel prototype has grown to a full scale VR autostereo display system, with a variety of form factors, features, and options. Varrier is a barrier strip AS display system that qualifies as a true VR display, offering a head-tracked ortho-stereo first person interactive VR experience without the need for glasses or other gear to be worn by the user. Since Varrier's inception, new algorithmic and systemic developments have produced performance and quality improvements. Visual acuity has increased by a factor of 1.4X with new fine-resolution barrier strip linescreens and computational algorithms that support variable sub-pixel resolutions. Performance has improved by a factor of 3X using a new GPU shader-based sub-pixel algorithm that accomplishes in one pass what previously required three passes. The Varrier modulation algorithm that began as a computationally expensive task is now no more costly than conventional stereoscopic rendering. Interactive rendering rates of 60 Hz are now possible in Varrier for complex scene geometry on the order of 100K vertices, and performance is GPU bound, hence it is expected to continue improving with graphics card enhancements. Head tracking is accomplished with a neural network camera-based tracking system developed at EVL for Varrier. Multiple cameras capture subjects at 120 Hz and the neural network recognizes known faces from a database and tracks them in 3D space. New faces are trained and added to the database in a matter of minutes, and accuracy is comparable to commercially available tracking systems. Varrier supports a variety of VR applications, including visualization of polygonal, ray traced, and

  17. Dominant Voice in Hamlet

    Institute of Scientific and Technical Information of China (English)

    李丹

    2015-01-01

    <正>The Tragedy of Hamlet dramatizes the revenge Prince Hamlet exacts on his uncle Claudius for murdering King Hamlet,Claudius’s brother and Prince Hamlet’s father,and then succeeding to the throne and taking as his wife Gertrude,the old king’s widow and Prince Hamlet’s mother.This paper will discuss something about dominant voice in the play.Dominant voice is the major voice in the country,the society,or the whole world.Those people who have the power or

  18. Chemical flooding in a virtual environment - a survivor`s guide to VR development

    Energy Technology Data Exchange (ETDEWEB)

    Bethel, W.

    1994-03-01

    Building something which could be called {open_quotes}virtual reality{close_quotes} (VR) is something of a challenge, particularly when nobody really seems to agree on a definition of VR. The author wanted to combine scientific visualization with VR, resulting in an environment useful for assisting scientific research. He demonstrates the combination of VR and scientific visualization in a prototype application. The VR application constructed consists of a dataflow based system for performing scientific visualization (AVS), extensions to the system to support VR input devices and a numerical simulation ported into the dataflow environment. The VR system includes two inexpensive, off-the-shelf VR devices and some custom code. A working system was assembled with about two man-months of effort. The system allows the user to specify parameters for a chemical flooding simulation as well as some viewing parameters using VR input devices, as well as view the output using VR output devices. In chemical flooding, there is a subsurface region that contains chemicals which are to be removed. Secondary oil recovery and environmental remediation are typical applications of chemical flooding. The process assumes one or more injection wells, and one or more production wells. Chemicals or water are pumped into the ground, mobilizing and displacing hydrocarbons or contaminants. The placement of the production and injection wells, and other parameters of the wells, are the most important variables in the simulation.

  19. Molecular Analysis of Asymptomatic Bacteriuria Escherichia coli Strain VR50 Reveals Adaptation to the Urinary Tract by Gene Acquisition

    DEFF Research Database (Denmark)

    Beatson, Scott A.; Ben Zakour, Nouri L.; Totsika, Makrina;

    2015-01-01

    to adhere to human bladder epithelial cells. In the mouse model of UTI, VR50afa and VR50afaE displayed reduced bladder colonization compared to wild-type VR50, similar to the colonization level of the GI-VR50-pheV mutant. Our study suggests that E. coli VR50 is a commensal-like strain that has acquired...

  20. Ethnographic Voice Memo Narratives

    DEFF Research Database (Denmark)

    Rasmussen, Mette Apollo; Conradsen, Maria Bosse

    1800-01-01

    -based technique which actively involves actors in producing ethnography-based data concerning their everyday practice. With the help from smartphone technology it is possible to complement ethnography-based research methods when involving the actors and having them create small voice memo narratives. The voice...... memos create insights of actors‟ everyday practice, without the direct presence of a researcher and could be considered a step towards meeting the dilemmas of research in complex fieldwork settings....

  1. Voice-Controlled Artificial Handspeak System

    Directory of Open Access Journals (Sweden)

    Jonathan Gatti

    2014-04-01

    Full Text Available A man-machine interaction project is described whic h aims to establish an automated voice to sign language translator for communication with the deaf using integrated open technologies. The first prototype consists of a robotic hand designed with OpenSCAD and manufactured with a low-cost 3D printer which smoothly reproduces the alphabet of the sign language controlled by voice only. The core automation comprises an Arduino UNO controller used to activate a set of servo motors that follow instructions from a Raspberry Pi mini-computer havi ng installed the open source speech recognition eng ine Julius. We discuss its features, limitations and po ssible future developmen

  2. SPEECH EMOTION RECOGNITION USING MODIFIED QUADRATIC DISCRIMINATION FUNCTION

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    Quadratic Discrimination Function(QDF)is commonly used in speech emotion recognition,which proceeds on the premise that the input data is normal distribution.In this Paper,we propose a transformation to normalize the emotional features,then derivate a Modified QDF(MQDF) to speech emotion recognition.Features based on prosody and voice quality are extracted and Principal Component Analysis Neural Network (PCANN) is used to reduce dimension of the feature vectors.The results show that voice quality features are effective supplement for recognition.and the method in this paper could improve the recognition ratio effectively.

  3. Voice disorders without organic diseases of the larynx. A 10-year review of 62 patients.

    Science.gov (United States)

    Watanabe, Y; Miura, M; Shoji, H

    1983-01-01

    We reviewed the clinical records of 62 patients with voice disorders without organic diseases of the larynx who were examined in the Department of Otolaryngology, Kurume University Hospital during the 10 years from 1971 to 1980. There were 9 patients with psychogenic dysphonia, 24 with vocal abuse, 4 with spastic dysphonia, 9 with mutational voice disturbance, 8 with virilization of voice, 7 with dysphonia attributed to diseases of other organs than the larynx, and 1 with senile change of voice. The clinical service to the patients with these kinds of dysphonia has been poor in Japan for lack of trained voice pathologists. This has been also the case with our department. Recognition for the necessity of voice pathologists is strongly demanded.

  4. Web-Based Interactive VR-CAD System for Conceptual Design and Analysis

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    To overcome the bottleneck of the one-way translation from VR (virtual reality) to CAD in VR-based CAD system, this paper proposes a Web-based VR-CAD system WVCD ( Web-based virtual conceptual designer) to support both customers'and design team members'involvement in the conceptual design and analysis activity over the Interbet. It provides users with both parametric and freehand methods to create and edit 3D conceptual models with different levels of complexity in a VR envixonment intuitively without the need of switching between the VR environment and CAD tools. As a simple and low-cost VR-CAD system, WVCD can help to popularize and improve the Web-based collaborative conceptual design.

  5. KinImmerse: Macromolecular VR for NMR ensembles

    Directory of Open Access Journals (Sweden)

    Vinson E Claire

    2009-02-01

    Full Text Available Abstract Background In molecular applications, virtual reality (VR and immersive virtual environments have generally been used and valued for the visual and interactive experience – to enhance intuition and communicate excitement – rather than as part of the actual research process. In contrast, this work develops a software infrastructure for research use and illustrates such use on a specific case. Methods The Syzygy open-source toolkit for VR software was used to write the KinImmerse program, which translates the molecular capabilities of the kinemage graphics format into software for display and manipulation in the DiVE (Duke immersive Virtual Environment or other VR system. KinImmerse is supported by the flexible display construction and editing features in the KiNG kinemage viewer and it implements new forms of user interaction in the DiVE. Results In addition to molecular visualizations and navigation, KinImmerse provides a set of research tools for manipulation, identification, co-centering of multiple models, free-form 3D annotation, and output of results. The molecular research test case analyzes the local neighborhood around an individual atom within an ensemble of nuclear magnetic resonance (NMR models, enabling immersive visual comparison of the local conformation with the local NMR experimental data, including target curves for residual dipolar couplings (RDCs. Conclusion The promise of KinImmerse for production-level molecular research in the DiVE is shown by the locally co-centered RDC visualization developed there, which gave new insights now being pursued in wider data analysis.

  6. Motion control simulation based on VR for humanoid robot

    Science.gov (United States)

    He, Huaiqing; Tang, Haoxuan

    2004-03-01

    This paper describes the motion control simulation based on VR for humanoid robot aiming at walking and running. To insure that the motion rhythm of humanoid robot conforms to the motion laws of humans, the body geometrical model based on skeleton and its kinematics models based on the graph of time sequences are presented firstly. Then a control algorithm based on Jacobian matrix is proposed to generate the periodical walking and running. Finally, computer simulation experiments demonstrate the feasibility of the models and the algorithm. The simulation system developed makes us interactively regulate the motion direction and velocity for humanoid robot.

  7. GyroVR: Simulating Inertia in Virtual Reality using Head Worn Flywheels

    DEFF Research Database (Denmark)

    Gugenheimer, Jan; Wolf, Dennis; Eiríksson, Eyþór Rúnar

    2016-01-01

    We present GyroVR, head worn flywheels designed to render inertia in Virtual Reality (VR. Motions such as flying, diving or floating in outer space generate kinesthetic forces onto our body which impede movement and are currently not represented in VR. We simulate those kinesthetic forces...... by attaching flywheels to the users head, leveraging the gyroscopic effect of resistance when changing the spinning axis of rotation. GyroVR is an ungrounded, wireless and self contained device allowing the user to freely move inside the virtual environment. The generic shape allows to attach it to different...

  8. Arabic Speech Recognition System using CMU-Sphinx4

    CERN Document Server

    Satori, H; Chenfour, N

    2007-01-01

    In this paper we present the creation of an Arabic version of Automated Speech Recognition System (ASR). This system is based on the open source Sphinx-4, from the Carnegie Mellon University. Which is a speech recognition system based on discrete hidden Markov models (HMMs). We investigate the changes that must be made to the model to adapt Arabic voice recognition. Keywords: Speech recognition, Acoustic model, Arabic language, HMMs, CMUSphinx-4, Artificial intelligence.

  9. Advances in Speech Recognition

    CERN Document Server

    Neustein, Amy

    2010-01-01

    This volume is comprised of contributions from eminent leaders in the speech industry, and presents a comprehensive and in depth analysis of the progress of speech technology in the topical areas of mobile settings, healthcare and call centers. The material addresses the technical aspects of voice technology within the framework of societal needs, such as the use of speech recognition software to produce up-to-date electronic health records, not withstanding patients making changes to health plans and physicians. Included will be discussion of speech engineering, linguistics, human factors ana

  10. AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS

    Directory of Open Access Journals (Sweden)

    Murat CANER

    2006-02-01

    Full Text Available In this study an artificial neural network (ANN is implemented, which has been used frequently as an implementation model in recent years, to recognize speaker identification. Generally, recognition is consist of three stages that, processing of signal, obtaining attributes and comparing them. Speech samples are transformed into digital data according to voice card of PC. In the analysis of voice stage, recurrent periods and white noise of voice data are trimmed by hamming window method and voice attribute part of the digital data is obtained. For obtaining attribute of voice data LPC (linear predictive coding and DFT (discrete fourier transform methods are used. Of those 28 coefficents, that is used for speaker recognition, 16 were obtained by the analysis of DFT and 12 were obtained by the analysis of LPC. The parameters that represent speaker voice, is used for training and test of ANN. Multilayer perceptron model is used as an architecture of ANN and backpropagation algorithm is used for training method. Voices of "a" is taken from 7 different person and their attributes are found. ANN is trained with these features to find the speaker who is the owner of the sample voice. And then using the test data that is not used for training part, recognition achievement of ANN is tested. As a result, good results were obtained with low failure rate.

  11. Voice Therapy Practices and Techniques: A Survey of Voice Clinicians.

    Science.gov (United States)

    Mueller, Peter B.; Larson, George W.

    1992-01-01

    Eighty-three voice disorder therapists' ratings of statements regarding voice therapy practices indicated that vocal nodules are the most frequent disorder treated; vocal abuse and hard glottal attack elimination, counseling, and relaxation were preferred treatment approaches; and voice therapy is more effective with adults than with children.…

  12. Voice in early glottic cancer compared to benign voice pathology

    NARCIS (Netherlands)

    Van Gogh, C. D. L.; Mahieu, H. F.; Kuik, D. J.; Rinkel, R. N. P. M.; Langendijk, J. A.; Verdonck-de Leeuw, I. M.

    2007-01-01

    The purpose of this study is to compare (Dutch) Voice Handicap Index (VHIvumc) scores from a selected group of patients with voice problems after treatment for early glottic cancer with patients with benign voice disorders and subjects from the normal population. The study included a group of 35 pat

  13. The inner voice

    Directory of Open Access Journals (Sweden)

    Anthony James Ridgway

    2009-12-01

    Full Text Available The inner voice- we all know what it is because we all have it and use it when we are thinking or reading, for example. Little work has been done on it in our field, with the notable exception of Brian Tomlinson, but presumably it must be a cognitive phenomenon which is of great importance in thinking, language learning, and reading in a foreign language. The inner voice will be discussed as a cognitive psychological phenomenon associated with short-term memory, and distinguished from the inner ear. The process of speech recoding will be examined (the process of converting written language into the inner voice and the importance of developing the inner voice, as a means of both facilitating the production of a new language and enhancing the comprehension of a text in a foreign language, will be emphasized. Finally, ways of developing the inner voice in beginning and intermediate readers of a foreign language will be explored and recommended.

  14. Virtual Reality: Developing a VR space for Academic activities

    Science.gov (United States)

    Kaimaris, D.; Stylianidis, E.; Karanikolas, N.

    2014-05-01

    Virtual reality (VR) is extensively used in various applications; in industry, in academia, in business, and is becoming more and more affordable for end users from the financial point of view. At the same time, in academia and higher education more and more applications are developed, like in medicine, engineering, etc. and students are inquiring to be well-prepared for their professional life after their educational life cycle. Moreover, VR is providing the benefits having the possibility to improve skills but also to understand space as well. This paper presents the methodology used during a course, namely "Geoinformatics applications" at the School of Spatial Planning and Development (Eng.), Aristotle University of Thessaloniki, to create a virtual School space. The course design focuses on the methods and techniques to be used in order to develop the virtual environment. In addition the project aspires to become more and more effective for the students and provide a real virtual environment with useful information not only for the students but also for any citizen interested in the academic life at the School.

  15. Democratizing rendering for multiple viewers in surround VR systems

    KAUST Repository

    Schulze, Jürgen P.

    2012-03-01

    We present a new approach for how multiple users\\' views can be rendered in a surround virtual environment without using special multi-view hardware. It is based on the idea that different parts of the screen are often viewed by different users, so that they can be rendered from their own view point, or at least from a point closer to their view point than traditionally expected. The vast majority of 3D virtual reality systems are designed for one head-tracked user, and a number of passive viewers. Only the head tracked user gets to see the correct view of the scene, everybody else sees a distorted image. We reduce this problem by algorithmically democratizing the rendering view point among all tracked users. Researchers have proposed solutions for multiple tracked users, but most of them require major changes to the display hardware of the VR system, such as additional projectors or custom VR glasses. Our approach does not require additional hardware, except the ability to track each participating user. We propose three versions of our multi-viewer algorithm. Each of them balances image distortion and frame rate in different ways, making them more or less suitable for certain application scenarios. Our most sophisticated algorithm renders each pixel from its own, optimized camera perspective, which depends on all tracked users\\' head positions and orientations. © 2012 IEEE.

  16. Smartphone App for Voice Disorders

    Science.gov (United States)

    ... on. Feature: Taste, Smell, Hearing, Language, Voice, Balance Smartphone App for Voice Disorders Past Issues / Fall 2013 ... developed a mobile monitoring device that relies on smartphone technology to gather a week's worth of talking, ...

  17. The effects of thematic context and presentation mode on memory for sentence voice.

    Science.gov (United States)

    Kerr, N H; Butler, S F; Maykuth, P L; Delis, D

    1982-05-01

    A sentence in discourse may appear in the passive voice to emphasize the logical object rather than the logical subject when it is thematically more important. Two experiments are reported that explore the impact of this textual function of voice on sentence memory. The first experiment required subjects to listen to prose passages and then recall them. Sentences were recalled predominantly in the active voice regardless of voice or thematic focus in the prose passage, showing that the English-language bias for the active voice was a more important determinant of sentence reconstruction than was the experiment manipulation of thematic context. The second experiment required subjects to listen to or read either prose passages or lists of unrelated sentences and then to try to recognize "key" sentences that were either unchanged or changed lexically, semantically, or in voice. Recognition, both overall and specifically for voice, was better for sentences that were read than for those that were heard, and recognition for semantic change was consistently higher than for any other. Only when passages were read was there evidence in support of a thematic textual influence on memory for sentence voice.

  18. Voice-based assessments of trustworthiness, competence, and warmth in blind and sighted adults.

    Science.gov (United States)

    Oleszkiewicz, Anna; Pisanski, Katarzyna; Lachowicz-Tabaczek, Kinga; Sorokowska, Agnieszka

    2017-06-01

    The study of voice perception in congenitally blind individuals allows researchers rare insight into how a lifetime of visual deprivation affects the development of voice perception. Previous studies have suggested that blind adults outperform their sighted counterparts in low-level auditory tasks testing spatial localization and pitch discrimination, as well as in verbal speech processing; however, blind persons generally show no advantage in nonverbal voice recognition or discrimination tasks. The present study is the first to examine whether visual experience influences the development of social stereotypes that are formed on the basis of nonverbal vocal characteristics (i.e., voice pitch). Groups of 27 congenitally or early-blind adults and 23 sighted controls assessed the trustworthiness, competence, and warmth of men and women speaking a series of vowels, whose voice pitches had been experimentally raised or lowered. Blind and sighted listeners judged both men's and women's voices with lowered pitch as being more competent and trustworthy than voices with raised pitch. In contrast, raised-pitch voices were judged as being warmer than were lowered-pitch voices, but only for women's voices. Crucially, blind and sighted persons did not differ in their voice-based assessments of competence or warmth, or in their certainty of these assessments, whereas the association between low pitch and trustworthiness in women's voices was weaker among blind than sighted participants. This latter result suggests that blind persons may rely less heavily on nonverbal cues to trustworthiness compared to sighted persons. Ultimately, our findings suggest that robust perceptual associations that systematically link voice pitch to the social and personal dimensions of a speaker can develop without visual input.

  19. Sustainable Consumer Voices

    DEFF Research Database (Denmark)

    Klitmøller, Anders; Rask, Morten; Jensen, Nevena

    2011-01-01

    Aiming to explore how user driven innovation can inform high level design strategies, an in-depth empirical study was carried out, based on data from 50 observations of private vehicle users. This paper reports the resulting 5 consumer voices: Technology Enthusiast, Environmentalist, Design Lover......, Pragmatist and Status Seeker. Expedient use of the voices in creating design strategies is discussed, thus contributing directly to the practice of high level design managers. The main academic contribution of this paper is demonstrating how applied anthropology can be used to generate insights...... into disruptive emergence of product service systems, where quantitative user analyses rely on historical continuation....

  20. The quest to apply VR technology to rehabilitation: tribulations and treasures.

    Science.gov (United States)

    Keshner, Emily A; Fung, Joyce

    2017-01-01

    The papers that follow stem from a symposium presented at the International Society for Posture and Gait Research (ISPGR) in Seville, Spain, in July 2015. Four speakers were charged with presenting their methods of applying virtual reality (VR) technology to obtain meaningful rehabilitation outcomes. The symposium aims to explore characteristics of VR that modify mechanisms supporting motor relearning. Common impairments in posture and gait that can be modulated within virtual environments by employing motor learning concepts, including sensory augmentation and repetition, were examined. Critical overviews of VR applications that address different therapeutic objectives for improving posture and gait in individuals with neurological insult or injury were presented. A further goal was to identify approaches and efforts to bridge the gap between knowledge generation from research and knowledge uptake in clinical practice. Specific objectives of this symposium were that participants be able to: 1) identify benefits and limitations of selecting VR as an intervention tool; 2) discuss how VR relates to principles for motor relearning following neurological insult or injury; and 3) identify areas and methods for future translation of VR technology in clinical and home-based settings. Our symposium concluded that the application of VR technology in assessment, treatment, and research has yielded promising results in transferring learned cognitive and motor skills to more natural environments. VR permits the user to interact with a multidimensional and multisensory environment in real time, and offers the opportunity to provide both standardized and individualized interventions while monitoring behavior.

  1. Virtual Reality (VR) as a Source for Self-Efficacy in Teacher Training

    Science.gov (United States)

    Nissim, Yonit; Weissblueth, Eyal

    2017-01-01

    The current study sought to explore the experiences of pre-service student teachers in a teaching unit in VR within a special course framework which was intended to enhance student-teacher's 21st century skills and growth processes. In particular, how their experiences working with VR affected their self-efficacy. The research population comprised…

  2. Verona Coding Definitions of Emotional Sequences (VR-CoDES): Conceptual framework and future directions.

    Science.gov (United States)

    Piccolo, Lidia Del; Finset, Arnstein; Mellblom, Anneli V; Figueiredo-Braga, Margarida; Korsvold, Live; Zhou, Yuefang; Zimmermann, Christa; Humphris, Gerald

    2017-06-21

    To discuss the theoretical and empirical framework of VR-CoDES and potential future direction in research based on the coding system. The paper is based on selective review of papers relevant to the construction and application of VR-CoDES. VR-CoDES system is rooted in patient-centered and biopsychosocial model of healthcare consultations and on a functional approach to emotion theory. According to the VR-CoDES, emotional interaction is studied in terms of sequences consisting of an eliciting event, an emotional expression by the patient and the immediate response by the clinician. The rationale for the emphasis on sequences, on detailed classification of cues and concerns, and on the choices of explicit vs. non-explicit responses and providing vs. reducing room for further disclosure, as basic categories of the clinician responses, is described. Results from research on VR-CoDES may help raise awareness of emotional sequences. Future directions in applying VR-CoDES in research may include studies on predicting patient and clinician behavior within the consultation, qualitative analyses of longer sequences including several VR-CoDES triads, and studies of effects of emotional communication on health outcomes. VR-CoDES may be applied to develop interventions to promote good handling of patients' emotions in healthcare encounters. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Empowering Students to Create Better Virtual Reality Applications: A Longitudinal Study of a VR Capstone Course

    Science.gov (United States)

    Takala, Tuukka M.; Malmi, Lauri; Pugliese, Roberto; Takala, Tapio

    2016-01-01

    In this paper we present our experiences of teaching an annually organized virtual reality (VR) capstone course. We review three iterations of the course, during which a total of 45 students completed the course and 16 VR applications were implemented. Our comparative analysis describes the students' evaluation of the course, the applications…

  4. VoiceRelay: voice key operation using visual basic.

    Science.gov (United States)

    Abrams, Lise; Jennings, David T

    2004-11-01

    Using a voice key is a popular method for recording vocal response times in a variety of language production tasks. This article describes a class module called VoiceRelay that can be easily utilized in Visual Basic programs for voice key operation. This software-based voice key offers the precision of traditional voice keys (although accuracy is system dependent), as well as the flexibility of volume and sensitivity control. However, VoiceRelay is a considerably less expensive alternative for recording vocal response times because it operates with existing PC hardware and does not require the purchase of external response boxes or additional experiment-generation software. A sample project demonstrating implementation of the VoiceRelay class module may be downloaded from the Psychonomic Society Web archive, www.psychonomic.org/archive.

  5. Web-based VR training simulator for percutaneous rhizotomy.

    Science.gov (United States)

    Li, Y; Brodlie, K; Phillips, N

    2000-01-01

    Virtual Reality offers great potential for surgical training--yet is typically limited by the dedicated and expensive equipment required. Web-based VR has the potential to offer a much cheaper alternative, in which simulations of fundamental techniques are downloaded from a server to run within a web browser. The equipment requirement is modest--an Internet-connected PC or small workstation--and the simulation can be accessed worldwide. In a collaboration between computer scientists and neurosurgeons, we have studied the use of web-based VR to train neurosurgeons in Percutaneous Rhizotomy--a treatment for the intractable facial pain which occurs in trigeminal neuralgia. This involves the insertion of a needle so as to puncture the foramen ovale, and lesion the nerve. Our simulation uses VRML to provide a 3D visualization environment, but the work immediately exposes a key limitation of VRML for surgical simulation. VRML does not support collision detection between objects--only between viewpoint and object. Thus collision between needle and skull cannot be detected and fed back to the trainee. We have developed a novel solution in which the training simulation has linked views: a normal view, plus a view as seen from the tip of the needle. Collision detection is captured in the needle view, and fed back to the viewer. A happy consequence of this approach has been the chance to aid the trainee with this additional view from needle tip, which helps locate the foramen ovale. The technology to achieve this is Java software communicating with the VRML worlds through the External Authoring Interface (EAI). The training simulator is available on the Web, with accompanying tutorial on its use. A major advantage of web-based VR is that the techniques generalize to a whole range of surgical simulations. Thus we have been able to use exactly the same approach as described above for neurosurgery, to develop a shoulder arthroscopy simulator--where again collision detection, and

  6. An Interactive, 3D Fault Editor for VR Environments

    Science.gov (United States)

    van Aalsburg, J.; Yikilmaz, M. B.; Kreylos, O.; Kellogg, L. H.; Rundle, J. B.

    2008-12-01

    Digitial Fault Models (DFM) play a vital role in the study of earthquake dynamics, fault-earthquake interactions, and seismicity. DFMs serve as input for finite-element method (FEM) or other earthquake simulations such as Virtual California. Generally, digital fault models are generated by importing a digitized and georeferenced (2D) fault map and/or a hillshade image of the study area into a geographical information system (GIS) application, where individual fault lines are traced by the user. Data assimilation and creation of a DFM, or updating an existing DFM based on new observations, is a tedious and time-consuming process. In order to facilitate the creation process, we are developing an immersive virtual reality (VR) application to visualize and edit fault models. This program is designed to run in immersive environments such as a CAVE (walk-in VR environment), but also works in a wide range of other environments, including desktop systems and GeoWalls. It is being developed at the UC Davis W.M. Keck Center for Active Visualization in the Earth Sciences (KeckCAVES, http://www.keckcaves.org). Our program allows users to create new models or modify existing ones; for instance by repositioning individual fault-segments, by changing the dip angle, or by modifying (or assigning) the value of a property associated with a particular fault segment (i.e. slip rate). With the addition of high resolution Digital Elevation Models (DEM) , georeferenced active tectonic fault maps and earthquake hypocenters, the user can accurately add new segments to an existing model or create a fault model entirely from scratch. Interactively created or modified models can be written to XML files at any time; from there the data may easily be converted into various formats required by the analysis software or simulation. We believe that the ease of interaction provided by VR technology is ideally suited to the problem of creating and editing digital fault models. Our software provides

  7. Voice application development for Android

    CERN Document Server

    McTear, Michael

    2013-01-01

    This book will give beginners an introduction to building voice-based applications on Android. It will begin by covering the basic concepts and will build up to creating a voice-based personal assistant. By the end of this book, you should be in a position to create your own voice-based applications on Android from scratch in next to no time.Voice Application Development for Android is for all those who are interested in speech technology and for those who, as owners of Android devices, are keen to experiment with developing voice apps for their devices. It will also be useful as a starting po

  8. Dissociating the cortical basis of memory for voices, words and tones.

    Science.gov (United States)

    Stevens, Alexander A

    2004-01-01

    Human speech carries both linguistic content and information about the speaker's identity and affect. While neuroimaging has been used extensively to study verbal memory, there has been little attention to the neural basis of memory for voices. Evidence from studies of aphasia and auditory agnosia suggests that voice memory may rely on anatomically distinct areas in the right temporal and parietal lobes regions, but there is little data on the broader neural systems involved in voice memory. The present study tested the hypothesis that the neural systems involved in voice memory are functionally distinct from the systems involved in word recognition and are primarily located in the right cerebral hemisphere. Subjects performed two-back tasks in which they were required to alternately remember the voices speaking (Voice condition), and the words they produced (Word condition). A tone memory condition was also included, as a non-speech comparison. The contrast between the Voice and Word conditions revealed greater Voice-related effects in left temporal, right frontal and right medial parietal areas, while the Word-related effects appeared in left frontal and bilateral parietal areas. These findings map out a partially right-lateralized fronto-parietal network associated with voice memory, which can be distinguished from predominantly left-hemisphere regions associated with verbal working memory. These results provide further evidence that distinct neural systems are associated with the carrier waves of speech and word identity.

  9. Sustainable Consumer Voices

    DEFF Research Database (Denmark)

    Klitmøller, Anders; Rask, Morten; Jensen, Nevena

    2011-01-01

    Aiming to explore how user driven innovation can inform high level design strategies, an in-depth empirical study was carried out, based on data from 50 observations of private vehicle users. This paper reports the resulting 5 consumer voices: Technology Enthusiast, Environmentalist, Design Lover...

  10. Voices of courage

    Directory of Open Access Journals (Sweden)

    Noraida Abdullah Karim

    2007-07-01

    Full Text Available In May 2007 the Women’s Commission for Refugee Women and Children1 presented its annual Voices of Courage awards to three displaced people who have dedicated their lives to promoting economic opportunities for refugee and displaced women and youth. These are their (edited testimonies.

  11. Listen to a voice

    DEFF Research Database (Denmark)

    Hølge-Hazelton, Bibi

    2001-01-01

    Listen to the voice of a young girl Lonnie, who was diagnosed with Type 1 diabetes at 16. Imagine that she is deeply involved in the social security system. She lives with her mother and two siblings in a working class part of a small town. She is at a special school for problematic youth, and he...

  12. Political animal voices

    NARCIS (Netherlands)

    Meijer, E.R.

    2017-01-01

    In this thesis, I develop a theory of political animal voices. The first part of the thesis focuses on non-human animal languages and forming interspecies worlds. I first investigate the relation between viewing language as exclusively human and seeing humans as categorically different from other

  13. Finding a Voice

    Science.gov (United States)

    Stuart, Shannon

    2012-01-01

    Schools have struggled for decades to provide expensive augmentative and alternative communication (AAC) resources for autistic students with communication challenges. Clunky voice output devices, often included in students' individualized education plans, cost about $8,000, a difficult expense to cover in hard times. However, mobile technology is…

  14. the Voice of Tomorrow

    Institute of Scientific and Technical Information of China (English)

    AlanBurdick

    2003-01-01

    Have you heard Mide? Coule be.Mike is a professional reader,and he's everywhere these days. On MapQuest, the Web-based map service,he'll read aloud whatever directions you ask for. If you like to have AOL or Yahoo! e-mail read aloud to you over the phone, that's Mike's voice you 're hearing. Soon

  15. What the voice reveals.

    NARCIS (Netherlands)

    Ko, Sei Jin

    2007-01-01

    Given that the voice is our main form of communication, we know surprisingly little about how it impacts judgment and behavior. Furthermore, the modern advancement in telecommunication systems, such as cellular phones, has meant that a large proportion of our everyday interactions are conducted voca

  16. The Inner Voice

    Science.gov (United States)

    Ridgway, Anthony James

    2009-01-01

    The inner voice- we all know what it is because we all have it and use it when we are thinking or reading, for example. Little work has been done on it in our field, with the notable exception of Brian Tomlinson, but presumably it must be a cognitive phenomenon which is of great importance in thinking, language learning, and reading in a foreign…

  17. Moving beyond Youth Voice

    Science.gov (United States)

    Serido, Joyce; Borden, Lynne M.; Perkins, Daniel F.

    2011-01-01

    This study combines research documenting the benefits of positive relationships between youth and caring adults on a young person's positive development with studies on youth voice to examine the mechanisms through which participation in youth programs contributes to positive developmental outcomes. Specifically, the study explores whether youth's…

  18. Bodies and Voices

    DEFF Research Database (Denmark)

    A wide-ranging collection of essays centred on readings of the body in contemporary literary and socio-anthropological discourse, from slavery and rape to female genital mutilation, from clothing, ocular pornography, voice, deformation and transmutation to the imprisoned, dismembered, remembered...

  19. Voices for Careers.

    Science.gov (United States)

    York, Edwin G.; Kapadia, Madhu

    Listed in this annotated bibliography are 502 cassette tapes of value to career exploration for Grade 7 through the adult level, whether as individualized instruction, small group study, or total class activity. Available to New Jersey educators at no charge, this Voices for Careers System is also available for duplication on request from the New…

  20. What the voice reveals

    NARCIS (Netherlands)

    Ko, Sei Jin

    2007-01-01

    Given that the voice is our main form of communication, we know surprisingly little about how it impacts judgment and behavior. Furthermore, the modern advancement in telecommunication systems, such as cellular phones, has meant that a large proportion of our everyday interactions are conducted voca

  1. Bodies and Voices

    DEFF Research Database (Denmark)

    A wide-ranging collection of essays centred on readings of the body in contemporary literary and socio-anthropological discourse, from slavery and rape to female genital mutilation, from clothing, ocular pornography, voice, deformation and transmutation to the imprisoned, dismembered, remembered...

  2. Immersive BCI with SSVEP in VR head-mounted display.

    Science.gov (United States)

    Bonkon Koo; Hwan-Gon Lee; Yunjun Nam; Seungjin Choi

    2015-08-01

    In this paper we present an immersive brain computer interface (BCI) where we use a virtual reality head-mounted display (VRHMD) to invoke SSVEP responses. Compared to visual stimuli in monitor display, we demonstrate that visual stimuli in VRHMD indeed improve the user engagement for BCI. To this end, we validate our method with experiments on a VR maze game, the goal of which is to guide a ball into the destination in a 2D grid map in a 3D space, successively choosing one of four neighboring cells using SSVEP evoked by visual stimuli on neighboring cells. Experiments indicate that the averaged information transfer rate is improved by 10% for VRHMD, compared to the case in monitor display and the users feel easier to play the game with the proposed system.

  3. 当电视新闻遇见VR

    Institute of Scientific and Technical Information of China (English)

    张晓抒

    2016-01-01

    2016年被业界称之为VR元年,虚拟技术的出现,受到了世界科技巨头们的纷纷追捧.VR沉浸式的视觉体验颠覆了传统的观感体验.而正处在低谷之中的电视媒体,在VR身上,嗅到了创新变革的味道.VR的恰时出现,电视媒体该如何应对呢?在本文中,笔者将对电视新闻制作的VR创新思路,做一次探讨式的阐述.笔者相信一点,当电视新闻遇见VR,这将是一条非常不错的改革创新之路.

  4. Visualization tool. 3DAVS and polarization-type VR system

    Energy Technology Data Exchange (ETDEWEB)

    Takeda, Yasuhiro [Fujitsu Tokushima Systems Engineering Limited, Tokushima (Japan); Ueshima, Yutaka [Japan Atomic Energy Research Inst., Kizu, Kyoto (Japan)

    2003-01-01

    In the visualization work of simulation data in every advanced research field, what is used most in the report or the presentation as a research result has still remained in the stages of the still picture or the 2-dimensional animation, in spite of recent abundance of various visualization software. With the recent progress of computational environment, however, more complicated phenomena can be so easily computed that the results are more needed to be comprehensible as well as intelligible. Therefore, it inevitably requires an animation rather than a still picture, or 3-dimensional display (virtual reality) rather than 2-dimensional one. In this report, two visualization tools, 3DAVS and Polarization-Type VR system are described as the data expression method after visualization processing. (author)

  5. VEEVVIE: Visual Explorer for Empirical Visualization, VR and Interaction Experiments.

    Science.gov (United States)

    Papadopoulos, C; Gutenko, I; Kaufman, A E

    2016-01-01

    Empirical, hypothesis-driven, experimentation is at the heart of the scientific discovery process and has become commonplace in human-factors related fields. To enable the integration of visual analytics in such experiments, we introduce VEEVVIE, the Visual Explorer for Empirical Visualization, VR and Interaction Experiments. VEEVVIE is comprised of a back-end ontology which can model several experimental designs encountered in these fields. This formalization allows VEEVVIE to capture experimental data in a query-able form and makes it accessible through a front-end interface. This front-end offers several multi-dimensional visualization widgets with built-in filtering and highlighting functionality. VEEVVIE is also expandable to support custom experimental measurements and data types through a plug-in visualization widget architecture. We demonstrate VEEVVIE through several case studies of visual analysis, performed on the design and data collected during an experiment on the scalability of high-resolution, immersive, tiled-display walls.

  6. Training Implications of Airborne Applications of Automated Speech Recognition Technology.

    Science.gov (United States)

    1980-10-01

    Coler , C. R. Automated speech recognition and man- computer interaction research at NASA Ames Research Center. In S. Harris (Ed.), Proceedings: Voice...Sons, Inc., 1964. 56 NAVTRAEQUIPCEN 80-D-0009-0155-1 Coler , C. R. Automated speech recognition and man- computer interaction research at NASA Ames

  7. Double Fourier analysis for Emotion Identification in Voiced Speech

    Science.gov (United States)

    Sierra-Sosa, D.; Bastidas, M.; Ortiz P., D.; Quintero, O. L.

    2016-04-01

    We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech. Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions. A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds. Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions. Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it. Finally features related with emotions in voiced speech are extracted and presented.

  8. Speech Recognition: A World of Opportunities

    Science.gov (United States)

    PACER Center, 2004

    2004-01-01

    Speech recognition technology helps people with disabilities interact with computers more easily. People with motor limitations, who cannot use a standard keyboard and mouse, can use their voices to navigate the computer and create documents. The technology is also useful to people with learning disabilities who experience difficulty with spelling…

  9. 20 CFR 411.415 - Who will verify the establishment of agreements between ENs and State VR agencies?

    Science.gov (United States)

    2010-04-01

    ... agreements between ENs and State VR agencies? 411.415 Section 411.415 Employees' Benefits SOCIAL SECURITY...' Participation Agreements Between Employment Networks and State Vr Agencies § 411.415 Who will verify the establishment of agreements between ENs and State VR agencies? The PM will verify the establishment of these...

  10. 20 CFR 404.2112 - Payment for VR services in a case where an individual continues to receive disability payments...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Payment for VR services in a case where an individual continues to receive disability payments based on participation in an approved VR program. 404... Payment for VR services in a case where an individual continues to receive disability payments based on...

  11. 20 CFR 411.350 - Must a State VR agency participate in the Ticket to Work program?

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Must a State VR agency participate in the... Participation in the Ticket to Work Program § 411.350 Must a State VR agency participate in the Ticket to Work program? A State VR agency may elect, but is not required, to participate in the Ticket to Work program as...

  12. 20 CFR 404.2116 - When claims for payment for VR services must be made (filing deadlines).

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false When claims for payment for VR services must... Rehabilitation Services Payment Provisions § 404.2116 When claims for payment for VR services must be made (filing deadlines). The State VR agency or alternate participant must file a claim for payment in each...

  13. 20 CFR 416.2216 - When claims for payment for VR services must be made (filing deadlines).

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false When claims for payment for VR services must... Rehabilitation Services Payment Provisions § 416.2216 When claims for payment for VR services must be made (filing deadlines). The State VR agency or alternate participant must file a claim for payment in each...

  14. 20 CFR 416.2212 - Payment for VR services in a case where an individual continues to receive disability or...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Payment for VR services in a case where an individual continues to receive disability or blindness benefits based on participation in an approved VR... Provisions § 416.2212 Payment for VR services in a case where an individual continues to receive disability...

  15. Forensic Speaker Recognition Law Enforcement and Counter-Terrorism

    CERN Document Server

    Patil, Hemant

    2012-01-01

    Forensic Speaker Recognition: Law Enforcement and Counter-Terrorism is an anthology of the research findings of 35 speaker recognition experts from around the world. The volume provides a multidimensional view of the complex science involved in determining whether a suspect’s voice truly matches forensic speech samples, collected by law enforcement and counter-terrorism agencies, that are associated with the commission of a terrorist act or other crimes. While addressing such topics as the challenges of forensic case work, handling speech signal degradation, analyzing features of speaker recognition to optimize voice verification system performance, and designing voice applications that meet the practical needs of law enforcement and counter-terrorism agencies, this material all sounds a common theme: how the rigors of forensic utility are demanding new levels of excellence in all aspects of speaker recognition. The contributors are among the most eminent scientists in speech engineering and signal process...

  16. 基于VR-Forces的水面舰艇防空作战仿真%Surface Antiaircraft Recovery Simulation Based on VR-Forces

    Institute of Scientific and Technical Information of China (English)

    王勃; 张立民; 徐建志

    2011-01-01

    作战模拟是水面舰艇对空防御研究的有效手段之一,VR-Forces为此提供了有力的支持.首先建立了基于VR-Forces的舰空对抗仿真系统,分析了基于VR-Forces组件式作战兵力实体结构特点,提出了基于功能组件的实体结构元模型;在电子对抗模型的基础上,以雷达告警仿真为例建立了基于VR-Forces的水面舰艇对空防御仿真组件开发过程,对组件及组件间的数据流程进行详细说明.所提出的开发方法已经成功运用于多型作战模拟训练系统中,实现了电磁环境下的舰空对抗训练.%Combat simulation is one of the most valid approaches for research on surface antiaircraft recovery , for which VR-Forces provide powerful support. Firstly , surface antiaircraft simulation system is built. Then, the features of force entity structure based on component-style of VR-Forces are analyzed and the entity meta model based-on function component is put forward. Based on models of electric measurement and taking radar warning simulation as an example , the development process of simulation components of surfaces based on VR-Forces is designed and the relationship of inter-component and components and data flow within the component are described. The approach has been applied in several types of simulation training systems and implemented the confront training between surfaces and aircraft under electronic measurement conditions successfully.

  17. April 16th : The World Voice Day

    NARCIS (Netherlands)

    Svec, Jan G.; Behlau, Mara

    2007-01-01

    Although the voice is used as an everyday basis of speech, most people realize its importance only when a voice problem arises. Increasing public awareness of the importance of the voice and alertness to voice problems are the main goals of the World Voice Day, which is celebrated yearly on April 16

  18. Risk factors for voice problems in teachers

    NARCIS (Netherlands)

    Kooijman, P. G. C.; de Jong, F. I. C. R. S.; Thomas, G.; Huinck, W.; Donders, R.; Graamans, K.; Schutte, H. K.

    2006-01-01

    In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints

  19. You're a What? Voice Actor

    Science.gov (United States)

    Liming, Drew

    2009-01-01

    This article talks about voice actors and features Tony Oliver, a professional voice actor. Voice actors help to bring one's favorite cartoon and video game characters to life. They also do voice-overs for radio and television commercials and movie trailers. These actors use the sound of their voice to sell a character's emotions--or an advertised…

  20. Risk factors for voice problems in teachers

    NARCIS (Netherlands)

    Kooijman, P. G. C.; de Jong, F. I. C. R. S.; Thomas, G.; Huinck, W.; Donders, R.; Graamans, K.; Schutte, H. K.

    2006-01-01

    In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints

  1. Risk factors for voice problems in teachers.

    NARCIS (Netherlands)

    Kooijman, P.G.C.; Jong, F.I.C.R.S. de; Thomas, G.; Huinck, W.J.; Donders, A.R.T.; Graamans, K.; Schutte, H.K.

    2006-01-01

    In order to identify factors that are associated with voice problems and voice-related absenteeism in teachers, 1,878 questionnaires were analysed. The questionnaires inquired about personal data, voice complaints, voice-related absenteeism from work and conditions that may lead to voice complaints

  2. New Dimensions of GIS Data: Exploring Virtual Reality (VR) Technology for Earth Science

    Science.gov (United States)

    Skolnik, S.; Ramirez-Linan, R.

    2016-12-01

    NASA's Science Mission Directorate (SMD) Earth Science Division (ESD) Earth Science Technology Office (ESTO) and Navteca are exploring virtual reality (VR) technology as an approach and technique related to the next generation of Earth science technology information systems. Having demonstrated the value of VR in viewing pre-visualized science data encapsulated in a movie representation of a time series, further investigation has led to the additional capability of permitting the observer to interact with the data, make selections, and view volumetric data in an innovative way. The primary objective of this project has been to investigate the use of commercially available VR hardware, the Oculus Rift and the Samsung Gear VR, for scientific analysis through an interface to ArcGIS to enable the end user to order and view data from the NASA Discover-AQ mission. A virtual console is presented through the VR interface that allows the user to select various layers of data from the server in both 2D, 3D, and full 4pi steradian views. By demonstrating the utility of VR in interacting with Discover-AQ flight mission measurements, and building on previous work done at the Atmospheric Science Data Center (ASDC) at NASA Langley supporting analysis of sources of CO2 during the Discover-AQ mission, the investigation team has shown the potential for VR as a science tool beyond simple visualization.

  3. T wave amplitude in lead aVR as a novel diagnostic marker for cardiac sarcoidosis.

    Science.gov (United States)

    Tanaka, Yoshihiro; Konno, Tetsuo; Yoshida, Shohei; Tsuda, Toyonobu; Sakata, Kenji; Furusho, Hiroshi; Takamura, Masayuki; Yoshimura, Kenichi; Yamagishi, Masakazu; Hayashi, Kenshi

    2017-03-01

    It is vital to identify cardiac involvement (CI) in patients with sarcoidosis as the condition could initially lead to sudden cardiac death. Although the T wave amplitude in lead aVR (TWAaVR) is reportedly associated with adverse cardiac events in various cardiovascular diseases, only scarce data are available concerning the utility of lead aVR in identifying CI in patients with sarcoidosis. We retrospectively investigated the diagnostic values of TWAaVR in patients with sarcoidosis in comparison with conventional electrocardiography parameters such as bundle branch block (BBB). From January 2006 to December 2014, 93 consecutive patients with sarcoidosis were enrolled (mean age, 55.7 ± 15.7 years; male, 31 %; cardiac involvement, n = 26). TWAaVR showed the greatest sensitivity (39 %) and specificity (92 %) in distinguishing between sarcoidosis patients with and without CI, at a cutoff value of -0.08 mV. The diagnostic value of BBB for cardiac involvement was significantly improved when combined with TWAaVR (sensitivity: 61-94 %, specificity: 97-89 %, area under the curve: 0.79-0.92, p = 0.018). Multivariate logistic regression analysis indicated that TWAaVR and BBB were independent electrocardiography parameters associated with CI. In summary, we observed that sarcoidosis patients exhibiting a high TWAaVR were likely to have CI. Thus, the application of a combination of BBB with TWAaVR may be useful when screening for CI in sarcoidosis patients.

  4. Voice and GPS Based Navigation System For Visually Impaired

    Directory of Open Access Journals (Sweden)

    Harsha Gawari

    2014-04-01

    Full Text Available The paper represents the architecture and implementation of a system that will help to navigate the visually impaired people. The system designed uses GPS and voice recognition along with obstacle avoidance for the purpose of guiding visually impaired. The visually impaired person issues the command and receives the direction response using audio signals. The latitude and longitude values are received continuously from the GPS receiver. The directions are given to the user with the help of audio signals. An obstacle detector is used to help the user to avoid obstacles by sending an audio message.GPS receivers use NMEA standard. With the advancement in voice recognition it becomes easier to issue commands regarding directions to the visually impaired.

  5. Speech recognition with amplitude and frequency modulations

    Science.gov (United States)

    Zeng, Fan-Gang; Nie, Kaibao; Stickney, Ginger S.; Kong, Ying-Yee; Vongphoe, Michael; Bhargave, Ashish; Wei, Chaogang; Cao, Keli

    2005-02-01

    Amplitude modulation (AM) and frequency modulation (FM) are commonly used in communication, but their relative contributions to speech recognition have not been fully explored. To bridge this gap, we derived slowly varying AM and FM from speech sounds and conducted listening tests using stimuli with different modulations in normal-hearing and cochlear-implant subjects. We found that although AM from a limited number of spectral bands may be sufficient for speech recognition in quiet, FM significantly enhances speech recognition in noise, as well as speaker and tone recognition. Additional speech reception threshold measures revealed that FM is particularly critical for speech recognition with a competing voice and is independent of spectral resolution and similarity. These results suggest that AM and FM provide independent yet complementary contributions to support robust speech recognition under realistic listening situations. Encoding FM may improve auditory scene analysis, cochlear-implant, and audiocoding performance. auditory analysis | cochlear implant | neural code | phase | scene analysis

  6. Keyboard With Voice Output

    Science.gov (United States)

    Huber, W. C.

    1986-01-01

    Voice synthesizer tells what key is about to be depressed. Verbal feedback useful for blind operators or where dim light prevents sighted operator from seeing keyboard. Also used where operator is busy observing other things while keying data into control system. Used as training aid for touch typing, and to train blind operators to use both standard and braille keyboards. Concept adapted to such equipment as typewriters, computers, calculators, telephones, cash registers, and on/off controls.

  7. Why Is My Voice Changing? (For Teens)

    Science.gov (United States)

    ... Week of Healthy Breakfasts Shyness Why Is My Voice Changing? KidsHealth > For Teens > Why Is My Voice ... deeper than a girl's, though. What Causes My Voice to Change? At puberty, guys' bodies begin producing ...

  8. Common Problems That Can Affect Your Voice

    Science.gov (United States)

    ... near you Common Problems That Can Affect Your Voice Common Problems That Can Affect Your Voice Patient ... that traditionally accompany gastro esophageal reflux disease (GERD). Voice Misuse and Overuse Speaking is a physical task ...

  9. Pattern recognition

    CERN Document Server

    Theodoridis, Sergios

    2003-01-01

    Pattern recognition is a scientific discipline that is becoming increasingly important in the age of automation and information handling and retrieval. Patter Recognition, 2e covers the entire spectrum of pattern recognition applications, from image analysis to speech recognition and communications. This book presents cutting-edge material on neural networks, - a set of linked microprocessors that can form associations and uses pattern recognition to ""learn"" -and enhances student motivation by approaching pattern recognition from the designer's point of view. A direct result of more than 10

  10. Voice and silence in organizations

    Directory of Open Access Journals (Sweden)

    Moaşa, H.

    2011-01-01

    Full Text Available Unlike previous research on voice and silence, this article breaksthe distance between the two and declines to treat them as opposites. Voice and silence are interrelated and intertwined strategic forms ofcommunication which presuppose each other in such a way that the absence of one would minimize completely the other’s presence. Social actors are not voice, or silence. Social actors can have voice or silence, they can do both because they operate at multiple levels and deal with multiple issues at different moments in time.

  11. VOICE REHABILITATION FOLLOWING TOTAL LARYNGECTOMY

    Directory of Open Access Journals (Sweden)

    Balasubramanian Thiagarajan

    2015-03-01

    Full Text Available Despite continuing advances in surgical management of laryngeal malignancy, total laryngectomy is still the treatment of choice in advanced laryngeal malignancies. Considering the longevity of the patient following total laryngectomy, various measures have been adopted in order to provide voice function to the patient. Significant advancements have taken place in voice rehabilitation of post laryngectomy patients. Advancements in oncological surgical techniques and irradiation techniques have literally cured laryngeal malignancies. Among the various voice rehabilitation techniques available TEP (Tracheo oesophageal puncture is considered to be the gold standard. This article attempts to explore the various voice rehabilitation technique available with primary focus on TEP.

  12. Speech emotion recognition based on statistical pitch model

    Institute of Scientific and Technical Information of China (English)

    WANG Zhiping; ZHAO Li; ZOU Cairong

    2006-01-01

    A modified Parzen-window method, which keep high resolution in low frequencies and keep smoothness in high frequencies, is proposed to obtain statistical model. Then, a gender classification method utilizing the statistical model is proposed, which have a 98% accuracy of gender classification while long sentence is dealt with. By separation the male voice and female voice, the mean and standard deviation of speech training samples with different emotion are used to create the corresponding emotion models. Then the Bhattacharyya distance between the test sample and statistical models of pitch, are utilized for emotion recognition in speech.The normalization of pitch for the male voice and female voice are also considered, in order to illustrate them into a uniform space. Finally, the speech emotion recognition experiment based on K Nearest Neighbor shows that, the correct rate of 81% is achieved, where it is only 73.85%if the traditional parameters are utilized.

  13. The impact of voice on speech realization

    OpenAIRE

    Jelka Breznik

    2014-01-01

    The study discusses spoken literary language and the impact of voice on speech realization. The voice consists of a sound made by a human being using the vocal folds for talking, singing, laughing, crying, screaming… The human voice is specifically the part of human sound production in which the vocal folds (vocal cords) are the primary sound source. Our voice is our instrument and identity card. How does the voice (voice tone) affect others and how do they respond, positively or negatively? ...

  14. Assessing the Utility of a Virtual Environment for Enhancing Facial Affect Recognition in Adolescents with Autism

    Science.gov (United States)

    Bekele, Esubalew; Crittendon, Julie; Zheng, Zhi; Swanson, Amy; Weitlauf, Amy; Warren, Zachary; Sarkar, Nilanjan

    2014-01-01

    Teenagers with autism spectrum disorder (ASD) and age-matched controls participated in a dynamic facial affect recognition task within a virtual reality (VR) environment. Participants identified the emotion of a facial expression displayed at varied levels of intensity by a computer generated avatar. The system assessed performance (i.e.,…

  15. Assessing the Utility of a Virtual Environment for Enhancing Facial Affect Recognition in Adolescents with Autism

    Science.gov (United States)

    Bekele, Esubalew; Crittendon, Julie; Zheng, Zhi; Swanson, Amy; Weitlauf, Amy; Warren, Zachary; Sarkar, Nilanjan

    2014-01-01

    Teenagers with autism spectrum disorder (ASD) and age-matched controls participated in a dynamic facial affect recognition task within a virtual reality (VR) environment. Participants identified the emotion of a facial expression displayed at varied levels of intensity by a computer generated avatar. The system assessed performance (i.e.,…

  16. Female artists and the VR crucible: expanding the aesthetic vocabulary

    Science.gov (United States)

    Morie, Jacquelyn Ford

    2012-03-01

    Virtual Reality was a technological wonder in its early days, and it was widely held to be a domain where men were the main practitioners. However, a survey done in 2007 of VR Artworks (Immersive Virtual Environments or VEs) showed that women have actually created the majority of artistic immersive works. This argues against the popular idea that the field has been totally dominated by men. While men have made great contributions in advancing the field, especially technologically, it appears most artistic works emerge from a decidedly feminine approach. Such an approach seems well suited to immersive environments as it incorporates aspects of inclusion, wholeness, and a blending of the body and the spirit. Female attention to holistic concerns fits the gestalt approach needed to create in a fully functional yet open-ended virtual world, which focuses not so much on producing a finished object (like a text or a sculpture) but rather on creating a possibility for becoming, like bringing a child into the world. Immersive VEs are not objective works of art to be hung on a wall and critiqued. They are vehicles for experience, vessels to live within for a piece of time.

  17. Pick Cubes for Object Picking in VR World

    Directory of Open Access Journals (Sweden)

    M. Mohamed Sathik

    2010-02-01

    Full Text Available In this paper, we proposed an algorithm which performs object picking in Virtualreality world using cubes. As ‘Picking’ is an essential interaction technique in graphics applications,Object Picking is performed in VR world using a 3D mouse or an interactive glove to explore andinteract with any of the objects. In this paper, cube is used to represent the intersection point of the raycoming from the user input and the object to be picked. By intersecting the objects in the scene with theray with a pick cube at the intersection point, it is determined which one is picked. An object is selectedwhen this pick cube intersects the object’s bounding box. In this paper, object picking is typicallyperformed by bounding box checks, taking the position of the pick cube and the objects in account so thatthe algorithm is made simpler and the computation time of the algorithm is also reduced. Pick cube canbe activated to perform more actions during interactions than other shapes. So the efficiency and theperformance of the picking process will be increased which helps the user to play in the virtual worldwith full enthusiasm.

  18. Vanilloid Receptor–Related Osmotically Activated Channel (VR-OAC), a Candidate Vertebrate Osmoreceptor

    National Research Council Canada - National Science Library

    Liedtke, Wolfgang; Choe, Yong; Martí-Renom, Marc A; Bell, Andrea M; Denis, Charlotte S; AndrejŠali; Hudspeth, A.J; Friedman, Jeffrey M; Heller, Stefan

    2000-01-01

    .... By employing a candidate-gene approach based on genes encoding members of the TRP superfamily of ion channels, we cloned cDNAs encoding the vanilloid receptor-related osmotically activated channel (VR-OAC...

  19. 20 CFR 411.420 - What information should be included in an agreement between an EN and a State VR agency?

    Science.gov (United States)

    2010-04-01

    ... agreement between an EN and a State VR agency? 411.420 Section 411.420 Employees' Benefits SOCIAL SECURITY...' Participation Agreements Between Employment Networks and State Vr Agencies § 411.420 What information should be included in an agreement between an EN and a State VR agency? The agreement between an EN and a State VR...

  20. 20 CFR 411.585 - Can a State VR agency and an EN both receive payment for serving the same beneficiary?

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Can a State VR agency and an EN both receive... State VR agency and an EN both receive payment for serving the same beneficiary? Yes. A State VR agency... assigned to one EN, including a State VR agency acting as an EN, at a time. It also cannot be assigned to...

  1. 20 CFR 411.425 - What should a State VR agency do if it gets an attempted referral from an EN and no agreement has...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false What should a State VR agency do if it gets... VR agency? 411.425 Section 411.425 Employees' Benefits SOCIAL SECURITY ADMINISTRATION THE TICKET TO... Between Employment Networks and State Vr Agencies § 411.425 What should a State VR agency do if it gets an...

  2. The Voice Handicap Index with Post-Laryngectomy Male Voices

    Science.gov (United States)

    Evans, Eryl; Carding, Paul; Drinnan, Michael

    2009-01-01

    Background: Surgical treatment for advanced laryngeal cancer involves complete removal of the larynx ("laryngectomy") and initial total loss of voice. Post-laryngectomy rehabilitation involves implementation of different means of "voicing" for these patients wherever possible. There is little information about laryngectomees'…

  3. Pedagogic Voice: Student Voice in Teaching and Engagement Pedagogies

    Science.gov (United States)

    Baroutsis, Aspa; McGregor, Glenda; Mills, Martin

    2016-01-01

    In this paper, we are concerned with the notion of "pedagogic voice" as it relates to the presence of student "voice" in teaching, learning and curriculum matters at an alternative, or second chance, school in Australia. This school draws upon many of the principles of democratic schooling via its utilisation of student voice…

  4. ST segment elevation in lead aVR during exercise testing is associated with LAD stenosis

    Energy Technology Data Exchange (ETDEWEB)

    Neill, Johanne; Harbinson, Mark [Royal Victoria Hospital, Regional Medical Cardiology Centre, Belfast (United Kingdom); Queens University, Belfast (United Kingdom); Shannon, Heather J.; Morton, Amanda; Muir, Alison R.; Adgey, Jennifer A. [Royal Victoria Hospital, Regional Medical Cardiology Centre, Belfast (United Kingdom)

    2007-03-15

    To evaluate, in patients with chest pain, the diagnostic value of ST elevation (STE) in lead aVR during stress testing prior to {sup 99m} Tc-sestamibi scanning correlating ischaemic territory with angiographic findings. Consecutive patients attending for {sup 99m} Tc-sestamibi myocardial perfusion imaging (MPI) completed a treadmill protocol. Peak exercise ECGs were coded. STE {>=}0.05 mV in lead aVR was considered significant. Gated perfusion images and findings at angiography were assessed. STE in lead aVR occurred in 25% (138/557) of the patients. More patients with STE in aVR had a reversible defect on imaging compared with those who had no STE in aVR (41%, 56/138 vs 27%, 114/419, p=0.003). Defects indicating a left anterior descending artery (LAD) culprit lesion were more common in the STE in aVR group (20%, 27/138 vs 9%, 39/419, p=0.001). There was a trend towards coronary artery stenosis (>70%) in a double vessel distribution involving the LAD in those patients who had STE in aVR compared with those who did not (22%, 8/37 vs 5%, 4/77, p=0.06). Logistic regression analysis demonstrated that STE in aVR (OR 1.36, p=0.233) is not an independent predictor of inducible abnormality when adjusted for STD >0.1 mV (OR 1.69, p=0.026). However, using anterior wall defect as an end-point, STE in aVR (OR 2.77, p=0.008) was a predictor even after adjustment for STD (OR 1.43, p=0.281). STE in lead aVR during exercise does not diagnose more inducible abnormalities than STD alone. However, unlike STD, which is not predictive of a territory of ischaemia, STE in aVR may indicate an anterior wall defect. (orig.)

  5. Chord Recognition Based on Temporal Correlation Support Vector Machine

    OpenAIRE

    Zhongyang Rao; Xin Guan; Jianfu Teng

    2016-01-01

    In this paper, we propose a method called temporal correlation support vector machine (TCSVM) for automatic major-minor chord recognition in audio music. We first use robust principal component analysis to separate the singing voice from the music to reduce the influence of the singing voice and consider the temporal correlations of the chord features. Using robust principal component analysis, we expect the low-rank component of the spectrogram matrix to contain the musical accompaniment and...

  6. NeuroVR: an open source virtual reality platform for clinical psychology and behavioral neurosciences.

    Science.gov (United States)

    Riva, Giuseppe; Gaggioli, Andrea; Villani, Daniela; Preziosa, Alessandra; Morganti, Francesca; Corsi, Riccardo; Faletti, Gianluca; Vezzadini, Luca

    2007-01-01

    In the past decade, the use of virtual reality for clinical and research applications has become more widespread. However, the diffusion of this approach is still limited by three main issues: poor usability, lack of technical expertise among clinical professionals, and high costs. To address these challenges, we introduce NeuroVR (http://www.neurovr.org--http://www.neurotiv.org), a cost-free virtual reality platform based on open-source software, that allows non-expert users to adapt the content of a pre-designed virtual environment to meet the specific needs of the clinical or experimental setting. Using the NeuroVR Editor, the user can choose the appropriate psychological stimuli/stressors from a database of objects (both 2D and 3D) and videos, and easily place them into the virtual environment. The edited scene can then be visualized in the NeuroVR Player using either immersive or non-immersive displays. Currently, the NeuroVR library includes different virtual scenes (apartment, office, square, supermarket, park, classroom, etc.), covering two of the most studied clinical applications of VR: specific phobias and eating disorders. The NeuroVR Editor is based on Blender (http://www.blender.org), the open source, cross-platform suite of tools for 3D creation, and is available as a completely free resource. An interesting feature of the NeuroVR Editor is the possibility to add new objects to the database. This feature allows the therapist to enhance the patient's feeling of familiarity and intimacy with the virtual scene, i.e., by using photos or movies of objects/people that are part of the patient's daily life, thereby improving the efficacy of the exposure. The NeuroVR platform runs on standard personal computers with Microsoft Windows; the only requirement for the hardware is related to the graphics card, which must support OpenGL.

  7. Towards surgeon-authored VR training: the scene-development cycle.

    Science.gov (United States)

    Dindar, Saleh; Nguyen, Thien; Peters, Jörg

    2016-01-01

    Enabling surgeon-educators to themselves create virtual reality (VR) training units promises greater variety, specialization, and relevance of the units. This paper describes a software bridge that semi-automates the scene-generation cycle, a key bottleneck in authoring, modeling, and developing VR units. Augmenting an open source modeling environment with physical behavior attachment and collision specifications yields single-click testing of the full force-feedback enabled anatomical scene.

  8. Air bridge docking -- on voice command recognition and synthesis technology in ATC%陆空对接的桥梁--论空管中的语音指令识别与合成技术

    Institute of Scientific and Technical Information of China (English)

    马林南

    2015-01-01

    空中交通管制指令标准用语的训练是非常重要的、不可或缺的内容,在我国民航运输业不断发展的时代,空中交通流量急剧增长,陆空通话标准用语的训练是空中管制模拟训练中的主要技术。为了改变当前训练系统依赖于专人飞行员席位的设置状况,本文对空中管制中的语音指令识别与合成技术进行研究,针对其中的多个关键技术开展讨论,以实现空中控制模拟训练机中自动飞行员席位替代专人飞行员席位。%The training of air traffic control instructions standard terminology is very important and indispensable content,in the era of the continuous development of China's civil aviation transportation industry,the rapid growth of air traffic flow,radiotelephony communication standard language training is training in the main technology of air traffic control simulation.In order to change the current training system depends on the special pilot seats in,this paper of air traffic control in the speech command recognition and synthesis technology research, for which a number of key technology to launch the discussion,in order to achieve the air control simulation training machine automatic pilot seat replacement pilot special seats.

  9. VR-Cluster: Dynamic Migration for Resource Fragmentation Problem in Virtual Router Platform

    Directory of Open Access Journals (Sweden)

    Xianming Gao

    2016-01-01

    Full Text Available Network virtualization technology is regarded as one of gradual schemes to network architecture evolution. With the development of network functions virtualization, operators make lots of effort to achieve router virtualization by using general servers. In order to ensure high performance, virtual router platform usually adopts a cluster of general servers, which can be also regarded as a special cloud computing environment. However, due to frequent creation and deletion of router instances, it may generate lots of resource fragmentation to prevent platform from establishing new router instances. In order to solve “resource fragmentation problem,” we firstly propose VR-Cluster, which introduces two extra function planes including switching plane and resource management plane. Switching plane is mainly used to support seamless migration of router instances without packet loss; resource management plane can dynamically move router instances from one server to another server by using VR-mapping algorithms. Besides, three VR-mapping algorithms including first-fit mapping algorithm, best-fit mapping algorithm, and worst-fit mapping algorithm are proposed based on VR-Cluster. At last, we establish VR-Cluster protosystem by using general X86 servers, evaluate its migration time, and further analyze advantages and disadvantages of our proposed VR-mapping algorithms to solve resource fragmentation problem.

  10. Automation of Bioinformatics Workflows using CloVR, a Cloud Virtual Resource

    Science.gov (United States)

    Vangala, Mahesh

    2013-01-01

    Exponential growth of biological data, mainly due to revolutionary developments in NGS technologies in past couple of years, created a multitude of challenges in downstream data analysis using bioinformatics approaches. To handle such tsunami of data, bioinformatics analysis must be carried out in an automated and parallel fashion. A successful analysis often requires more than a few computational steps and bootstrapping these individual steps (scripts) into components and the components into pipelines certainly makes bioinformatics a reproducible and manageable segment of scientific research. CloVR (http://clovr.org) is one such flexible framework that facilitates the abstraction of bioinformatics workflows into executable pipelines. CloVR comes packaged with various built-in bioinformatics pipelines that can make use of multicore processing power when run on servers and/or cloud. CloVR is amenable to build custom pipelines based on individual laboratory requirements. CloVR is available as a single executable virtual image file that comes bundled with pre-installed and pre-configured bioinformatics tools and packages and thus circumvents the cumbersome installation difficulties. CloVR is highly portable and can be run on traditional desktop/laptop computers, central servers and cloud compute farms. In conclusion, CloVR provides built-in automated analysis pipelines for microbial genomics with a scope to develop and integrate custom-workflows that make use of parallel processing power when run on compute clusters, there by addressing the bioinformatics challenges with NGS data.

  11. AR Feels "Softer" than VR: Haptic Perception of Stiffness in Augmented versus Virtual Reality.

    Science.gov (United States)

    Gaffary, Yoren; Le Gouis, Benoit; Marchal, Maud; Argelaguet, Ferran; Arnaldi, Bruno; Lecuyer, Anatole

    2017-11-01

    Does it feel the same when you touch an object in Augmented Reality (AR) or in Virtual Reality (VR)? In this paper we study and compare the haptic perception of stiffness of a virtual object in two situations: (1) a purely virtual environment versus (2) a real and augmented environment. We have designed an experimental setup based on a Microsoft HoloLens and a haptic force-feedback device, enabling to press a virtual piston, and compare its stiffness successively in either Augmented Reality (the virtual piston is surrounded by several real objects all located inside a cardboard box) or in Virtual Reality (the same virtual piston is displayed in a fully virtual scene composed of the same other objects). We have conducted a psychophysical experiment with 12 participants. Our results show a surprising bias in perception between the two conditions. The virtual piston is on average perceived stiffer in the VR condition compared to the AR condition. For instance, when the piston had the same stiffness in AR and VR, participants would select the VR piston as the stiffer one in 60% of cases. This suggests a psychological effect as if objects in AR would feel "softer" than in pure VR. Taken together, our results open new perspectives on perception in AR versus VR, and pave the way to future studies aiming at characterizing potential perceptual biases.

  12. Voice and Speech after Laryngectomy

    Science.gov (United States)

    Stajner-Katusic, Smiljka; Horga, Damir; Musura, Maja; Globlek, Dubravka

    2006-01-01

    The aim of the investigation is to compare voice and speech quality in alaryngeal patients using esophageal speech (ESOP, eight subjects), electroacoustical speech aid (EACA, six subjects) and tracheoesophageal voice prosthesis (TEVP, three subjects). The subjects reading a short story were recorded in the sound-proof booth and the speech samples…

  13. Voice Quality of Psychological Origin

    Science.gov (United States)

    Teixeira, Antonio; Nunes, Ana; Coimbra, Rosa Lidia; Lima, Rosa; Moutinho, Lurdes

    2008-01-01

    Variations in voice quality are essentially related to modifications of the glottal source parameters, such as: F[subscript 0], jitter, and shimmer. Voice quality is affected by prosody, emotional state, and vocal pathologies. Psychogenic vocal pathology is particularly interesting. In the present case study, the speaker naturally presented a…

  14. Voice handicap index in Swedish.

    Science.gov (United States)

    Ohlsson, Ann-Christine; Dotevall, Hans

    2009-01-01

    The objective of this study was to evaluate a Swedish version of the voice handicap index questionnaire (Sw-VHI). A total of 57 adult, dysphonic patients and 15 healthy controls completed the Sw-VHI and rated the degree of vocal fatigue and hoarseness on visual analogue scales. A perceptual voice evaluation was also performed. Test-retest reliability was analyzed in 38 subjects without voice complaints. Sw-VHI distinguished between dysphonic subjects and controls (P 0.84) and test-retest reliability (intraclass correlation coefficient >0.75) were good. Only moderate or weak correlations were found between Sw-VHI and the subjective and perceptual voice ratings. The data indicate that a difference above 13 points for the total Sw-VHI score and above 6 points for the Sw-VHI subscales is significant for an individual when comparing two different occasions. In conclusion, the Sw-VHI appears to be a robust instrument for assessment of the psycho-social impact of a voice disorder. However, Sw-VHI seems to, at least partly, capture different aspects of voice function to the subjective voice ratings and the perceptual voice evaluation.

  15. Enhancing Author's Voice through Scripting

    Science.gov (United States)

    Young, Chase J.; Rasinski, Timothy V.

    2011-01-01

    The authors suggest using scripting as a strategy to mentor and enhance author's voice in writing. Through gradual release, students use authentic literature as a model for writing with voice. The authors also propose possible extensions for independent practice, integration across content areas, and tips for evaluation.

  16. Voices in History

    Directory of Open Access Journals (Sweden)

    Ivan Leudar

    2001-06-01

    Full Text Available Experiences of “hearing voices” nowadays usually count as verbal hallucinations and they indicate serious mental illness. Some are first rank symptoms of schizophrenia, and the mass media, at least in Britain, tend to present them as antecedents of impulsive violence. They are, however, also found in other psychiatric conditions and epidemiological surveys reveal that even individuals with no need of psychiatric help can hear voices, sometimes following bereavement or abuse, but sometimes for no discernible reason. So do these experiences necessarily mean insanity and violence, and must they be thought of as pathogenic hallucinations; or are there other ways to understand them and live with them, and with what consequences?One way to make our thinking more flexible is to turn to history. We find that hearing voices was always an enigmatic experience, and the people who had it were rare. The gallery of voice hearers is, though, distinguished and it includes Galilei, Bunyan and St Teresa. Socrates heard a daemon who guided his actions, but in his time this did not signify madness, nor was it described as a hallucination. Yet in 19th century French psychological medicine the daemon became a hallucination and Socrates was retrospectively diagnosed as mentally ill. This paper examines the controversies which surrounded the experience at different points in history as well as the practice of retrospective psychiatry. The conclusion reached on the basis of the historical materials is that the experience and the ontological status it is ascribed are not trans-cultural or trans-historic but situated both in history and in the contemporary conflicts.

  17. Facing Sound - Voicing Art

    DEFF Research Database (Denmark)

    Lønstrup, Ansa

    2013-01-01

    This article is based on examples of contemporary audiovisual art, with a special focus on the Tony Oursler exhibition Face to Face at Aarhus Art Museum ARoS in Denmark in March-July 2012. My investigation involves a combination of qualitative interviews with visitors, observations of the audienc......´s interactions with the exhibition and the artwork in the museum space and short analyses of individual works of art based on reception aesthetics and phenomenology and inspired by newer writings on sound, voice and listening....

  18. Voice over IP Security

    CERN Document Server

    Keromytis, Angelos D

    2011-01-01

    Voice over IP (VoIP) and Internet Multimedia Subsystem technologies (IMS) are rapidly being adopted by consumers, enterprises, governments and militaries. These technologies offer higher flexibility and more features than traditional telephony (PSTN) infrastructures, as well as the potential for lower cost through equipment consolidation and, for the consumer market, new business models. However, VoIP systems also represent a higher complexity in terms of architecture, protocols and implementation, with a corresponding increase in the potential for misuse. In this book, the authors examine the

  19. Effects on vocal range and voice quality of singing voice training: the classically trained female voice.

    Science.gov (United States)

    Pabon, Peter; Stallinga, Rob; Södersten, Maria; Ternström, Sten

    2014-01-01

    A longitudinal study was performed on the acoustical effects of singing voice training under a given study program, using the voice range profile (VRP). Pretraining and posttraining recordings were made of students who participated in a 3-year bachelor singing study program. A questionnaire that included questions on optimal range, register use, classification, vocal health and hygiene, mixing technique, and training goals was used to rate and categorize self-assessed voice changes. Based on the responses, a subgroup of 10 classically trained female voices was selected, which was homogeneous enough for effects of training to be identified. The VRP perimeter contour was analyzed for effects of voice training. Also, a mapping within the VRP of voice quality, as expressed by the crest factor, was used to indicate the register boundaries and to monitor the acoustical consequences of the newly learned vocal technique of "mixed voice." VRPs were averaged across subjects. Findings were compared with the self-assessed vocal changes. Pre/post comparison of the average VRPs showed, in the midrange, (1) a decrease in the VRP area that was associated with the loud chest voice, (2) a reduction of the crest factor values, and (3) a reduction of maximum sound pressure level values. The students' self-evaluations of the voice changes appeared in some cases to contradict the VRP findings. VRPs of individual voices were seen to change over the course of a singing education. These changes were manifest also in the average group. High-resolution computerized recording, complemented with an acoustic register marker, allows a meaningful assessment of some effects of training, on an individual basis and for groups that comprise singers of a specific genre. It is argued that this kind of investigation is possible only within a focused training program, given by a faculty who has agreed on the goals. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  20. A preference-based measure of health: the VR-6D derived from the veterans RAND 12-Item Health Survey.

    Science.gov (United States)

    Selim, Alfredo J; Rogers, William; Qian, Shirley X; Brazier, John; Kazis, Lewis E

    2011-10-01

    The Veterans RAND 12-Item Health Survey (VR-12) is currently the major endpoint used in the Medicare managed care outcomes measure in the Healthcare Effectiveness Data and Information Set (HEDIS(®)), referred to as the Health Outcomes Survey (HOS). The purpose of this study is to adapt the Brazier SF-6D utility measure to the VR-12 to generate a single utility index. We used the HOS cohorts 2 and 3 for SF-36 data and 9 for VR-12 data. We calculated SF-6D scores from the SF-36 using the algorithms developed by Brazier and colleagues. The values of the Brazier SF-6D were used to estimate utility scores from the VR-12 using a mapping approach based on a 2-stage mapping procedure, named as VR-6D. The VR-6D derived from the VR-12 has similar distributional properties as the SF-6D. The change in VR-6D showed significant variations across disease groups with different levels of morbidity and mortality. This study produced a utility measure for the VR-12 that is comparable to the SF-6D and responsive to change. The VR-6D can be used in evaluations of health care plans and cost-effectiveness analysis to compare the health gains that health care interventions can achieve.

  1. Questioning Photovoice Research: Whose Voice?

    Science.gov (United States)

    Evans-Agnew, Robin A; Rosemberg, Marie-Anne S

    2016-07-01

    Photovoice is an important participatory research tool for advancing health equity. Our purpose is to critically review how participant voice is promoted through the photovoice process of taking and discussing photos and adding text/captions. PubMed, Scopus, PsycINFO, and Web of Science databases were searched from the years 2008 to 2014 using the keywords photovoice, photonovella, photovoice and social justice, and photovoice and participatory action research. Research articles were reviewed for how participant voice was (a) analyzed, (b) exhibited in community forums, and (c) disseminated through published manuscripts. Of 21 studies, 13 described participant voice in the data analysis, 14 described participants' control over exhibiting photo-texts, seven manuscripts included a comprehensive set of photo-texts, and none described participant input on choice of manuscript photo-texts. Photovoice designs vary in the advancement of participant voice, with the least advancement occurring in manuscript publication. Future photovoice researchers should expand approaches to advancing participant voice.

  2. Voice quality of psychological origin.

    Science.gov (United States)

    Teixeira, Antonio; Nunes, Ana; Coimbra, Rosa Lídia; Lima, Rosa; Moutinho, Lurdes

    2008-01-01

    Variations in voice quality are essentially related to modifications of the glottal source parameters, such as: F0, jitter, and shimmer. Voice quality is affected by prosody, emotional state, and vocal pathologies. Psychogenic vocal pathology is particularly interesting. In the present case study, the speaker naturally presented a ventricular band voice whereas in a controlled production he was able to use a more normal phonation process. A small corpus was recorded which included sustained vowels and short sentences in both registers. A normal speaker was also recorded in similar tasks. Annotation and extraction of parameters were made using Praat's voice report function. Application of the Hoarseness Diagram to sustained productions situates this case in the pseudo-glottic phonation region. Analysis of several different parameters related to F0, jitter, shimmer, and harmonicity revealed that the speaker with psychogenic voice was capable of controlling certain parameters (e.g. F0 maximum) but was unable to correct others such as shimmer.

  3. Taking Immersive VR Leap in Training of Landing Signal Officers.

    Science.gov (United States)

    Greunke, Larry; Sadagic, Amela

    2016-04-01

    A major training device used to train all Landing Signal Officers (LSOs) for several decades has been the Landing Signal Officer Trainer, Device 2H111. This simulator, located in Oceana, VA, is contained within a two story tall room; it consists of several large screens and a physical rendition of the actual instruments used by LSOs in their operational environment. The young officers who serve in this specialty will typically encounter this system for only a short period of formal instruction (six one-hour long sessions), leaving multiple gaps in training. While experience with 2H111 is extremely valuable for all LSO officers, the amount of time they can spend using this training device is undeniably too short. The need to provide LSOs with an unlimited number of training opportunities unrestricted by location and time, married with recent advancements in commercial off the shelf (COTS) immersive technologies, provided an ideal platform to create a lightweight training solution that would fill those gaps and extend beyond the capabilities currently offered in the 2H111 simulator. This paper details our efforts on task analysis, surveying of user domain, mapping of 2H111 training capabilities to new prototype system to ensure its support of major training objectives of 2H111, design and development of prototype training system, and a feasibility study that included tests of technical system performance and informal testing with trainees at the LSO Schoolhouse. The results achieved in this effort indicate that the time for LSO training to make the leap to immersive VR has decidedly come.

  4. Muscular tension and body posture in relation to voice handicap and voice quality in teachers with persistent voice complaints.

    NARCIS (Netherlands)

    Kooijman, P.G.C.; Jong, F.I.C.R.S. de; Oudes, M.J.; Huinck, W.J.; Acht, H. van; Graamans, K.

    2005-01-01

    The aim of this study was to investigate the relationship between extrinsic laryngeal muscular hypertonicity and deviant body posture on the one hand and voice handicap and voice quality on the other hand in teachers with persistent voice complaints and a history of voice-related absenteeism. The st

  5. Objective voice and speech analysis of persons with chronic hoarseness by prosodic analysis of speech samples.

    Science.gov (United States)

    Haderlein, Tino; Döllinger, Michael; Matoušek, Václav; Nöth, Elmar

    2016-10-01

    Automatic voice assessment is often performed using sustained vowels. In contrast, speech analysis of read-out texts can be applied to voice and speech assessment. Automatic speech recognition and prosodic analysis were used to find regression formulae between automatic and perceptual assessment of four voice and four speech criteria. The regression was trained with 21 men and 62 women (average age 49.2 years) and tested with another set of 24 men and 49 women (48.3 years), all suffering from chronic hoarseness. They read the text 'Der Nordwind und die Sonne' ('The North Wind and the Sun'). Five voice and speech therapists evaluated the data on 5-point Likert scales. Ten prosodic and recognition accuracy measures (features) were identified which describe all the examined criteria. Inter-rater correlation within the expert group was between r = 0.63 for the criterion 'match of breath and sense units' and r = 0.87 for the overall voice quality. Human-machine correlation was between r = 0.40 for the match of breath and sense units and r = 0.82 for intelligibility. The perceptual ratings of different criteria were highly correlated with each other. Likewise, the feature sets modeling the criteria were very similar. The automatic method is suitable for assessing chronic hoarseness in general and for subgroups of functional and organic dysphonia. In its current version, it is almost as reliable as a randomly picked rater from a group of voice and speech therapists.

  6. Bodies, Spaces, Voices, Silences

    Directory of Open Access Journals (Sweden)

    Donatella Mazzoleni

    2013-07-01

    Full Text Available A good architecture should not only allow functional, formal and technical quality for urban spaces, but also let the voice of the city be perceived, listened, enjoyed. Every city has got its specific sound identity, or “ISO” (R. O. Benenzon, made up of a complex texture of background noises and fluctuation of sound figures emerging and disappearing in a game of continuous fadings. For instance, the ISO of Naples is characterized by a spread need of hearing the sound return of one’s/others voices, by a hate of silence. Cities may fall ill: illness from noise, within super-crowded neighbourhoods, or illness from silence, in the forced isolation of peripheries. The proposal of an urban music therapy denotes an unpublished and innovative enlarged interdisciplinary research path, where architecture, music, medicine, psychology, communication science may converge, in order to work for rebalancing spaces and relation life of the urban collectivity, through the care of body and sound dimensions.

  7. Crossing Cultures with Multi-Voiced Journals

    Science.gov (United States)

    Styslinger, Mary E.; Whisenant, Alison

    2004-01-01

    In this article, the authors discuss the benefits of using multi-voiced journals as a teaching strategy in reading instruction. Multi-voiced journals, an adaptation of dual-voiced journals, encourage responses to reading in varied, cultured voices of characters. It is similar to reading journals in that they prod students to connect to the lives…

  8. Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

    Science.gov (United States)

    Partila, Pavol; Voznak, Miroslav; Tovarek, Jaromir

    2015-01-01

    The impact of the classification method and features selection for the speech emotion recognition accuracy is discussed in this paper. Selecting the correct parameters in combination with the classifier is an important part of reducing the complexity of system computing. This step is necessary especially for systems that will be deployed in real-time applications. The reason for the development and improvement of speech emotion recognition systems is wide usability in nowadays automatic voice controlled systems. Berlin database of emotional recordings was used in this experiment. Classification accuracy of artificial neural networks, k-nearest neighbours, and Gaussian mixture model is measured considering the selection of prosodic, spectral, and voice quality features. The purpose was to find an optimal combination of methods and group of features for stress detection in human speech. The research contribution lies in the design of the speech emotion recognition system due to its accuracy and efficiency. PMID:26346654

  9. Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

    Directory of Open Access Journals (Sweden)

    Pavol Partila

    2015-01-01

    Full Text Available The impact of the classification method and features selection for the speech emotion recognition accuracy is discussed in this paper. Selecting the correct parameters in combination with the classifier is an important part of reducing the complexity of system computing. This step is necessary especially for systems that will be deployed in real-time applications. The reason for the development and improvement of speech emotion recognition systems is wide usability in nowadays automatic voice controlled systems. Berlin database of emotional recordings was used in this experiment. Classification accuracy of artificial neural networks, k-nearest neighbours, and Gaussian mixture model is measured considering the selection of prosodic, spectral, and voice quality features. The purpose was to find an optimal combination of methods and group of features for stress detection in human speech. The research contribution lies in the design of the speech emotion recognition system due to its accuracy and efficiency.

  10. Lexical frequency and voice assimilation.

    Science.gov (United States)

    Ernestus, Mirjam; Lahey, Mybeth; Verhees, Femke; Baayen, R Harald

    2006-08-01

    Acoustic duration and degree of vowel reduction are known to correlate with a word's frequency of occurrence. The present study broadens the research on the role of frequency in speech production to voice assimilation. The test case was regressive voice assimilation in Dutch. Clusters from a corpus of read speech were more often perceived as unassimilated in lower-frequency words and as either completely voiced (regressive assimilation) or, unexpectedly, as completely voiceless (progressive assimilation) in higher-frequency words. Frequency did not predict the voice classifications over and above important acoustic cues to voicing, suggesting that the frequency effects on the classifications were carried exclusively by the acoustic signal. The duration of the cluster and the period of glottal vibration during the cluster decreased while the duration of the release noises increased with frequency. This indicates that speakers reduce articulatory effort for higher-frequency words, with some acoustic cues signaling more voicing and others less voicing. A higher frequency leads not only to acoustic reduction but also to more assimilation.

  11. Facial, vocal and musical emotion recognition is altered in paranoid schizophrenic patients.

    Science.gov (United States)

    Weisgerber, Anne; Vermeulen, Nicolas; Peretz, Isabelle; Samson, Séverine; Philippot, Pierre; Maurage, Pierre; De Graeuwe D'Aoust, Catherine; De Jaegere, Aline; Delatte, Benoît; Gillain, Benoît; De Longueville, Xavier; Constant, Eric

    2015-09-30

    Disturbed processing of emotional faces and voices is typically observed in schizophrenia. This deficit leads to impaired social cognition and interactions. In this study, we investigated whether impaired processing of emotions also affects musical stimuli, which are widely present in daily life and known for their emotional impact. Thirty schizophrenic patients and 30 matched healthy controls evaluated the emotional content of musical, vocal and facial stimuli. Schizophrenic patients are less accurate than healthy controls in recognizing emotion in music, voices and faces. Our results confirm impaired recognition of emotion in voice and face stimuli in schizophrenic patients and extend this observation to the recognition of emotion in musical stimuli.

  12. Emotional cues during simultaneous face and voice processing: electrophysiological insights.

    Directory of Open Access Journals (Sweden)

    Taosheng Liu

    Full Text Available Both facial expression and tone of voice represent key signals of emotional communication but their brain processing correlates remain unclear. Accordingly, we constructed a novel implicit emotion recognition task consisting of simultaneously presented human faces and voices with neutral, happy, and angry valence, within the context of recognizing monkey faces and voices task. To investigate the temporal unfolding of the processing of affective information from human face-voice pairings, we recorded event-related potentials (ERPs to these audiovisual test stimuli in 18 normal healthy subjects; N100, P200, N250, P300 components were observed at electrodes in the frontal-central region, while P100, N170, P270 were observed at electrodes in the parietal-occipital region. Results indicated a significant audiovisual stimulus effect on the amplitudes and latencies of components in frontal-central (P200, P300, and N250 but not the parietal occipital region (P100, N170 and P270. Specifically, P200 and P300 amplitudes were more positive for emotional relative to neutral audiovisual stimuli, irrespective of valence, whereas N250 amplitude was more negative for neutral relative to emotional stimuli. No differentiation was observed between angry and happy conditions. The results suggest that the general effect of emotion on audiovisual processing can emerge as early as 200 msec (P200 peak latency post stimulus onset, in spite of implicit affective processing task demands, and that such effect is mainly distributed in the frontal-central region.

  13. Emotional cues during simultaneous face and voice processing: electrophysiological insights.

    Science.gov (United States)

    Liu, Taosheng; Pinheiro, Ana; Zhao, Zhongxin; Nestor, Paul G; McCarley, Robert W; Niznikiewicz, Margaret A

    2012-01-01

    Both facial expression and tone of voice represent key signals of emotional communication but their brain processing correlates remain unclear. Accordingly, we constructed a novel implicit emotion recognition task consisting of simultaneously presented human faces and voices with neutral, happy, and angry valence, within the context of recognizing monkey faces and voices task. To investigate the temporal unfolding of the processing of affective information from human face-voice pairings, we recorded event-related potentials (ERPs) to these audiovisual test stimuli in 18 normal healthy subjects; N100, P200, N250, P300 components were observed at electrodes in the frontal-central region, while P100, N170, P270 were observed at electrodes in the parietal-occipital region. Results indicated a significant audiovisual stimulus effect on the amplitudes and latencies of components in frontal-central (P200, P300, and N250) but not the parietal occipital region (P100, N170 and P270). Specifically, P200 and P300 amplitudes were more positive for emotional relative to neutral audiovisual stimuli, irrespective of valence, whereas N250 amplitude was more negative for neutral relative to emotional stimuli. No differentiation was observed between angry and happy conditions. The results suggest that the general effect of emotion on audiovisual processing can emerge as early as 200 msec (P200 peak latency) post stimulus onset, in spite of implicit affective processing task demands, and that such effect is mainly distributed in the frontal-central region.

  14. Memory for faces and voices varies as a function of sex and expressed emotion.

    Science.gov (United States)

    S Cortes, Diana; Laukka, Petri; Lindahl, Christina; Fischer, Håkan

    2017-01-01

    We investigated how memory for faces and voices (presented separately and in combination) varies as a function of sex and emotional expression (anger, disgust, fear, happiness, sadness, and neutral). At encoding, participants judged the expressed emotion of items in forced-choice tasks, followed by incidental Remember/Know recognition tasks. Results from 600 participants showed that accuracy (hits minus false alarms) was consistently higher for neutral compared to emotional items, whereas accuracy for specific emotions varied across the presentation modalities (i.e., faces, voices, and face-voice combinations). For the subjective sense of recollection ("remember" hits), neutral items received the highest hit rates only for faces, whereas for voices and face-voice combinations anger and fear expressions instead received the highest recollection rates. We also observed better accuracy for items by female expressers, and own-sex bias where female participants displayed memory advantage for female faces and face-voice combinations. Results further suggest that own-sex bias can be explained by recollection, rather than familiarity, rates. Overall, results show that memory for faces and voices may be influenced by the expressions that they carry, as well as by the sex of both items and participants. Emotion expressions may also enhance the subjective sense of recollection without enhancing memory accuracy.

  15. A Transliteration Algorithm for Adapting a Japanese Voice Controlled Browser to English

    Science.gov (United States)

    Saito, Kuniko; Shinohara, Akio; Nagata, Masaaki; Ohara, Hisashi

    We propose a novel algorithm to transliterate English to Japanese and its application to a voice controlled browser, which enable ordinary Japanese people to browse English Web site by voice. Speech recognition software designed for native English speakers do not work for most Japanese because Japanese can't pronounce English as native English speakers do. Therefore, we combined Japanese speech recognition software with English-to-Japanese transliteration software. The accuracy of our transliteration algorithm is 80% recall for the top candidate, and 92% recall for the top three candidates. The browser using this transliteration algorithm makes it possible for Japanese to navigate English Web pages almost as accurate as Japanese pages by voice commands.

  16. The voice of emotion across species: how do human listeners recognize animals' affective states?

    Directory of Open Access Journals (Sweden)

    Marina Scheumann

    Full Text Available Voice-induced cross-taxa emotional recognition is the ability to understand the emotional state of another species based on its voice. In the past, induced affective states, experience-dependent higher cognitive processes or cross-taxa universal acoustic coding and processing mechanisms have been discussed to underlie this ability in humans. The present study sets out to distinguish the influence of familiarity and phylogeny on voice-induced cross-taxa emotional perception in humans. For the first time, two perspectives are taken into account: the self- (i.e. emotional valence induced in the listener versus the others-perspective (i.e. correct recognition of the emotional valence of the recording context. Twenty-eight male participants listened to 192 vocalizations of four different species (human infant, dog, chimpanzee and tree shrew. Stimuli were recorded either in an agonistic (negative emotional valence or affiliative (positive emotional valence context. Participants rated the emotional valence of the stimuli adopting self- and others-perspective by using a 5-point version of the Self-Assessment Manikin (SAM. Familiarity was assessed based on subjective rating, objective labelling of the respective stimuli and interaction time with the respective species. Participants reliably recognized the emotional valence of human voices, whereas the results for animal voices were mixed. The correct classification of animal voices depended on the listener's familiarity with the species and the call type/recording context, whereas there was less influence of induced emotional states and phylogeny. Our results provide first evidence that explicit voice-induced cross-taxa emotional recognition in humans is shaped more by experience-dependent cognitive mechanisms than by induced affective states or cross-taxa universal acoustic coding and processing mechanisms.

  17. Voice Habits and Behaviors: Voice Care Among Flamenco Singers.

    Science.gov (United States)

    Garzón García, Marina; Muñoz López, Juana; Y Mendoza Lara, Elvira

    2017-03-01

    The purpose of this study is to analyze the vocal behavior of flamenco singers, as compared with classical music singers, to establish a differential vocal profile of voice habits and behaviors in flamenco music. Bibliographic review was conducted, and the Singer's Vocal Habits Questionnaire, an experimental tool designed by the authors to gather data regarding hygiene behavior, drinking and smoking habits, type of practice, voice care, and symptomatology perceived in both the singing and the speaking voice, was administered. We interviewed 94 singers, divided into two groups: the flamenco experimental group (FEG, n = 48) and the classical control group (CCG, n = 46). Frequency analysis, a Likert scale, and discriminant and exploratory factor analysis were used to obtain a differential profile for each group. The FEG scored higher than the CCG in speaking voice symptomatology. The FEG scored significantly higher than the CCG in use of "inadequate vocal technique" when singing. Regarding voice habits, the FEG scored higher in "lack of practice and warm-up" and "environmental habits." A total of 92.6% of the subjects classified themselves correctly in each group. The Singer's Vocal Habits Questionnaire has proven effective in differentiating flamenco and classical singers. Flamenco singers are exposed to numerous vocal risk factors that make them more prone to vocal fatigue, mucosa dehydration, phonotrauma, and muscle stiffness than classical singers. Further research is needed in voice training in flamenco music, as a means to strengthen the voice and enable it to meet the requirements of this musical genre. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  18. Lift-Off: Using Reference Imagery and Freehand Sketching to Create 3D Models in VR.

    Science.gov (United States)

    Jackson, Bret; Keefe, Daniel F

    2016-04-01

    Three-dimensional modeling has long been regarded as an ideal application for virtual reality (VR), but current VR-based 3D modeling tools suffer from two problems that limit creativity and applicability: (1) the lack of control for freehand modeling, and (2) the difficulty of starting from scratch. To address these challenges, we present Lift-Off, an immersive 3D interface for creating complex models with a controlled, handcrafted style. Artists start outside of VR with 2D sketches, which are then imported and positioned in VR. Then, using a VR interface built on top of image processing algorithms, 2D curves within the sketches are selected interactively and "lifted" into space to create a 3D scaffolding for the model. Finally, artists sweep surfaces along these curves to create 3D models. Evaluations are presented for both long-term users and for novices who each created a 3D sailboat model from the same starting sketch. Qualitative results are positive, with the visual style of the resulting models of animals and other organic subjects as well as architectural models matching what is possible with traditional fine art media. In addition, quantitative data from logging features built into the software are used to characterize typical tool use and suggest areas for further refinement of the interface.

  19. Postural Control Disturbances Produced By Exposure to HMD and Dome Vr Systems

    Science.gov (United States)

    Harm, D. L.; Taylor, L. C.

    2005-01-01

    Two critical and unresolved human factors issues in VR systems are: 1) potential "cybersickness", a form of motion sickness which is experienced in virtual worlds, and 2) maladaptive sensorimotor performance following exposure to VR systems. Interestingly, these aftereffects are often quite similar to adaptive sensorimotor responses observed in astronauts during and/or following space flight. Most astronauts and cosmonauts experience perceptual and sensorimotor disturbances during and following space flight. All astronauts exhibit decrements in postural control following space flight. It has been suggested that training in virtual reality (VR) may be an effective countermeasure for minimizing perceptual and/or sensorimotor disturbances. People adapt to consistent, sustained alterations of sensory input such as those produced by microgravity, and experimentally-produced stimulus rearrangements (e.g., reversing prisms, magnifying lenses, flight simulators, and VR systems). Adaptation is revealed by aftereffects including perceptual disturbances and sensorimotor control disturbances. The purpose of the current study was to compare disturbances in postural control produced by dome and head-mounted virtual environment displays. Individuals recovered from motion sickness and the detrimental effects of exposure to virtual reality on postural control within one hour. Sickness severity and initial decrements in postural equilibrium decreases over days, which suggests that subjects become dual-adapted over time. These findings provide some direction for developing training schedules for VR users that facilitate adaptation, and address safety concerns about aftereffects.

  20. Updated U.S. population standard for the Veterans RAND 12-item Health Survey (VR-12).

    Science.gov (United States)

    Selim, Alfredo J; Rogers, William; Fleishman, John A; Qian, Shirley X; Fincke, Benjamin G; Rothendler, James A; Kazis, Lewis E

    2009-02-01

    The purpose of this project was to develop an updated U.S. population standard for the Veterans RAND 12-item Health Survey (VR-12). We used a well-defined and nationally representative sample of the U.S. population from 52,425 responses to the Medical Expenditure Panel Survey (MEPS) collected between 2000 and 2002. We applied modified regression estimates to update the non-proprietary 1990 scoring algorithms. We applied the updated standard to the Medicare Health Outcomes Survey (HOS) to compute the VR-12 physical (PCS((MEPS standard))) and mental (MCS((MEPS standard))) component summaries based on the MEPS. We compared these scores to PCS and MCS based on the 1990 U.S. population standard. Using the updated U.S. population standard, the average VR-12 PCS((MEPS standard)) and MCS((MEPS standard)) scores in the Medicare HOS were 39.82 (standard deviation [SD] = 12.2) and 50.08 (SD = 11.4), respectively. For the same Medicare HOS, the average PCS and MCS scores based on the 1990 standard were 1.40 points higher and 0.99 points lower in comparison to VR-12 PCS and MCS, respectively. Changes in the U.S. population between 1990 and today make the old standard obsolete for the VR-12, so the updated standard developed here is widely available to serve as such a contemporary standard for future applications for health-related quality of life (HRQoL) assessments.

  1. Face and construct validation of a next generation virtual reality (Gen2-VR) surgical simulator.

    Science.gov (United States)

    Sankaranarayanan, Ganesh; Li, Baichun; Manser, Kelly; Jones, Stephanie B; Jones, Daniel B; Schwaitzberg, Steven; Cao, Caroline G L; De, Suvranu

    2016-03-01

    Surgical performance is affected by distractors and interruptions to surgical workflow that exist in the operating room. However, traditional surgical simulators are used to train surgeons in a skills laboratory that does not recreate these conditions. To overcome this limitation, we have developed a novel, immersive virtual reality (Gen2-VR) system to train surgeons in these environments. This study was to establish face and construct validity of our system. The study was a within-subjects design, with subjects repeating a virtual peg transfer task under three different conditions: Case I: traditional VR; Case II: Gen2-VR with no distractions and Case III: Gen2-VR with distractions and interruptions. In Case III, to simulate the effects of distractions and interruptions, music was played intermittently, the camera lens was fogged for 10 s and tools malfunctioned for 15 s at random points in time during the simulation. At the completion of the study subjects filled in a 5-point Likert scale feedback questionnaire. A total of sixteen subjects participated in this study. Friedman test showed significant difference in scores between the three conditions (p VR simulator has both face and construct validity and that it can accurately and realistically present distractions and interruptions in a simulated OR, in spite of limitations of the current HMD hardware technology.

  2. The use of VR distraction to decrease pain after laparoscopic bariatric surgery: a case study.

    Science.gov (United States)

    Hoyus, Kattia Cabas; Cardenas-López, Georgina; Maldonado, Jose Gutierrez; Ruiz-Esquivel, Ma Fernanda; Torres-Villalobos, Gonzalo

    2012-01-01

    One of the advantages of laparoscopic bariatric surgery is the reduced level of postoperative pain. In some cases, however, the pain level may be high. This is a challenge for specialists. This case study explores the use of VR distraction in an 18 year-old patient who had undergone laparoscopic bariatric surgery and who reported pain during the postoperative period. The study was conducted in a Level III Private Hospital in Mexico City where the patient was hospitalized. The patient was administered standard analgesic during VR distraction, which lasted a total of 40 minutes divided into two sessions. The scores of three visual analogue scales and catastrophism were the dependent variables of this study. The scales were administered before and after the VR distraction intervention. The patient reported lower pain levels after VR distraction and reductions in some components of catastrophism. This study proves that VR distraction can be effective not only in reducing the physical component of pain (a notion that is already well established) but also the cognitive/affective component. More controlled studies of the issue are required.

  3. An evaluation on CT image acquisition method for medical VR applications

    Science.gov (United States)

    Jang, Seong-wook; Ko, Junho; Yoo, Yon-sik; Kim, Yoonsang

    2017-02-01

    Recent medical virtual reality (VR) applications to minimize re-operations are being studied for improvements in surgical efficiency and reduction of operation error. The CT image acquisition method considering three-dimensional (3D) modeling for medical VR applications is important, because the realistic model is required for the actual human organ. However, the research for medical VR applications has focused on 3D modeling techniques and utilized 3D models. In addition, research on a CT image acquisition method considering 3D modeling has never been reported. The conventional CT image acquisition method involves scanning a limited area of the lesion for the diagnosis of doctors once or twice. However, the medical VR application is required to acquire the CT image considering patients' various postures and a wider area than the lesion. A wider area than the lesion is required because of the necessary process of comparing bilateral sides for dyskinesia diagnosis of the shoulder, pelvis, and leg. Moreover, patients' various postures are required due to the different effects on the musculoskeletal system. Therefore, in this paper, we perform a comparative experiment on the acquired CT images considering image area (unilateral/bilateral) and patients' postures (neutral/abducted). CT images are acquired from 10 patients for the experiments, and the acquired CT images are evaluated based on the length per pixel and the morphological deviation. Finally, by comparing the experiment results, we evaluate the CT image acquisition method for medical VR applications.

  4. CAD-VR geometry and meta data synchronization for design review applications

    Institute of Scientific and Technical Information of China (English)

    SCHILLING Arne; KIM Seokryul; WEISSMANN Daniel; TANG Ziying; CHOI Sangsu

    2006-01-01

    In this paper we present the MEMPHIS middleware framework for the integration of CAD geometries and assemblies with derived Virtual Reality (VR) models and its specific meta data and attributes. The goal of this work is to connect real time VR applications, especially for the Design Review, with enterprise software storing and managing CAD models (Product Data Management-PDM). The preparation of VR models requires expert knowledge, is time consuming, and includes selection of required CAD data, tessellation, healing of unwanted gaps, applying materials and textures, and special surface and light effects. During the Design Review process, decisions are made concerning the choice of materials and surface forms. While materials can be switched directly on the VR model, the modification of part geometries must be made on the CAD model. Our system synchronizes modifications of the original CAD geometries and of attributes that are relevant for the realistic rendering using the PLM Services standard. Thus, repeated work for the VR preparation can be avoided.

  5. Voices of the Unheard

    DEFF Research Database (Denmark)

    Matthiesen, Noomi Christine Linde

    2014-01-01

    . They were in two different classes at both schools, i.e. four classes in total. The families were followed for 18 months. Formal interviews were conducted with mothers and teachers, parent-teacher conferences were recorded, participant observations were conducted in classrooms and playgrounds, afterschool...... is that Somali diaspora parents (and with special focus on mothers as these where the parents who took most responsibility in the four cases of this research) have difficulty expressing their opinions as there are structural, historical and social dynamics that create conditions in which their voices...... are silenced, or at least restricted significantly, resulting in marginalizing consequences. The focus in each article is on here-and-now interactional dynamics but in order to understand these constitutive negotiations, it is argued that the analysis must be situated in a description of the constituted...

  6. Passing on power & voice

    DEFF Research Database (Denmark)

    Noer, Vibeke Røn; Nielsen, Cathrine Sand

    2014-01-01

    . The education lasts for 3,5 years and the landmark of the educational model is the continuously shifts between teaching in classroom and teaching in clinical practice. Clinical teaching takes place at approved clinical placement institutions in hospitals and in the social and health care services outside...... intention of gaining knowledge about other possible ways to perform the education. The class, named the E-class, followed what in the field was named ‘an experimental educational model based on experienced-based learning’ (Nielsen et al. 2011). The experiential educational model is argued as an experiment.......aspx Higher degree of student involvement in planning as well as teaching was in the field presented as a part of ‘the overall educational approach’. In the course ‘Acute, Critical Nursing & Terminal, Palliative Care’ this was transferred into an innovative pedagogy with intend to pass on power and voice...

  7. Voice stress analysis

    Science.gov (United States)

    Brenner, Malcolm; Shipp, Thomas

    1988-01-01

    In a study of the validity of eight candidate voice measures (fundamental frequency, amplitude, speech rate, frequency jitter, amplitude shimmer, Psychological Stress Evaluator scores, energy distribution, and the derived measure of the above measures) for determining psychological stress, 17 males age 21 to 35 were subjected to a tracking task on a microcomputer CRT while parameters of vocal production as well as heart rate were measured. Findings confirm those of earlier studies that increases in fundamental frequency, amplitude, and speech rate are found in speakers involved in extreme levels of stress. In addition, it was found that the same changes appear to occur in a regular fashion within a more subtle level of stress that may be characteristic, for example, of routine flying situations. None of the individual speech measures performed as robustly as did heart rate.

  8. Voice over IP

    OpenAIRE

    Mantula, Juha

    2006-01-01

    Tämä opinnäytetyö käsittelee Voice over Internet Protocol -tekniikkaa ja sen tuomia mahdollisuuksia yrityselämässä. Teoriaosa käsittelee VoIP:n kannalta tärkeitä pro-tokollia ja standardeja, VoIP:n ominaisuuksia sekä esittelee erilaisia puheohjelmia, jotka käyttävät VoIP-tekniikkaa hyväkseen. Empiirinen osuus tutkii Viestintä Ky Pitkärannan Skype-ohjelman käyttöä. Työn tarkoituksena on selvittää VoIP:n hyviä ja huonoja puolia ja sitä kuinka tek-niikkaa voidaan käyttää hyväksi päivittäisessä ...

  9. Dialogic Showcases Innovative Asian CT Solutions at Voice Asia '98

    Institute of Scientific and Technical Information of China (English)

    1998-01-01

    Dialogic Corporation showcases Computer Telephony (CT) solutions from some of Asia's leading CT developers at this Voice Asia'98 show. These vendors display the latest Asian solutions for IP Telephony,Speech Recognition, Telephone Company Enhanced Services Platform. Call Center and Unified Messaging, Open Switch and CT Servers.

  10. A Voice-Activated, Interactive Videodisc Case Study for Use in the Medical School Classroom.

    Science.gov (United States)

    Harless, William G.; And Others

    1986-01-01

    The Technological Innovations in Medical Education (TIME) Project of the Lister Hill National Center for Biomedical Communications is exploring the use of interactive videodisc, microcomputer, and voice recognition technology to create interactive case studies of simulated patients to train second-year medical students in the introduction to…

  11. A Voice-Activated, Interactive Videodisc Case Study for Use in the Medical School Classroom.

    Science.gov (United States)

    Harless, William G.; And Others

    1986-01-01

    The Technological Innovations in Medical Education (TIME) Project of the Lister Hill National Center for Biomedical Communications is exploring the use of interactive videodisc, microcomputer, and voice recognition technology to create interactive case studies of simulated patients to train second-year medical students in the introduction to…

  12. Introduction to Arabic Speech Recognition Using CMUSphinx System

    CERN Document Server

    Satori, H; Chenfour, N

    2007-01-01

    In this paper Arabic was investigated from the speech recognition problem point of view. We propose a novel approach to build an Arabic Automated Speech Recognition System (ASR). This system is based on the open source CMU Sphinx-4, from the Carnegie Mellon University. CMU Sphinx is a large-vocabulary; speaker-independent, continuous speech recognition system based on discrete Hidden Markov Models (HMMs). We build a model using utilities from the OpenSource CMU Sphinx. We will demonstrate the possible adaptability of this system to Arabic voice recognition.

  13. Voice and choice by delegation.

    Science.gov (United States)

    van de Bovenkamp, Hester; Vollaard, Hans; Trappenburg, Margo; Grit, Kor

    2013-02-01

    In many Western countries, options for citizens to influence public services are increased to improve the quality of services and democratize decision making. Possibilities to influence are often cast into Albert Hirschman's taxonomy of exit (choice), voice, and loyalty. In this article we identify delegation as an important addition to this framework. Delegation gives individuals the chance to practice exit/choice or voice without all the hard work that is usually involved in these options. Empirical research shows that not many people use their individual options of exit and voice, which could lead to inequality between users and nonusers. We identify delegation as a possible solution to this problem, using Dutch health care as a case study to explore this option. Notwithstanding various advantages, we show that voice and choice by delegation also entail problems of inequality and representativeness.

  14. The Christian voice in philosophy

    Directory of Open Access Journals (Sweden)

    Stuart Fowler

    1982-03-01

    Full Text Available In this paper the Rev. Stuart Fowler outlines a Christian voice in Philosophy and urges the Christian philosopher to investigate his position and his stance with integrity and honesty.

  15. Voice Force tulekul / Tõnu Ojala

    Index Scriptorium Estoniae

    Ojala, Tõnu, 1969-

    2005-01-01

    60. sünnipäeva tähistava Tallinna Tehnikaülikooli Akadeemilise Meeskoori juubelihooaja üritusest - a capella pop-gruppide festivalist Voice Force (kontserdid 12. nov. klubis Parlament ja 3. dets. Vene Kultuurikeskuses)

  16. Voice Force tulekul / Tõnu Ojala

    Index Scriptorium Estoniae

    Ojala, Tõnu, 1969-

    2005-01-01

    60. sünnipäeva tähistava Tallinna Tehnikaülikooli Akadeemilise Meeskoori juubelihooaja üritusest - a capella pop-gruppide festivalist Voice Force (kontserdid 12. nov. klubis Parlament ja 3. dets. Vene Kultuurikeskuses)

  17. Feature Extraction of Voice Segments Using Cepstral Analysis for Voice Regeneration

    OpenAIRE

    Banerjee, P. S.; Baisakhi Chakraborty; Jaya Banerjee

    2015-01-01

    Even though a lot of work has been done on areas of speech to text and vice versa or voice detection or similarity analysis of two voice samples but very less emphasis has be given to voice regeneration. General algorithms for distinct voice checking for two voice sources paved way for our endeavor in reconstructing the voice from the source voice samples provided. By utilizing these algorithms and putting further stress on the feature extraction part we tried to fabricate the source voice wi...

  18. Voice Simulation in Nursing Education.

    Science.gov (United States)

    Kepler, Britney B; Lee, Heeyoung; Kane, Irene; Mitchell, Ann M

    2016-01-01

    The goal of this study was to improve prelicensure nursing students' attitudes toward and self-efficacy related to delivering nursing care to patients with auditory hallucinations. Based on the Hearing Voices That Are Distressing curriculum, 87 participants were instructed to complete 3 tasks while wearing headphones delivering distressing voices. Comparing presimulation and postsimulation results, this study suggests that the simulation significantly improved attitudes toward patients with auditory hallucinations; however, self-efficacy related to caring for these patients remained largely unchanged.

  19. Work-related voice disorder

    OpenAIRE

    Paulo Eduardo Przysiezny; Luciana Tironi Sanson Przysiezny

    2015-01-01

    INTRODUCTION: Dysphonia is the main symptom of the disorders of oral communication. However, voice disorders also present with other symptoms such as difficulty in maintaining the voice (asthenia), vocal fatigue, variation in habitual vocal fundamental frequency, hoarseness, lack of vocal volume and projection, loss of vocal efficiency, and weakness when speaking. There are several proposals for the etiologic classification of dysphonia: functional, organofunctional, organic, and work-related...

  20. Tracheostomy cannulas and voice prosthesis.

    Science.gov (United States)

    Kramp, Burkhard; Dommerich, Steffen

    2009-01-01

    Cannulas and voice prostheses are mechanical aids for patients who had to undergo tracheotomy or laryngectomy for different reasons. For better understanding of the function of those artificial devices, first the indications and particularities of the previous surgical intervention are described in the context of this review. Despite the established procedure of percutaneous dilatation tracheotomy e.g. in intensive care units, the application of epithelised tracheostomas has its own position, especially when airway obstruction is persistent (e.g. caused by traumata, inflammations, or tumors) and a longer artificial ventilation or special care of the patient are required. In order to keep the airways open after tracheotomy, tracheostomy cannulas of different materials with different functions are available. For each patient the most appropriate type of cannula must be found. Voice prostheses are meanwhile the device of choice for rapid and efficient voice rehabilitation after laryngectomy. Individual sizes and materials allow adaptation of the voice prostheses to the individual anatomical situation of the patients. The combined application of voice prostheses with HME (Head and Moisture Exchanger) allows a good vocal as well as pulmonary rehabilitation. Precondition for efficient voice prosthesis is the observation of certain surgical principles during laryngectomy. The duration of the prosthesis mainly depends on material properties and biofilms, mostly consisting of funguses and bacteries. The quality of voice with valve prosthesis is clearly superior to esophagus prosthesis or electro-laryngeal voice. Whenever possible, tracheostoma valves for free-hand speech should be applied. Physicians taking care of patients with speech prostheses after laryngectomy should know exactly what to do in case the device fails or gets lost.

  1. Smart Homes with Voice Activated Systems for Disabled People

    Directory of Open Access Journals (Sweden)

    Bekir Busatlic

    2017-02-01

    Full Text Available Smart home refers to the application of various technologies to semi-unsupervised home control It refers to systems that control temperature, lighting, door locks, windows and many other appliances. The aim of this study was to design a system that will use existing technology to showcase how it can benefit people with disabilities. This work uses only off-the-shelf products (smart home devices and controllers, speech recognition technology, open-source code libraries. The Voice Activated Smart Home application was developed to demonstrate online grocery shopping and home control using voice comments and tested by measuring its effectiveness in performing tasks as well as its efficiency in recognizing user speech input.

  2. Challenging Institutional Conventions and Forming a Voice through Creativity

    DEFF Research Database (Denmark)

    Nielsen, Margit Saltofte

    2013-01-01

    This article explores and discusses examples of students’ everyday creativity that seem to be overlooked by teachers but are acknowledged by ‘peers’ in a 9th Grade (age 15–16) at a Danish free school. Creativity emerged as part of the everyday student interactions at school in ‘in-between’ social...... spaces, outside the formal teaching zones. Creative activities took place in the interstitial zones of time and space, where they gave voice to those students whose voice is not always heard in the formal teaching context. Creativity occurred also among students as a way to challenge institutional...... conditions and this practice gave them recognition by their peers. The argument is being made that students’ interactions in these zones draw on other forms of knowledge and ways of performing than those used in structured teaching zones. The creativity expressed in interstitial zones contributes to forming...

  3. A VR based therapy for the treatment of impotence and premature ejaculation.

    Science.gov (United States)

    Optale, G; Munari, A; Nasta, A; Pianon, C; Verde, J B; Viggiano, G

    1998-01-01

    The use of psycho-dynamic psychotherapy integrating virtual reality (VR) dealt with in this study on the treatment of erection dysfunctions and premature ejaculation started several years ago, after having seen the scarce results we obtained using exclusively a psycho-dynamic approach (accompanied by pre-recorded sound and music). Considering the particular way that full-immersion VR involves the subject who experiences it, we hypothesized that better results could be obtained during therapy for these sexual disorders and in particular regarding the nature of erection dysfunction, commonly referred to as impotence "a persistent or recurrent inability to attain, or to maintain until completion of the sexual activity, an adequate erection." The plan for therapy consisted of 12 hour-long sessions over a 25-week period, and the methods involved the use of a VR helmet, joystick and miniature television screens that projected specially-designed CD-ROM programs on psychological development.

  4. Utilidad clínica de la derivación aVR

    Directory of Open Access Journals (Sweden)

    Armel Hernández Reyes

    2012-11-01

    Full Text Available En épocas pasadas, la derivación aVR ha sido frecuentemente olvidada durante la interpretación electrocardiográfica. Con la presente revisión se pretende lograr que se realice una sistemática revisión de la derivación aVR, por lo que puede aportar a la utilidad diagnóstica y pronóstica del electrocardiograma. Se realizó una revisión de literatura actualizada y relacionada con la utilidad clínica de aVR. Se concluye que esta aporta información adicional, tanto pronóstica como diagnóstica, en múltiples condiciones cardiacas.

  5. Phoneme Recognition Using Acoustic Events

    CERN Document Server

    Huebener, K; Huebener, Kai; Carson-Berndsen, Julie

    1994-01-01

    This paper presents a new approach to phoneme recognition using nonsequential sub--phoneme units. These units are called acoustic events and are phonologically meaningful as well as recognizable from speech signals. Acoustic events form a phonologically incomplete representation as compared to distinctive features. This problem may partly be overcome by incorporating phonological constraints. Currently, 24 binary events describing manner and place of articulation, vowel quality and voicing are used to recognize all German phonemes. Phoneme recognition in this paradigm consists of two steps: After the acoustic events have been determined from the speech signal, a phonological parser is used to generate syllable and phoneme hypotheses from the event lattice. Results obtained on a speaker--dependent corpus are presented.

  6. Associations between facial emotion recognition, cognition and alexithymia in patients with schizophrenia: comparison of photographic and virtual reality presentations.

    Science.gov (United States)

    Gutiérrez-Maldonado, J; Rus-Calafell, M; Márquez-Rejón, S; Ribas-Sabaté, J

    2012-01-01

    Emotion recognition is known to be impaired in schizophrenia patients. Although cognitive deficits and symptomatology have been associated with this impairment there are other patient characteristics, such as alexithymia, which have not been widely explored. Emotion recognition is normally assessed by means of photographs, although they do not reproduce the dynamism of human expressions. Our group has designed and validated a virtual reality (VR) task to assess and subsequently train schizophrenia patients. The present study uses this VR task to evaluate the impaired recognition of facial affect in patients with schizophrenia and to examine its association with cognitive deficit and the patients' inability to express feelings. Thirty clinically stabilized outpatients with a well-established diagnosis of schizophrenia or schizoaffective disorder were assessed in neuropsychological, symptomatic and affective domains. They then performed the facial emotion recognition task. Statistical analyses revealed no significant differences between the two presentation conditions (photographs and VR) in terms of overall errors made. However, anger and fear were easier to recognize in VR than in photographs. Moreover, strong correlations were found between psychopathology and the errors made.

  7. Voice Collection under Different Spectrum

    Directory of Open Access Journals (Sweden)

    Min Li

    2013-05-01

    Full Text Available According to the short-time Fourier transform theory and principle of digital filtering, this paper established a mathematical model called collection of voice signal collection at different spectrum. The voice signal was a non-stationary process, while the standard Fourier transform only applied to the periodic signal, transient signals or stationary random signal. Therefore, the standard Fourier transform could not be directly used for the speech signal. By controlling the input different types and parameters, this paper analyzed the collected original voice signal spectrum with the use of MATLAB software platform. At the same time, it realized the extraction, recording and playback of the speech signal at different frequencies. Therefore, the waveforms could be displayed obviously on the graphic user interface and voice effect could be more clearly. Meanwhile, the result was verified by the hardware platforms, which consisted of TMS320VC5509A [1] chip and TLV320AIC23 voice chip. The results showed that the extraction of voice signal under different spectrum model was scientific, rational and effective.

  8. Variants of Phosphotriesterase for the Enhanced Detoxification of the Chemical Warfare Agent VR.

    Science.gov (United States)

    Bigley, Andrew N; Mabanglo, Mark F; Harvey, Steven P; Raushel, Frank M

    2015-09-08

    The V-type organophosphorus nerve agents are among the most hazardous compounds known. Previous efforts to evolve the bacterial enzyme phosphotriesterase (PTE) for the hydrolytic decontamination of VX resulted in the identification of the variant L7ep-3a, which has a kcat value more than 2 orders of magnitude higher than that of wild-type PTE for the hydrolysis of VX. Because of the relatively small size of the O-ethyl, methylphosphonate center in VX, stereoselectivity is not a major concern. However, the Russian V-agent, VR, contains a larger O-isobutyl, methylphosphonate center, making stereoselectivity a significant issue since the SP-enantiomer is expected to be significantly more toxic than the RP-enantiomer. The three-dimensional structure of the L7ep-3a variant was determined to a resolution of 2.01 Å (PDB id: 4ZST ). The active site of the L7ep-3a mutant has revealed a network of hydrogen bonding interactions between Asp-301, Tyr-257, Gln-254, and the hydroxide that bridges the two metal ions. A series of new analogues that mimic VX and VR has helped to identify critical structural features for the development of new enzyme variants that are further enhanced for the catalytic detoxification of VR and VX. The best of these mutants has been shown to have a reversed stereochemical preference for the hydrolysis of VR-chiral center analogues. This mutant hydrolyzes the two enantiomers of VR 160- and 600-fold faster than wild-type PTE hydrolyzes the SP-enantiomer of VR.

  9. The impact of voice on speech realization

    Directory of Open Access Journals (Sweden)

    Jelka Breznik

    2014-12-01

    Full Text Available The study discusses spoken literary language and the impact of voice on speech realization. The voice consists of a sound made by a human being using the vocal folds for talking, singing, laughing, crying, screaming… The human voice is specifically the part of human sound production in which the vocal folds (vocal cords are the primary sound source. Our voice is our instrument and identity card. How does the voice (voice tone affect others and how do they respond, positively or negatively? How important is voice (voice tone in communication process? The study presents how certain individuals perceive voice. The results of the research on the relationships between the spoken word, excellent speaker, voice and description / definition / identification of specific voices done by experts in the field of speech and voice as well as non-professionals are presented. The study encompasses two focus groups. One consists of amateurs (non-specialists in the field of speech or voice who have no knowledge in this field and the other consists of professionals who work with speech or language or voice. The questions were intensified from general to specific, directly related to the topic. The purpose of such a method of questioning was to create relaxed atmosphere, promote discussion, allow participants to interact, complement, and to set up self-listening and additional comments.

  10. Towards very large vocabulary word recognition

    Science.gov (United States)

    Waibel, A.

    1982-11-01

    In this paper, preliminary considerations and some experimental results are presented in an effort to design Very Large Vocabulary Recognition (VLVR) systems. We will first consider the applicability of current recognition techniques and argue their inadequacy for VLVR. Possible alternate strategies will be explored and their potential usefulness statistically evaluated. Our results indicate that suprasegmental cues such as syllabification, stress patterns, rhythmic patterns, rhythmic patterns and the voiced - unvoiced patterns in the syllables of a word provide powerful mechanisms for search space reduction. Suprasegmental feature could thus operate in a complementary fashion to segmental features.

  11. Does knowing speaker sex facilitate vowel recognition at short durations?

    Science.gov (United States)

    Smith, David R R

    2014-05-01

    A man, woman or child saying the same vowel do so with very different voices. The auditory system solves the complex problem of extracting what the man, woman or child has said despite substantial differences in the acoustic properties of their voices. Much of the acoustic variation between the voices of men and woman is due to changes in the underlying anatomical mechanisms for producing speech. If the auditory system knew the sex of the speaker then it could potentially correct for speaker sex related acoustic variation thus facilitating vowel recognition. This study measured the minimum stimulus duration necessary to accurately discriminate whether a brief vowel segment was spoken by a man or woman, and the minimum stimulus duration necessary to accuately recognise what vowel was spoken. Results showed that reliable vowel recognition precedesreliable speaker sex discrimination, thus questioning the use of speaker sex information in compensating for speaker sex related acoustic variation in the voice. Furthermore, the pattern of performance across experiments where the fundamental frequency and formant frequency information of speaker's voices were systematically varied, was markedly different depending on whether the task was speaker-sex discrimination or vowel recognition. This argues for there being little relationship between perception of speaker sex (indexical information) and perception of what has been said (linguistic information) at short durations. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. On the definition and interpretation of voice selective activation in the temporal cortex

    Directory of Open Access Journals (Sweden)

    Anja eBethmann

    2014-07-01

    Full Text Available Regions along the superior temporal sulci and in the anterior temporal lobes have been found to be involved in voice processing. It has even been argued that parts of the temporal cortices serve as voice-selective areas. Yet, evidence for voice-selective activation in the strict sense is still missing. The current fMRI study aimed at assessing the degree of voice-specific processing in different parts of the superior and middle temporal cortices. To this end, voices of famous persons were contrasted with widely different categories, which were sounds of animals and musical instruments. The argumentation was that only brain regions with statistically proven absence of activation by the control stimuli may be considered as candidates for voice-selective areas. Neural activity was found to be stronger in response to human voices in all analyzed parts of the temporal lobes except for the middle and posterior STG. More importantly, the activation differences between voices and the other environmental sounds increased continuously from the mid-posterior STG to the anterior MTG. Here, only voices but not the control stimuli excited an increase of the BOLD response above a resting baseline level. The findings are discussed with reference to the function of the anterior temporal lobes in person recognition and the general question on how to define selectivity of brain regions for a specific class of stimuli or tasks. In addition, our results corroborate recent assumptions about the hierarchical organization of auditory processing building on a processing stream from the primary auditory cortices to anterior portions of the temporal lobes.

  13. Toward adaptive VR simulators combining visual, haptic, and brain-computer interfaces.

    Science.gov (United States)

    Lécuyer, Anatole; George, Laurent; Marchal, Maud

    2013-01-01

    The next generation of VR simulators could take into account a novel input: the user's mental state, as measured with electrodes and a brain-computer interface. One illustration of this promising path is a project that adapted a guidance system's force feedback to the user's mental workload in real time. A first application of this approach is a medical training simulator that provides virtual assistance that adapts to the trainee's mental activity. Such results pave the way to VR systems that will automatically reconfigure and adapt to their users' mental states and cognitive processes.

  14. Computational Virtual Reality (VR) as a human-computer interface in the operation of telerobotic systems

    Science.gov (United States)

    Bejczy, Antal K.

    1995-01-01

    This presentation focuses on the application of computer graphics or 'virtual reality' (VR) techniques as a human-computer interface tool in the operation of telerobotic systems. VR techniques offer very valuable task realization aids for planning, previewing and predicting robotic actions, operator training, and for visual perception of non-visible events like contact forces in robotic tasks. The utility of computer graphics in telerobotic operation can be significantly enhanced by high-fidelity calibration of virtual reality images to actual TV camera images. This calibration will even permit the creation of artificial (synthetic) views of task scenes for which no TV camera views are available.

  15. A Virtual Assembly System for Aero-engines Based on VR

    Institute of Scientific and Technical Information of China (English)

    GAO Ying; GUO Yun-peng; SHAO Ya-nan; XU Zhi-guo

    2008-01-01

    In this paper, a synthesized technique is explored through design of a virtual experiment system for main aeroengine based on VR, which attempts to integrate some VR techniques, such as 3D stereo simulation and modeling, 3D scene management and human-computer virtual assembly based on data glove and collision detection. The geometry modeling and instantiation technique, virtual hand modeling and data glove interaction technique are discussed chiefly. Finally, taking aim at assembling and dismantling experiment system of a certain aero-engine, interface technology is expounded also.

  16. Building tomorrow's nuclear power plants with 4{sup +}D VR technology

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Il S.; Yoon, Sang H.; Shim, Kyu W.; Yu, Yong H.; Suh, Kune Y. [PhiloSOPhIA, Inc., Seoul (Korea, Republic of)

    2002-04-15

    There continues to be an increasing demand of electricity around the globe to fuel the industrial growth and to promote the human welfare. The economic activities have brought about richness in our material and cultural lives, in which process the electric power has been at the heart of the versatile energy sources. In order to timely and competitively respond to rapidly changing energy environment in the twenty-first century there is a growing need to build the advanced nuclear power plants in the unlimited workspace of virtual reality (VR) prior to commissioning. One can then realistically evaluate their construction time and cost per varying methods and options available from the leading-edge technology. In particular a great deal of efforts have yet to be made for time- and cost-dependent plant simulation and dynamically coupled database construction in the VR space. The operator training and personnel education may also benefit from the VR technology. The present work is being proposed in the three-dimensional space and time plus cost coordinates, i. e. four plus dimensional (4{sup +}D) coordinates. The 4{sup +}D VR application will enable the nuclear industry to narrow the technological gap from the other leading industries that have long since been employing the VR engineering. The 4{sup +}D technology will help nurture public understanding of the special discipline of nuclear power plants. The technology will also facilitate public access to the knowledge on the nuclear science and engineering which has so far been monopolized by the academia, national laboratories and the heavy industry. The 4{sup +}D virtual design and construction will open up the new horizon for revitalization of the nuclear industry over the globe in the foreseeable future. Considering the long construction and operation time for the nuclear power plants, the preliminary VR simulation capability for the plants will supply the vital information not only for the actual design and

  17. Vrănceanu Connections and Foliations with Bundle-Like Metrics

    Indian Academy of Sciences (India)

    Aurel Bejancu; Hani Reda Farran

    2008-02-01

    We show that the Vrănceanu connection which was initially introduced on non-holonomic manifolds can be used to study the geometry of foliated manifolds. We prove that a foliation is totally geodesic with bundle-like metric if and only if this connection is a metric one. We introduce the notion of a foliated Riemannian manifold of constant transversal Vrănceanu curvature and the notion of a transversal Einstein foliated Riemannian manifold. The geometry of these two classes of manifolds is studied and the relationship between them is determined.

  18. Industrial Applications of Automatic Speech Recognition Systems

    Directory of Open Access Journals (Sweden)

    Dr. Jayashri Vajpai

    2016-03-01

    Full Text Available Current trends in developing technologies form important bridges to the future, fortified by the early and productive use of technology for enriching the human life. Speech signal processing, which includes automatic speech recognition, synthetic speech, and natural language processing, is beginning to have a significant impact on business, industry and ease of operation of personal computers. Apart from this, it facilitates the deeper understanding of complex mechanism of functioning of human brain. Advances in speech recognition technology, over the past five decades, have enabled a wide range of industrial applications. Yet today's applications provide a small preview of a rich future for speech and voice interface technology that will eventually replace keyboards with microphones for designing human machine interface for providing easy access to increasingly intelligent machines. It also shows how the capabilities of speech recognition systems in industrial applications are evolving over time to usher in the next generation of voice-enabled services. This paper aims to present an effective survey of the speech recognition technology described in the available literature and integrate the insights gained during the process of study of individual research and developments. The current applications of speech recognition for real world and industry have also been outlined with special reference to applications in the areas of medical, industrial robotics, forensic, defence and aviation

  19. 基于VR-Force的协同空战效能评估研究%Evaluation effectiveness of cooperate air combat based on VR-Force

    Institute of Scientific and Technical Information of China (English)

    左家亮; 杨任农; 寇雅楠; 张滢

    2011-01-01

    In the air combat, through integrating the main factor of the entity in the combat, and evaluating the hit probability of the entity in theory. On the assign task of the air combat based on the VR-Force develop environment is researched. According to the plan of the task, after edit campaign plan for every entity in the air combat, simulation experiment is done according the air combat plan in VR-Force, evaluate the efficiency of the air combat plan. At last, through comparing the result of the simulation, the value and meaning of the scientific and reasonable task assign plan in the air to air combat is illuminated.%在空战对抗中,通过综合参战实体各项战技指标,从理论上拟合单个实体对对方的杀伤概率.基于VR-Force开发环境,进行协同空战任务分配研究.根据任务分配方案,对每个参战实体制定对应的作战计划,按此方案制作作战想定,进行战场推演,实现效能评估.实验仿真结果与理论推导说明了科学合理的任务分配方案在协同空战中的作用和意义.

  20. Hands-free human-machine interaction with voice

    Science.gov (United States)

    Juang, B. H.

    2001-05-01

    Voice is natural communication interface between a human and a machine. The machine, when placed in today's communication networks, may be configured to provide automation to save substantial operating cost, as demonstrated in AT&T's VRCP (Voice Recognition Call Processing), or to facilitate intelligent services, such as virtual personal assistants, to enhance individual productivity. These intelligent services often need to be accessible anytime, anywhere (e.g., in cars when the user is in a hands-busy-eyes-busy situation or during meetings where constantly talking to a microphone is either undersirable or impossible), and thus call for advanced signal processing and automatic speech recognition techniques which support what we call ``hands-free'' human-machine communication. These techniques entail a broad spectrum of technical ideas, ranging from use of directional microphones and acoustic echo cancellatiion to robust speech recognition. In this talk, we highlight a number of key techniques that were developed for hands-free human-machine communication in the mid-1990s after Bell Labs became a unit of Lucent Technologies. A video clip will be played to demonstrate the accomplishement.

  1. Implementing Virtual Reality Technology as an Effective Web Based Kiosk: Darulaman's Teacher Training College Tour (Ipda Vr Tour)

    Science.gov (United States)

    Fadzil, Azman

    2006-01-01

    At present, the development of Virtual Reality (VR) technology is expanding due to the importance and needs to use the 3D elements and 360 degrees panorama in expressing a clearer picture to consumers in various fields such as education, military, medicine, entertainment and so on. The web based VR kiosk project in Darulaman's Teacher Training…

  2. Stabilization of Clostridium perfringens collagenase mRNA by VR-RNA-dependent cleavage in 5' leader sequence.

    Science.gov (United States)

    Obana, Nozomu; Shirahama, Yu; Abe, Kimihiro; Nakamura, Kouji

    2010-09-01

    The small RNA (sRNA), VR-RNA that is directly regulated by the VirR/VirS two-component system, regulates many genes including toxin genes such as collagenase (colA) and phospholipase C (plc) in Clostridium perfringens. Although the VR-RNA 3' region is sufficient to regulate the colA and plc genes, the molecular mechanism of toxin gene regulation by VR-RNA remains unclear. Here, we found that colA mRNA is cleaved at position -79 and -78 from the A of the first codon (ATG) in the presence of VR-RNA. The processed transcripts were stable compared with longer intact transcripts. On the other hand, colA mRNA was labile in a VR-RNA-deficient strain, and processed transcripts were undetectable. The stability and processing of colA mRNA were restored by transformation of the 3' region of VR-RNA-expression vector. The 3' region of VR-RNA and colA mRNA had significant complementation and interacted in vitro. These results show that VR-RNA base pairs with colA mRNA and induces cleavage in the 5' untranslated region (UTR) of colA mRNA, which leads to the stabilization of colA mRNA and the activation of colA expression. © 2010 Blackwell Publishing Ltd.

  3. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob

    2015-01-01

    be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. METHODS: Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS...... with a median score of five (range: 3-9), which was improved with the addition of 5,000 words. CONCLUSION: The out-of-the-box performance of VRS was acceptable and improved after additional words were added. Further studies are needed to investigate the effect of additional software accuracy training....

  4. Practical Voice Recognition for the Aircraft Cockpit Project

    Data.gov (United States)

    National Aeronautics and Space Administration — This proposal responds to the urgent need for improved pilot interfaces in the modern aircraft cockpit. Recent advances in aircraft equipment bring tremendous...

  5. Indonesian Automatic Speech Recognition For Command Speech Controller Multimedia Player

    Directory of Open Access Journals (Sweden)

    Vivien Arief Wardhany

    2014-12-01

    Full Text Available The purpose of multimedia devices development is controlling through voice. Nowdays voice that can be recognized only in English. To overcome the issue, then recognition using Indonesian language model and accousticc model and dictionary. Automatic Speech Recognizier is build using engine CMU Sphinx with modified english language to Indonesian Language database and XBMC used as the multimedia player. The experiment is using 10 volunteers testing items based on 7 commands. The volunteers is classifiedd by the genders, 5 Male & 5 female. 10 samples is taken in each command, continue with each volunteer perform 10 testing command. Each volunteer also have to try all 7 command that already provided. Based on percentage clarification table, the word “Kanan” had the most recognize with percentage 83% while “pilih” is the lowest one. The word which had the most wrong clarification is “kembali” with percentagee 67%, while the word “kanan” is the lowest one. From the result of Recognition Rate by male there are several command such as “Kembali”, “Utama”, “Atas “ and “Bawah” has the low Recognition Rate. Especially for “kembali” cannot be recognized as the command in the female voices but in male voice that command has 4% of RR this is because the command doesn’t have similar word in english near to “kembali” so the system unrecognize the command. Also for the command “Pilih” using the female voice has 80% of RR but for the male voice has only 4% of RR. This problem is mostly because of the different voice characteristic between adult male and female which male has lower voice frequencies (from 85 to 180 Hz than woman (165 to 255 Hz.The result of the experiment showed that each man had different number of recognition rate caused by the difference tone, pronunciation, and speed of speech. For further work needs to be done in order to improving the accouracy of the Indonesian Automatic Speech Recognition system

  6. Towards Real-Time Speech Emotion Recognition for Affective E-Learning

    Science.gov (United States)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2016-01-01

    This paper presents the voice emotion recognition part of the FILTWAM framework for real-time emotion recognition in affective e-learning settings. FILTWAM (Framework for Improving Learning Through Webcams And Microphones) intends to offer timely and appropriate online feedback based upon learner's vocal intonations and facial expressions in order…

  7. Word Intelligibility in Multi-voice Singing: The Influence of Chorus Size.

    Science.gov (United States)

    Condit-Schultz, Nathaniel; Huron, David

    2017-01-01

    This study investigated how the intelligibility of sung words is influenced by the number of singers in a choral music style. The study used repeated measures factorial. One hundred forty-nine participants listened to recordings of spoken and sung English words and attempted to identify the words. Each stimuli word was sung or spoken in sync by either one, four, eight, sixteen, or twenty-seven members of a high-quality Soprano Alto Tenor Bass (SATB) choir. In general, single-voice word recognition was higher than multi-voice word recognition in the sung condition. However, the difference between four concurrent singers and the full choir was negligible; that is, reduced intelligibility with multiple singers shows little sensitivity to the number of singers. The principal effect of voice density on intelligibility is found to occur with coda consonants-a result consistent with the importance many choral conductors attribute to coordinating word offsets. In particular, the plosives /b/, /d/, /g/, and /p/ are easily confused. Coda liquids (/l/,/r/) were also found to be a source of confusion. Finally, an increasing density of voices appears to have a facilitating effect for the coda nasal /m/. Groups of four or more choral singers do appear to be less intelligible than single singers, although the observed effect is modest. However, increasing the number of singers in a choral texture beyond four singers does not appear to further degrade intelligibility. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  8. [A comparative study of pathological voice based on traditional acoustic characteristics and nonlinear features].

    Science.gov (United States)

    Gan, Deying; Hu, Weiping; Zhao, Bingxin

    2014-10-01

    By analyzing the mechanism of pronunciation, traditional acoustic parameters, including fundamental frequency, Mel frequency cepstral coefficients (MFCC), linear prediction cepstrum coefficient (LPCC), frequency perturbation, amplitude perturbation, and nonlinear characteristic parameters, including entropy (sample entropy, fuzzy entropy, multi-scale entropy), box-counting dimension, intercept and Hurst, are extracted as feature vectors for identification of pathological voice. Seventy-eight normal voice samples and 73 pathological voice samples for /a/, and 78 normal samples and 80 pathological samples for /i/ are recognized based on support vector machine (SVM). The results showed that compared with traditional acoustic parameters, nonlinear characteristic parameters could be well used to distinguish between healthy and pathological voices, and the recognition rates for /a/ were all higher than those for /i/ except for multi-scale entropy. That is why the /a/ sound data is used widely in related research at home and abroad for obtaining better identification of pathological voices. Adopting multi-scale entropy for /i/ could obtain higher recognition rate than /a/ between healthy and pathological samples, which may provide some useful inspiration for evaluating vocal compensatory function.

  9. Mechanics of human voice production and control

    Science.gov (United States)

    Zhang, Zhaoyan

    2016-01-01

    As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed. PMID:27794319

  10. Facial Recognition

    National Research Council Canada - National Science Library

    Mihalache Sergiu; Stoica Mihaela-Zoica

    2014-01-01

    .... From birth, faces are important in the individual's social interaction. Face perceptions are very complex as the recognition of facial expressions involves extensive and diverse areas in the brain...

  11. Fingerprint recognition

    OpenAIRE

    Diefenderfer, Graig T.

    2006-01-01

    The use of biometrics is an evolving component in today's society. Fingerprint recognition continues to be one of the most widely used biometric systems. This thesis explores the various steps present in a fingerprint recognition system. The study develops a working algorithm to extract fingerprint minutiae from an input fingerprint image. This stage incorporates a variety of image pre-processing steps necessary for accurate minutiae extraction and includes two different methods of ridge thin...

  12. Native voice, self-concept and the moral case for personalized voice technology.

    Science.gov (United States)

    Nathanson, Esther

    2017-01-01

    Purpose (1) To explore the role of native voice and effects of voice loss on self-concept and identity, and survey the state of assistive voice technology; (2) to establish the moral case for developing personalized voice technology. Methods This narrative review examines published literature on the human significance of voice, the impact of voice loss on self-concept and identity, and the strengths and limitations of current voice technology. Based on the impact of voice loss on self and identity, and voice technology limitations, the moral case for personalized voice technology is developed. Results Given the richness of information conveyed by voice, loss of voice constrains expression of the self, but the full impact is poorly understood. Augmentative and alternative communication (AAC) devices facilitate communication but, despite advances in this field, voice output cannot yet express the unique nuances of individual voice. The ethical principles of autonomy, beneficence and equality of opportunity establish the moral responsibility to invest in accessible, cost-effective, personalized voice technology. Conclusions Although further research is needed to elucidate the full effects of voice loss on self-concept, identity and social functioning, current understanding of the profoundly negative impact of voice loss establishes the moral case for developing personalized voice technology. Implications for Rehabilitation Rehabilitation of voice-disordered patients should facilitate self-expression, interpersonal connectedness and social/occupational participation. Proactive questioning about the psychological and social experiences of patients with voice loss is a valuable entry point for rehabilitation planning. Personalized voice technology would enhance sense of self, communicative participation and autonomy and promote shared healthcare decision-making. Further research is needed to identify the best strategies to preserve and strengthen identity and sense of

  13. Children's Voice or Children's Voices? How Educational Research Can Be at the Heart of Schooling

    Science.gov (United States)

    Stern, Julian

    2015-01-01

    There are problems with considering children and young people in schools as quite separate individuals, and with considering them as members of a single collectivity. The tension is represented in the use of "voice" and "voices" in educational debates. Voices in dialogue, in contrast to "children's voice", are…

  14. Voice complaints, risk factors for voice problems and history of voice problems in relation to puberty in female student teachers.

    NARCIS (Netherlands)

    Thomas, G.; Jong, F.I.C.R.S. de; Kooijman, P.G.C.; Donders, A.R.T.; Cremers, C.W.R.J.

    2006-01-01

    The aim of the study was to estimate voice complaints, risk factors for voice complaints and history of voice problems in student teachers before they embarked on their professional teaching career. A cross-sectional questionnaire survey was performed among female student teachers. The response rate

  15. Voice complaints, risk factors for voice problems and history of voice problems in relation to puberty in female student teachers.

    NARCIS (Netherlands)

    Thomas, G.; Jong, F.I.C.R.S. de; Kooijman, P.G.C.; Donders, A.R.T.; Cremers, C.W.R.J.

    2006-01-01

    The aim of the study was to estimate voice complaints, risk factors for voice complaints and history of voice problems in student teachers before they embarked on their professional teaching career. A cross-sectional questionnaire survey was performed among female student teachers. The response rate

  16. Utility of lead aVR for identifying the culprit lesion in acute myocardial infarction

    DEFF Research Database (Denmark)

    Kühl, Jørgen Tobias; Berg, Ronan M G

    2009-01-01

    -segment elevation AMI (STEMI). Pooled data showed a sensitivity of 47%, a specificity of 96%, a positive predicative value (PPV) of 91% and a NPV of 69%. Five studies examined if ST-segment depression (STD) in lead aVR is valuable for discerning lesions in the circumflex artery from those in the right coronary...

  17. Application of MCNP for neutronic calculations at VR-1 training reactor

    Science.gov (United States)

    Huml, Ondřej; Rataj, Jan; Bílý, Tomáš

    2014-06-01

    The paper presents utilization of Monte Carlo MCNP transport code for neutronic calculations of training reactor VR-1. Results of calculations are compared with results of measurements realized during last few critical experiments with various reactor core configurations. Very good agreement between calculations and measurements is observed.

  18. VR-Based Gamification of Communication Training and Oral Examination in a Second Language

    Science.gov (United States)

    Reitz, Liesa; Sohny, Aline; Lochmann, Gerrit

    2016-01-01

    The authors present a novel way of oral language training by embedding the English as a foreign language (EFL) learning process into a generic 3D Cooperative Virtual Reality (VR) Game. Due to lack of time, resources and innovation, the language classroom is limited in its possibilities of promoting authentic communication. Therefore, the…

  19. Interpretation of Simulations in Interactive VR Environments: Depth Perception in Cave and Panorama

    DEFF Research Database (Denmark)

    Mullins, Michael

    2006-01-01

     Virtual reality (VR) applications are transforming the way architecture is conceived and produced. By introducing an open and inclusive approach, they encourage a creative dialogue with the users of residential schemes and other buildings and allow competition juries a more thorough understanding...

  20. Neurocognition, presence and acceptance of a VR programme for psychotic patients: a correlational study.

    Science.gov (United States)

    Rus-Calafell, Mar; Gutiérrez-Maldonado, José; Ribas-Sabaté, Joan

    2013-01-01

    Patients with psychosis exhibit a wide range of cognitive deficits which are associated with poor functioning and poor outcomes in psychosocial interventions. Recently, virtual reality (VR) has been demonstrated to be a useful tool for treatment and rehabilitation of these patients. We have developed and applied an integrated VR programme to improve social skills in people with schizophrenia: the Soskitrain. The aim of the present study is to evaluate the relationship between patients' cognitive deficits, their sense of presence and their ratings of the programme's acceptability. Twelve clinically stabilized outpatients with a well-established diagnosis of schizophrenia or schizoaffective disorder underwent neuropsychological assessment prior to treatment, while after the intervention they completed a questionnaire about their sense of presence and the acceptability of the VR programme. Post-treatment results revealed a high sense of presence among patients, as well as good verisimilitude and high acceptance of the virtual environments. In addition, there were significant negative correlations between sense of presence and deficits in both delayed verbal learning and processing speed. The paper discusses the implications of cognitive impairment for the experience and acceptance of VR when treating psychotic patients.

  1. Polymorphisms of CHAT but not TFAM or VR22 are Associated with Alzheimer Disease Risk.

    Science.gov (United States)

    Gao, Lili; Zhang, Yan; Deng, Jinghua; Yu, Wenbing; Yu, Yunxia

    2016-06-07

    BACKGROUND Alzheimer disease (AD) is a chronic neurodegenerative disease that is one of the most prevalent health problems among seniors. The cause of AD has not yet been elucidated, but many risk factors have been identified that might contribute to the pathogenesis and prognosis of AD. We conducted a meta-analysis of studies involving CHAT, TFAM, and VR22 polymorphisms and AD susceptibility to further understand the pathogenesis of AD. MATERIAL AND METHODS PubMed/Medline, Embase, Web of Science, the Cochrane Library, and Google Scholar were searched for relevant articles. Rs1880676, rs2177369, rs3810950, and rs868750 of CHAT; rs1937 and rs2306604 of TFAM; and rs10997691 and rs7070570 of VR22 are studied in this meta-analysis. RESULTS A total of 51 case-control studies with 16 446 cases and 16 057 controls were enrolled. For CHAT, rs2177369 (G>A) in whites and rs3810950 (G>A) in Asians were found to be associated with AD susceptibility. No association was detected between rs1880676 and rs868750 and AD risk. For TFAM and VR22, no significant association was detected in studied single-nucleotide polymorphisms (SNPs). CONCLUSIONS Rs2177369 and rs3810950 of CHAT are associated with AD susceptibility, but rs1880676 and rs868750 are not. Rs1937 and rs2306604 of TFAM, and rs10997691 and rs7070570 of VR22 are not significantly associated with AD risk.

  2. VizieR Online Data Catalog: V1044 Her VR differential photometry (Lu+, 2016)

    Science.gov (United States)

    Lu, H.; Zhang, L.; Han, X. L.; Pi, Q.; Wang, D.

    2016-11-01

    V1044 Her was observed in the VR bands on May 22, 23 and 24, 2015 using the 60-cm reflecting telescope at Xinglong Station of the National Astronomical Observatories of China (NAOC). This telescope was equipped with a 1024x1024 pixel CCD and the standard Johnson UBVRI filters. (1 data file).

  3. VR-Based Gamification of Communication Training and Oral Examination in a Second Language

    Science.gov (United States)

    Reitz, Liesa; Sohny, Aline; Lochmann, Gerrit

    2016-01-01

    The authors present a novel way of oral language training by embedding the English as a foreign language (EFL) learning process into a generic 3D Cooperative Virtual Reality (VR) Game. Due to lack of time, resources and innovation, the language classroom is limited in its possibilities of promoting authentic communication. Therefore, the…

  4. Generating Alternative Engineering Designs by Integrating Desktop VR with Genetic Algorithms

    Science.gov (United States)

    Chandramouli, Magesh; Bertoline, Gary; Connolly, Patrick

    2009-01-01

    This study proposes an innovative solution to the problem of multiobjective engineering design optimization by integrating desktop VR with genetic computing. Although, this study considers the case of construction design as an example to illustrate the framework, this method can very much be extended to other engineering design problems as well.…

  5. Generating Alternative Engineering Designs by Integrating Desktop VR with Genetic Algorithms

    Science.gov (United States)

    Chandramouli, Magesh; Bertoline, Gary; Connolly, Patrick

    2009-01-01

    This study proposes an innovative solution to the problem of multiobjective engineering design optimization by integrating desktop VR with genetic computing. Although, this study considers the case of construction design as an example to illustrate the framework, this method can very much be extended to other engineering design problems as well.…

  6. Creating widely accessible spatial interfaces: mobile VR for managing persistent pain.

    Science.gov (United States)

    Schroeder, David; Korsakov, Fedor; Jolton, Joseph; Keefe, Francis J; Haley, Alex; Keefe, Daniel F

    2013-01-01

    Using widely accessible VR technologies, researchers have implemented a series of multimodal spatial interfaces and virtual environments. The results demonstrate the degree to which we can now use low-cost (for example, mobile-phone based) VR environments to create rich virtual experiences involving motion sensing, physiological inputs, stereoscopic imagery, sound, and haptic feedback. Adapting spatial interfaces to these new platforms can open up exciting application areas for VR. In this case, the application area was in-home VR therapy for patients suffering from persistent pain (for example, arthritis and cancer pain). For such therapy to be successful, a rich spatial interface and rich visual aesthetic are particularly important. So, an interdisciplinary team with expertise in technology, design, meditation, and the psychology of pain collaborated to iteratively develop and evaluate several prototype systems. The video at http://youtu.be/mMPE7itReds demonstrates how the sine wave fitting responds to walking motions, for a walking-in-place application.

  7. Quick Statistics about Voice, Speech, and Language

    Science.gov (United States)

    ... here Home » Health Info » Statistics and Epidemiology Quick Statistics About Voice, Speech, Language Voice, Speech, Language, and ... no 205. Hyattsville, MD: National Center for Health Statistics. 2015. Hoffman HJ, Li C-M, Losonczy K, ...

  8. Introduction: Textual and contextual voices of translation

    DEFF Research Database (Denmark)

    2017-01-01

    Voices – marks of the tangle of subjectivities involved in textual processes – constitute the very fabric of texts in general and translations in particular. The title of this book, Textual and Contextual Voices of Translation, refers both to textual voices, that is, the voices found within...... the translated texts, and to contextual voices, that is, the voices of those involved in shaping, commenting, or otherwise influencing the textual voices. The latter appear in prefaces, reviews, and other texts that surround the translated texts and provide them with a context. Our main claim is that studying...... both the textual and contextual voices helps us better understand and explain the complexity of both the translation process and the translation product. The dovetailed approach to translation research that is advocated in this book aims at highlighting the diversity of participants, power positions...

  9. Speaker's voice as a memory cue.

    Science.gov (United States)

    Campeanu, Sandra; Craik, Fergus I M; Alain, Claude

    2015-02-01

    Speaker's voice occupies a central role as the cornerstone of auditory social interaction. Here, we review the evidence suggesting that speaker's voice constitutes an integral context cue in auditory memory. Investigation into the nature of voice representation as a memory cue is essential to understanding auditory memory and the neural correlates which underlie it. Evidence from behavioral and electrophysiological studies suggest that while specific voice reinstatement (i.e., same speaker) often appears to facilitate word memory even without attention to voice at study, the presence of a partial benefit of similar voices between study and test is less clear. In terms of explicit memory experiments utilizing unfamiliar voices, encoding methods appear to play a pivotal role. Voice congruency effects have been found when voice is specifically attended at study (i.e., when relatively shallow, perceptual encoding takes place). These behavioral findings coincide with neural indices of memory performance such as the parietal old/new recollection effect and the late right frontal effect. The former distinguishes between correctly identified old words and correctly identified new words, and reflects voice congruency only when voice is attended at study. Characterization of the latter likely depends upon voice memory, rather than word memory. There is also evidence to suggest that voice effects can be found in implicit memory paradigms. However, the presence of voice effects appears to depend greatly on the task employed. Using a word identification task, perceptual similarity between study and test conditions is, like for explicit memory tests, crucial. In addition, the type of noise employed appears to have a differential effect. While voice effects have been observed when white noise is used at both study and test, using multi-talker babble does not confer the same results. In terms of neuroimaging research modulations, characterization of an implicit memory effect

  10. Virtual Reality: Bringing the Awe of Our Science into The Classroom with VR

    Science.gov (United States)

    Bell, R. E.; Turrin, M.; Frearson, N.; Boghosian, A.; Ferrini, V. L.; Simpson, F.

    2016-12-01

    The geosciences are rich in imagery, making them compelling material for immersive teaching experiences. We often work in remote locations, places where few others are able to travel. Flat 2 D images from the field have served explorers and scientists well from the lantern slides brought back from Antarctica to the images scientists and educators now use in powerpoint presentations. These images provide a backdrop to introduce the experience for formal classes and informal presentations. Our stories from the field bring the setting alive for the participants. The travelers presented and the audience passively listened. Immersive learning opportunities are much more powerful than lecturing. We have enlisted both VR and drone imagery to bring learners fully into the experience of science. A 360 VR image brings the viewer into the moment of discovery. Both have been shown to create an active learning setting fully under the learner's control; they explore at their own pace and following their own interest. This learning `sticks', becoming part of the participant's own unique experience in the space. We are building VR images of field experiences and VR data immersion experiences that will transport people into new locations, building a field experience that they can not only see but fully explore. Through VR we introduce new experiences that showcase our science, our careers and our collaborations. Users can spin the view up to see the helicopter landing in a remote field location by the ice. Spin to the right and see a colleague collecting a reading from instruments that have been pulled from the LC130 aircraft. Turn the view to the left and see the harsh windswept environment along the edge of an ice shelf. Look down and note that you feet are encased in snow boots to keep them warm and stable on the ice. The viewer is in the field as part of the science team. Learning in the classroom and through social media is now fully 360 and fully immersive.

  11. 20 CFR 411.405 - When does an agreement between an EN and the State VR agency have to be in place?

    Science.gov (United States)

    2010-04-01

    ... State VR agency have to be in place? 411.405 Section 411.405 Employees' Benefits SOCIAL SECURITY...' Participation Agreements Between Employment Networks and State Vr Agencies § 411.405 When does an agreement between an EN and the State VR agency have to be in place? Each EN must have an agreement with the State...

  12. 20 CFR 411.400 - Can an EN to which a beneficiary's ticket is assigned refer the beneficiary to a State VR agency...

    Science.gov (United States)

    2010-04-01

    ... assigned refer the beneficiary to a State VR agency for services? 411.400 Section 411.400 Employees... Rehabilitation Agencies' Participation Referrals by Employment Networks to State Vr Agencies § 411.400 Can an EN to which a beneficiary's ticket is assigned refer the beneficiary to a State VR agency for services...

  13. 20 CFR 411.390 - What does a State VR agency do if a beneficiary to whom it is already providing services has a...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false What does a State VR agency do if a... VR agency do if a beneficiary to whom it is already providing services has a ticket that is available for assignment? If a beneficiary who is receiving services from the State VR agency under an existing...

  14. 20 CFR 411.582 - Can a State VR agency receive payment under the cost reimbursement payment system if a continuous...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Can a State VR agency receive payment under... Systems § 411.582 Can a State VR agency receive payment under the cost reimbursement payment system if a...? Yes. If a State VR agency provides services to a beneficiary under 34 CFR part 361, and elects payment...

  15. 20 CFR 411.375 - Does a State VR agency continue to provide services under the requirements of the State plan...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false Does a State VR agency continue to provide... Rehabilitation Agencies' Participation Participation in the Ticket to Work Program § 411.375 Does a State VR.... The State VR agency must continue to provide services under the requirements of the State plan...

  16. 20 CFR 411.365 - How does a State VR agency notify us about its choice of a payment system for use when...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false How does a State VR agency notify us about... Rehabilitation Agencies' Participation Participation in the Ticket to Work Program § 411.365 How does a State VR agency notify us about its choice of a payment system for use when functioning as an EN? (a) The State VR...

  17. Voicing Consciousness: The Mind in Writing

    Science.gov (United States)

    Luce-Kapler, Rebecca; Catlin, Susan; Sumara, Dennis; Kocher, Philomene

    2011-01-01

    In this paper, the authors investigate the enduring power of voice as a concept in writing pedagogy. They argue that one can benefit from considering Elbow's assertion that both text and voice be considered as important aspects of written discourse. In particular, voice is a powerful metaphor for the material, social and historical nature of…

  18. Understanding the 'Anorexic Voice' in Anorexia Nervosa.

    Science.gov (United States)

    Pugh, Matthew; Waller, Glenn

    2016-07-20

    In common with individuals experiencing a number of disorders, people with anorexia nervosa report experiencing an internal 'voice'. The anorexic voice comments on the individual's eating, weight and shape and instructs the individual to restrict or compensate. However, the core characteristics of the anorexic voice are not known. This study aimed to develop a parsimonious model of the voice characteristics that are related to key features of eating disorder pathology and to determine whether patients with anorexia nervosa fall into groups with different voice experiences. The participants were 49 women with full diagnoses of anorexia nervosa. Each completed validated measures of the power and nature of their voice experience and of their responses to the voice. Different voice characteristics were associated with current body mass index, duration of disorder and eating cognitions. Two subgroups emerged, with 'weaker' and 'stronger' voice experiences. Those with stronger voices were characterized by having more negative eating attitudes, more severe compensatory behaviours, a longer duration of illness and a greater likelihood of having the binge-purge subtype of anorexia nervosa. The findings indicate that the anorexic voice is an important element of the psychopathology of anorexia nervosa. Addressing the anorexic voice might be helpful in enhancing outcomes of treatments for anorexia nervosa, but that conclusion might apply only to patients with more severe eating psychopathology. Copyright © 2016 John Wiley & Sons, Ltd.

  19. Voice and culture: A prospect theory approach

    NARCIS (Netherlands)

    Paddock, E.L.; Ko, Junsu; Cropanzano, R.; Bagger, J.; El Akremi, A.; Camerman, A.; Greguras, G. J.; Mladinic, A.; Moliner, C.; Nam, K.; Törnblom, K.; Van den Bos, Kees

    2015-01-01

    The present study examines the congruence of individuals' minimum preferred amounts of voice with the prospect theory value function across nine countries. Accounting for previously ignored minimum preferred amounts of voice and actual voice amounts integral to testing the steepness of gain and loss

  20. Finding Voice: Learning about Language and Power

    Science.gov (United States)

    Christensen, Linda

    2011-01-01

    Christensen discusses why teachers need to teach students "voice" in its social and political context, to show the intersection of voice and power, to encourage students to ask, "Whose voices get heard? Whose are marginalized?" As Christensen writes, "Once students begin to understand that Standard English is one language among many, we can help…

  1. Analyzing the mediated voice - a datasession

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    Broadcasted voices are technologically manipulated. In order to achieve a certain autencity or sound of “reality” paradoxically the voices are filtered and trained in order to reach the listeners. This “mis-en-scene” is important knowledge when it comes to the development of a consistent method o...... of analysis of the mediated voice...

  2. Analyzing the mediated voice - a datasession

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    Broadcasted voices are technologically manipulated. In order to achieve a certain autencity or sound of “reality” paradoxically the voices are filtered and trained in order to reach the listeners. This “mis-en-scene” is important knowledge when it comes to the development of a consistent method...... of analysis of the mediated voice...

  3. Voice and culture: A prospect theory approach

    NARCIS (Netherlands)

    Paddock, E.L.; Ko, Junsu; Cropanzano, R.; Bagger, J.; El Akremi, A.; Camerman, A.; Greguras, G. J.; Mladinic, A.; Moliner, C.; Nam, K.; Törnblom, K.; Van den Bos, Kees

    2015-01-01

    The present study examines the congruence of individuals' minimum preferred amounts of voice with the prospect theory value function across nine countries. Accounting for previously ignored minimum preferred amounts of voice and actual voice amounts integral to testing the steepness of gain and loss

  4. "Voice Forum" The Human Voice as Primary Instrument in Music Therapy

    DEFF Research Database (Denmark)

    Pedersen, Inge Nygaard; Storm, Sanne

    2009-01-01

    Aspects will be drawn on the human voice as tool for embodying our psychological and physiological state, and attempting integration of feelings. Presentations and dialogues on different methods and techniques in "Therapy related body-and voice work.", as well as the human voice as a tool for non...... for nonverbal orientation and information both to our selves and others. Focus on training on the voice instrument, the effect and impact of the human voice, and listening perspectives...

  5. Voice-Specialized Speech-Language Pathologist's Criteria for Discharge from Voice Therapy.

    Science.gov (United States)

    Gillespie, Amanda I; Gartner-Schmidt, Jackie

    2017-08-07

    No standard protocol exists to determine when a patient is ready and able to be discharged from voice therapy. The aim of the present study was to determine what factors speech-language pathologists (SLPs) deem most important when discharging a patient from voice therapy. A second aim was to determine if responses differed based on years of voice experience. Step 1: Seven voice-specialized SLPs generated a list of items thought to be relevant to voice therapy discharge. Step 2: Fifty voice-specialized SLPs rated each item on the list in terms of importance in determining discharge from voice therapy. Step 1: Four themes emerged-outcome measures, laryngeal appearance, SLP perceptions, and patient factors-as important items when determining discharge from voice therapy. Step 2: The top five most important criteria for discharge readiness were that the patient had to be able to (1) independently use a better voice (transfer), (2) function with his or her new voice production in activities of daily living (transfer), (3) differentiate between good and bad voice, (4) take responsibility for voice, and (5) sound better from baseline. Novice and experienced clinicians agreed between 94% and 97% concerning what was deemed "very important." SLPs agree that a patient's ability to use voice techniques in conversation and real-life situations outside of the therapy room are the most important determinants for voice therapy discharge. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  6. Multi-thread Parallel Speech Recognition for Mobile Applications

    Directory of Open Access Journals (Sweden)

    LOJKA Martin

    2014-05-01

    Full Text Available In this paper, the server based solution of the multi-thread large vocabulary automatic speech recognition engine is described along with the Android OS and HTML5 practical application examples. The basic idea was to bring speech recognition available for full variety of applications for computers and especially for mobile devices. The speech recognition engine should be independent of commercial products and services (where the dictionary could not be modified. Using of third-party services could be also a security and privacy problem in specific applications, when the unsecured audio data could not be sent to uncontrolled environments (voice data transferred to servers around the globe. Using our experience with speech recognition applications, we have been able to construct a multi-thread speech recognition serverbased solution designed for simple applications interface (API to speech recognition engine modified to specific needs of particular application.

  7. The development of the Spanish verb ir into auxiliary of voice

    DEFF Research Database (Denmark)

    Vinther, Thora

    2005-01-01

    spanish, syntax, grammaticalisation, past participle, passive voice, middle voice, language development......spanish, syntax, grammaticalisation, past participle, passive voice, middle voice, language development...

  8. Objective Voice Parameters in Colombian School Workers with Healthy Voices

    Directory of Open Access Journals (Sweden)

    Lady Catherine Cantor Cutiva

    2015-09-01

    Full Text Available Objectives: To characterize the objective voice parameters among school workers, and to identi­fy associated factors of three objective voice parameters, namely fundamental frequency, sound pressure level and maximum phonation time. Materials and methods: We conducted a cross-sectional study among 116 Colombian teachers and 20 Colombian non-teachers. After signing the informed consent form, participants filled out a questionnaire. Then, a voice sample was recorded and evaluated perceptually by a speech therapist and by objective voice analysis with praat software. Short-term environmental measurements of sound level, temperature, humi­dity, and reverberation time were conducted during visits at the workplaces, such as classrooms and offices. Linear regression analysis was used to determine associations between individual and work-related factors and objective voice parameters. Results: Compared with men, women had higher fundamental frequency (201 Hz for teachers and 209 for non-teachers vs. 120 Hz for teachers and 127 for non-teachers and sound pressure level (82 dB vs. 80 dB, and shorter maximum phonation time (around 14 seconds vs. around 16 seconds. Female teachers younger than 50 years of age evidenced a significant tendency to speak with lower fundamental frequen­cy and shorter mpt compared with female teachers older than 50 years of age. Female teachers had significantly higher fundamental frequency (66 Hz, higher sound pressure level (2 dB and short phonation time (2 seconds than male teachers. Conclusion: Female teachers younger than 50 years of age had significantly lower F0 and shorter mpt compared with those older than 50 years of age. The multivariate analysis showed that gender was a much more important determinant of variations in F0, spl and mpt than age and teaching occupation. Objectively measured temperature also contributed to the changes on spl among school workers.

  9. Virtual Reality in Health System: Beyond Entertainment. A Mini-Review on the Efficacy of VR During Cancer Treatment.

    Science.gov (United States)

    Chirico, Andrea; Lucidi, Fabio; De Laurentiis, Michele; Milanese, Carla; Napoli, Alessandro; Giordano, Antonio

    2016-02-01

    Virtual reality (VR), a computer-generated virtual environment, has been increasingly used in the entertainment world becoming a very new evolving field, but VR technology has also found a variety of applications in the biomedical field. VR can offer to subjects a safe environment within which to carry on different interventions ranging from the rehabilitation of discharged patients directly at home, to the support of hospitalized patients during different procedures and also of oncological inpatient subjects. VR appears as a promising tool for support and monitoring treatments in cancer patients influencing psychological and physiological functions. The aim of this systematic review is to provide an overview of all the studies that used VR intervention on cancer patients and analyze their main findings. Nineteen studies across nearly a thousand articles were identified that explored effects of VR interventions on cancer patients. Although these studies varied greatly in setting and design, this review identified some overarching themes. Results found that VR improved patients' emotional well-being, and diminished cancer-related psychological symptoms. The studies explored various relevant variables including different types of settings (i.e., during chemotherapy, during pain procedures, during hospitalization). Here, we point to the need of a global and multi-disciplinary approach aimed at analyzing the effects of VR taking advantage of the new technology systems like biosensors as well as electroencephalogram monitoring pre, during, and after intervention. Devoting more attention to bio-physiological variables, standardized procedures, extending duration to longitudinal studies and adjusting for motion sickness related to VR treatment need to become standard of this research field. © 2015 Wiley Periodicals, Inc.

  10. Playful Interaction with Voice Sensing Modular Robots

    DEFF Research Database (Denmark)

    Heesche, Bjarke; MacDonald, Ewen; Fogh, Rune

    2013-01-01

    This paper describes a voice sensor, suitable for modular robotic systems, which estimates the energy and fundamental frequency, F0, of the user’s voice. Through a number of example applications and tests with children, we observe how the voice sensor facilitates playful interaction between...... children and two different robot configurations. In future work, we will investigate if such a system can motivate children to improve voice control and explore how to extend the sensor to detect emotions in the user’s voice....

  11. Recognition and Exteriority: Towards a Recognition-Theoretic Account of Globalization

    Directory of Open Access Journals (Sweden)

    Sebastian Purcell

    2011-06-01

    Full Text Available This essay aims to extend Paul Ricœur’s account of recognition to address some of the concerns of globalization, especially those voiced by Enrique Dussel. The extension is accomplished in two parts.  First, Dussel’s account of spatial existence as dwelling is reviewed as it is pertinent to the concerns of globalization.  Next, it is demonstrated that each of the aspects of Ricœur’s account of recognition may be given a spatial re-articulation.  The results thus establish an outline of how recognition theory might address some of the concerns of globalization.  The essay concludes with several consequences for the modification of recognition politics as one finds it in the late work of Ricœur and in Axel Honneth’s ongoing inquiries. 

  12. Application Of t-Cherry Junction Trees in Pattern Recognition

    Directory of Open Access Journals (Sweden)

    Edith Kovacs

    2010-06-01

    Full Text Available Pattern recognition aims to classify data (patterns based ei-
    ther on a priori knowledge or on statistical information extracted from the data. In this paper we will concentrate on statistical pattern recognition using a new probabilistic approach which makes possible to select the so called 'informative' features. We develop a pattern recognition algorithm which is based on the conditional independence structure underlying the statistical data. Our method was succesfully applied on a real problem of recognizing Parkinson's disease on the basis of voice disorders.

  13. Monitoring outcomes for the Medicare Advantage program: methods and application of the VR-12 for evaluation of plans.

    Science.gov (United States)

    Kazis, Lewis E; Selim, Alfredo J; Rogers, William; Qian, Shirley X; Brazier, John

    2012-01-01

    The Veterans RAND 12-Item Health Survey (VR-12) is one of the major patient-reported outcomes for ranking the Medicare Advantage (MA) plans in the Health Outcomes Survey (HOS). Approaches for scoring physical and mental health are given using contemporary norms and regression estimators. A new metric approach for the VR-12 called the "VR-6D" is presented with case-mix adjustments for monitoring plans that combine utilities and mortality. Results show that the models for ranking health outcomes of the plans are robust and credible. Future directions include the use of utilities for evaluating and ranking of MA plans.

  14. VOICE QUALITY BEFORE AND AFTER THYROIDECTOMY

    Directory of Open Access Journals (Sweden)

    Dora CVELBAR

    2016-04-01

    Full Text Available Introduction: Voice disorders are a well-known complication which is often associated with thyroid gland diseases and because voice is still the basic mean of communication it is very important to maintain its quality healthy. Objectives: The aim of this study referred to questions whether there is a statistically significant difference between results of voice self-assessment, perceptual voice assessment and acoustic voice analysis before and after thyroidectomy and whether there are statistically significant correlations between variables of voice self-assessment, perceptual assessment and acoustic analysis before and after thyroidectomy. Methods: This scientific research included 12 participants aged between 41 and 76. Voice self-assessment was conducted with the help of Croatian version of Voice Handicap Index (VHI. Recorded reading samples were used for perceptual assessment and later evaluated by two clinical speech and language therapists. Recorded samples of phonation were used for acoustic analysis which was conducted with the help of acoustic program Praat. All of the data was processed through descriptive statistics and nonparametric statistical methods. Results: Results showed that there are statistically significant differences between results of voice self-assessments and results of acoustic analysis before and after thyroidectomy. Statistically significant correlations were found between variables of perceptual assessment and acoustic analysis. Conclusion: Obtained results indicate the importance of multidimensional, preoperative and postoperative assessment. This kind of assessment allows the clinician to describe all of the voice features and provides appropriate recommendation for further rehabilitation to the patient in order to optimize voice outcomes.

  15. Beyond Insularity: Releasing the Voices.

    Science.gov (United States)

    Greene, Maxine

    1993-01-01

    Aspects of English-as-a-Second-Language are discussed from the standpoint of a teacher-educator with a particular interest in philosophy, the arts, and humanities and what they signify for the schools. The idea of giving voice to all viewpoints and sociocultural circumstances is considered for content learning and heterogeneous grouping. (Contains…

  16. A voice and nothing more

    DEFF Research Database (Denmark)

    Mebus, Andreas Nozic Lindgren

    2012-01-01

    Andreas Mebus fokuserer herefter på et helt konkret aspekt af talen, nemlig ”stemmen” i sin artikel ”A voice and nothing more – en filosofisk udredning af stemmen”. Gennem Mladen Dolars teori om stemmen, redegør Mebus for de forskellige aspekter ved stemmen; som bærer af mening, som æstetisk...

  17. Voice, Citizenship, and Civic Action

    DEFF Research Database (Denmark)

    Tufte, Thomas

    2014-01-01

    In recent years the world has experienced a resurgence in practices of bottom-up communication for social change, a plethora of agency in which claims for voice and citizenship through massive civic action have conquered center stage in the public debate. This resurgence has sparked a series...

  18. The Performing Voice of Radio

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    The ongoing international development of opening media archives for researchers as well as for broader audiences calls for a closer discussion of the mediated voice and how to analyse it. Which parameters can be analysed and which parameters are not covered by the analysis? Furthermore, how do we...

  19. Voice and choice by delegation

    NARCIS (Netherlands)

    van de Bovenkamp, H.; Vollaard, H.; Trappenburg, M.; Grit, K

    2013-01-01

    In many Western countries, options for citizens to influence public services are increased to improve the quality of services and democratize decision making. Possibilities to influence are often cast into Albert Hirschman's taxonomy of exit (choice), voice, and loyalty. In this article we identify

  20. Work-related voice disorder

    Directory of Open Access Journals (Sweden)

    Paulo Eduardo Przysiezny

    2015-04-01

    Full Text Available INTRODUCTION: Dysphonia is the main symptom of the disorders of oral communication. However, voice disorders also present with other symptoms such as difficulty in maintaining the voice (asthenia, vocal fatigue, variation in habitual vocal fundamental frequency, hoarseness, lack of vocal volume and projection, loss of vocal efficiency, and weakness when speaking. There are several proposals for the etiologic classification of dysphonia: functional, organofunctional, organic, and work-related voice disorder (WRVD.OBJECTIVE: To conduct a literature review on WRVD and on the current Brazilian labor legislation.METHODS: This was a review article with bibliographical research conducted on the PubMed and Bireme databases, using the terms "work-related voice disorder", "occupational dysphonia", "dysphonia and labor legislation", and a review of labor and social security relevant laws.CONCLUSION: WRVD is a situation that frequently is listed as a reason for work absenteeism, functional rehabilitation, or for prolonged absence from work. Currently, forensic physicians have no comparative parameters to help with the analysis of vocal disorders. In certain situations WRVD may cause, work disability. This disorder may be labor-related, or be an adjuvant factor to work-related diseases.

  1. The Performing Voice of Radio

    DEFF Research Database (Denmark)

    Lawaetz, Anna

    The ongoing international development of opening media archives for researchers as well as for broader audiences calls for a closer discussion of the mediated voice and how to analyse it. Which parameters can be analysed and which parameters are not covered by the analysis? Furthermore, how do we...

  2. Adolescent Leadership: The Female Voice

    Science.gov (United States)

    Archard, Nicole

    2013-01-01

    This research investigated the female adolescent view of leadership by giving voice to student leaders through focus group discussions. The questions: What is leadership? Where/how was leadership taught?, and How was leadership practised? were explored within the context of girls' schools located in Australia, with one school located in South…

  3. Voicing children's critique and utopias

    DEFF Research Database (Denmark)

    Husted, Mia; Lind, Unni

    2016-01-01

    , designed to accommodate children's participation through graphic illustrations of young children's critique and utopias. The study is informed by a commitment to democratic participation and processes (Reason and Bradbury 2001, Gunnarsson et al. 2016). Ethical guidelines implied dialogues and discussions......, children's voice, critique and utopias, pedagogical work...

  4. Women's Voices in Experiential Education.

    Science.gov (United States)

    Warren, Karen, Ed.

    This book is a collection of feminist analyses of various topics in experiential education, particularly as it applies to outdoors and adventure education, as well as practical examples of how women's experiences can contribute to the field as a whole. Following an introduction, "The Quilt of Women's Voices" (Maya Angelou), the 25…

  5. Forensic Automatic Speaker Recognition Based on Likelihood Ratio Using Acoustic-phonetic Features Measured Automatically

    Directory of Open Access Journals (Sweden)

    Huapeng Wang

    2015-01-01

    Full Text Available Forensic speaker recognition is experiencing a remarkable paradigm shift in terms of the evaluation framework and presentation of voice evidence. This paper proposes a new method of forensic automatic speaker recognition using the likelihood ratio framework to quantify the strength of voice evidence. The proposed method uses a reference database to calculate the within- and between-speaker variability. Some acoustic-phonetic features are extracted automatically using the software VoiceSauce. The effectiveness of the approach was tested using two Mandarin databases: A mobile telephone database and a landline database. The experiment's results indicate that these acoustic-phonetic features do have some discriminating potential and are worth trying in discrimination. The automatic acoustic-phonetic features have acceptable discriminative performance and can provide more reliable results in evidence analysis when fused with other kind of voice features.

  6. Vowel recognition by fuzzy inference and application to recognition of continuous Korean speech. Fuzzy suiron ni yoru boin ninshiki to kankokugo renzoku onsei eno oyo

    Energy Technology Data Exchange (ETDEWEB)

    Choi, W.K.; Akizuki, K. (Waseda Univ., Tokyo (Japan)); Lee, H.H. (Fukuoka Inst. of Tech., Fukuoka (Japan))

    1991-05-20

    The target of voice recognition is to recognize continuous speech which is effective for speech recognition of unspecified persons. As a new matching method, the variations of feature parameters of speakers are represented as fuzzy variables to express the variation by membership functions. It is a new pattern matching method of fuzzy inference using feature parameters, fuzzy relation and synthesis of each formant, and the fuzzy rule. It is a recognition method for the inference of best formant which matches the fact by providing each characteristic quantity and fuzzy rule for composite calculation. For consonant recognition, pitch, logarithmic energies, zero crossing rates, etc. are used which represent features of each formant. KOSRES 2, recognition system for continuous Korean speech, was structured using this method which was subjected to recognition experiments on continuous Korean speech, and the recognition method by fuzzy inference is found to be effective for speech recognition of unspecified persons. 8 refs., 9 figs., 3 tabs.

  7. Earthquake Disaster Simulation for an Urban Area, with GIS, CAD,FEA, and VR Integration

    Institute of Scientific and Technical Information of China (English)

    XU Feng; CHEN Xuping; REN Aizhu; LU Xinzheng

    2008-01-01

    For a disaster whose scale includes an urban area, it is difficult to be studied with physical ex-periments. Numerical simulation is found a very efficient tool for such problem. This paper aims at developing an integrated urban earthquake simulation system (UESS) that uses GIS as the model source, CAD as the model generating tools, FEA as damage prediction, and virtual reality (VR) as the post-process platform. Anautomatic procedure was developed to buildup the 3D structural model of buildings in an urban area, as well as to simulate their earthquake performances, from the digital map of GIS. And the simulation results were presented in an integrated interface with a GIS view-port for position finding, a CAD view-port for 3D struc-tural damage identification, and a VR view-port for 3D dynamic structural vibration display. An urban example with more than 7000 buildings was select to demonstrate the feasibility of proposed system.

  8. Adsorption of molecular gases on porous materials in the SAFT-VR approximation

    CERN Document Server

    Castro, M; Martinez, A; Rosu, H C; 10.1016/j.physa.2010.04.028

    2010-01-01

    A simple molecular thermodynamic approach is applied to the study of the adsorption of gases of chain molecules on solid surfaces. We use a model based on the Statistical Associating Fluid Theory for Variable Range (SAFT-VR) potentials [A. Gil-Villegas, A. Galindo, P. J. Whitehead, S. J. Mills, G. Jackson, A. N. Burgess, J. Chem. Phys. 106 (1997) 4168] that we extend by including a quasi-two-dimensional approximation to describe the adsorption properties of this type of real gases [A. Martinez, M. Castro, C. McCabe, A. Gil-Villegas, J. Chem. Phys. 126 (2007) 074707]. The model is applied to ethane, ethylene, propane, and carbon dioxide adsorbed on activated carbon and silica gel, which are porous media of significant industrial interest. We show that the adsorption isotherms obtained by means of the present SAFT-VR modeling are in fair agreement with the experimental results provided in the literature

  9. The use of PC based VR in clinical medicine: the VREPAR projects.

    Science.gov (United States)

    Riva, G; Bacchetta, M; Baruffi, M; Borgomainerio, E; Defrance, C; Gatti, F; Galimberti, C; Fontaneto, S; Marchi, S; Molinari, E; Nugues, P; Rinaldi, S; Rovetta, A; Ferretti, G S; Tonci, A; Wann, J; Vincelli, F

    1999-01-01

    Virtual reality (VR) is an emerging technology that alters the way individuals interact with computers: a 3D computer-generated environment in which a person can move about and interact as if he actually was inside it. Given to the high computational power required to create virtual environments, these are usually developed on expensive high-end workstations. However, the significant advances in PC hardware that have been made over the last three years, are making PC-based VR a possible solution for clinical assessment and therapy. VREPAR - Virtual Reality Environments for Psychoneurophysiological Assessment and Rehabilitation - are two European Community funded projects (Telematics for health - HC 1053/HC 1055 - http://www.psicologia.net) that are trying to develop a modular PC-based virtual reality system for the medical market. The paper describes the rationale of the developed modules and the preliminary results obtained.

  10. Design and simulation of satellite attitude control system based on Simulink and VR

    Science.gov (United States)

    Zhang, Yang; Gan, Qingbo; Kang, Jingshu

    2016-01-01

    In order to research satellite attitude control system design and visual simulation, the simulation framework of satellite dynamics and attitude control using Simulink were established. The design of satellite earth-oriented control system based on quaternion feedback was completed. The 3D scene based on VR was created and models in the scene were driven by simulation data of Simulink. By coordinate transformation. successful observing the scene in inertial coordinate system, orbit coordinate system and body coordinate system. The result shows that application of simulation method of Simulink combined with VR in the design of satellite attitude control system field, has the advantages of high confidence level, hard real-time property, multi-perspective and multi-coordinate system observing the scene, and improves the comprehensibility and accuracy of the design.

  11. Needle bending in a VR-puncture training system using a 6DOF haptic device.

    Science.gov (United States)

    Färber, Matthias; Dahmke, Thorsten; Bohn, Christian A; Handels, Heinz

    2009-01-01

    The use of virtual reality techniques opens up new perspectives to support and improve the puncture training in medical education. In this work a 3D VR-Simulator for the training of lumbar and ascites punctures has been extended to support the bending of the puncture needle. For this purpose the needle is designed as an angular spring model. The forces that restrict the user from bending the needle are calculated using a multiproxy technique and given to the user via a 6DOF haptic device (Sensable Phantom Premium 1.5). Proxy based haptic volume rendering is used to calculate the proxy movement. This way it is possible to integrate original CT-patient data into the rendering process and generate forces from structures that have not been segmented. The bending technique has been integrated in a VR-training system for puncture interventions and shows good results concerning update rate and user acceptance.

  12. A VR Based Anti-accident Training System for Airport’s Power

    Directory of Open Access Journals (Sweden)

    Gu GuoDong

    2013-05-01

    Full Text Available A virtual environment of airport’s power supply is very useful for anti-accident training. In this paper, an effective method of anti-accident training for airport’s power supply based on Delta3D VR engine was proposed. The architecture of airport’s power supply training system was introduced. The electric closet’s operating simulation was accomplished based on the device’s operation logic. The model of fault and relay protection was designed according to electrical characteristics. A finite automaton model of diesel engine was presented based on operating conditions for virtual operation. Finally, a VR based airport’s power supply system was built and verified in worker’s training.

  13. Asymmetries in the exploitation of phonetic features for word recognition.

    Science.gov (United States)

    Martin, Alexander; Peperkamp, Sharon

    2015-04-01

    French listeners' reliance on voicing, manner, and place was tested in a mispronunciation detection task. Mispronounced words were more likely to be recognized when the mispronunciation concerned voicing rather than manner or place. This indicates that listeners rely less on the former than on the latter for the purposes of word recognition. Further, the role of visual cues to phonetic features was explored by the task being conducted in both an audio-only and an audiovisual version, but no effect of modality was found. Discussion focuses on crosslinguistic comparisons and lexical factors that might influence the weight of individual features.

  14. The segmentation of the HMD market: optics for smart glasses, smart eyewear, AR and VR headsets

    Science.gov (United States)

    Kress, Bernard; Saeedi, Ehsan; Brac-de-la-Perriere, Vincent

    2014-09-01

    This paper reviews the various optical technologies that have been developed to implement HMDs (Head Mounted Displays), both as AR (Augmented Reality) devices, VR (Virtual Reality) devices and more recently as smart glasses, smart eyewear or connected glasses. We review the typical requirements and optical performances of such devices and categorize them into distinct groups, which are suited for different (and constantly evolving) market segments, and analyze such market segmentation.

  15. Development of Haptic Needle for VR Based Injection Training System Using Simulated Patient.

    Science.gov (United States)

    Nishio, Kohei; Nakaguchi, Toshiyia

    2016-01-01

    VR based injection training system using standardized patient was proposed. This system uses haptic needle (HNSP) which can represent haptic force. Since the previously proposed system had a few problems with haptic reproduction, biological reproducibility needed to be improved. In order to solve these problems, we developed a new HNSP to replace the haptic needle. In addition, we conducted two validation studies for the HNSP.

  16. Goodbye spreadsheets : no matter your oil and gas project, PetroVR can replace unwieldy in-house spreadsheets

    Energy Technology Data Exchange (ETDEWEB)

    Chandler, G.

    2009-03-15

    This article described Caesar Systems' newest version of the Petroleum Ventures and Risk (PetroVR) decision software that is now commercially available for the oil and gas industry. PetroVR is field-proven to enable exploration and production project teams to simulate reservoir-to-market business opportunities. The software helps operators gather project information, build a development model and calculate cost and production profiles. It automates key activities that technical teams once performed manually with ordinary spreadsheets. With PetroVR, project evaluations that previously took weeks or months to complete can now be performed in a matter of hours or days with more accuracy. Although PetroVR contains sophisticated modeling capabilities, it has a user friendly interface. 3 figs.

  17. Toward a Next Generation of Widely Accessible Spatial Interfaces: Mobile VR Environments for Patients with Persistent Pain

    Science.gov (United States)

    Schroeder, David; Korsakov, Fedor; Jolton, Joseph; Keefe, Francis J.; Haley, Alex; Keefe, Daniel F.

    2015-01-01

    We present a series of multi-modal spatial interfaces and virtual environments that can be implemented with widely accessible virtual reality (VR) technologies. The results demonstrate and evaluate the new degree to which rich virtual experiences involving motion sensing, physiological inputs, stereoscopic imagery, sound, and haptic feedback can now be created using low-cost (e.g., mobile phone based) VR environments. Adapting spatial interfaces to these new platforms can open up exciting new application areas for VR. This is demonstrated through a series of prototype systems aimed at delivering in-home VR therapies to patients suffering from persistent pain conditions (e.g. arthritis pain, cancer pain). A rich spatial interface and visual aesthetic is particularly important for the success of these applications; thus an interdisciplinary team with expertise in technology, design, meditation, and the psychology of pain worked together to iteratively develop and evaluate the current prototypes. PMID:24807994

  18. An automatic speech recognition system with speaker-independent identification support

    Science.gov (United States)

    Caranica, Alexandru; Burileanu, Corneliu

    2015-02-01

    The novelty of this work relies on the application of an open source research software toolkit (CMU Sphinx) to train, build and evaluate a speech recognition system, with speaker-independent support, for voice-controlled hardware applications. Moreover, we propose to use the trained acoustic model to successfully decode offline voice commands on embedded hardware, such as an ARMv6 low-cost SoC, Raspberry PI. This type of single-board computer, mainly used for educational and research activities, can serve as a proof-of-concept software and hardware stack for low cost voice automation systems.

  19. Proximal pulmonary vein stenosis detection in pediatric patients: value of multiplanar and 3-D VR imaging evaluation.

    Science.gov (United States)

    Lee, Edward Y; Jenkins, Kathy J; Muneeb, Muhammad; Marshall, Audrey C; Tracy, Donald A; Zurakowski, David; Boiselle, Phillip M

    2013-08-01

    One of the important benefits of using multidetector computed tomography (MDCT) is its capability to generate high-quality two-dimensional (2-D) multiplanar (MPR) and three-dimensional (3-D) images from volumetric and isotropic axial CT data. However, to the best of our knowledge, no results have been published on the potential diagnostic role of multiplanar and 3-D volume-rendered (VR) images in detecting pulmonary vein stenosis, a condition in which MDCT has recently assumed a role as the initial noninvasive imaging modality of choice. The purpose of this study was to compare diagnostic accuracy and interpretation time of axial, multiplanar and 3-D VR images for detection of proximal pulmonary vein stenosis in children, and to assess the potential added diagnostic value of multiplanar and 3-D VR images. We used our hospital information system to identify all consecutive children (VR images. Final diagnosis was confirmed by angiographic findings. Diagnostic accuracy was compared using the z-test. Confidence level of diagnosis (scale 1-5, 5 = highest), perceived added diagnostic value (scale 1-5, 5 = highest), and interpretation time of multiplanar or 3-D VR images were compared using paired t-tests. Interobserver agreement was measured using the chance-corrected kappa coefficient. The final study population consisted of 28 children (15 boys and 13 girls; mean age: 5.2 months). Diagnostic accuracy based on 116 individual pulmonary veins for detection of proximal pulmonary vein stenosis was 72.4% (84 of 116) for axial MDCT images, 77.5% (90 of 116 cases) for multiplanar MDCT images, and 93% (108 of 116 cases) for 3-D VR images with significantly higher accuracy with 3-D VR compared to axial (z = 4.17, P VR images (mean level: 4.6) compared to axial MDCT images (mean level: 1.7) and multiplanar MDCT images (mean level: 2.0) (paired t-tests, P VR images (mean added diagnostic value: 4.7) were found to provide added diagnostic value for detecting

  20. Virtual Iraq: initial results from a VR exposure therapy application for combat-related PTSD.

    Science.gov (United States)

    Rizzo, Albert A; Graap, Ken; Perlman, Karen; McLay, Robert N; Rothbaum, Barbara O; Reger, Greg; Parsons, Thomas; Difede, Joann; Pair, Jarrell

    2008-01-01

    Post Traumatic Stress Disorder (PTSD) is reported to be caused by traumatic events that are outside the range of usual human experience including (but not limited to) military combat, violent personal assault, being kidnapped or taken hostage and terrorist attacks. Initial data suggests that at least 1 out of 6 Iraq War veterans are exhibiting symptoms of depression, anxiety and PTSD. Virtual Reality (VR) delivered exposure therapy for PTSD has been used with reports of positive outcomes. The aim of the current paper is to present the rationale and brief description of a Virtual Iraq PTSD VR therapy application and present initial findings from its use with PTSD patients. Thus far, Virtual Iraq consists of a series of customizable virtual scenarios designed to represent relevant Middle Eastern VR contexts for exposure therapy, including a city and desert road convoy environment. User-centered design feedback needed to iteratively evolve the system was gathered from returning Iraq War veterans in the USA and from a system deployed in Iraq and tested by an Army Combat Stress Control Team. Clinical trials are currently underway at Ft. Lewis, Camp Pendleton, Emory University, Weill Cornell Medical College, Walter Reed Army Medical Center, San Diego Naval Medical Center and 12 other sites.

  1. DEEP SPACE: High Resolution VR Platform for Multi-user Interactive Narratives

    Science.gov (United States)

    Kuka, Daniela; Elias, Oliver; Martins, Ronald; Lindinger, Christopher; Pramböck, Andreas; Jalsovec, Andreas; Maresch, Pascal; Hörtner, Horst; Brandl, Peter

    DEEP SPACE is a large-scale platform for interactive, stereoscopic and high resolution content. The spatial and the system design of DEEP SPACE are facing constraints of CAVETM-like systems in respect to multi-user interactive storytelling. To be used as research platform and as public exhibition space for many people, DEEP SPACE is capable to process interactive, stereoscopic applications on two projection walls with a size of 16 by 9 meters and a resolution of four times 1080p (4K) each. The processed applications are ranging from Virtual Reality (VR)-environments to 3D-movies to computationally intensive 2D-productions. In this paper, we are describing DEEP SPACE as an experimental VR platform for multi-user interactive storytelling. We are focusing on the system design relevant for the platform, including the integration of the Apple iPod Touch technology as VR control, and a special case study that is demonstrating the research efforts in the field of multi-user interactive storytelling. The described case study, entitled "Papyrate's Island", provides a prototypical scenario of how physical drawings may impact on digital narratives. In this special case, DEEP SPACE helps us to explore the hypothesis that drawing, a primordial human creative skill, gives us access to entirely new creative possibilities in the domain of interactive storytelling.

  2. Visual Perspective and Feedback Guidance for VR Free-Throw Training.

    Science.gov (United States)

    Covaci, Alexandra; Olivier, Anne-Hélène; Multon, Franck

    2015-01-01

    Accurate distance perception and natural interactions are mandatory conditions when training precision aiming tasks in VR. However, many factors specific to virtual environments (VEs) lead to differences in the way users execute a motor task in VR versus the real world. To investigate these differences, the authors performed a study on basketball beginners' free-throw performance in VEs under different visual conditions. Although the success rate is not statistically different, some adaptations occurred in the way the users performed the task, depending on the visual conditions. In the third-person perspective visual condition, the release parameters indicate that the users more accurately estimated distance to target. Adding visual guidance information (gradual depth information showing the ideal ball trajectory) also led to more natural motor behavior. The final aim of this study was to develop a reliable basketball free-throw training system in VEs, so the authors compared beginners' performances in VR with experts' models of performance. Their results show that most of the performance variables tended to evolve closer to the experts' performance during the training in the VE.

  3. IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses.

    Science.gov (United States)

    Paez-Espino, David; Chen, I-Min A; Palaniappan, Krishna; Ratner, Anna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Huang, Jinghua; Markowitz, Victor M; Nielsen, Torben; Huntemann, Marcel; K Reddy, T B; Pavlopoulos, Georgios A; Sullivan, Matthew B; Campbell, Barbara J; Chen, Feng; McMahon, Katherine; Hallam, Steve J; Denef, Vincent; Cavicchioli, Ricardo; Caffrey, Sean M; Streit, Wolfgang R; Webster, John; Handley, Kim M; Salekdeh, Ghasem H; Tsesmetzis, Nicolas; Setubal, Joao C; Pope, Phillip B; Liu, Wen-Tso; Rivers, Adam R; Ivanova, Natalia N; Kyrpides, Nikos C

    2017-01-04

    Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from >6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs are grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparing with external sequences, thus serving as an essential resource in the viral genomics community. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Semantic Entity-Component State Management Techniques to Enhance Software Quality for Multimodal VR-Systems.

    Science.gov (United States)

    Fischbach, Martin; Wiebusch, Dennis; Latoschik, Marc Erich

    2017-04-01

    Modularity, modifiability, reusability, and API usability are important software qualities that determine the maintainability of software architectures. Virtual, Augmented, and Mixed Reality (VR, AR, MR) systems, modern computer games, as well as interactive human-robot systems often include various dedicated input-, output-, and processing subsystems. These subsystems collectively maintain a real-time simulation of a coherent application state. The resulting interdependencies between individual state representations, mutual state access, overall synchronization, and flow of control implies a conceptual close coupling whereas software quality asks for a decoupling to develop maintainable solutions. This article presents five semantics-based software techniques that address this contradiction: Semantic grounding, code from semantics, grounded actions, semantic queries, and decoupling by semantics. These techniques are applied to extend the well-established entity-component-system (ECS) pattern to overcome some of this pattern's deficits with respect to the implied state access. A walk-through of central implementation aspects of a multimodal (speech and gesture) VR-interface is used to highlight the techniques' benefits. This use-case is chosen as a prototypical example of complex architectures with multiple interacting subsystems found in many VR, AR and MR architectures. Finally, implementation hints are given, lessons learned regarding maintainability pointed-out, and performance implications discussed.

  5. Design of a Physiology-Sensitive VR-Based Social Communication Platform for Children With Autism.

    Science.gov (United States)

    Kuriakose, Selvia; Lahiri, Uttama

    2017-08-01

    Individuals with autism are often characterized by impairments in communication, reciprocal social interaction and explicit expression of their affective states. In conventional techniques, a therapist adjusts the intervention paradigm by monitoring the affective state e.g., anxiety of these individuals for effective floor-time-therapy. Conventional techniques, though powerful, are observation-based and face resource limitations. Technology-assisted systems can provide a quantitative, individualized rehabilitation platform. Presently-available systems are designed primarily to chain learning via aspects of one's performance alone restricting individualization. Specifically, these systems are not sensitive to one's anxiety. Our presented work seeks to bridge this gap by developing a novel VR-based interactive system with Anxiety-Sensitive adaptive technology. Specifically, such a system is capable of objectively identifying and quantifying one's anxiety level from real-time biomarkers, along with performance metrics. In turn it can adaptively respond in an individualized manner to foster improved social communication skills. In our present research, we have used Virtual Reality (VR) to design a proof-of-concept application that exposes participants to social tasks of varying challenges. Results of a preliminary usability study indicate the potential of our VR-based Anxiety-Sensitive system to foster improved task performance, thereby serving as a potent complementary tool in the hands of therapist.

  6. The Determination Of Reddening From Intrinsic VR Colors Of RR Lyrae Stars

    CERN Document Server

    Kunder, Andrea; Layden, Andrew

    2009-01-01

    New R-band observations of 21 local field RR Lyrae variable stars are used to explore the reliability of minimum light (V-R) colors as a tool for measuring interstellar reddening. For each star, R-band intensity mean magnitudes and light amplitudes are presented. Corresponding V-band light curves from the literature are supplemented with the new photometry, and (V-R) colors at minimum light are determined for a subset of these stars as well as for other stars in the literature. Two different definitions of minimum light color are examined, one which uses a Fourier decomposition to the V and R light curves to find (V-R) at minimum V-band light, (V-R)_{min}^F, and the other which uses the average color between the phase interval 0.5-0.8, (V-R)_{min}^{\\phi(0.5-0.8)}. From 31 stars with a wide range of metallicities and pulsation periods, the mean dereddened RR Lyrae color at minimum light is (V-R)_{min,0}^F = 0.28 pm 0.02 mag and (V-R)_{min,0}^{\\phi(0.5-0.8)} = 0.27 pm 0.02 mag. As was found by Guldenschuh et al...

  7. Comparing acoustic and perceptual voice parameters in female teachers based on voice complaints

    Directory of Open Access Journals (Sweden)

    Maryam Faghani Abukeili

    2014-04-01

    Full Text Available Background and Aim: Teachers are a large group of professional voice users that several risk factors and voice demands causes various voice complaints among them. As the voice is multidimensional, the aim of this study was acoustic and perceptual measurement of teachers’ voice and comparing the findings between two groups with many and few voice complaints.Methods: Sixty female teachers of high school in Sari, north of Iran, were chosen by available sampling to participate in this cross-sectional study. According to a voice complaints questionnaire, 21 subjects located in few voice complaints and 31 in many voice complaints group. After a working day, subjects completed a voice self-assessment questionnaire. Also, teachers’voice were recorded during three tasks including sustained vowels /a/ and /i/, text reading and conversational speech. Acoustic parameters were analyzed by Praat software and 2 speech-language pathalogists performed auditory-perceptual assessment by GRBAS ( Grade, Roughness, Breathiness, Asthenia, Strain scale. Results: Comparing of the voice self-assessment between the two groups demonstrated statistically significant difference (p<0.05; however results of the acoustic and auditory-perceptual measurement did not show significant diffrence.Conclusion: Despite prevalent voice problems in teachers, there are various conditions in terms of complaints and assessments methods. In this study, only a remarkable deviation documented in the client-based assessments in many voice compliants group in comparison with few voice compliants, which would be probably related to different individual’s perception of voice problem between two groups. These results support paying attention to self-assessments in clinical process of voice problems.

  8. The Role of Occupational Voice Demand and Patient-Rated Impairment in Predicting Voice Therapy Adherence.

    Science.gov (United States)

    Ebersole, Barbara; Soni, Resha S; Moran, Kathleen; Lango, Miriam; Devarajan, Karthik; Jamal, Nausheen

    2017-07-11

    Examine the relationship among the severity of patient-perceived voice impairment, perceptual dysphonia severity, occupational voice demand, and voice therapy adherence. Identify clinical predictors of increased risk for therapy nonadherence. A retrospective cohort study of patients presenting with a chief complaint of persistent dysphonia at an interdisciplinary voice center was done. The Voice Handicap Index-10 (VHI-10) and the Voice-Related Quality of Life (V-RQOL) survey scores, clinician rating of dysphonia severity using the Grade score from the Grade, Roughness Breathiness, Asthenia, and Strain scale, occupational voice demand, and patient demographics were tested for associations with therapy adherence, defined as completion of the treatment plan. Classification and Regression Tree (CART) analysis was performed to establish thresholds for nonadherence risk. Of 166 patients evaluated, 111 were recommended for voice therapy. The therapy nonadherence rate was 56%. Occupational voice demand category, VHI-10, and V-RQOL scores were the only factors significantly correlated with therapy adherence (P occupational voice demand are significantly more likely to be nonadherent with therapy than those with high occupational voice demand (P 40 is a significant cutoff point for predicting therapy nonadherence (P Occupational voice demand and patient perception of impairment are significantly and independently correlated with therapy adherence. A VHI-10 score of ≤9 or a V-RQOL score of >40 is a significant cutoff point for predicting nonadherence risk. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  9. Facial Recognition

    Directory of Open Access Journals (Sweden)

    Mihalache Sergiu

    2014-05-01

    Full Text Available During their lifetime, people learn to recognize thousands of faces that they interact with. Face perception refers to an individual's understanding and interpretation of the face, particularly the human face, especially in relation to the associated information processing in the brain. The proportions and expressions of the human face are important to identify origin, emotional tendencies, health qualities, and some social information. From birth, faces are important in the individual's social interaction. Face perceptions are very complex as the recognition of facial expressions involves extensive and diverse areas in the brain. Our main goal is to put emphasis on presenting human faces specialized studies, and also to highlight the importance of attractiviness in their retention. We will see that there are many factors that influence face recognition.

  10. Toward a Next Generation of Widely Accessible Spatial Interfaces: Mobile VR Environments for Patients with Persistent Pain

    OpenAIRE

    Schroeder, David; Korsakov, Fedor; Jolton, Joseph; Keefe, Francis J.; Haley, Alex; Keefe, Daniel F.

    2013-01-01

    We present a series of multi-modal spatial interfaces and virtual environments that can be implemented with widely accessible virtual reality (VR) technologies. The results demonstrate and evaluate the new degree to which rich virtual experiences involving motion sensing, physiological inputs, stereoscopic imagery, sound, and haptic feedback can now be created using low-cost (e.g., mobile phone based) VR environments. Adapting spatial interfaces to these new platforms can open up exciting new...

  11. VOICE ACTIVATED MULTIPROCESSOR EMBEDDED SYSTEM TO IMPROVE THE CONTROL OF A MOTORIZED WHEELCHAIR

    Directory of Open Access Journals (Sweden)

    SANGMESHWAR S. KENDRE,

    2010-11-01

    Full Text Available The main idea of this work is to process analog voice signal. The theme is implemented for controlling the wheelchair by voice through speech processing using Hawkboard (OMAP processor. The adopted model is based on grouping a ARM and a DSP processor for speech enhancement with a voice recognition module for isolated word and speaker dependent. The Texas Instruments OMAP-L138 is integrated in order to enhance the quality of speech signal by reducing noise and connected with the wheelchair for processing of the voice signal. The Hawkboard denoises speech signal and HMC2007 recognizes the commnads. It also generates different desired signals according to the spoken words which further used to control the movement of wheelchair, a vector of information on the context given by a set of sensors for security actions. Six words are recognized which are start, forward, reverse, left, right, stop. In order to gain in time design, experiments have shown that the best way is to choose a speech recognition kit and to adapt it to the application. The result at the end show the efficiency of the system.

  12. Design and implementation of a user-oriented speech recognition interface: the synergy of technology and human factors

    NARCIS (Netherlands)

    Kloosterman, Sietse H.

    1994-01-01

    The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection. In the design of the interface a synergy of technology

  13. DESIGN AND IMPLEMENTATION OF A USER-ORIENTED SPEECH RECOGNITION INTERFACE - THE SYNERGY OF TECHNOLOGY AND HUMAN-FACTORS

    NARCIS (Netherlands)

    KLOOSTERMAN, SH

    The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection. In the design of the interface a synergy of technology

  14. Development and Feasibility of a Virtual Reality Task for the Cognitive Assessment of Older Adults: The ECO-VR.

    Science.gov (United States)

    Oliveira, Camila R; Lopes Filho, Brandel José P; Sugarman, Michael A; Esteves, Cristiane S; Lima, Margarida Maria B M P; Moret-Tatay, Carmen; Irigaray, Tatiana Q; Argimon, Irani Iracema L

    2016-12-13

    Cognitive assessment with virtual reality (VR) may have superior ecological validity for older adults compared to traditional pencil-and-paper cognitive assessment. However, few studies have reported the development of VR tasks. The aim of this study was to present the development, feasibility, content validity, and preliminary evidence of construct validity of an ecological task of cognitive assessment for older adults in VR (ECO-VR). The tasks were prepared based on theoretical and clinical backgrounds. We had 29 non-expert judges identify virtual visual stimuli and three-dimensional scenarios, and five expert judges assisted with content analysis and developing instructions. Finally, six older persons participated in three pilot studies and thirty older persons participated in the preliminary study to identify construct validity evidence. Data were analyzed by descriptive statistics and partial correlation. Target stimuli and three-dimensional scenarios were judged adequate and the content analysis demonstrated that ECO-VR evaluates temporo-spatial orientation, memory, language and executive functioning. We made significant changes to the instructions after the pilot studies to increase comprehensibility and reduce the completion time. The total score of ECO-VR was positively correlated mainly with performance in executive function (r = .172, p cognitive assessment in older adults, as well as content and construct validity evidences.

  15. CloVR: A virtual machine for automated and portable sequence analysis from the desktop using cloud computing

    Directory of Open Access Journals (Sweden)

    Arze Cesar

    2011-08-01

    Full Text Available Abstract Background Next-generation sequencing technologies have decentralized sequence acquisition, increasing the demand for new bioinformatics tools that are easy to use, portable across multiple platforms, and scalable for high-throughput applications. Cloud computing platforms provide on-demand access to computing infrastructure over the Internet and can be used in combination with custom built virtual machines to distribute pre-packaged with pre-configured software. Results We describe the Cloud Virtual Resource, CloVR, a new desktop application for push-button automated sequence analysis that can utilize cloud computing resources. CloVR is implemented as a single portable virtual machine (VM that provides several automated analysis pipelines for microbial genomics, including 16S, whole genome and metagenome sequence analysis. The CloVR VM runs on a personal computer, utilizes local computer resources and requires minimal installation, addressing key challenges in deploying bioinformatics workflows. In addition CloVR supports use of remote cloud computing resources to improve performance for large-scale sequence processing. In a case study, we demonstrate the use of CloVR to automatically process next-generation sequencing data on multiple cloud computing platforms. Conclusion The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing.

  16. Application of MPVR and TL-VR with 64-row MDCT in neonates with congenital EA and distal TEF.

    Science.gov (United States)

    Wen, Yang; Peng, Yun; Zhai, Ren-You; Li, Ying-Zi

    2011-03-28

    To assess the application of multiple planar volume reconstruction (MPVR) and three-dimensional (3D) transparency lung volume rendering (TL-VR) with 64-row multidetector-row computed tomography (MDCT) in neonates with congenital esophageal atresia (EA) and distal tracheoesophageal fistula (TEF). Twenty neonates (17 boys, 3 girls) with EA and distal TEF at a mean age of 4.6 d (range 1-16 d) were enrolled in this study. A helical scan of 64-row MDCT was performed at the 64 mm × 0.625 mm collimation. EA and TEF were reconstructed with MPVR and TL-VR, respectively. Initial diagnosis of EA was made by chest radiography showing the inserted catheter in the proximal blind-ended esophageal pouch. Manifestations of MDCT images were compared with the findings at surgery. MDCT showed the proximal and distal esophageal pouches in 20 cases. No significant difference was observed in gaps between the proximal and distal esophageal pouches detected by MPVR and TL-VR. The lengths of gaps between the proximal and distal esophageal pouches detected by MPVR and TL-VR correlated well with the findings at surgery (R = 0.87, P VR images showed the orifice of TEF in 4 cases. EA and distal TEF can be reconstructed using MPVR and TL-VR of 64-row MDCT, which is a noninvasive technique to demonstrate the distal esophageal pouches and inter-pouch distance in neonates with EA and distal TEF.

  17. CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing.

    Science.gov (United States)

    Angiuoli, Samuel V; Matalka, Malcolm; Gussman, Aaron; Galens, Kevin; Vangala, Mahesh; Riley, David R; Arze, Cesar; White, James R; White, Owen; Fricke, W Florian

    2011-08-30

    Next-generation sequencing technologies have decentralized sequence acquisition, increasing the demand for new bioinformatics tools that are easy to use, portable across multiple platforms, and scalable for high-throughput applications. Cloud computing platforms provide on-demand access to computing infrastructure over the Internet and can be used in combination with custom built virtual machines to distribute pre-packaged with pre-configured software. We describe the Cloud Virtual Resource, CloVR, a new desktop application for push-button automated sequence analysis that can utilize cloud computing resources. CloVR is implemented as a single portable virtual machine (VM) that provides several automated analysis pipelines for microbial genomics, including 16S, whole genome and metagenome sequence analysis. The CloVR VM runs on a personal computer, utilizes local computer resources and requires minimal installation, addressing key challenges in deploying bioinformatics workflows. In addition CloVR supports use of remote cloud computing resources to improve performance for large-scale sequence processing. In a case study, we demonstrate the use of CloVR to automatically process next-generation sequencing data on multiple cloud computing platforms. The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing.

  18. Penguins use the two-voice system to recognize each other.

    Science.gov (United States)

    Aubin, T; Jouventin, P; Hildebrand, C

    2000-06-07

    The sound-producing structure in birds is the syrinx, which is usually a two-part organ located at the junction of the bronchi. As each branch of the syrinx produces sound independently, many birds have two acoustic sources. Thirty years ago, we had anatomical, physiological and acoustical evidence of this two-voice phenomenon but no function was known. In songbirds, often these two voices with their respective harmonics are not activated simultaneously but they are obvious in large penguins and generate a beat pattern which varies between individuals. The emperor penguin breeds during the Antarctic winter, incubating and carrying its egg on its feet. Without the topographical cue of a nest, birds identify each other only by vocal means when switching duties during incubation or chick rearing. To test whether the two-voice system contains the identity code, we played back the modified call of their mate to both adults and also the modified call of their parents to chicks. Both the adults and the chicks replied to controls (two voices) but not to modified signals (one voice being experimentally suppressed). Our experiments demonstrate that the beat generated by the interaction of these two fundamental frequencies conveys information about individual identity and also propagates well through obstacles, being robust to sound degradation through the medium of bodies in a penguin colony. The two-voice structure is also clear in the call of other birds such as the king penguin, another non-nesting species, but not in the 14 other nesting penguins. We concluded that the two-voice phenomenon functions as an individual recognition system in species using few if any landmarks to meet. In penguins, this coding process, increasing the call complexity and resisting sound degradation, has evolved in parallel with the loss of territoriality.

  19. Classifying human voices by using hybrid SFX time-series preprocessing and ensemble feature selection.

    Science.gov (United States)

    Fong, Simon; Lan, Kun; Wong, Raymond

    2013-01-01

    Voice biometrics is one kind of physiological characteristics whose voice is different for each individual person. Due to this uniqueness, voice classification has found useful applications in classifying speakers' gender, mother tongue or ethnicity (accent), emotion states, identity verification, verbal command control, and so forth. In this paper, we adopt a new preprocessing method named Statistical Feature Extraction (SFX) for extracting important features in training a classification model, based on piecewise transformation treating an audio waveform as a time-series. Using SFX we can faithfully remodel statistical characteristics of the time-series; together with spectral analysis, a substantial amount of features are extracted in combination. An ensemble is utilized in selecting only the influential features to be used in classification model induction. We focus on the comparison of effects of various popular data mining algorithms on multiple datasets. Our experiment consists of classification tests over four typical categories of human voice data, namely, Female and Male, Emotional Speech, Speaker Identification, and Language Recognition. The experiments yield encouraging results supporting the fact that heuristically choosing significant features from both time and frequency domains indeed produces better performance in voice classification than traditional signal processing techniques alone, like wavelets and LPC-to-CC.

  20. Familiar Speaker Recognition

    Science.gov (United States)

    2012-05-01

    linguistic information (such as a laughter or stutter ) may allow the listener to quickly identify the speaker. For speakers that are less familiar to...EXPERIMENTAL RESULTS Figure 2 shows the results for Sessions 1-6. Remember that there are 25 voices and for every session, each voice is presented twice...Thus, a 90% correct means that out of 50 voice presentations , the listener identified the correct speaker 45 times. It was always a forced choice

  1. Perturbation Measures of Voice: A Comparative Study between Multi-Dimensional Voice Program and Praat

    National Research Council Canada - National Science Library

    Maryn, Youri; Corthals, Paul; De Bodt, Marc; Van Cauwenberge, Paul; Deliyski, Dimitar

    2009-01-01

    .... In the present study, perturbation measures provided by two computer systems (a purpose-built professional voice analysis apparatus and a personal computer-based system for acoustic voice assessment...

  2. [The smokers voice self assessment based on Voice Handicap Index (VHI)].

    Science.gov (United States)

    Wiskirska-Woźnica, Bozena; Wojnowski, Waldemar

    2009-01-01

    Complex voice assessment due to European Laryngeal Society proposals (2000) contains voice self estimation based on the Polish version of the Voice Handicap Index (VHI). This study focuses on the relation between voice handicap and smoking in dysphonic patients, who are using voice professionally. Thirty outpatient (25 female and 5 male, aged 40 to 55 years) voice department attendees suffering from professional dysphonia took part in this study. All patients after phoniatric examination completed the Polish version of the Voice Handicap Index (VHI). The questions concern functional, emotional and physical complains due to dysphonia. Most of smokers did not complain of dysphonia related problems comparing to non smokers. Even the scores of functional and emotional scales of VHI in smokers shown better results (less handicap) than in nonsmokers. Smoking does not affect patients handicap due to dysphonia measured in the Voice Handicap Index.

  3. Qos and Voice Over IP

    Directory of Open Access Journals (Sweden)

    Adrian GHENCEA

    Full Text Available As Voice over Internet Protocol (VoIP technology matures, companies are increasingly adopting it to cut costs, improve efficiency and enhance customer service. Using the Internet as an existing network for integrating data and telecom systems through intelligent VoIP, a range of benefits results: lower long distance costs, cost cuts in cabling processes and more flexible telephony management. However, as voice over IP services grow in popularity, major threats arise: this rapid growth leads to traffic congestion, security is jeopardizedand the poor quality of calls affects communication. The objective of this article is to present all the elements that can affect voicequality in a VoIP network and to provide methods for solving them. A detailed analysis to minimize the impact of implementation of QoS will be made, and at the end solutions to management strategies will be proposed.

  4. Effect of Religiosity on Voice.

    Science.gov (United States)

    Hamdan, Abdul-Latif; Khneisser, Gebran; Dowli, Alex; Ziade, Georges; Tamim, Hani

    2017-04-01

    The aim of this study was to examine the relationship between religiosity and phonatory behavior. A total of 186 participants participated in a survey that included four sections: demographic data, extent of religiosity, history of dysphonia, phonatory behavior and laryngeal manipulation, in addition to the Voice Handicap Index (VHI-10). There was no significant association between the prevalence of phonatory symptoms and any of the religiosity questions. There was no significant association between phonatory behavior, history of laryngeal manipulation and any of the religiosity questions. There was also no significant association between the score of the Voice Handicap Index and any of the five religiosity questions. There is no association between religiosity and prevalence of phonatory disturbances, phonotraumatic behavior and/or history of laryngeal manipulation.

  5. Feature based sliding window technique for face recognition

    Science.gov (United States)

    Javed, Muhammad Younus; Mohsin, Syed Maajid; Anjum, Muhammad Almas

    2010-02-01

    Human beings are commonly identified by biometric schemes which are concerned with identifying individuals by their unique physical characteristics. The use of passwords and personal identification numbers for detecting humans are being used for years now. Disadvantages of these schemes are that someone else may use them or can easily be forgotten. Keeping in view of these problems, biometrics approaches such as face recognition, fingerprint, iris/retina and voice recognition have been developed which provide a far better solution when identifying individuals. A number of methods have been developed for face recognition. This paper illustrates employment of Gabor filters for extracting facial features by constructing a sliding window frame. Classification is done by assigning class label to the unknown image that has maximum features similar to the image stored in the database of that class. The proposed system gives a recognition rate of 96% which is better than many of the similar techniques being used for face recognition.

  6. Multimodal approaches for emotion recognition: a survey

    Science.gov (United States)

    Sebe, Nicu; Cohen, Ira; Gevers, Theo; Huang, Thomas S.

    2005-01-01

    Recent technological advances have enabled human users to interact with computers in ways previously unimaginable. Beyond the confines of the keyboard and mouse, new modalities for human-computer interaction such as voice, gesture, and force-feedback are emerging. Despite important advances, one necessary ingredient for natural interaction is still missing-emotions. Emotions play an important role in human-to-human communication and interaction, allowing people to express themselves beyond the verbal domain. The ability to understand human emotions is desirable for the computer in several applications. This paper explores new ways of human-computer interaction that enable the computer to be more aware of the user's emotional and attentional expressions. We present the basic research in the field and the recent advances into the emotion recognition from facial, voice, and physiological signals, where the different modalities are treated independently. We then describe the challenging problem of multimodal emotion recognition and we advocate the use of probabilistic graphical models when fusing the different modalities. We also discuss the difficult issues of obtaining reliable affective data, obtaining ground truth for emotion recognition, and the use of unlabeled data.

  7. Voice disorders in mucosal leishmaniasis.

    Directory of Open Access Journals (Sweden)

    Ana Cristina Nunes Ruas

    Full Text Available INTRODUCTION: Leishmaniasis is considered as one of the six most important infectious diseases because of its high detection coefficient and ability to produce deformities. In most cases, mucosal leishmaniasis (ML occurs as a consequence of cutaneous leishmaniasis. If left untreated, mucosal lesions can leave sequelae, interfering in the swallowing, breathing, voice and speech processes and requiring rehabilitation. OBJECTIVE: To describe the anatomical characteristics and voice quality of ML patients. MATERIALS AND METHODS: A descriptive transversal study was conducted in a cohort of ML patients treated at the Laboratory for Leishmaniasis Surveillance of the Evandro Chagas National Institute of Infectious Diseases-Fiocruz, between 2010 and 2013. The patients were submitted to otorhinolaryngologic clinical examination by endoscopy of the upper airways and digestive tract and to speech-language assessment through directed anamnesis, auditory perception, phonation times and vocal acoustic analysis. The variables of interest were epidemiologic (sex and age and clinic (lesion location, associated symptoms and voice quality. RESULTS: 26 patients under ML treatment and monitored by speech therapists were studied. 21 (81% were male and five (19% female, with ages ranging from 15 to 78 years (54.5+15.0 years. The lesions were distributed in the following structures 88.5% nasal, 38.5% oral, 34.6% pharyngeal and 19.2% laryngeal, with some patients presenting lesions in more than one anatomic site. The main complaint was nasal obstruction (73.1%, followed by dysphonia (38.5%, odynophagia (30.8% and dysphagia (26.9%. 23 patients (84.6% presented voice quality perturbations. Dysphonia was significantly associated to lesions in the larynx, pharynx and oral cavity. CONCLUSION: We observed that vocal quality perturbations are frequent in patients with mucosal leishmaniasis, even without laryngeal lesions; they are probably associated to disorders of some

  8. English Spoken Language & Voice Quality

    OpenAIRE

    Folsberg, Jens; Nielsen, Charlotte; Brusokaite, Giedre; Beinkamp, Line; Bach Jensen, Niels; Aalbæk Jensen, Sandra

    2013-01-01

    This project investigates the way language and accents are depicted in animated features and how linguistic stereotypes can be used in the process of character construction. In order to look into that, examples from four movies, produced by two studios, have been selected; the American Disney studios being represented by The Lion King (1994) and Up (2009), and the Japanese Studio Ghibli being represented by Ponyo (2008) and Howl's Moving Castle (2004). Voice qualities and specific accents ...

  9. Tracheostomy cannulas and voice prosthesis

    OpenAIRE

    2011-01-01

    Cannulas and voice prostheses are mechanical aids for patients who had to undergo tracheotomy or laryngectomy for different reasons. For better understanding of the function of those artificial devices, first the indications and particularities of the previous surgical intervention are described in the context of this review. Despite the established procedure of percutaneous dilatation tracheotomy e.g. in intensive care units, the application of epithelised tracheostomas has its own position,...

  10. The voice of the voiceless

    Directory of Open Access Journals (Sweden)

    Carlos Morales

    2014-12-01

    Full Text Available Women’s voice, these are stories of a group of women and their partners who lived part of a particular historical period agriculture during the twentieth century, the hacienda. The present work attempts to reflect their everyday life, reinterpreted from nowadays as a reflection of collective memory, in the big farms where they grew by putting in scene their roles in juxtaposition to the roles of male tradition.

  11. Voices in (and around the Museum: Introduction

    Directory of Open Access Journals (Sweden)

    Ruth Holt

    2012-08-01

    Full Text Available The voice already plays an important role in contemporary art. This introductory paper summarises a series of four sessions in which speakers explored the place of the voice in the museum context. It became clear that the voice not only offered richness in interpretation of and response to other museum artefacts but was itself an artefact meriting conservation  and interpretation.

  12. Speak Up! But don't strain your voice

    Science.gov (United States)

    ... Disorders Speak Up! But don't strain your voice Past Issues / Fall 2008 Table of Contents For ... Javascript on. A clinical trial at the NIDCD Voice Center gave Sherdina Jones tools to limit voice ...

  13. Speaking up in groups: a cross-level study of group voice climate and voice.

    Science.gov (United States)

    Morrison, Elizabeth Wolfe; Wheeler-Smith, Sara L; Kamdar, Dishan

    2011-01-01

    Despite a growing body of research on employee voice—defined as the discretionary communication of ideas, suggestions, or opinions intended to improve organizational or unit functioning—the effects of shared or collective-level cognitions have received scant attention. There has also been relatively little research on voice within work groups. Our goal in this study was to address these important gaps by focusing on the effects of group-level beliefs about voice (i.e., group voice climate) on individual voice behavior within work groups. We conducted a cross-level investigation of voice behavior within 42 groups of engineers from a large chemical company. Consistent with our hypotheses, group voice climate was highly predictive of voice and explained variance beyond the effects of individual-level identification and satisfaction, and procedural justice climate. Also consistent with predictions, the effect of identification on voice was stronger in groups with favorable voice climates. These findings provide evidence that voice is shaped not just by individual attitudes and perceptions of the work context, as past research has shown, but also by group-level beliefs. The results also highlight the importance of broadening our conceptual models of voice to include shared cognitions and of conducting additional cross-level research on voice.

  14. The electrolarynx: voice restoration after total laryngectomy.

    Science.gov (United States)

    Kaye, Rachel; Tang, Christopher G; Sinclair, Catherine F

    2017-01-01

    The ability to speak and communicate with one's voice is a unique human characteristic and is fundamental to many activities of daily living, such as talking on the phone and speaking to loved ones. When the larynx is removed during a total laryngectomy (TL), loss of voice can lead to a devastating decrease in a patient's quality of life, and precipitate significant frustration over their inability to communicate with others effectively. Over the past 50 years there have been many advances in techniques of voice restoration after TL. Currently, there are three main methods of voice restoration: the electrolarynx, esophageal speech, and tracheoesophageal speech through a tracheoesophageal puncture (TEP) with voice prosthesis. Although TEP voice is the current gold standard for vocal rehabilitation, a significant minority of patients cannot use or obtain TEP speech for various reasons. As such, the electrolarynx is a viable and useful alternative for these patients. This article will focus on voice restoration using an electrolarynx with the following objectives: 1) To provide an understanding of the importance of voice restoration after total laryngectomy. 2) To discuss how the electrolarynx may be used to restore voice following total laryngectomy. 3) To outline some of the current electrolarynx devices available, including their mechanism of action and limitations. 4) To compare pros and cons of electrolaryngeal speech to TEP and esophageal speech.

  15. The Performing Voice of the Audiobook

    DEFF Research Database (Denmark)

    Pedersen, Birgitte Stougaard; Have, Iben

    2014-01-01

    will be based on a reception aesthetic and phenomenological approach, the latter as presented by Don Ihde in his book Listening and Voice. Phenomenologies of Sound , and my analytical sketches will be related to theoretical statements concerning the understanding of voice and media (Cavarero, Dolar, La......Belle, Neumark). Finally, the article will discuss the specific artistic combination and our auditory experience of mediated human voices and sculpturally projected faces in an art museum context under the general conditions of the societal panophonia of disembodied and mediated voices, as promoted by Steven...

  16. Methods of Translating the English passive voice

    Institute of Scientific and Technical Information of China (English)

    张李丽

    2009-01-01

    @@ 一、Comparisons between Voices in English and Chinese In English as in many other languages,the passive voice is the form of a transitive verb whose grammatical sabjOct serves as the patient,receiving the action of the verb.The passive voice is typically contrasted with the active voice,which is the form of a transitive verb whose subject serves as the agent,performing the action of the verb.The subject of a verb in the passive voice corresponds to the object of the same verb in the active voice.English's passive voice is periphrastic;that is,it does not have a one-word form.Rather,it is formed using a form of the auxiliary verb be together with a verb's past participle.The passive voice is widely used in English when it is unnecessary,undesirable,or impossible to ilame the agent of an action,or when the passive voice is needed to link the text better.

  17. Prevalence of voice disorders among future teachers.

    Science.gov (United States)

    Simberg, S; Laine, A; Sala, E; Rönnemaa, A M

    2000-06-01

    An epidemiological study was conducted in order to find out the prevalence of voice disorders among students studying to be teachers. Vocal symptoms were inquired of 226 students. Their voices were assessed perceptually by a speech therapist and those who had abnormal voice quality or reported several vocal symptoms were referred to a clinical examination by a laryngologist. The results showed that 20% of this population reported two or more vocal symptoms during the previous year and that 19% had an organic voice disorder. This reinforces the need for clinical evaluation of students with vocal symptoms and more vocal training in the teacher education programs.

  18. V2S: Voice to Sign Language Translation System for Malaysian Deaf People

    Science.gov (United States)

    Mean Foong, Oi; Low, Tang Jung; La, Wai Wan

    The process of learning and understand the sign language may be cumbersome to some, and therefore, this paper proposes a solution to this problem by providing a voice (English Language) to sign language translation system using Speech and Image processing technique. Speech processing which includes Speech Recognition is the study of recognizing the words being spoken, regardless of whom the speaker is. This project uses template-based recognition as the main approach in which the V2S system first needs to be trained with speech pattern based on some generic spectral parameter set. These spectral parameter set will then be stored as template in a database. The system will perform the recognition process through matching the parameter set of the input speech with the stored templates to finally display the sign language in video format. Empirical results show that the system has 80.3% recognition rate.

  19. The voice conveys emotion in ten globalized cultures and one remote village in Bhutan.

    Science.gov (United States)

    Cordaro, Daniel T; Keltner, Dacher; Tshering, Sumjay; Wangchuk, Dorji; Flynn, Lisa M

    2016-02-01

    With data from 10 different globalized cultures and 1 remote, isolated village in Bhutan, we examined universals and cultural variations in the recognition of 16 nonverbal emotional vocalizations. College students in 10 nations (Study 1) and villagers in remote Bhutan (Study 2) were asked to match emotional vocalizations to 1-sentence stories of the same valence. Guided by previous conceptualizations of recognition accuracy, across both studies, 7 of the 16 vocal burst stimuli were found to have strong or very strong recognition in all 11 cultures, 6 vocal bursts were found to have moderate recognition, and 4 were not universally recognized. All vocal burst stimuli varied significantly in terms of the degree to which they were recognized across the 11 cultures. Our discussion focuses on the implications of these results for current debates concerning the emotion conveyed in the voice.

  20. Influence of classroom acoustics on the voice levels of teachers with and without voice problems: a field study

    DEFF Research Database (Denmark)

    Pelegrin Garcia, David; Lyberg-Åhlander, Viveka; Rydell, Roland

    2010-01-01

    Many teachers suffer from voice problems and classroom acoustics has been considered as one of the potential hazards for this. The present study examines how classroom acoustics interacts with the voices of 14 teachers without voice problems and 13 teachers with voice problems. The assessment...... of Reverberation Time and Voice Support were measured in the 30 empty classrooms of the study. An empirical model shows that the measured voice levels depended on the activity noise levels and the voice support. Teachers with and without voice problems were differently affected by the voice support...... of the classroom. The results thus suggest that teachers with voice problems are more aware of classroom acoustic conditions than their healthy colleagues and make use of the more supportive rooms to lower their voice levels. This behavior may result from an adaptation process of the teachers with voice problems...

  1. The recognition of work

    OpenAIRE

    Nierling, Linda

    2007-01-01

    The following article argues that recognition structures in work relations differ significantly in the sphere of paid work in contrast to unpaid work in private spheres. According to the systematic approach on recognition of Axel Honneth three different levels of recognition are identified: the interpersonal recognition, organisational recognition and societal recognition. Based on this framework it can be stated that recognition structures in the sphere of paid work and in private spheres di...

  2. Prosody recognition in adults with high-functioning autism spectrum disorders: from psychoacoustics to cognition.

    Science.gov (United States)

    Globerson, Eitan; Amir, Noam; Kishon-Rabin, Liat; Golan, Ofer

    2015-04-01

    Prosody is an important tool of human communication, carrying both affective and pragmatic messages in speech. Prosody recognition relies on processing of acoustic cues, such as the fundamental frequency of the voice signal, and their interpretation according to acquired socioemotional scripts. Individuals with autism spectrum disorders (ASD) show deficiencies in affective prosody recognition. These deficiencies have been mostly associated with general difficulties in emotion recognition. The current study explored an additional association between affective prosody recognition in ASD and auditory perceptual abilities. Twenty high-functioning male adults with ASD and 32 typically developing male adults, matched on age and verbal abilities undertook a battery of auditory tasks. These included affective and pragmatic prosody recognition tasks, two psychoacoustic tasks (pitch direction recognition and pitch discrimination), and a facial emotion recognition task, representing nonvocal emotion recognition. Compared with controls, the ASD group demonstrated poorer performance on both vocal and facial emotion recognition, but not on pragmatic prosody recognition or on any of the psychoacoustic tasks. Both groups showed strong associations between psychoacoustic abilities and prosody recognition, both affective and pragmatic, although these were more pronounced in the ASD group. Facial emotion recognition predicted vocal emotion recognition in the ASD group only. These findings suggest that auditory perceptual abilities, alongside general emotion recognition abilities, play a significant role in affective prosody recognition in ASD.

  3. The design of a digital voice data compression technique for orbiter voice channels

    Science.gov (United States)

    1975-01-01

    Voice bandwidth compression techniques were investigated to anticipate link margin difficulties in the shuttle S-band communication system. It was felt that by reducing the data rate on each voice channel from the baseline 24 (or 32) Kbps to 8 Kbps, additional margin could be obtained. The feasibility of such an alternate voice transmission system was studied. Several factors of prime importance that were addressed are: (1) achieving high quality voice at 8 Kbps; (2) performance in the presence of the anticipated shuttle cabin environmental noise; (3) performance in the presence of the anticipated channel error statistics; and (4) minimal increase in size, weight, and power over the current baseline voice processor.

  4. Impact of fetal-neonatal iron deficiency on recognition memory at two months of age

    Science.gov (United States)

    Geng, Fengji; Mai, Xiaoqin; Zhan, Jianying; Xu, Lin; Zhao, Zhengyan; Georgieff, Michael; Shao, Jie; Lozoff, Betsy

    2015-01-01

    Objective To assess the effects of fetal-neonatal iron deficiency on recognition memory in early infancy. Perinatal iron deficiency delays or disrupts hippocampal development in animal models and thus may impair related neural functions in human infants, such as recognition memory. Study design Event-related potentials were used in an auditory recognition memory task to compare 2-month-old Chinese infants with iron sufficiency or deficiency at birth. Fetal- neonatal iron deficiency was defined two ways: high zinc protoporphyrin/heme ratio (ZPP/H > 118 μmol/mol) or low serum ferritin (iron deficiency as defined by high cord ZPP/H but not low ferritin. Comparing 35 infants with iron deficiency (ZPP/H > 118 μmol/mol) to 92 with lower ZPP/H (iron-sufficient), only infants with iron sufficiency showed larger LSW amplitude for stranger’s voice than mother’s voice in frontal-central and parietal-occipital locations, indicating the recognition of mother’s voice. Conclusions Infants with iron sufficiency showed electrophysiological evidence of recognizing their mother’s voice, whereas infants with fetal-neonatal iron deficiency did not. Their poorer auditory recognition memory at two months of age is consistent with effects of fetal-neonatal iron deficiency on the developing hippocampus. PMID:26382625

  5. Using the computer-driven VR environment to promote experiences of natural world immersion

    Science.gov (United States)

    Frank, Lisa A.

    2013-03-01

    In December, 2011, over 800 people experienced the exhibit, :"der"//pattern for a virtual environment, created for the fully immersive CAVETM at the University of Wisconsin-Madison. This exhibition took my nature-based photographic work and reinterpreted it for virtual reality (VR).Varied responses such as: "It's like a moment of joy," or "I had to see it twice," or "I'm still thinking about it weeks later" were common. Although an implied goal of my 2D artwork is to create a connection that makes viewers more aware of what it means to be a part of the natural world, these six VR environments opened up an unexpected area of inquiry that my 2D work has not. Even as the experience was mediated by machines, there was a softening at the interface between technology and human sensibility. Somehow, for some people, through the unlikely auspices of a computer-driven environment, the project spoke to a human essence that they connected with in a way that went beyond all expectations and felt completely out of my hands. Other interesting behaviors were noted: in some scenarios some spoke of intense anxiety, acrophobia, claustrophobia-even fear of death when the scene took them underground. These environments were believable enough to cause extreme responses and disorientation for some people; were fun, pleasant and wonder-filled for most; and were liberating, poetic and meditative for many others. The exhibition seemed to promote imaginative skills, creativity, emotional insight, and environmental sensitivity. It also revealed the CAVETM to be a powerful tool that can encourage uniquely productive experiences. Quite by accident, I watched as these nature-based environments revealed and articulated an essential relationship between the human spirit and the physical world. The CAVETM is certainly not a natural space, but there is clear potential to explore virtual environments as a path to better and deeper connections between people and nature. We've long associated contact

  6. 20 CFR 411.310 - How does an entity other than a State VR agency apply to be an EN and who will determine whether...

    Science.gov (United States)

    2010-04-01

    ... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false How does an entity other than a State VR... Employment Networks § 411.310 How does an entity other than a State VR agency apply to be an EN and who will determine whether an entity qualifies as an EN? (a) An entity other than a State VR agency applies by...

  7. Assessing the Ability of a VR-Based Assembly Task Simulation to Evaluate Physical Risk Factors.

    Science.gov (United States)

    Pontonnier, Charles; Samani, Afshin; Badawi, Marwan; Madeleine, Pascal; Dumont, Georges

    2014-05-01

    Nowadays the process of workstation design tends to include assessment steps in a virtual environment (VE) to evaluate the ergonomic features. These approaches are cost-effective and convenient since working directly on the digital mock-up in a VE is preferable to constructing a real physical mock-up in a real environment (RE). This study aimed at understanding the ability of a VR-based assembly tasks simulator to evaluate physical risk factors in ergonomics. Sixteen subjects performed simplified assembly tasks in RE and VE. Motion of the upper body and five muscle electromyographic activities were recorded to compute normalized and averaged objective indicators of discomfort, that is, rapid upper limb assessment score, averaged muscle activations, and total task time. Rated perceived exertion (RPE) and a questionnaire were used as subjective indicators of discomfort. The timing regime and complexity of the assembly tasks were investigated as within-subject factors. The results revealed significant differences between measured indicators in RE and VE. While objective measures indicated lower activity and exposure in VE, the subjects experienced more discomfort than in RE. Fairly good correlation levels were found between RE and VE for six of the objective indicators. This study clearly demonstrates that ergonomic studies of assembly tasks using VR are still challenging. Indeed, objective and subjective measurements of discomfort that are usually used in ergonomics to minimize the risks of work-related musculoskeletal disorders development exhibit opposite trends in RE and VE. Nevertheless, the high level of correlation found during this study indicates that the VR-based simulator can be used for such assessments.

  8. Muscular tension and body posture in relation to voice handicap and voice quality in teachers with persistent voice complaints.

    Science.gov (United States)

    Kooijman, P G C; de Jong, F I C R S; Oudes, M J; Huinck, W; van Acht, H; Graamans, K

    2005-01-01

    The aim of this study was to investigate the relationship between extrinsic laryngeal muscular hypertonicity and deviant body posture on the one hand and voice handicap and voice quality on the other hand in teachers with persistent voice complaints and a history of voice-related absenteeism. The study group consisted of 25 female teachers. A voice therapist assessed extrinsic laryngeal muscular tension and a physical therapist assessed body posture. The assessed parameters were clustered in categories. The parameters in the different categories represent the same function. Further a tension/posture index was created, which is the summation of the different parameters. The different parameters and the index were related to the Voice Handicap Index (VHI) and the Dysphonia Severity Index (DSI). The scores of the VHI and the individual parameters differ significantly except for the posterior weight bearing and tension of the sternocleidomastoid muscle. There was also a significant difference between the individual parameters and the DSI, except for tension of the cricothyroid muscle and posterior weight bearing. The score of the tension/posture index correlates significantly with both the VHI and the DSI. In a linear regression analysis, the combination of hypertonicity of the sternocleidomastoid, the geniohyoid muscles and posterior weight bearing is the most important predictor for a high voice handicap. The combination of hypertonicity of the geniohyoid muscle, posterior weight bearing, high position of the hyoid bone, hypertonicity of the cricothyroid muscle and anteroposition of the head is the most important predictor for a low DSI score. The results of this study show the higher the score of the index, the higher the score of the voice handicap and the worse the voice quality is. Moreover, the results are indicative for the importance of assessment of muscular tension and body posture in the diagnosis of voice disorders.

  9. The Minnesota pelvic trainer: a hybrid VR/physical pelvis for providing virtual mentorship.

    Science.gov (United States)

    Konchada, Vamsi; Shen, Yunhe; Burke, Dan; Argun, Omer B; Weinhaus, Anthony; Erdman, Arthur G; Sweet, Robert M

    2011-01-01

    Obtaining accurate understanding of three dimensional structures and their relationships is important in learning human anatomy. To leverage the learning advantages of using both physical and virtual models, we built a hybrid platform consisting of virtual and mannequin pelvis, motion tracking interface, anatomy and pathology knowledge base. The virtual mentorship concept is to allow learners to conveniently manipulate and explore the virtual pelvic structures through the mannequin model and VR interface, and practice on anatomy identification tasks and pathology quizzes more intuitively and interactively than in a traditional self-study classroom, and to reduce the demands of access to dissection lab or wet lab.

  10. Asynchronous teaching of psychomotor skills through VR annotations: evaluation in digital rectal examination.

    Science.gov (United States)

    Rissanen, Mikko J; Kume, Naoto; Kuroda, Yoshihiro; Kuroda, Tomohiro; Yoshimura, Koji; Yoshihara, Hiroyuki

    2008-01-01

    Many VR technology based training systems use expert's motion data as the training aid, but would not provide any short-cut to teaching medical skills that do not depend on exact motions. Earlier we presented Annotated Simulation Records (ASRs), which can be used to encapsulate experts' insight on psychomotor skills. Annotations made to behavioural parameters in training simulators enable asynchronous teaching instead of just motion training in a proactive way to the learner. We evaluated ASRs for asynchronous teaching of Digital Rectal Examination (DRE) with 3 urologists and 8 medical students. The ASRs were found more effective than motion-based training with verbal feedback.

  11. Using the Voice to Design Ceramics

    DEFF Research Database (Denmark)

    Hansen, Flemming Tvede; Jensen, Kristoffer

    2011-01-01

    SoundShaping, a system to create ceramics from the human voice. Based on a generic audio feature extraction system, and the principal component analysis to ensure that the pertinent information in the voice is used, a 3D shape is created using simple geometric rules. This shape is output to a 3D printer...

  12. The Voice of Conscience in Rousseau's Emile

    Science.gov (United States)

    Kodelja, Zdenko

    2015-01-01

    According to Rousseau, conscience and conscience alone can elevate human beings to a level above that of animals. It is conscience, understood as infallible judge of good and bad, which makes man like God. Conscience itself is, in this context, understood as divine, as an "immortal and celestial voice". Therefore, if the voice of…

  13. Why Is My Voice Changing? (For Teens)

    Science.gov (United States)

    ... enter puberty earlier or later than others. How Deep Will My Voice Get? How deep a guy's voice gets depends on his genes: ... of Use Notice of Nondiscrimination Visit the Nemours Web site. Note: All information on TeensHealth® is for ...

  14. Gender in voice perception in autism.

    NARCIS (Netherlands)

    Groen, W.B.; Orsouw, L van; Zwiers, M.; Swinkels, S.; Gaag, R.J. van der; Buitelaar, J.K.

    2008-01-01

    Deficits in the perception of social stimuli may contribute to the characteristic impairments in social interaction in high functioning autism (HFA). Although the cortical processing of voice is abnormal in HFA, it is unclear whether this gives rise to impairments in the perception of voice gender.

  15. Speaking with the voice of authority

    CERN Multimedia

    2002-01-01

    GPB Consulting has developed a scientific approach to voice coaching. A digital recording of the voice is sent to a lab in Switzerland and analyzed by a computer programme designed by a doctor of psychology and linguistics and a scientist at CERN (1 page).

  16. The Voice of Conscience in Rousseau's Emile

    Science.gov (United States)

    Kodelja, Zdenko

    2015-01-01

    According to Rousseau, conscience and conscience alone can elevate human beings to a level above that of animals. It is conscience, understood as infallible judge of good and bad, which makes man like God. Conscience itself is, in this context, understood as divine, as an "immortal and celestial voice". Therefore, if the voice of…

  17. Epidemiology of voice problems in Dutch teachers

    NARCIS (Netherlands)

    de Jong, F. I. C. R. S.; Kooijman, P. G. C.; Thomas, G.; Huinck, W. J.; Graamans, K.; Schutte, H. K.

    2006-01-01

    In order to assess voice complaints and absence from work due to voice problems among teachers of primary and secondary education, as well as among a control group, 2,117 questionnaires were analysed. The total group consisted of 1,878 teachers and 239 controls. Female teachers more frequently repor

  18. Voice Deviations and Coexisting Communication Disorders.

    Science.gov (United States)

    St. Louis, Kenneth O.; And Others

    1992-01-01

    This study examined the coexistence of other communicative disorders with voice disorders in about 3,400 children in grades 1-12 at 100 sites throughout the United States. The majority of voice-disordered children had coexisting articulation deviations and also differed from controls on two language measures and mean pure-tone hearing thresholds.…

  19. Using the Voice to Design Ceramics

    DEFF Research Database (Denmark)

    Hansen, Flemming Tvede; Jensen, Kristoffer

    2011-01-01

    SoundShaping, a system to create ceramics from the human voice. Based on a generic audio feature extraction system, and the principal component analysis to ensure that the pertinent information in the voice is used, a 3D shape is created using simple geometric rules. This shape is output to a 3D printer...

  20. Epidemiology of voice problems in Dutch teachers.

    NARCIS (Netherlands)

    Jong, F.I.C.R.S. de; Kooijman, P.G.C.; Thomas, G.; Huinck, W.J.; Graamans, K.; Schutte, H.K.

    2006-01-01

    In order to assess voice complaints and absence from work due to voice problems among teachers of primary and secondary education, as well as among a control group, 2,117 questionnaires were analysed. The total group consisted of 1,878 teachers and 239 controls. Female teachers more frequently repor