WorldWideScience

Sample records for voice recognition systems

  1. Effects of emotional and perceptual-motor stress on a voice recognition system's accuracy: An applied investigation

    Science.gov (United States)

    Poock, G. K.; Martin, B. J.

    1984-02-01

    This was an applied investigation examining the ability of a speech recognition system to recognize speakers' inputs when the speakers were under different stress levels. Subjects were asked to speak to a voice recognition system under three conditions: (1) normal office environment, (2) emotional stress, and (3) perceptual-motor stress. Results indicate a definite relationship between voice recognition system performance and the type of low stress reference patterns used to achieve recognition.

  2. Motorcycle Start-stop System based on Intelligent Biometric Voice Recognition

    Science.gov (United States)

    Winda, A.; E Byan, W. R.; Sofyan; Armansyah; Zariantin, D. L.; Josep, B. G.

    2017-03-01

    Current mechanical key in the motorcycle is prone to bulgary, being stolen or misplaced. Intelligent biometric voice recognition as means to replace this mechanism is proposed as an alternative. The proposed system will decide whether the voice is belong to the user or not and the word utter by the user is ‘On’ or ‘Off’. The decision voice will be sent to Arduino in order to start or stop the engine. The recorded voice is processed in order to get some features which later be used as input to the proposed system. The Mel-Frequency Ceptral Coefficient (MFCC) is adopted as a feature extraction technique. The extracted feature is the used as input to the SVM-based identifier. Experimental results confirm the effectiveness of the proposed intelligent voice recognition and word recognition system. It show that the proposed method produces a good training and testing accuracy, 99.31% and 99.43%, respectively. Moreover, the proposed system shows the performance of false rejection rate (FRR) and false acceptance rate (FAR) accuracy of 0.18% and 17.58%, respectively. In the intelligent word recognition shows that the training and testing accuracy are 100% and 96.3%, respectively.

  3. Neural mechanisms for voice recognition

    NARCIS (Netherlands)

    Andics, A.V.; McQueen, J.M.; Petersson, K.M.; Gal, V.; Rudas, G.; Vidnyanszky, Z.

    2010-01-01

    We investigated neural mechanisms that support voice recognition in a training paradigm with fMRI. The same listeners were trained on different weeks to categorize the mid-regions of voice-morph continua as an individual's voice. Stimuli implicitly defined a voice-acoustics space, and training expli

  4. WHEEL CHAIR USING VOICE RECOGNITION

    OpenAIRE

    Manish Kumar Yadav*; Rajat Kumar; Santosh Yadav; Ravindra Prajapati; Prof. Kshirsagar

    2016-01-01

    The wide spread prevalence of lost limbs and sensing system is of major concern in present day due to wars, accident, age and health problems. This Omni-directional wheelchair was designed for the less able elderly to move more flexibly in narrow spaces, such as elevators or small aisle. The wheelchair is developed to help disabled patients by using speech recognition system to control the movement of wheelchair in different directions by using voice commands and also the simple movement of t...

  5. Application of Voice Recognition Input to Decision Support Systems

    Science.gov (United States)

    1988-12-01

    namely, a Bark-scale frequency warping and the incorporation of suprasegmental energy information. All distortion measures and their modifications were...lowest score; (2) Whereas the addition of suprasegmental energy information helped the recognition performance, the use of gain and absolute loudness

  6. Voice integrated systems

    Science.gov (United States)

    Curran, P. Mike

    1977-01-01

    The program at Naval Air Development Center was initiated to determine the desirability of interactive voice systems for use in airborne weapon systems crew stations. A voice recognition and synthesis system (VRAS) was developed and incorporated into a human centrifuge. The speech recognition aspect of VRAS was developed using a voice command system (VCS) developed by Scope Electronics. The speech synthesis capability was supplied by a Votrax, VS-5, speech synthesis unit built by Vocal Interface. The effects of simulated flight on automatic speech recognition were determined by repeated trials in the VRAS-equipped centrifuge. The relationship of vibration, G, O2 mask, mission duration, and cockpit temperature and voice quality was determined. The results showed that: (1) voice quality degrades after 0.5 hours with an O2 mask; (2) voice quality degrades under high vibration; and (3) voice quality degrades under high levels of G. The voice quality studies are summarized. These results were obtained with a baseline of 80 percent recognition accuracy with VCS.

  7. FILTWAM and Voice Emotion Recognition

    NARCIS (Netherlands)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2014-01-01

    This paper introduces the voice emotion recognition part of our framework for improving learning through webcams and microphones (FILTWAM). This framework enables multimodal emotion recognition of learners during game-based learning. The main goal of this study is to validate the use of microphone d

  8. FILTWAM and Voice Emotion Recognition

    NARCIS (Netherlands)

    Bahreini, Kiavash; Nadolski, Rob; Westera, Wim

    2014-01-01

    This paper introduces the voice emotion recognition part of our framework for improving learning through webcams and microphones (FILTWAM). This framework enables multimodal emotion recognition of learners during game-based learning. The main goal of this study is to validate the use of microphone

  9. Voice Activity Detector of Wake-Up-Word Speech Recognition System Design on FPGA

    Directory of Open Access Journals (Sweden)

    Veton Z. Këpuska

    2014-12-01

    Full Text Available A typical speech recognition system is push-to-talk operated that requires activation. However for those who use hands-busy applications, movement may by restricted or impossible. One alternative is to use Speech-Only Interface. The proposed method that is called Wake-Up-Word Speech Recognition (WUW-SR that utilizes speech only interface. A WUW-SR system would allow the user to activate systems (Cell phone, Computer, etc. with only speech commands instead of manual activation. The trend in WUW-SR hardware design is towards implementing a complete system on a single chip intended for various applications. This paper presents an experimental FPGA design and implementation of a novel architecture of a real time feature extraction processor that includes: Voice Activity Detector (VAD, and features extraction, MFCC, LPC, and ENH_MFCC. In the WUW-SR system, the recognizer front-end with VAD is located at the terminal which is typically connected over a data network(e.g., serverfor remote back-end recognition. VAD is responsible for segmenting the signal into speech-like and non-speech-like segments. For any given frame VAD reports one of two possible states: VAD_ON or VAD_OFF. The back-end is then responsible to score the features that are being segmented during VAD_ON stage. The most important characteristic of the presented design is that it should guarantee virtually 100% correct rejection for non-WUW (out of vocabulary words - OOV while maintaining correct acceptance rate of 99.9% or higher (in vocabulary words - INV. This requirement sets apart WUW-SR from other speech recognition tasks because no existing system can guarantee 100% reliability by any measure.

  10. Voice congruency facilitates word recognition.

    Directory of Open Access Journals (Sweden)

    Sandra Campeanu

    Full Text Available Behavioral studies of spoken word memory have shown that context congruency facilitates both word and source recognition, though the level at which context exerts its influence remains equivocal. We measured event-related potentials (ERPs while participants performed both types of recognition task with words spoken in four voices. Two voice parameters (i.e., gender and accent varied between speakers, with the possibility that none, one or two of these parameters was congruent between study and test. Results indicated that reinstating the study voice at test facilitated both word and source recognition, compared to similar or no context congruency at test. Behavioral effects were paralleled by two ERP modulations. First, in the word recognition test, the left parietal old/new effect showed a positive deflection reflective of context congruency between study and test words. Namely, the same speaker condition provided the most positive deflection of all correctly identified old words. In the source recognition test, a right frontal positivity was found for the same speaker condition compared to the different speaker conditions, regardless of response success. Taken together, the results of this study suggest that the benefit of context congruency is reflected behaviorally and in ERP modulations traditionally associated with recognition memory.

  11. Voice recognition software for clinical use.

    Science.gov (United States)

    Korn, K

    1998-11-01

    The current generation voice recognition products truly offer the promise of voice recognition systems, that are financially and operationally acceptable for use in a health care facility. Although the initial capital outlay for the purchase of such equipment may be substantial, the long-term benefit is felt to outweigh the expense. The ability to utilize computer equipment for educational purposes and information management alone helps to rationalize the cost. In addition, it is important to remember that the Internet has become a substantial source of information which provides another functional use for this equipment. Although one can readily see the implication for such a program in clinical practice, other uses for the program should not be overlooked. Uses far beyond the writing of clinic notes and correspondence can be easily envisioned. Utilization of voice recognition software offers clinical practices the ability to produce quality printed records in a timely and cost-effective manner. After learning procedures for the selected product and appropriately formatting word processing software and printers, printed progress notes should be able to be produced in less time than traditional dictation and transcription methods. Although certain procedures and practices may need to be altered, or may preclude optimal utilization of this type of system, many advantages are apparent. It is recommended that facilities consider utilization of Voice Recognition products such as Dragon Systems Naturally Speaking Software, or at least consider a trial of this method with one of the limited-feature products, if current dictation practices are unsatisfactory or excessively costly. Free downloadable trial software or single user software can provide a reduced-cost method for trial evaluation of such products if a major commitment is not felt to be desired. A list of voice recognition software manufacturer web sites may be accessed through the following: http

  12. Human voice recognition depends on language ability.

    Science.gov (United States)

    Perrachione, Tyler K; Del Tufo, Stephanie N; Gabrieli, John D E

    2011-07-29

    The ability to recognize people by their voice is an important social behavior. Individuals differ in how they pronounce words, and listeners may take advantage of language-specific knowledge of speech phonology to facilitate recognizing voices. Impaired phonological processing is characteristic of dyslexia and thought to be a basis for difficulty in learning to read. We tested voice-recognition abilities of dyslexic and control listeners for voices speaking listeners' native language or an unfamiliar language. Individuals with dyslexia exhibited impaired voice-recognition abilities compared with controls only for voices speaking their native language. These results demonstrate the importance of linguistic representations for voice recognition. Humans appear to identify voices by making comparisons between talkers' pronunciations of words and listeners' stored abstract representations of the sounds in those words.

  13. Implicit multisensory associations influence voice recognition.

    Directory of Open Access Journals (Sweden)

    Katharina von Kriegstein

    2006-10-01

    Full Text Available Natural objects provide partially redundant information to the brain through different sensory modalities. For example, voices and faces both give information about the speech content, age, and gender of a person. Thanks to this redundancy, multimodal recognition is fast, robust, and automatic. In unimodal perception, however, only part of the information about an object is available. Here, we addressed whether, even under conditions of unimodal sensory input, crossmodal neural circuits that have been shaped by previous associative learning become activated and underpin a performance benefit. We measured brain activity with functional magnetic resonance imaging before, while, and after participants learned to associate either sensory redundant stimuli, i.e. voices and faces, or arbitrary multimodal combinations, i.e. voices and written names, ring tones, and cell phones or brand names of these cell phones. After learning, participants were better at recognizing unimodal auditory voices that had been paired with faces than those paired with written names, and association of voices with faces resulted in an increased functional coupling between voice and face areas. No such effects were observed for ring tones that had been paired with cell phones or names. These findings demonstrate that brief exposure to ecologically valid and sensory redundant stimulus pairs, such as voices and faces, induces specific multisensory associations. Consistent with predictive coding theories, associative representations become thereafter available for unimodal perception and facilitate object recognition. These data suggest that for natural objects effective predictive signals can be generated across sensory systems and proceed by optimization of functional connectivity between specialized cortical sensory modules.

  14. Voice Recognition in Face-Blind Patients.

    Science.gov (United States)

    Liu, Ran R; Pancaroglu, Raika; Hills, Charlotte S; Duchaine, Brad; Barton, Jason J S

    2016-04-01

    Right or bilateral anterior temporal damage can impair face recognition, but whether this is an associative variant of prosopagnosia or part of a multimodal disorder of person recognition is an unsettled question, with implications for cognitive and neuroanatomic models of person recognition. We assessed voice perception and short-term recognition of recently heard voices in 10 subjects with impaired face recognition acquired after cerebral lesions. All 4 subjects with apperceptive prosopagnosia due to lesions limited to fusiform cortex had intact voice discrimination and recognition. One subject with bilateral fusiform and anterior temporal lesions had a combined apperceptive prosopagnosia and apperceptive phonagnosia, the first such described case. Deficits indicating a multimodal syndrome of person recognition were found only in 2 subjects with bilateral anterior temporal lesions. All 3 subjects with right anterior temporal lesions had normal voice perception and recognition, 2 of whom performed normally on perceptual discrimination of faces. This confirms that such lesions can cause a modality-specific associative prosopagnosia.

  15. Automatic Speech Recognition Systems for the Evaluation of Voice and Speech Disorders in Head and Neck Cancer

    Directory of Open Access Journals (Sweden)

    Andreas Maier

    2010-01-01

    Full Text Available In patients suffering from head and neck cancer, speech intelligibility is often restricted. For assessment and outcome measurements, automatic speech recognition systems have previously been shown to be appropriate for objective and quick evaluation of intelligibility. In this study we investigate the applicability of the method to speech disorders caused by head and neck cancer. Intelligibility was quantified by speech recognition on recordings of a standard text read by 41 German laryngectomized patients with cancer of the larynx or hypopharynx and 49 German patients who had suffered from oral cancer. The speech recognition provides the percentage of correctly recognized words of a sequence, that is, the word recognition rate. Automatic evaluation was compared to perceptual ratings by a panel of experts and to an age-matched control group. Both patient groups showed significantly lower word recognition rates than the control group. Automatic speech recognition yielded word recognition rates which complied with experts' evaluation of intelligibility on a significant level. Automatic speech recognition serves as a good means with low effort to objectify and quantify the most important aspect of pathologic speech—the intelligibility. The system was successfully applied to voice and speech disorders.

  16. Voice Recognition: A New Assessment Tool?

    Science.gov (United States)

    Jones, Darla

    2005-01-01

    This article presents the results of a study conducted in Anchorage, Alaska, that evaluated the accuracy and efficiency of using voice recognition (VR) technology to collect oral reading fluency data for classroom-based assessments. The primary research question was as follows: Is voice recognition technology a valid and reliable alternative to…

  17. A self-teaching image processing and voice-recognition-based, intelligent and interactive system to educate visually impaired children

    Science.gov (United States)

    Iqbal, Asim; Farooq, Umar; Mahmood, Hassan; Asad, Muhammad Usman; Khan, Akrama; Atiq, Hafiz Muhammad

    2010-02-01

    A self teaching image processing and voice recognition based system is developed to educate visually impaired children, chiefly in their primary education. System comprises of a computer, a vision camera, an ear speaker and a microphone. Camera, attached with the computer system is mounted on the ceiling opposite (on the required angle) to the desk on which the book is placed. Sample images and voices in the form of instructions and commands of English, Urdu alphabets, Numeric Digits, Operators and Shapes are already stored in the database. A blind child first reads the embossed character (object) with the help of fingers than he speaks the answer, name of the character, shape etc into the microphone. With the voice command of a blind child received by the microphone, image is taken by the camera which is processed by MATLAB® program developed with the help of Image Acquisition and Image processing toolbox and generates a response or required set of instructions to child via ear speaker, resulting in self education of a visually impaired child. Speech recognition program is also developed in MATLAB® with the help of Data Acquisition and Signal Processing toolbox which records and process the command of the blind child.

  18. Electrolarynx Voice Recognition Utilizing Pulse Coupled Neural Network

    Directory of Open Access Journals (Sweden)

    Fatchul Arifin

    2010-08-01

    Full Text Available The laryngectomies patient has no ability to speak normally because their vocal chords have been removed. The easiest option for the patient to speak again is by using electrolarynx speech. This tool is placed on the lower chin. Vibration of the neck while speaking is used to produce sound. Meanwhile, the technology of "voice recognition" has been growing very rapidly. It is expected that the technology of "voice recognition" can also be used by laryngectomies patients who use electrolarynx.This paper describes a system for electrolarynx speech recognition. Two main parts of the system are feature extraction and pattern recognition. The Pulse Coupled Neural Network – PCNN is used to extract the feature and characteristic of electrolarynx speech. Varying of β (one of PCNN parameter also was conducted. Multi layer perceptron is used to recognize the sound patterns. There are two kinds of recognition conducted in this paper: speech recognition and speaker recognition. The speech recognition recognizes specific speech from every people. Meanwhile, speaker recognition recognizes specific speech from specific person. The system ran well. The "electrolarynx speech recognition" has been tested by recognizing of “A” and "not A" voice. The results showed that the system had 94.4% validation. Meanwhile, the electrolarynx speaker recognition has been tested by recognizing of “saya” voice from some different speakers. The results showed that the system had 92.2% validation. Meanwhile, the best β parameter of PCNN for electrolarynx recognition is 3.

  19. Design and Implementation of Monophones and Triphones-Based Speech Recognition Systems for Voice Activated Telephony

    Directory of Open Access Journals (Sweden)

    Rupayan Das

    2013-07-01

    Full Text Available Speech recognition is the ability of a machine or program to convert spoken words into its equivalent text form. Nowadays, most recognition systems use Hidden Markov Models for modeling the spoken utterances. In this paper we have implemented two speaker independent speech recognition systems which include all the words required for dialing a phone. The systems contain 42 words including digits from zero to nine and also include names of 20 persons. A total of 16,800 utterances have been used for training each system. The two systems are able to recognize continuous speech and it is implemented with the help of monophones and triphones using HTK. Experimental results show an accuracy of 74.11% for monophones based models and 93.77% for triphones based models.

  20. Noise Robust Speech Recognition Applied to Voice-Driven Wheelchair

    Science.gov (United States)

    Sasou, Akira; Kojima, Hiroaki

    2009-12-01

    Conventional voice-driven wheelchairs usually employ headset microphones that are capable of achieving sufficient recognition accuracy, even in the presence of surrounding noise. However, such interfaces require users to wear sensors such as a headset microphone, which can be an impediment, especially for the hand disabled. Conversely, it is also well known that the speech recognition accuracy drastically degrades when the microphone is placed far from the user. In this paper, we develop a noise robust speech recognition system for a voice-driven wheelchair. This system can achieve almost the same recognition accuracy as the headset microphone without wearing sensors. We verified the effectiveness of our system in experiments in different environments, and confirmed that our system can achieve almost the same recognition accuracy as the headset microphone without wearing sensors.

  1. A basic study on application of voice recognition input to an electronic nursing record system -evaluation of the function as an input interface-.

    Science.gov (United States)

    Marukami, Terutaka; Tani, Shoko; Matsuda, Atsuko; Takemoto, Keiko; Shindo, Akiko; Inada, Hiroshi

    2012-06-01

    As computerization in the nursing field has been recently progressing, an electronic nursing record system is gradually introduced in the medical institution in Japan. Although it is expected for the electronic nursing record system to reduce the load of nursing work, the conventional keyboard operation is used for information input of the present electronic nursing record system and it has some problems concerning the input time and the operationability for common nurses who are unfamiliar with the computer operation. In the present study, we conducted a basic study on application of voice recognition input to an electronic nursing record system. The voice input is recently introduced to an electronic medical record system in a few clinics. However, so far the entered information cannot be processed because the information of the medical record must be entered as a free sentence. Therefore, we contrived a template for an electronic nursing record system and introduced it to the system for simple information entry and easy processing of the entered information in this study. Furthermore, an input experiment for evaluation of the voice input with the template was carried out by voluntary subjects for evaluation of the function as an input interface of an electronic nursing record system. The results of the experiment revealed that the input time by the voice input is obviously fast compared with that by the keyboard input and operationability of the voice input was superior to the keyboard input although all subjects had inexperience of the voice input. As a result, it was suggested our method, the voice input using the template made by us, might be useful for an input interface of an electronic nursing record system.

  2. A meta-analysis of in-vehicle and nomadic voice-recognition system interaction and driving performance.

    Science.gov (United States)

    Simmons, Sarah M; Caird, Jeff K; Steel, Piers

    2017-09-01

    Driver distraction is a growing and pervasive issue that requires multiple solutions. Voice-recognition (V-R) systems may decrease the visual-manual (V-M) demands of a wide range of in-vehicle system and smartphone interactions. However, the degree that V-R systems integrated into vehicles or available in mobile phone applications affect driver distraction is incompletely understood. A comprehensive meta-analysis of experimental studies was conducted to address this knowledge gap. To meet study inclusion criteria, drivers had to interact with a V-R system while driving and doing everyday V-R tasks such as dialing, initiating a call, texting, emailing, destination entry or music selection. Coded dependent variables included detection, reaction time, lateral position, speed and headway. Comparisons of V-R systems with baseline driving and/or a V-M condition were also coded. Of 817 identified citations, 43 studies involving 2000 drivers and 183 effect sizes (r) were analyzed in the meta-analysis. Compared to baseline, driving while interacting with a V-R system is associated with increases in reaction time and lane positioning, and decreases in detection. When V-M systems were compared to V-R systems, drivers had slightly better performance with the latter system on reaction time, lane positioning and headway. Although V-R systems have some driving performance advantages over V-M systems, they have a distraction cost relative to driving without any system at all. The pattern of results indicates that V-R systems impose moderate distraction costs on driving. In addition, drivers minimally engage in compensatory performance adjustments such as reducing speed and increasing headway while using V-R systems. Implications of the results for theory, design guidelines and future research are discussed. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Machine Recognition vs Human Recognition of Voices

    Science.gov (United States)

    2012-05-01

    recognized. The accuracy of speaker recognition for disyllables was 87%. For monosyllables, it was 81%, consonant- vowel excerpts were 63%, and... vowel excerpts were 56%. Thus, they demonstrated that the identification performance decreased as the number of phonemes decreased. In [2], the...will still sound natural and the performance of listeners could be tied directly to the degradation of particular frequencies. If the performance

  4. Improving Speaker Recognition by Biometric Voice Deconstruction

    Directory of Open Access Journals (Sweden)

    Luis Miguel eMazaira-Fernández

    2015-09-01

    Full Text Available Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g. YouTube to broadcast its message. In this new scenario, classical identification methods (such fingerprints or face recognition have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. Through the present paper, a new methodology to characterize speakers will be shown. This methodology is benefiting from the advances achieved during the last years in understanding and modelling voice production. The paper hypothesizes that a gender dependent characterization of speakers combined with the use of a new set of biometric parameters extracted from the components resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract gender-dependent extended biometric parameters are given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.

  5. Improving Speaker Recognition by Biometric Voice Deconstruction.

    Science.gov (United States)

    Mazaira-Fernandez, Luis Miguel; Álvarez-Marquina, Agustín; Gómez-Vilda, Pedro

    2015-01-01

    Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g., YouTube) to broadcast its message. In this new scenario, classical identification methods (such as fingerprints or face recognition) have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. The present study benefits from the advances achieved during last years in understanding and modeling voice production. The paper hypothesizes that a gender-dependent characterization of speakers combined with the use of a set of features derived from the components, resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract the gender-dependent extended biometric parameters is given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.

  6. Voice Recognition Technology: Has It Come of Age?

    Directory of Open Access Journals (Sweden)

    Joseph R. Zumalt

    2005-12-01

    Full Text Available Voice recognition software allows computer users to bypass their keyboards and use their voices to enter text. While the library literature is somewhat silent about voice recognition technology, the medical and legal communities have reported some success using it. Voice recognition software was tested for dictation accuracy and usability within an agriculture library at the University of Illinois. Dragon NaturallySpeaking 8.0 was found to be more accurate than speech recognition within Microsoft Office 2003. Helpful Web sites and a short history regarding this breakthrough technology are included.

  7. Temporal voice areas exist in autism spectrum disorder but are dysfunctional for voice identity recognition

    Science.gov (United States)

    Borowiak, Kamila; von Kriegstein, Katharina

    2016-01-01

    The ability to recognise the identity of others is a key requirement for successful communication. Brain regions that respond selectively to voices exist in humans from early infancy on. Currently, it is unclear whether dysfunction of these voice-sensitive regions can explain voice identity recognition impairments. Here, we used two independent functional magnetic resonance imaging studies to investigate voice processing in a population that has been reported to have no voice-sensitive regions: autism spectrum disorder (ASD). Our results refute the earlier report that individuals with ASD have no responses in voice-sensitive regions: Passive listening to vocal, compared to non-vocal, sounds elicited typical responses in voice-sensitive regions in the high-functioning ASD group and controls. In contrast, the ASD group had a dysfunction in voice-sensitive regions during voice identity but not speech recognition in the right posterior superior temporal sulcus/gyrus (STS/STG)—a region implicated in processing complex spectrotemporal voice features and unfamiliar voices. The right anterior STS/STG correlated with voice identity recognition performance in controls but not in the ASD group. The findings suggest that right STS/STG dysfunction is critical for explaining voice recognition impairments in high-functioning ASD and show that ASD is not characterised by a general lack of voice-sensitive responses. PMID:27369067

  8. Secure Recognition of Voice-Less Commands Using Videos

    Science.gov (United States)

    Yau, Wai Chee; Kumar, Dinesh Kant; Weghorn, Hans

    Interest in voice recognition technologies for internet applications is growing due to the flexibility of speech-based communication. The major drawback with the use of sound for internet access with computers is that the commands will be audible to other people in the vicinity. This paper examines a secure and voice-less method for recognition of speech-based commands using video without evaluating sound signals. The proposed approach represents mouth movements in the video data using 2D spatio-temporal templates (STT). Zernike moments (ZM) are computed from STT and fed into support vector machines (SVM) to be classified into one of the utterances. The experimental results demonstrate that the proposed technique produces a high accuracy of 98% in a phoneme classification task. The proposed technique is demonstrated to be invariant to global variations of illumination level. Such a system is useful for securely interpreting user commands for internet applications on mobile devices.

  9. Building Domain Specific Languages for Voice Recognition Applications

    Directory of Open Access Journals (Sweden)

    Cristian IONITA

    2008-01-01

    Full Text Available This paper presents a method of implementing the voice recognition for the control of software applications. The solutions proposed are based on transforming a subset of the natural language in commands recognized by the application using a formal language defined by the means of a context free grammar. At the end of the paper is presented the modality of integration of voice recognition and of voice synthesis for the Romanian language in Windows applications.

  10. Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

    CERN Document Server

    Muda, Lindasalwa; Elamvazuthi, I

    2010-01-01

    Digital processing of speech signal and voice recognition algorithm is very important for fast and accurate automatic voice recognition technology. The voice is a signal of infinite information. A direct analysis and synthesizing the complex voice signal is due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching are introduced to represent the voice signal. Several methods such as Liner Predictive Predictive Coding (LPC), Hidden Markov Model (HMM), Artificial Neural Network (ANN) and etc are evaluated with a view to identify a straight forward and effective method for voice signal. The extraction and matching process is implemented right after the Pre Processing or filtering signal is performed. The non-parametric method for modelling the human auditory perception system, Mel Frequency Cepstral Coefficients (MFCCs) are utilize as extraction techniques. The non linear sequence alignment known as Dynamic Time Warping (DTW) intro...

  11. Familiar Person Recognition: Is Autonoetic Consciousness More Likely to Accompany Face Recognition Than Voice Recognition?

    Science.gov (United States)

    Barsics, Catherine; Brédart, Serge

    2010-11-01

    Autonoetic consciousness is a fundamental property of human memory, enabling us to experience mental time travel, to recollect past events with a feeling of self-involvement, and to project ourselves in the future. Autonoetic consciousness is a characteristic of episodic memory. By contrast, awareness of the past associated with a mere feeling of familiarity or knowing relies on noetic consciousness, depending on semantic memory integrity. Present research was aimed at evaluating whether conscious recollection of episodic memories is more likely to occur following the recognition of a familiar face than following the recognition of a familiar voice. Recall of semantic information (biographical information) was also assessed. Previous studies that investigated the recall of biographical information following person recognition used faces and voices of famous people as stimuli. In this study, the participants were presented with personally familiar people's voices and faces, thus avoiding the presence of identity cues in the spoken extracts and allowing a stricter control of frequency exposure with both types of stimuli (voices and faces). In the present study, the rate of retrieved episodic memories, associated with autonoetic awareness, was significantly higher from familiar faces than familiar voices even though the level of overall recognition was similar for both these stimuli domains. The same pattern was observed regarding semantic information retrieval. These results and their implications for current Interactive Activation and Competition person recognition models are discussed.

  12. Impact of PACS and Voice-Recognition Reporting on the Education of Radiology Residents

    OpenAIRE

    Gutierrez, Antonio J.; Mullins, Mark E.; Robert A. Novelline

    2005-01-01

    Rationale and Objectives: The introduction of picture archiving and communication system (PACS) has decreased the time needed to interpret radiology examinations resulting in an increased workflow. Because of concerns that the increase in exam throughput and the use of voice recognition may have a negative impact upon radiology resident education, a survey was conducted to assess the impact of PACS and voice recognition. Materials and Methods: Residents at four diagnostic radiology training p...

  13. The Neuropsychology of Familiar Person Recognition from Face and Voice

    Directory of Open Access Journals (Sweden)

    Guido Gainotti

    2014-05-01

    Full Text Available Prosopagnosia has been considered for a long period of time as the most important and almost exclusive disorder in the recognition of familiar people. In recent years, however, this conviction has been undermined by the description of patients showing a concomitant defect in the recognition of familiar faces and voices as a consequence of lesions encroaching upon the right anterior temporal lobe (ATL. These new data have obliged researchers to reconsider on one hand the construct of ‘associative prosopagnosia’ and on the other hand current models of people recognition. A systematic review of the patterns of familiar people recognition disorders observed in patients with right and left ATL lesions has shown that in patients with right ATL lesions face familiarity feelings and the retrieval of person-specific semantic information from faces are selectively affected, whereas in patients with left ATL lesions the defect selectively concerns famous people naming. Furthermore, some patients with right ATL lesions and intact face familiarity feelings show a defect in the retrieval of person-specific semantic knowledge greater from face than from name. These data are at variance with current models assuming: (a that familiarity feelings are generated at the level of person identity nodes (PINs where information processed by various sensory modalities converge, and (b that PINs provide a modality-free gateway to a single semantic system, where information about people is stored in an amodal format. They suggest, on the contrary: (a that familiarity feelings are generated at the level of modality-specific recognition units; (b that face and voice recognition units are represented more in the right than in the left ATLs; (c that in the right ATL are mainly stored person-specific information based on a convergence of perceptual information, whereas in the left ATLs are represented verbally-mediated person-specific information.

  14. LABORATORY VOICE DATA ENTRY SYSTEM.

    Energy Technology Data Exchange (ETDEWEB)

    PRAISSMAN,J.L.SUTHERLAND,J.C.

    2003-04-01

    We have assembled a system using a personal computer workstation equipped with standard office software, an audio system, speech recognition software and an inexpensive radio-based wireless microphone that permits laboratory workers to enter or modify data while performing other work. Speech recognition permits users to enter data while their hands are holding equipment or they are otherwise unable to operate a keyboard. The wireless microphone allows unencumbered movement around the laboratory without a ''tether'' that might interfere with equipment or experimental procedures. To evaluate the potential of voice data entry in a laboratory environment, we developed a prototype relational database that records the disposal of radionuclides and/or hazardous chemicals Current regulations in our laboratory require that each such item being discarded must be inventoried and documents must be prepared that summarize the contents of each container used for disposal. Using voice commands, the user enters items into the database as each is discarded. Subsequently, the program prepares the required documentation.

  15. Superior voice recognition in a patient with acquired prosopagnosia and object agnosia.

    Science.gov (United States)

    Hoover, Adria E N; Démonet, Jean-François; Steeves, Jennifer K E

    2010-11-01

    Anecdotally, it has been reported that individuals with acquired prosopagnosia compensate for their inability to recognize faces by using other person identity cues such as hair, gait or the voice. Are they therefore superior at the use of non-face cues, specifically voices, to person identity? Here, we empirically measure person and object identity recognition in a patient with acquired prosopagnosia and object agnosia. We quantify person identity (face and voice) and object identity (car and horn) recognition for visual, auditory, and bimodal (visual and auditory) stimuli. The patient is unable to recognize faces or cars, consistent with his prosopagnosia and object agnosia, respectively. He is perfectly able to recognize people's voices and car horns and bimodal stimuli. These data show a reverse shift in the typical weighting of visual over auditory information for audiovisual stimuli in a compromised visual recognition system. Moreover, the patient shows selectively superior voice recognition compared to the controls revealing that two different stimulus domains, persons and objects, may not be equally affected by sensory adaptation effects. This also implies that person and object identity recognition are processed in separate pathways. These data demonstrate that an individual with acquired prosopagnosia and object agnosia can compensate for the visual impairment and become quite skilled at using spared aspects of sensory processing. In the case of acquired prosopagnosia it is advantageous to develop a superior use of voices for person identity recognition in everyday life. Copyright © 2010 Elsevier Ltd. All rights reserved.

  16. Face and Voice Recognition Algorithms of Sign-in System for Underground Coalmine%人脸与声音结合的矿井人员签到识别

    Institute of Scientific and Technical Information of China (English)

    王君; 李成武; 杨茜; 刘世森

    2012-01-01

    矿井时有安全事故发生,签到管理系统可及时、准确掌握人员出入人员状况,保障矿井安全生产,方便及时救援.针对传统签到管理系统用于矿井,遇到光线昏暗、人脸易附着粉尘、干扰噪音等因素影响,签到识别方法检测率低,提出了—种根据KL变换(Karhunen-Loeve Transform)和TAN分类(Tree-Augmented Naive Bayesian network)相结合的人脸识别,并辅以声音识别的方法.通过形态学滤波变换快速去掉大部分无用背景,使处理更快速,特征点更突出;自动根据具体环境选择图像识别或声音识别,使识别准确率更高.仿真结果表明:结合声音的系统识别方法既减小了计算复杂度,又提高了人员识别率,还增强了适应性.%Coalmine accidents happen sometimes. It is significant to know the accurate statement of the miners in coalmine or outside, which is convenient for rescue. When the traditional Sign—in Management System was used in coal mine, the system meets new problems, such as black, hazy face, etc. Aiming at this issue, this paper put forward a face recognition algorithm based on the combination of Karhunen—Loeve Transform and Tree—Augmented Naive Bayesian network classifier, which uses the morphological filtering to remove most of useless transform background quickly. In addition, the voice recognition method was addede to that algorithm which makes feature point more outstanding and identification more accuracy, according to the specific environment automatic selection of face recognition or voice recognition. The simulation shows that this algorithm not only reduces the computational complexity and improves the human face recognition rate, but also enhances the adaptability.

  17. When the face fits: recognition of celebrities from matching and mismatching faces and voices.

    Science.gov (United States)

    Stevenage, Sarah V; Neil, Greg J; Hamlin, Iain

    2014-01-01

    The results of two experiments are presented in which participants engaged in a face-recognition or a voice-recognition task. The stimuli were face-voice pairs in which the face and voice were co-presented and were either "matched" (same person), "related" (two highly associated people), or "mismatched" (two unrelated people). Analysis in both experiments confirmed that accuracy and confidence in face recognition was consistently high regardless of the identity of the accompanying voice. However accuracy of voice recognition was increasingly affected as the relationship between voice and accompanying face declined. Moreover, when considering self-reported confidence in voice recognition, confidence remained high for correct responses despite the proportion of these responses declining across conditions. These results converged with existing evidence indicating the vulnerability of voice recognition as a relatively weak signaller of identity, and results are discussed in the context of a person-recognition framework.

  18. The Neuropsychology of Familiar Person Recognition from Face and Voice

    OpenAIRE

    2014-01-01

    Prosopagnosia has been considered for a long period of time as the most important and almost exclusive disorder in the recognition of familiar people. In recent years, however, this conviction has been undermined by the description of patients showing a concomitant defect in the recognition of familiar faces and voices as a consequence of lesions encroaching upon the right anterior temporal lobe (ATL). These new data have obliged researchers to reconsider on one hand the construct of ‘associa...

  19. Emotional Recognition in Autism Spectrum Conditions from Voices and Faces

    Science.gov (United States)

    Stewart, Mary E.; McAdam, Clair; Ota, Mitsuhiko; Peppe, Sue; Cleland, Joanne

    2013-01-01

    The present study reports on a new vocal emotion recognition task and assesses whether people with autism spectrum conditions (ASC) perform differently from typically developed individuals on tests of emotional identification from both the face and the voice. The new test of vocal emotion contained trials in which the vocal emotion of the sentence…

  20. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob;

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...

  1. Recognition of voice commands using adaptation of foreign language speech recognizer via selection of phonetic transcriptions

    Science.gov (United States)

    Maskeliunas, Rytis; Rudzionis, Vytautas

    2011-06-01

    In recent years various commercial speech recognizers have become available. These recognizers provide the possibility to develop applications incorporating various speech recognition techniques easily and quickly. All of these commercial recognizers are typically targeted to widely spoken languages having large market potential; however, it may be possible to adapt available commercial recognizers for use in environments where less widely spoken languages are used. Since most commercial recognition engines are closed systems the single avenue for the adaptation is to try set ways for the selection of proper phonetic transcription methods between the two languages. This paper deals with the methods to find the phonetic transcriptions for Lithuanian voice commands to be recognized using English speech engines. The experimental evaluation showed that it is possible to find phonetic transcriptions that will enable the recognition of Lithuanian voice commands with recognition accuracy of over 90%.

  2. Pegembangan Game dengan Menggunakan Teknologi Voice Recognition Berbasis Android

    Directory of Open Access Journals (Sweden)

    Franky Hadinata Marpaung

    2014-06-01

    Full Text Available The purpose of this research is to create a new kind of game by using technology that rarely used in current games. It is developed as an entertainment media and also a social media in which the users can play the games together via multiplayer mode. This research uses Scrum development method since it supports small scaled developer and it supports software increment along the development. Using this game application, the users can play and watch interesting animations by controlling it with their voice, listen the character imitating the users’ voice, play various mini games both in single player or multiplayer mode via Bluetooth connection. The conclusion is that game application of My Name is Dug use voice recognition and inter-devices connection as its main features. It also has various mini games that support both single player and multiplayer.

  3. Acoustic cues for the recognition of self-voice and other-voice

    Directory of Open Access Journals (Sweden)

    Mingdi eXu

    2013-10-01

    Full Text Available Self-recognition, being indispensable for successful social communication, has become a major focus in current social neuroscience. The physical aspects of the self are most typically manifested in the face and voice. Compared with the wealth of studies on self-face recognition, self-voice recognition (SVR has not gained much attention. Converging evidence has suggested that the fundamental frequency (F0 and formant structures serve as the key acoustic cues for other-voice recognition (OVR. However, little is known about which, and how, acoustic cues are utilized for SVR as opposed to OVR. To address this question, we independently manipulated the F0 and formant information of recorded voices and investigated their contributions to SVR and OVR. Japanese participants were presented with recorded vocal stimuli and were asked to identify the speaker—either themselves or one of their peers. Six groups of 5 peers of the same sex participated in the study. Under conditions where the formant information was fully preserved and where only the frequencies lower than the third formant (F3 were retained, accuracies of SVR deteriorated significantly with the modulation of the F0, and the results were comparable for OVR. By contrast, under a condition where only the frequencies higher than F3 were retained, the accuracy of SVR was significantly higher than that of OVR throughout the range of F0 modulations, and the F0 scarcely affected the accuracies of SVR and OVR. Our results indicate that while both F0 and formant information are involved in SVR, as well as in OVR, the advantage of SVR is manifested only when major formant information for speech intelligibility is absent. These findings imply the robustness of self-voice representation, possibly by virtue of auditory familiarity and other factors such as its association with motor/articulatory representation.

  4. A Voice Operated Tour Planning System for Autonomous Mobile Robots

    Directory of Open Access Journals (Sweden)

    Charles V. Smith Iii

    2010-06-01

    Full Text Available Control systems driven by voice recognition software have been implemented before but lacked the context driven approach to generate relevant responses and actions. A partially voice activated control system for mobile robotics is presented that allows an autonomous robot to interact with people and the environment in a meaningful way, while dynamically creating customized tours. Many existing control systems also require substantial training for voice application. The system proposed requires little to no training and is adaptable to chaotic environments. The traversable area is mapped once and from that map a fully customized route is generated to the user

  5. Controlling An Electric Car Starter System Through Voice

    Directory of Open Access Journals (Sweden)

    A.B. Muhammad Firdaus

    2015-04-01

    Full Text Available Abstract These days automotive has turned into a stand out amongst the most well-known modes of transportation on the grounds that a large number of Malaysians could bear to have an auto. There are numerous decisions of innovations in auto that have in the market. One of the engineering is voice controlled framework. Voice Recognition is the procedure of consequently perceiving a certain statement talked by a specific speaker focused around individual data included in discourse waves. This paper is to make an car controlled by voice of human. An essential pre-processing venture in Voice Recognition systems is to recognize the vicinity of noise. Sensitivity to speech variability lacking recognition precision and helplessness to mimic are among the principle specialized obstacles that keep the far reaching selection of speech-based recognition systems. Voice recognition systems work sensibly well with a quiet conditions however inadequately under loud conditions or in twisted channels. The key focus of the project is to control an electric car starter system.

  6. METHODS FOR QUALITY ENHANCEMENT OF USER VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    O. N. Faizulaieva

    2014-03-01

    Full Text Available The reasonability for the usage of computer systems user voice in the authentication process is proved. The scientific task for improving the signal/noise ratio of the user voice signal in the authentication system is considered. The object of study is the process of input and output of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of voice signal against external interference signals are researched. Methods for quality enhancement of user voice signal in voice authentication systems are suggested. As modern computer facilities, including mobile ones, have two-channel audio card, the usage of two microphones is proposed in the voice signal input system of authentication system. Meanwhile, the task of forming a lobe of microphone array in a desired area of voice signal registration (100 Hz to 8 kHz is solved. The usage of directional properties of the proposed microphone array gives the possibility to have the influence of external interference signals two or three times less in the frequency range from 4 to 8 kHz. The possibilities for implementation of space-time processing of the recorded signals using constant and adaptive weighting factors are investigated. The simulation results of the proposed system for input and extraction of signals during digital processing of narrowband signals are presented. The proposed solutions make it possible to improve the value of the signal/noise ratio of the useful signals recorded up to 10, ..., 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker’s discrimination.

  7. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...... be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. METHODS: Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS...

  8. Voice recognition software can be used for scientific articles

    DEFF Research Database (Denmark)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob;

    2015-01-01

    INTRODUCTION: Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore...... be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. METHODS: Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS...... was compared with the same dictate transcribed by an experienced research secretary, and the effect of adding words to the vocabulary of the VRS was investigated. The number of errors per hundred words was used as outcome. Furthermore, three experienced researchers assessed the subjective readability using...

  9. Enhancing nursing practice by utilizing voice recognition for direct documentation.

    Science.gov (United States)

    Fratzke, Jason; Tucker, Sharon; Shedenhelm, Heidi; Arnold, Jackie; Belda, Tom; Petera, Michael

    2014-02-01

    Innovative strategies that preserve nursing time for direct patient care activities are needed. This study examined the utility, feasibility, and acceptability of voice recognition (VR) software to document nursing care and patient outcomes in an electronic health record in a simulated nursing care environment. A phase 1 trial included 5 iterative experiments with observations and nurse participant feedback to allow enhancements to the speech detection capabilities and refinement of the technology, software, and processes. Utility ratings improved over time; however, interference on nursing care remained a concern throughout. Nurse participants favored keyboard entry electronic health record, largely due to software and technical issues, but also relative to the culture shift the new technology brings to nursing practice. Successful adoption of VR technology by nursing will be dependent on receptiveness of the nurses and perceived benefits, timely access to education and training, and minimization of barriers to using the software.

  10. Remote Voice Detection System

    Science.gov (United States)

    2007-06-25

    back to the laser Doppler vibrometer and the digital camera, respectively. Mechanical beam steering mirror modules, such as galvanometer steering...mirror module 43 in accordance with this invention. An appropriate galvanometer -based tracker system has been used for tracking eye motion during laser

  11. Multimodal user input to supervisory control systems - Voice-augmented keyboard

    Science.gov (United States)

    Mitchell, Christine M.; Forren, Michelle G.

    1987-01-01

    The use of a voice-augmented keyboard input modality is evaluated in a supervisory control application. An implementation of voice recognition technology in supervisory control is proposed: voice is used to request display pages, while the keyboard is used to input system reconfiguration commands. Twenty participants controlled GT-MSOCC, a high-fidelity simulation of the operator interface to a NASA ground control system, via a workstation equipped with either a single keyboard or a voice-augmented keyboard. Experimental results showed that in all cases where significant performance differences occurred, performance with the voice-augmented keyboard modality was inferior to and had greater variance than the keyboard-only modality. These results suggest that current moderately priced voice recognition systems are an inappropriate human-computer interaction technology in supervisory control systems.

  12. Frequency and analysis of non-clinical errors made in radiology reports using the National Integrated Medical Imaging System voice recognition dictation software.

    Science.gov (United States)

    Motyer, R E; Liddy, S; Torreggiani, W C; Buckley, O

    2016-11-01

    Voice recognition (VR) dictation of radiology reports has become the mainstay of reporting in many institutions worldwide. Despite benefit, such software is not without limitations, and transcription errors have been widely reported. Evaluate the frequency and nature of non-clinical transcription error using VR dictation software. Retrospective audit of 378 finalised radiology reports. Errors were counted and categorised by significance, error type and sub-type. Data regarding imaging modality, report length and dictation time was collected. 67 (17.72 %) reports contained ≥1 errors, with 7 (1.85 %) containing 'significant' and 9 (2.38 %) containing 'very significant' errors. A total of 90 errors were identified from the 378 reports analysed, with 74 (82.22 %) classified as 'insignificant', 7 (7.78 %) as 'significant', 9 (10 %) as 'very significant'. 68 (75.56 %) errors were 'spelling and grammar', 20 (22.22 %) 'missense' and 2 (2.22 %) 'nonsense'. 'Punctuation' error was most common sub-type, accounting for 27 errors (30 %). Complex imaging modalities had higher error rates per report and sentence. Computed tomography contained 0.040 errors per sentence compared to plain film with 0.030. Longer reports had a higher error rate, with reports >25 sentences containing an average of 1.23 errors per report compared to 0-5 sentences containing 0.09. These findings highlight the limitations of VR dictation software. While most error was deemed insignificant, there were occurrences of error with potential to alter report interpretation and patient management. Longer reports and reports on more complex imaging had higher error rates and this should be taken into account by the reporting radiologist.

  13. 基于BP和ARM的发动机声音识别系统%Voice recognition engine based on BP's system in realization of ARM

    Institute of Scientific and Technical Information of China (English)

    姜愉

    2012-01-01

    Aimed at addressing automatic fee charging of highway toll stations and large-scale re- chargeable parking lots, this paper introduces the design of a embedded speech recognition system based on ARM9 and embedded Linux system of the engine sound by analyzing the BP neural network recognition theory. The design consisting of S3C2410 microprocessors and Linux operating systems involves trans- planting the C language of speech recognition program to the embedded Linux operating system's file system when cross-compiled. The paper describes the system s hardware and software framework, and offers the experiments results produced by real-time recognition of the car type by the engine sound. The results prove its accuracy, real-time and validity.%为解决高速公路收费站及大型停车收费场自动收费问题,依据BP神经网络识别理论,设计了一个基于ARM9及嵌入式Linux系统的发动机声音识别系统。选用S3C2410微处理器和嵌入式Linux操作系统,把交叉编译后的发动机声音识别C语言程序移植到操作系统的文件中,实现了发动机声音实时识别功能,给出了系统整体软硬件结构框架以及实时输入发动机声音判别汽车类型的识别结果。现场实验证实了该系统的准确性、实时性和有效性。

  14. Arabic Speech Recognition System using CMU-Sphinx4

    CERN Document Server

    Satori, H; Chenfour, N

    2007-01-01

    In this paper we present the creation of an Arabic version of Automated Speech Recognition System (ASR). This system is based on the open source Sphinx-4, from the Carnegie Mellon University. Which is a speech recognition system based on discrete hidden Markov models (HMMs). We investigate the changes that must be made to the model to adapt Arabic voice recognition. Keywords: Speech recognition, Acoustic model, Arabic language, HMMs, CMUSphinx-4, Artificial intelligence.

  15. EXPERIMENTAL STUDY OF FIRMWARE FOR INPUT AND EXTRACTION OF USER’S VOICE SIGNAL IN VOICE AUTHENTICATION SYSTEMS

    Directory of Open Access Journals (Sweden)

    O. N. Faizulaieva

    2014-09-01

    Full Text Available Scientific task for improving the signal-to-noise ratio for user’s voice signal in computer systems and networks during the process of user’s voice authentication is considered. The object of study is the process of input and extraction of the voice signal of authentication system user in computer systems and networks. Methods and means for input and extraction of the voice signal on the background of external interference signals are investigated. Ways for quality improving of the user’s voice signal in systems of voice authentication are investigated experimentally. Firmware means for experimental unit of input and extraction of the user’s voice signal against external interference influence are considered. As modern computer means, including mobile, have two-channel audio card, two microphones are used in the voice signal input. The distance between sonic-wave sensors is 20 mm and it provides forming one direction pattern lobe of microphone array in a desired area of voice signal registration (from 100 Hz to 8 kHz. According to the results of experimental studies, the usage of directional properties of the proposed microphone array and space-time processing of the recorded signals with implementation of constant and adaptive weighting factors has made it possible to reduce considerably the influence of interference signals. The results of firmware experimental studies for input and extraction of the user’s voice signal against external interference influence are shown. The proposed solutions will give the possibility to improve the value of the signal/noise ratio of the useful signals recorded up to 20 dB under the influence of external interference signals in the frequency range from 4 to 8 kHz. The results may be useful to specialists working in the field of voice recognition and speaker discrimination.

  16. Embodied Transcription: A Creative Method for Using Voice-Recognition Software

    Science.gov (United States)

    Brooks, Christine

    2010-01-01

    Voice-recognition software is designed to be used by one user (voice) at a time, requiring a researcher to speak all of the words of a recorded interview to achieve transcription. Thus, the researcher becomes a conduit through which interview material is inscribed as written word. Embodied Transcription acknowledges performative and interpretative…

  17. Voice recognition software can be used for scientific articles.

    Science.gov (United States)

    Pommergaard, Hans-Christian; Huang, Chenxi; Burcharth, Jacob; Rosenberg, Jacob

    2015-02-01

    Dictation of scientific articles has been recognised as an efficient method for producing high-quality, first article drafts. However, standardised transcription service by a secretary may not be available for all researchers and voice recognition software (VRS) may therefore be an alternative. The purpose of this study was to evaluate the out-of-the-box accuracy of VRS. Eleven young researchers without dictation experience dictated the first draft of their own scientific article after thorough preparation according to a pre-defined schedule. The dictate transcribed by VRS was compared with the same dictate transcribed by an experienced research secretary, and the effect of adding words to the vocabulary of the VRS was investigated. The number of errors per hundred words was used as outcome. Furthermore, three experienced researchers assessed the subjective readability using a Likert scale (0-10). Dragon Nuance Premium version 12.5 was used as VRS. The median number of errors per hundred words was 18 (range: 8.5-24.3), which improved when 15,000 words were added to the vocabulary. Subjective readability assessment showed that the texts were understandable with a median score of five (range: 3-9), which was improved with the addition of 5,000 words. The out-of-the-box performance of VRS was acceptable and improved after additional words were added. Further studies are needed to investigate the effect of additional software accuracy training.

  18. Who gets credit for input? Demographic and structural status cues in voice recognition.

    Science.gov (United States)

    Howell, Taeya M; Harrison, David A; Burris, Ethan R; Detert, James R

    2015-11-01

    The authors investigate the employee features that, alongside overall voice expression, affect supervisors' voice recognition. Drawing primarily from status characteristics and network position theories, the authors propose and find in a study of 693 employees from 89 different credit union units that supervisors are more likely to credit those reporting the same amount of voice if the employees have higher ascribed or assigned (by the organization) status--cued by demographic variables such as majority ethnicity and full-time work hours. Further, supervisors are more likely to recognize voice from employees who have higher achieved status--cued by their centrality in informal social structures. The authors also find that even when certain groups of lower status employees speak up more, they cannot compensate for the negative effect of their demographic membership on voice recognition by their boss. The authors underscore how recognition of employee voice by supervisors matters for employees. It carries (mediates) the effects of voice expression and status onto performance evaluations 1 year later, which means that demographic differences in the assignment of credit for voice can serve as an implicit pathway for discrimination.

  19. Voice-Controlled Artificial Handspeak System

    Directory of Open Access Journals (Sweden)

    Carlo Fonda

    2014-01-01

    Full Text Available A man-machine interaction project is described which aims to establish an automated voice to sign language translator for communication with the deaf using integrated open technologies. The first prototype consists of a robotic hand designed with OpenSCAD and manufactured with a low-cost 3D printer ─which smoothly reproduces the alphabet of the sign language controlled by voice only. The core automation comprises an Arduino UNO controller used to activate a set of servo motors that follow instructions from a Raspberry Pi mini-computer having installed the open source speech recognition engine Julius. We discuss its features, limitations and possible future developments.

  20. Voice and GPS Based Navigation System For Visually Impaired

    Directory of Open Access Journals (Sweden)

    Harsha Gawari

    2014-04-01

    Full Text Available The paper represents the architecture and implementation of a system that will help to navigate the visually impaired people. The system designed uses GPS and voice recognition along with obstacle avoidance for the purpose of guiding visually impaired. The visually impaired person issues the command and receives the direction response using audio signals. The latitude and longitude values are received continuously from the GPS receiver. The directions are given to the user with the help of audio signals. An obstacle detector is used to help the user to avoid obstacles by sending an audio message.GPS receivers use NMEA standard. With the advancement in voice recognition it becomes easier to issue commands regarding directions to the visually impaired.

  1. Voice identity recognition: functional division of the right STS and its behavioral relevance.

    Science.gov (United States)

    Schall, Sonja; Kiebel, Stefan J; Maess, Burkhard; von Kriegstein, Katharina

    2015-02-01

    The human voice is the primary carrier of speech but also a fingerprint for person identity. Previous neuroimaging studies have revealed that speech and identity recognition is accomplished by partially different neural pathways, despite the perceptual unity of the vocal sound. Importantly, the right STS has been implicated in voice processing, with different contributions of its posterior and anterior parts. However, the time point at which vocal and speech processing diverge is currently unknown. Also, the exact role of the right STS during voice processing is so far unclear because its behavioral relevance has not yet been established. Here, we used the high temporal resolution of magnetoencephalography and a speech task control to pinpoint transient behavioral correlates: we found, at 200 msec after stimulus onset, that activity in right anterior STS predicted behavioral voice recognition performance. At the same time point, the posterior right STS showed increased activity during voice identity recognition in contrast to speech recognition whereas the left mid STS showed the reverse pattern. In contrast to the highly speech-sensitive left STS, the current results highlight the right STS as a key area for voice identity recognition and show that its anatomical-functional division emerges around 200 msec after stimulus onset. We suggest that this time point marks the speech-independent processing of vocal sounds in the posterior STS and their successful mapping to vocal identities in the anterior STS.

  2. Improving Quality of Voice Conversion Systems

    Science.gov (United States)

    Farhid, M.; Tinati, M. A.

    New improvement scheme for voice conversion are proposed in this paper. We take Human factor cepstral coefficients (HFCC), a modification of MFCC that uses the known relationship between center frequency and critical bandwidth from human psychoacoustics to decouple filter bandwidth from filter spacing, as the basic feature. We propose U/V (Unvoiced/Voiced) decision rule such that two sets of codebooks are used to capture the difference between unvoiced and voiced segments of the source speaker. Moreover, we apply three schemes to refine the synthesized voice, including pitch refinement, energy equalization, and frame concatenation. The acceptable performance of the voice conversion system can be verified through ABX listening test and MOS grad.

  3. DLMS Voice Data Entry.

    Science.gov (United States)

    1980-06-01

    between operator and computer displayed on ADM-3A 20c A-I Possible Hardware Configuration for a Multistation Cartographic VDES ...this program a Voice Recognition System (VRS) which can be used to explore the use of voice data entry ( VDE ) in the DIMS or other cartographic data...Multi-Station Cartographic Voice Data Entry System An engineering development model voice data entry system ( VDES ) could be most efficiently

  4. Ability for voice recognition is a marker for dyslexia in children.

    Science.gov (United States)

    Perea, Manuel; Jiménez, María; Suárez-Coalla, Paz; Fernández, Nohemí; Viña, Cecilia; Cuetos, Fernando

    2014-01-01

    A recent voice recognition experiment conducted by Perrachione, Del Tufo, and Gabrieli (2011) revealed that, in normal adult readers, the accuracy at identifying human voices was better in the participants' mother tongue than in an unfamiliar language, while this difference was absent in a group of adults with dyslexia. This pattern favored a view of dyslexia as due to "fundamentally impoverished native-language phonological representations." To further examine this issue, we conducted two voice recognition experiments, one with children with/without dyslexia, and the other with adults with/without dyslexia. Results revealed that children/adults with dyslexia were less accurate at identifying voices than normal readers and, importantly, this effect was independent of language. These data are more consistent with the assumption of dyslexia as due to a deficit in multisensory integration rather than a deficit based on impoverished native-language phonologically based representations.

  5. Literature Review of Voice Recognition and Generation Technology for Army Helicopter Applications.

    Science.gov (United States)

    1984-08-01

    support up this conclusion (Jay, 1981; Coler , 1983). Based upon the research presented, the following statements can be made: a. When flight control...dB) must be overcome by the voice recognizer ( Coler , 1983). 11 55i The effects of noise on voice recognition were the topic of a study performed at...noise when the subject was also required to perform a tracking task and enter data ( Coler , 1983). Performance was evaluated for three different

  6. Human Emotion Recognition System

    Directory of Open Access Journals (Sweden)

    Dilbag Singh

    2012-08-01

    Full Text Available This paper discusses the application of feature extraction of facial expressions with combination of neural network for the recognition of different facial emotions (happy, sad, angry, fear, surprised, neutral etc... Humans are capable of producing thousands of facial actions during communication that vary in complexity, intensity, and meaning. This paper analyses the limitations with existing system Emotion recognition using brain activity. In this paper by using an existing simulator I have achieved 97 percent accurate results and it is easy and simplest way than Emotion recognition using brain activity system. Purposed system depends upon human face as we know face also reflects the human brain activities or emotions. In this paper neural network has been used for better results. In the end of paper comparisons of existing Human Emotion Recognition System has been made with new one.

  7. Analysis of the influence of sound signal processing parameters on the quality voice command recognition

    Directory of Open Access Journals (Sweden)

    L. P. Dyuzhayev

    2014-04-01

    Full Text Available Introduction. For the task of voice control over different devices recognition of single (isolated voice commands is required. Typically, this control method requires high reliability (at least 95% accuracy voice recognition. It should be noted that voice commands are often pronounced in high noisiness. All presently known methods and algorithms of speech recognition do not allow to clearly determine which parameters of sound signal can provide the best results. The main part. On the first level of voice recognition is about preprocessing and extracting of acoustic features that have a number of useful features – they are easily calculated, providing a compact representation of the voice commands that are resistant to noise interference; On the next level given command is looked for in the reference dictionary. To get MFCC coefficients input file has to be divided into frames. Each frame is measured by a window function and processed by discrete Fourier transform. The resulting representation of signal in the frequency domain is divided into ranges using a set of triangular filters. The last step is to perform discrete cosine transform. Method of dynamic time warping allows to get a value that is an inverse of degree of similarity between given command and a reference. Conclusions. Research has shown that in the field of voice commands recognition optimum results in terms of quality / performance can be achieved using the following parameters of sound signal processing:8 kHz sample rate, frame duration 70–120 ms, Hamming weighting function of a window, number of Fourier samples is 512.

  8. Voice Quality in Mobile Telecommunication System

    Directory of Open Access Journals (Sweden)

    Evaldas Stankevičius

    2013-05-01

    Full Text Available The article deals with methods measuring the quality of voice transmitted over the mobile network as well as related problem, algorithms and options. It presents the created voice quality measurement system and discusses its adequacy as well as efficiency. Besides, the author presents the results of system application under the optimal hardware configuration. Under almost ideal conditions, the system evaluates the voice quality with MOS 3.85 average estimate; while the standardized TEMS Investigation 9.0 has 4.05 average MOS estimate. Next, the article presents the discussion of voice quality predictor implementation and investigates the predictor using nonlinear and linear prediction methods of voice quality dependence on the mobile network settings. Nonlinear prediction using artificial neural network resulted in the correlation coefficient of 0.62. While the linear prediction method using the least mean squares resulted in the correlation coefficient of 0.57. The analytical expression of voice quality features from the three network parameters: BER, C / I, RSSI is given as well.Article in Lithuanian

  9. Body expressions influence recognition of emotions in the face and voice.

    Science.gov (United States)

    Van den Stock, Jan; Righart, Ruthger; de Gelder, Beatrice

    2007-08-01

    The most familiar emotional signals consist of faces, voices, and whole-body expressions, but so far research on emotions expressed by the whole body is sparse. The authors investigated recognition of whole-body expressions of emotion in three experiments. In the first experiment, participants performed a body expression-matching task. Results indicate good recognition of all emotions, with fear being the hardest to recognize. In the second experiment, two alternative forced choice categorizations of the facial expression of a compound face-body stimulus were strongly influenced by the bodily expression. This effect was a function of the ambiguity of the facial expression. In the third experiment, recognition of emotional tone of voice was similarly influenced by task irrelevant emotional body expressions. Taken together, the findings illustrate the importance of emotional whole-body expressions in communication either when viewed on their own or, as is often the case in realistic circumstances, in combination with facial expressions and emotional voices.

  10. Smart Homes with Voice Activated Systems for Disabled People

    Directory of Open Access Journals (Sweden)

    Bekir Busatlic

    2017-02-01

    Full Text Available Smart home refers to the application of various technologies to semi-unsupervised home control It refers to systems that control temperature, lighting, door locks, windows and many other appliances. The aim of this study was to design a system that will use existing technology to showcase how it can benefit people with disabilities. This work uses only off-the-shelf products (smart home devices and controllers, speech recognition technology, open-source code libraries. The Voice Activated Smart Home application was developed to demonstrate online grocery shopping and home control using voice comments and tested by measuring its effectiveness in performing tasks as well as its efficiency in recognizing user speech input.

  11. Voice-Controlled Artificial Handspeak System

    Directory of Open Access Journals (Sweden)

    Jonathan Gatti

    2014-04-01

    Full Text Available A man-machine interaction project is described whic h aims to establish an automated voice to sign language translator for communication with the deaf using integrated open technologies. The first prototype consists of a robotic hand designed with OpenSCAD and manufactured with a low-cost 3D printer which smoothly reproduces the alphabet of the sign language controlled by voice only. The core automation comprises an Arduino UNO controller used to activate a set of servo motors that follow instructions from a Raspberry Pi mini-computer havi ng installed the open source speech recognition eng ine Julius. We discuss its features, limitations and po ssible future developmen

  12. Industrial Applications of Automatic Speech Recognition Systems

    Directory of Open Access Journals (Sweden)

    Dr. Jayashri Vajpai

    2016-03-01

    Full Text Available Current trends in developing technologies form important bridges to the future, fortified by the early and productive use of technology for enriching the human life. Speech signal processing, which includes automatic speech recognition, synthetic speech, and natural language processing, is beginning to have a significant impact on business, industry and ease of operation of personal computers. Apart from this, it facilitates the deeper understanding of complex mechanism of functioning of human brain. Advances in speech recognition technology, over the past five decades, have enabled a wide range of industrial applications. Yet today's applications provide a small preview of a rich future for speech and voice interface technology that will eventually replace keyboards with microphones for designing human machine interface for providing easy access to increasingly intelligent machines. It also shows how the capabilities of speech recognition systems in industrial applications are evolving over time to usher in the next generation of voice-enabled services. This paper aims to present an effective survey of the speech recognition technology described in the available literature and integrate the insights gained during the process of study of individual research and developments. The current applications of speech recognition for real world and industry have also been outlined with special reference to applications in the areas of medical, industrial robotics, forensic, defence and aviation

  13. Touchless palmprint recognition systems

    CERN Document Server

    Genovese, Angelo; Scotti, Fabio

    2014-01-01

    This book examines the context, motivation and current status of biometric systems based on the palmprint, with a specific focus on touchless and less-constrained systems. It covers new technologies in this rapidly evolving field and is one of the first comprehensive books on palmprint recognition systems.It discusses the research literature and the most relevant industrial applications of palmprint biometrics, including the low-cost solutions based on webcams. The steps of biometric recognition are described in detail, including acquisition setups, algorithms, and evaluation procedures. Const

  14. Introduction to Arabic Speech Recognition Using CMUSphinx System

    CERN Document Server

    Satori, H; Chenfour, N

    2007-01-01

    In this paper Arabic was investigated from the speech recognition problem point of view. We propose a novel approach to build an Arabic Automated Speech Recognition System (ASR). This system is based on the open source CMU Sphinx-4, from the Carnegie Mellon University. CMU Sphinx is a large-vocabulary; speaker-independent, continuous speech recognition system based on discrete Hidden Markov Models (HMMs). We build a model using utilities from the OpenSource CMU Sphinx. We will demonstrate the possible adaptability of this system to Arabic voice recognition.

  15. Evaluation of an Intelligent Assistive Technology for Voice Navigation of Spreadsheets

    CERN Document Server

    Flood, Derek; Caffery, Fergal Mc; Bishop, Brian

    2008-01-01

    An integral part of spreadsheet auditing is navigation. For sufferers of Repetitive Strain Injury who need to use voice recognition technology this navigation can be highly problematic. To counter this the authors have developed an intelligent voice navigation system, iVoice, which replicates common spreadsheet auditing behaviours through simple voice commands. This paper outlines the iVoice system and summarizes the results of a study to evaluate iVoice when compared to a leading voice recognition technology.

  16. (Almost) Word for Word: As Voice Recognition Programs Improve, Students Reap the Benefits

    Science.gov (United States)

    Smith, Mark

    2006-01-01

    Voice recognition software is hardly new--attempts at capturing spoken words and turning them into written text have been available to consumers for about two decades. But what was once an expensive and highly unreliable tool has made great strides in recent years, perhaps most recognized in programs such as Nuance's Dragon NaturallySpeaking…

  17. Automatic stereoscopic system for person recognition

    Science.gov (United States)

    Murynin, Alexander B.; Matveev, Ivan A.; Kuznetsov, Victor D.

    1999-06-01

    A biometric access control system based on identification of human face is presented. The system developed performs remote measurements of the necessary face features. Two different scenarios of the system behavior are implemented. The first one assumes the verification of personal data entered by visitor from console using keyboard or card reader. The system functions as an automatic checkpoint, that strictly controls access of different visitors. The other scenario makes it possible to identify visitors without any person identifier or pass. Only person biometrics are used to identify the visitor. The recognition system automatically finds necessary identification information preliminary stored in the database. Two laboratory models of recognition system were developed. The models are designed to use different information types and sources. In addition to stereoscopic images inputted to computer from cameras the models can use voice data and some person physical characteristics such as person's height, measured by imaging system.

  18. Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

    Science.gov (United States)

    Partila, Pavol; Voznak, Miroslav; Tovarek, Jaromir

    2015-01-01

    The impact of the classification method and features selection for the speech emotion recognition accuracy is discussed in this paper. Selecting the correct parameters in combination with the classifier is an important part of reducing the complexity of system computing. This step is necessary especially for systems that will be deployed in real-time applications. The reason for the development and improvement of speech emotion recognition systems is wide usability in nowadays automatic voice controlled systems. Berlin database of emotional recordings was used in this experiment. Classification accuracy of artificial neural networks, k-nearest neighbours, and Gaussian mixture model is measured considering the selection of prosodic, spectral, and voice quality features. The purpose was to find an optimal combination of methods and group of features for stress detection in human speech. The research contribution lies in the design of the speech emotion recognition system due to its accuracy and efficiency. PMID:26346654

  19. Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System

    Directory of Open Access Journals (Sweden)

    Pavol Partila

    2015-01-01

    Full Text Available The impact of the classification method and features selection for the speech emotion recognition accuracy is discussed in this paper. Selecting the correct parameters in combination with the classifier is an important part of reducing the complexity of system computing. This step is necessary especially for systems that will be deployed in real-time applications. The reason for the development and improvement of speech emotion recognition systems is wide usability in nowadays automatic voice controlled systems. Berlin database of emotional recordings was used in this experiment. Classification accuracy of artificial neural networks, k-nearest neighbours, and Gaussian mixture model is measured considering the selection of prosodic, spectral, and voice quality features. The purpose was to find an optimal combination of methods and group of features for stress detection in human speech. The research contribution lies in the design of the speech emotion recognition system due to its accuracy and efficiency.

  20. Speech Recognition System For Robotic Control And Movement

    Directory of Open Access Journals (Sweden)

    Biraja Nalini Rout

    2015-08-01

    Full Text Available Abstract In a current scenario voice and data recognition is one of the most sought after field in the area of artificial intelligence and robotic 1 engineering. The idea specializes on deriving a voice to voice intelligent system which operates purely on audiovoice instructions using a specialized voice recognition module a micro controller a set of wheels and a movable arm to operate. The working involves real time voice inputs feeded to the VR module which equivalently processes the audio signals and produces the output in audio format. It consists an IDE for both Windows and UNIX based operating system for manipulating and processing instructions both at software and hardware levels. The system also can perform a basic set of manual operations decides through the expert system. The VR module processes the data using multilayer perceptron to generate the required result. Movable arm operates to pick and place objects as per the given voice instructions. Its usability involves substituting manual work at both personal and professional levels.

  1. Voice recognition through phonetic features with Punjabi utterances

    Science.gov (United States)

    Kaur, Jasdeep; Juglan, K. C.; Sharma, Vishal; Upadhyay, R. K.

    2017-07-01

    This paper deals with perception and disorders of speech in view of Punjabi language. Visualizing the importance of voice identification, various parameters of speaker identification has been studied. The speech material was recorded with a tape recorder in their normal and disguised mode of utterances. Out of the recorded speech materials, the utterances free from noise, etc were selected for their auditory and acoustic spectrographic analysis. The comparison of normal and disguised speech of seven subjects is reported. The fundamental frequency (F0) at similar places, Plosive duration at certain phoneme, Amplitude ratio (A1:A2) etc. were compared in normal and disguised speech. It was found that the formant frequency of normal and disguised speech remains almost similar only if it is compared at the position of same vowel quality and quantity. If the vowel is more closed or more open in the disguised utterance the formant frequency will be changed in comparison to normal utterance. The ratio of the amplitude (A1: A2) is found to be speaker dependent. It remains unchanged in the disguised utterance. However, this value may shift in disguised utterance if cross sectioning is not done at the same location.

  2. Voice emotion recognition by cochlear-implanted children and their normally-hearing peers.

    Science.gov (United States)

    Chatterjee, Monita; Zion, Danielle J; Deroche, Mickael L; Burianek, Brooke A; Limb, Charles J; Goren, Alison P; Kulkarni, Aditya M; Christensen, Julie A

    2015-04-01

    Despite their remarkable success in bringing spoken language to hearing impaired listeners, the signal transmitted through cochlear implants (CIs) remains impoverished in spectro-temporal fine structure. As a consequence, pitch-dominant information such as voice emotion, is diminished. For young children, the ability to correctly identify the mood/intent of the speaker (which may not always be visible in their facial expression) is an important aspect of social and linguistic development. Previous work in the field has shown that children with cochlear implants (cCI) have significant deficits in voice emotion recognition relative to their normally hearing peers (cNH). Here, we report on voice emotion recognition by a cohort of 36 school-aged cCI. Additionally, we provide for the first time, a comparison of their performance to that of cNH and NH adults (aNH) listening to CI simulations of the same stimuli. We also provide comparisons to the performance of adult listeners with CIs (aCI), most of whom learned language primarily through normal acoustic hearing. Results indicate that, despite strong variability, on average, cCI perform similarly to their adult counterparts; that both groups' mean performance is similar to aNHs' performance with 8-channel noise-vocoded speech; that cNH achieve excellent scores in voice emotion recognition with full-spectrum speech, but on average, show significantly poorer scores than aNH with 8-channel noise-vocoded speech. A strong developmental effect was observed in the cNH with noise-vocoded speech in this task. These results point to the considerable benefit obtained by cochlear-implanted children from their devices, but also underscore the need for further research and development in this important and neglected area. This article is part of a Special Issue entitled .

  3. Recognition disorders for famous faces and voices: a review of the literature and normative data of a new test battery.

    Science.gov (United States)

    Quaranta, Davide; Piccininni, Chiara; Carlesimo, Giovanni Augusto; Luzzi, Simona; Marra, Camillo; Papagno, Costanza; Trojano, Luigi; Gainotti, Guido

    2016-03-01

    Several anatomo-clinical investigations have shown that familiar face recognition disorders not due to high level perceptual defects are often observed in patients with lesions of the right anterior temporal lobe (ATL). The meaning of these findings is, however, controversial, because some authors claim that these patients show pure instances of modality-specific 'associative prosopagnosia', whereas other authors maintain that in these patients voice recognition is also impaired and that these patients have a 'multimodal person recognition disorder'. To solve the problem of the nature of famous faces recognition disorders in patients affected by right ATL lesions, it is therefore very important to verify with formal tests if these patients are or are not able to recognize others by voice, but a direct comparison between the two modalities is hindered by the fact that voice recognition is more difficult than face recognition. To circumvent this difficulty, we constructed a test battery in which subjects were requested to recognize the same persons (well-known at the national level) through their faces and voices, evaluating familiarity and identification processes. The present paper describes the 'Famous People Recognition Battery' and reports the normative data necessary to clarify the nature of person recognition disorders observed in patients affected by right ATL lesions.

  4. Voice Activated Cockpit Management Systems: Voice-Flight NexGen Project

    Data.gov (United States)

    National Aeronautics and Space Administration — Speaking to the cockpit as a method of system management in flight can become an effective interaction method, since voice communication is very efficient. Automated...

  5. Practical applications of interactive voice technologies: Some accomplishments and prospects

    Science.gov (United States)

    Grady, Michael W.; Hicklin, M. B.; Porter, J. E.

    1977-01-01

    A technology assessment of the application of computers and electronics to complex systems is presented. Three existing systems which utilize voice technology (speech recognition and speech generation) are described. Future directions in voice technology are also described.

  6. DSP Based System for Real time Voice Synthesis Applications Development

    CERN Document Server

    Arsinte, Radu; Miron, Costin

    2008-01-01

    This paper describes an experimental system designed for development of real time voice synthesis applications. The system is composed from a DSP coprocessor card, equipped with an TMS320C25 or TMS320C50 chip, voice acquisition module (ADDA2),host computer (IBM-PC compatible), software specific tools.

  7. Increase in Organization Effectiveness Using Voice Analysis: The System Approach

    Directory of Open Access Journals (Sweden)

    Lina Bartkienė

    2011-04-01

    Full Text Available The main purpose of this article is to analyze literature related to the system theory and to present the system of increase in organization effectiveness using voice analysis. The concepts of the system approach were analyzed, the definition of the system, its components and classification were discussed. Following the principles of the system theory, the system of increase in organization effectiveness using voice analysis was designed. Each element was briefly discussed, i.e. processes influencing the employee, the environment, voice analysis system, expert system, prime and final organizational effectiveness. In addition, the relations between these elements were indentified. Article in Lithuanian

  8. Examining the effects of variation in emotional tone of voice on spoken word recognition.

    Science.gov (United States)

    Krestar, Maura L; McLennan, Conor T

    2013-09-01

    Emotional tone of voice (ETV) is essential for optimal verbal communication. Research has found that the impact of variation in nonlinguistic features of speech on spoken word recognition differs according to a time course. In the current study, we investigated whether intratalker variation in ETV follows the same time course in two long-term repetition priming experiments. We found that intratalker variability in ETVs affected reaction times to spoken words only when processing was relatively slow and difficult, not when processing was relatively fast and easy. These results provide evidence for the use of both abstract and episodic lexical representations for processing within-talker variability in ETV, depending on the time course of spoken word recognition.

  9. Voice Matching Using Genetic Algorithm

    Directory of Open Access Journals (Sweden)

    Abhishek Bal

    2014-03-01

    Full Text Available In this paper, the use of Genetic Algorithm (GA for voice recognition is described. The practical application of Genetic Algorithm (GA to the solution of engineering problem is a rapidly emerging approach in the field of control engineering and signal processing. Genetic algorithms are useful for searching a space in multi-directional way from large spaces and poorly defined space. Voice is a signal of infinite information. Digital processing of voice signal is very important for automatic voice recognition technology. Nowadays, voice processing is very much important in security mechanism due to mimicry characteristic. So studying the voice feature extraction in voice processing is very necessary in military, hospital, telephone system, investigation bureau and etc. In order to extract valuable information from the voice signal, make decisions on the process, and obtain results, the data needs to be manipulated and analyzed. In this paper, if the instant voice is not matched with same person’s reference voices in the database, then Genetic Algorithm (GA is applied between two randomly chosen reference voices. Again the instant voice is compared with the result of Genetic Algorithm (GA which is used, including its three main steps: selection, crossover and mutation. We illustrate our approach with different sample of voices from human in our institution.

  10. V2S: Voice to Sign Language Translation System for Malaysian Deaf People

    Science.gov (United States)

    Mean Foong, Oi; Low, Tang Jung; La, Wai Wan

    The process of learning and understand the sign language may be cumbersome to some, and therefore, this paper proposes a solution to this problem by providing a voice (English Language) to sign language translation system using Speech and Image processing technique. Speech processing which includes Speech Recognition is the study of recognizing the words being spoken, regardless of whom the speaker is. This project uses template-based recognition as the main approach in which the V2S system first needs to be trained with speech pattern based on some generic spectral parameter set. These spectral parameter set will then be stored as template in a database. The system will perform the recognition process through matching the parameter set of the input speech with the stored templates to finally display the sign language in video format. Empirical results show that the system has 80.3% recognition rate.

  11. Speech Recognition Technology Applied to Intelligent Mobile Navigation System

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The capability of human-computer interaction reflects the intelligent degree of mobile navigation system.The navigation data and functions of mobile navigation system are divided into system commands and non-system commands in this paper.And then a group of speech commands are Abstracted.This paper applies speech recognition technology to intelligent mobile navigation system to process speech commands and does some deep research on the integration of speech recognition technology with mobile navigation system.The navigation operation can be performed by speech commands,which makes human-computer interaction easy during navigation.Speech command interface of navigation system is implemented by Dutty ++ Software,which is based on speech recognition system -Via Voice of IBM.Through navigation experiments,navigation can be done almost without keyboard,which proved that human-computer interaction is very convenient by speech commands and the reliability is also higher.

  12. It doesn't matter what you say: FMRI correlates of voice learning and recognition independent of speech content.

    Science.gov (United States)

    Zäske, Romi; Awwad Shiekh Hasan, Bashar; Belin, Pascal

    2017-09-01

    Listeners can recognize newly learned voices from previously unheard utterances, suggesting the acquisition of high-level speech-invariant voice representations during learning. Using functional magnetic resonance imaging (fMRI) we investigated the anatomical basis underlying the acquisition of voice representations for unfamiliar speakers independent of speech, and their subsequent recognition among novel voices. Specifically, listeners studied voices of unfamiliar speakers uttering short sentences and subsequently classified studied and novel voices as "old" or "new" in a recognition test. To investigate "pure" voice learning, i.e., independent of sentence meaning, we presented German sentence stimuli to non-German speaking listeners. To disentangle stimulus-invariant and stimulus-dependent learning, during the test phase we contrasted a "same sentence" condition in which listeners heard speakers repeating the sentences from the preceding study phase, with a "different sentence" condition. Voice recognition performance was above chance in both conditions although, as expected, performance was higher for same than for different sentences. During study phases activity in the left inferior frontal gyrus (IFG) was related to subsequent voice recognition performance and same versus different sentence condition, suggesting an involvement of the left IFG in the interactive processing of speaker and speech information during learning. Importantly, at test reduced activation for voices correctly classified as "old" compared to "new" emerged in a network of brain areas including temporal voice areas (TVAs) of the right posterior superior temporal gyrus (pSTG), as well as the right inferior/middle frontal gyrus (IFG/MFG), the right medial frontal gyrus, and the left caudate. This effect of voice novelty did not interact with sentence condition, suggesting a role of temporal voice-selective areas and extra-temporal areas in the explicit recognition of learned voice identity

  13. Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization

    Directory of Open Access Journals (Sweden)

    Buddhamas eKriengwatana

    2015-01-01

    Full Text Available The extent to which human speech perception evolved by taking advantage of predispositions and pre-existing features of vertebrate auditory and cognitive systems remains a central question in the evolution of speech. This paper reviews asymmetries in vowel perception, speaker voice recognition, and speaker normalization in non-human animals – topics that have not been thoroughly discussed in relation to the abilities of non-human animals, but are nonetheless important aspects of vocal perception. Throughout this paper we demonstrate that addressing these issues in non-human animals is relevant and worthwhile because many non-human animals must deal with similar issues in their natural environment. That is, they must also discriminate between similar-sounding vocalizations, determine signaler identity from vocalizations, and resolve signaler-dependent variation in vocalizations from conspecifics. Overall, we find that, although plausible, the current evidence is insufficiently strong to conclude that directional asymmetries in vowel perception are specific to humans, or that non-human animals can use voice characteristics to recognize human individuals. However, we do find some indication that non-human animals can normalize speaker differences. Accordingly, we identify avenues for future research that would greatly improve and advance our understanding of these topics.

  14. Penguins use the two-voice system to recognize each other.

    Science.gov (United States)

    Aubin, T; Jouventin, P; Hildebrand, C

    2000-06-07

    The sound-producing structure in birds is the syrinx, which is usually a two-part organ located at the junction of the bronchi. As each branch of the syrinx produces sound independently, many birds have two acoustic sources. Thirty years ago, we had anatomical, physiological and acoustical evidence of this two-voice phenomenon but no function was known. In songbirds, often these two voices with their respective harmonics are not activated simultaneously but they are obvious in large penguins and generate a beat pattern which varies between individuals. The emperor penguin breeds during the Antarctic winter, incubating and carrying its egg on its feet. Without the topographical cue of a nest, birds identify each other only by vocal means when switching duties during incubation or chick rearing. To test whether the two-voice system contains the identity code, we played back the modified call of their mate to both adults and also the modified call of their parents to chicks. Both the adults and the chicks replied to controls (two voices) but not to modified signals (one voice being experimentally suppressed). Our experiments demonstrate that the beat generated by the interaction of these two fundamental frequencies conveys information about individual identity and also propagates well through obstacles, being robust to sound degradation through the medium of bodies in a penguin colony. The two-voice structure is also clear in the call of other birds such as the king penguin, another non-nesting species, but not in the 14 other nesting penguins. We concluded that the two-voice phenomenon functions as an individual recognition system in species using few if any landmarks to meet. In penguins, this coding process, increasing the call complexity and resisting sound degradation, has evolved in parallel with the loss of territoriality.

  15. Educational Pedagogy Explored: Attachment, Voice, and Students’ Limited Recognition of the Purpose of Writing

    Directory of Open Access Journals (Sweden)

    Rebecca A. Fairchild

    2013-07-01

    Full Text Available The following teacher research case-study involved an exploration of educational pedagogy by working with a freshman composition student at a college university. All data collected for the study was gathered during the 2013 spring semester. The study was driven by an inquiry based approach where the researcher determined the center of focus that arose from an exploration of the student as a writer through a survey, a classroom observation, multiple one-on-one meetings, and email conversations. The focus area that arose was the student’s limited recognition that writing was done solely for school purposes. Related puzzlements stemming from this focus area included the student’s lack of attachment and lack of voice in her writing. The conclusive data provided insights for how to educate students in future classrooms regarding how vital it is for students to be able to attach themselves to their work.

  16. VOICE ACTIVATED MULTIPROCESSOR EMBEDDED SYSTEM TO IMPROVE THE CONTROL OF A MOTORIZED WHEELCHAIR

    Directory of Open Access Journals (Sweden)

    SANGMESHWAR S. KENDRE,

    2010-11-01

    Full Text Available The main idea of this work is to process analog voice signal. The theme is implemented for controlling the wheelchair by voice through speech processing using Hawkboard (OMAP processor. The adopted model is based on grouping a ARM and a DSP processor for speech enhancement with a voice recognition module for isolated word and speaker dependent. The Texas Instruments OMAP-L138 is integrated in order to enhance the quality of speech signal by reducing noise and connected with the wheelchair for processing of the voice signal. The Hawkboard denoises speech signal and HMC2007 recognizes the commnads. It also generates different desired signals according to the spoken words which further used to control the movement of wheelchair, a vector of information on the context given by a set of sensors for security actions. Six words are recognized which are start, forward, reverse, left, right, stop. In order to gain in time design, experiments have shown that the best way is to choose a speech recognition kit and to adapt it to the application. The result at the end show the efficiency of the system.

  17. A Real-Time Face Motion Based Approach towards Modeling Socially Assistive Wireless Robot Control with Voice Recognition

    Directory of Open Access Journals (Sweden)

    Abhinaba Bhattacharjee

    2015-10-01

    Full Text Available The robotics domain has a couple of specific general design requirements which requires the close integration of planning, sensing, control and modeling and for sure the robot must take into account the interactions between itself, its task and its environment surrounding it. Thus considering the fundamental configurations, the main motive is to design a system with user-friendly interfaces that possess the ability to control embedded robotic systems by natural means. While earlier works have focused primarily on issues such as manipulation and navigation only, this proposal presents a conceptual and intuitive approach towards man-machine interaction in order to provide a secured live biometric logical authorization to the user access, while making an intelligent interaction with the control station to navigate advanced gesture controlled wireless Robotic prototypes or mobile surveillance systems along desired directions through required displacements. The intuitions are based on tracking real-time 3-Dimensional Face Motions using skin tone segmentation and maximum area considerations of segmented face-like blobs, Or directing the system with voice commands using real-time speech recognition. The system implementation requires designing a user interface to communicate between the Control station and prototypes wirelessly, either by accessing the internet over an encrypted Wi-Fi Protected Access (WPA via a HTML web page for communicating with face motions or with the help of natural voice commands like “Trace 5 squares”, “Trace 10 triangles”, “Move 10 meters”, etc. evaluated on an iRobot Create over Bluetooth connectivity using a Bluetooth Access Module (BAM. Such an implementation can prove to be highly effective for designing systems of elderly aid and maneuvering the physically challenged.

  18. Internet-Based System for Voice Communication With the ISS

    Science.gov (United States)

    Chamberlain, James; Myers, Gerry; Clem, David; Speir, Terri

    2005-01-01

    The Internet Voice Distribution System (IVoDS) is a voice-communication system that comprises mainly computer hardware and software. The IVoDS was developed to supplement and eventually replace the Enhanced Voice Distribution System (EVoDS), which, heretofore, has constituted the terrestrial subsystem of a system for voice communications among crewmembers of the International Space Station (ISS), workers at the Payloads Operations Center at Marshall Space Flight Center, principal investigators at diverse locations who are responsible for specific payloads, and others. The IVoDS utilizes a communication infrastructure of NASA and NASArelated intranets in addition to, as its name suggests, the Internet. Whereas the EVoDS utilizes traditional circuitswitched telephony, the IVoDS is a packet-data system that utilizes a voice over Internet protocol (VOIP). Relative to the EVoDS, the IVoDS offers advantages of greater flexibility and lower cost for expansion and reconfiguration. The IVoDS is an extended version of a commercial Internet-based voice conferencing system that enables each user to participate in only one conference at a time. In the IVoDS, a user can receive audio from as many as eight conferences simultaneously while sending audio to one of them. The IVoDS also incorporates administrative controls, beyond those of the commercial system, that provide greater security and control of the capabilities and authorizations for talking and listening afforded to each user.

  19. An automatic speech recognition system with speaker-independent identification support

    Science.gov (United States)

    Caranica, Alexandru; Burileanu, Corneliu

    2015-02-01

    The novelty of this work relies on the application of an open source research software toolkit (CMU Sphinx) to train, build and evaluate a speech recognition system, with speaker-independent support, for voice-controlled hardware applications. Moreover, we propose to use the trained acoustic model to successfully decode offline voice commands on embedded hardware, such as an ARMv6 low-cost SoC, Raspberry PI. This type of single-board computer, mainly used for educational and research activities, can serve as a proof-of-concept software and hardware stack for low cost voice automation systems.

  20. Transmission by an Embedded System with Enhancements in Voice Processing Technologies

    Directory of Open Access Journals (Sweden)

    G.Sitha Annapurna

    2014-03-01

    Full Text Available The paper reports that the robot can transmit the data such as video, audio, images. The robot can be controlled using the human voice. There are two embedded systems first one is robot controlling system(MASTER which is used to control the robot, second one is voice controlled robot(SLAVE which responds according to the instructions coming from the controlling system. These two embedded systems are communicated through wireless. We can use anyone one of wireless protocols such as IR, NFC, Bluetooth, Zigbee, WI-FI in order to establish a bridge between the MASTER and SLAVE. The voice controlled robot can understand the instructions with the help of the voice recognition system. Spinx-4 is a speech recognizer system written entirely in the java programming language.Sphinx-4 started out as a port of Sphinx-3 to the Java programming language, but evolved into a recognizer designed to be much more flexible than Sphinx-3, thus becoming an excellent platform for speech research.Sphinx-4 is an HMM-based speech recognizer. HMM stands for Hidden Markov Models, Sphinx-4 are a type of statistical model In HMM based speech recognizers.

  1. An audiovisual emotion recognition system

    Science.gov (United States)

    Han, Yi; Wang, Guoyin; Yang, Yong; He, Kun

    2007-12-01

    Human emotions could be expressed by many bio-symbols. Speech and facial expression are two of them. They are both regarded as emotional information which is playing an important role in human-computer interaction. Based on our previous studies on emotion recognition, an audiovisual emotion recognition system is developed and represented in this paper. The system is designed for real-time practice, and is guaranteed by some integrated modules. These modules include speech enhancement for eliminating noises, rapid face detection for locating face from background image, example based shape learning for facial feature alignment, and optical flow based tracking algorithm for facial feature tracking. It is known that irrelevant features and high dimensionality of the data can hurt the performance of classifier. Rough set-based feature selection is a good method for dimension reduction. So 13 speech features out of 37 ones and 10 facial features out of 33 ones are selected to represent emotional information, and 52 audiovisual features are selected due to the synchronization when speech and video fused together. The experiment results have demonstrated that this system performs well in real-time practice and has high recognition rate. Our results also show that the work in multimodules fused recognition will become the trend of emotion recognition in the future.

  2. FINGER-VEIN RECOGNITION SYSTEMS

    Directory of Open Access Journals (Sweden)

    A.Haritha Deepthi

    2015-10-01

    Full Text Available As the Person‟s/Organization‟s Private information‟s are becoming very easy to access, the demand for a Simple, Convenient, Efficient, and a highly Securable Authentication System has been increased. In considering these requirements for data Protection, Biometrics, which uses human physiological or behavioral system for personal Identification has been found as a solution for these difficulties. However most of the biometric systems have high complexity in both time and space. So we are going to use a Real time Finger-Vein recognition System for authentication purposes. In this paper we had implemented the Finger Vein Recognition concept using MATLAB R2013a. The features used are Lacunarity Distance, Blanket Dimension distance. This has more accuracy when compared to conventional methods.

  3. A new VOX technique for reducing noise in voice communication systems. [voice operated keying

    Science.gov (United States)

    Morris, C. F.; Morgan, W. C.; Shack, P. E.

    1974-01-01

    A VOX technique for reducing noise in voice communication systems is described which is based on the separation of voice signals into contiguous frequency-band components with the aid of an adaptive VOX in each band. It is shown that this processing scheme can effectively reduce both wideband and narrowband quasi-periodic noise since the threshold levels readjust themselves to suppress noise that exceeds speech components in each band. Results are reported for tests of the adaptive VOX, and it is noted that improvements can still be made in such areas as the elimination of noise pulses, phoneme reproduction at high-noise levels, and the elimination of distortion introduced by phase delay.

  4. Offline arabic character recognition system

    Institute of Scientific and Technical Information of China (English)

    2003-01-01

    Several languages use the Arabic alphabets and arabic scripts present challenges because the letter shape is context sensitive. For the past three decades, there has been a mounting interest among researchers in this problem. In this paper we present an Arabic Character Recognition system and quence steps of recognizing Arabic text. These steps are separately discussed, and previous research work on each step is reviewed. Also in this paper we give some samples of Arabic fonts.

  5. IBM Voice Conversion Systems for 2007 TC-STAR Evaluation

    Institute of Scientific and Technical Information of China (English)

    SHUANG Zhiwei; Raimo Bakis; QIN Yong

    2008-01-01

    This paper proposes a novel voice conversion method by frequency warping.The frequency warp-ing function is generated based on mapping formants of the source speaker and the target speaker.In addi-tion to frequency warping,fundamental frequency adjustment,spectral envelope equalization,breathiness addition,and duration modification are also used to improve the similarity to the target speaker.The pro-posed voice conversion method needs only a very small amount of training data for generating the warping function,thereby greatly facilitating its application.Systems based on the proposed method were used for the 2007 TC-STAR intra-lingual voice conversion evaluation for English and Spanish and a cross-lingual voice conversion evaluation for Spanish.The evaluation results show that the proposed method can achieve a much better quality of converted speech than other methods as well as a good balance between quality and similarity.The IBM1 system was ranked No.1 for English evaluation and No.2 for Spanish evaluation.Evaluation results also show that the proposed method is a convenient and competitive method for cross-lingual voice conversion tasks.

  6. 病态嗓音的识别与研究%study and recognition of pathological voice

    Institute of Scientific and Technical Information of China (English)

    陈承义; 高俊芬

    2013-01-01

      通过分析嗓音的发音机理,提取正常与病态嗓音的传统声学参数:基频、共振峰、Mel 倒谱系数(MFCC),以及非线性特征参数:计盒维数与截距,作为病态嗓音识别的特征矢量集.应用高斯混合模型(GMM)对156例正常嗓音与146例病态嗓音进行建模与识别.结果表明:非线性特征参数计盒维数与截距能很好地区分正常与病态嗓音,它们与传统声学参数基频和共振峰的组合,能够取得92.60%的识别率.%By analyzing the mechanism of pronunciation, normal and pathological voice of traditional acoustic parameters:fun-damental frequency, formant, Mel Frequency Cepstrum Coefficient(MFCC), and non-linear feature parameters:box-counting dimension and intercept, are extracted as feature vectors of recognition of pathological voice. 156 normal voice samples and 146 pathological voice samples are recognized based on Gaussian Mixture Model(GMM). The results show that the nonlinear fea-ture parameters of box-counting dimension and intercept can well distinguish between normal and pathological voice. The com-bination of box-counting dimension, intercept and the traditional acoustic parameters-fundamental frequency and formant can achieve a better recognition rate of 92.60%.

  7. A singing voices synthesis system to characterize vocal registers using ARX-LF model

    OpenAIRE

    Motoda, Hiroki; Akagi, Masato

    2013-01-01

    This paper proposes a singing voices synthesis system to synthesize singing voices having characteristics of vocal registers, such as vocal fly, modal and falsetto. Human can sing songs naturally in wide range of frequency by training how to use vocal fold vibrations to represent vocal registers. However, even state-of-the-art singing voices synthesis systems cannot produce vocal registers appropriately. Naturalness of the synthesized singing voices using these systems is reduced in low and h...

  8. Texture based iris recognition system

    Science.gov (United States)

    Mehrotra, Hunny; Gupta, Phalguni; Kaushik, Anil K.

    2008-04-01

    The paper proposes an efficient iris recognition algorithm, obtained through the fusion of Haar Wavelet and Circular Mellin operator. The recognition system preprocesses the captured iris image to remove the effect of holes or spot of light lying on the pupillary region which creates problem in pupil localization. The processed image is localized by detecting inner and outer boundaries from the pupil center using maximum value of the spectrum image. Then the eyelids are detected by fitting a 3 rd degree polynomial on the suitable edge segments and removing the region occluded by eyelids from the normalized iris image. The features for the iris pattern are extracted using Haar Wavelet and Circular Mellin operator. The Haar Wavelet decomposition reduces the size of feature vector while Circular Mellin operator is used for rotation and scale invariant feature extraction. The features are compared using Hamming Distance method and the fusion is done at decision level using Conjunction rule. The recognizer is found to be more robust with accuracy level more than 95%.

  9. Gender Based Emotion Recognition System for Telugu Rural Dialects Using Hidden Markov Models

    CERN Document Server

    D, Prasad Reddy P V G; Srinivas, Y; Brahmaiah, P

    2010-01-01

    Automatic emotion recognition in speech is a research area with a wide range of applications in human interactions. The basic mathematical tool used for emotion recognition is Pattern recognition which involves three operations, namely, pre-processing, feature extraction and classification. This paper introduces a procedure for emotion recognition using Hidden Markov Models (HMM), which is used to divide five emotional states: anger, surprise, happiness, sadness and neutral state. The approach is based on standard speech recognition technology using hidden continuous markov model by selection of low level features and the design of the recognition system. Emotional Speech Database from Telugu Rural Dialects of Andhra Pradesh (TRDAP) was designed using several speaker's voices comprising the emotional states. The accuracy of recognizing five different emotions for both genders of classification is 80% for anger-emotion which is achieved by using the best combination of 39-dimensioanl feature vector for every f...

  10. A speech recognition system for data collection in precision agriculture

    Science.gov (United States)

    Dux, David Lee

    Agricultural producers have shown interest in collecting detailed, accurate, and meaningful field data through field scouting, but scouting is labor intensive. They use yield monitor attachments to collect weed and other field data while driving equipment. However, distractions from using a keyboard or buttons while driving can lead to driving errors or missed data points. At Purdue University, researchers have developed an ASR system to allow equipment operators to collect georeferenced data while keeping hands and eyes on the machine during harvesting and to ease georeferencing of data collected during scouting. A notebook computer retrieved locations from a GPS unit and displayed and stored data in Excel. A headset microphone with a single earphone collected spoken input while allowing the operator to hear outside sounds. One-, two-, or three-word commands activated appropriate VBA macros. Four speech recognition products were chosen based on hardware requirements and ability to add new terms. After training, speech recognition accuracy was 100% for Kurzweil VoicePlus and Verbex Listen for the 132 vocabulary words tested, during tests walking outdoors or driving an ATV. Scouting tests were performed by carrying the system in a backpack while walking in soybean fields. The system recorded a point or a series of points with each utterance. Boundaries of points showed problem areas in the field and single points marked rocks and field corners. Data were displayed as an Excel chart to show a real-time map as data were collected. The information was later displayed in a GIS over remote sensed field images. Field corners and areas of poor stand matched, with voice data explaining anomalies in the image. The system was tested during soybean harvest by using voice to locate weed patches. A harvester operator with little computer experience marked points by voice when the harvester entered and exited weed patches or areas with poor crop stand. The operator found the

  11. The Cambridge Mindreading Face-Voice Battery for Children (CAM-C): complex emotion recognition in children with and without autism spectrum conditions.

    Science.gov (United States)

    Golan, Ofer; Sinai-Gavrilov, Yana; Baron-Cohen, Simon

    2015-01-01

    Difficulties in recognizing emotions and mental states are central characteristics of autism spectrum conditions (ASC). However, emotion recognition (ER) studies have focused mostly on recognition of the six 'basic' emotions, usually using still pictures of faces. This study describes a new battery of tasks for testing recognition of nine complex emotions and mental states from video clips of faces and from voice recordings taken from the Mindreading DVD. This battery (the Cambridge Mindreading Face-Voice Battery for Children or CAM-C) was given to 30 high-functioning children with ASC, aged 8 to 11, and to 25 matched controls. The ASC group scored significantly lower than controls on complex ER from faces and voices. In particular, participants with ASC had difficulty with six out of nine complex emotions. Age was positively correlated with all task scores, and verbal IQ was correlated with scores in the voice task. CAM-C scores were negatively correlated with parent-reported level of autism spectrum symptoms. Children with ASC show deficits in recognition of complex emotions and mental states from both facial and vocal expressions. The CAM-C may be a useful test for endophenotypic studies of ASC and is one of the first to use dynamic stimuli as an assay to reveal the ER profile in ASC. It complements the adult version of the CAM Face-Voice Battery, thus providing opportunities for developmental assessment of social cognition in autism.

  12. Voice Technology Design Guides for Navy Training Systems.

    Science.gov (United States)

    1983-03-01

    81 mi4000111 b bleck ehm ~m) - This project was directed toward gathering information about applications of automated speech technology (AST) and...environmental events. automated performance measurement, and a strong voice interaction between the trainee and the system. Both successes and difficulties have...80-C-0057-1 Strong interaction with the user community is necessary throughout the curriculum development. A subject matter expert, ideally, should

  13. 语音情感识别研究现状综述%A General Summary of the Research Status Que about the Voice Emotion Recognition

    Institute of Scientific and Technical Information of China (English)

    何秉羲

    2015-01-01

    This article starts from the concept and process of voice emotion recognition, the phased research situation about the process of voice emotion recognition has carried on the comprehensive elaboration in recent years, and the fu-ture research and its development are prospected.%本文从语音情感识别的概念以及流程入手,对近些年来关于语音情感识别过程情况的阶段性研究成果进行了综合阐述,并对其未来研究及其发展进行了展望。

  14. Evaluation of MPEG-7-Based Audio Descriptors for Animal Voice Recognition over Wireless Acoustic Sensor Networks.

    Science.gov (United States)

    Luque, Joaquín; Larios, Diego F; Personal, Enrique; Barbancho, Julio; León, Carlos

    2016-05-18

    Environmental audio monitoring is a huge area of interest for biologists all over the world. This is why some audio monitoring system have been proposed in the literature, which can be classified into two different approaches: acquirement and compression of all audio patterns in order to send them as raw data to a main server; or specific recognition systems based on audio patterns. The first approach presents the drawback of a high amount of information to be stored in a main server. Moreover, this information requires a considerable amount of effort to be analyzed. The second approach has the drawback of its lack of scalability when new patterns need to be detected. To overcome these limitations, this paper proposes an environmental Wireless Acoustic Sensor Network architecture focused on use of generic descriptors based on an MPEG-7 standard. These descriptors demonstrate it to be suitable to be used in the recognition of different patterns, allowing a high scalability. The proposed parameters have been tested to recognize different behaviors of two anuran species that live in Spanish natural parks; the Epidalea calamita and the Alytes obstetricans toads, demonstrating to have a high classification performance.

  15. Kannada character recognition system using neural network

    Science.gov (United States)

    Kumar, Suresh D. S.; Kamalapuram, Srinivasa K.; Kumar, Ajay B. R.

    2013-03-01

    Handwriting recognition has been one of the active and challenging research areas in the field of pattern recognition. It has numerous applications which include, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. As there is no sufficient number of works on Indian language character recognition especially Kannada script among 15 major scripts in India. In this paper an attempt is made to recognize handwritten Kannada characters using Feed Forward neural networks. A handwritten Kannada character is resized into 20x30 Pixel. The resized character is used for training the neural network. Once the training process is completed the same character is given as input to the neural network with different set of neurons in hidden layer and their recognition accuracy rate for different Kannada characters has been calculated and compared. The results show that the proposed system yields good recognition accuracy rates comparable to that of other handwritten character recognition systems.

  16. Practical automatic Arabic license plate recognition system

    Science.gov (United States)

    Mohammad, Khader; Agaian, Sos; Saleh, Hani

    2011-02-01

    Since 1970's, the need of an automatic license plate recognition system, sometimes referred as Automatic License Plate Recognition system, has been increasing. A license plate recognition system is an automatic system that is able to recognize a license plate number, extracted from image sensors. In specific, Automatic License Plate Recognition systems are being used in conjunction with various transportation systems in application areas such as law enforcement (e.g. speed limit enforcement) and commercial usages such as parking enforcement and automatic toll payment private and public entrances, border control, theft and vandalism control. Vehicle license plate recognition has been intensively studied in many countries. Due to the different types of license plates being used, the requirement of an automatic license plate recognition system is different for each country. [License plate detection using cluster run length smoothing algorithm ].Generally, an automatic license plate localization and recognition system is made up of three modules; license plate localization, character segmentation and optical character recognition modules. This paper presents an Arabic license plate recognition system that is insensitive to character size, font, shape and orientation with extremely high accuracy rate. The proposed system is based on a combination of enhancement, license plate localization, morphological processing, and feature vector extraction using the Haar transform. The performance of the system is fast due to classification of alphabet and numerals based on the license plate organization. Experimental results for license plates of two different Arab countries show an average of 99 % successful license plate localization and recognition in a total of more than 20 different images captured from a complex outdoor environment. The results run times takes less time compared to conventional and many states of art methods.

  17. Speech recognition systems on the Cell Broadband Engine

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Y; Jones, H; Vaidya, S; Perrone, M; Tydlitat, B; Nanda, A

    2007-04-20

    In this paper we describe our design, implementation, and first results of a prototype connected-phoneme-based speech recognition system on the Cell Broadband Engine{trademark} (Cell/B.E.). Automatic speech recognition decodes speech samples into plain text (other representations are possible) and must process samples at real-time rates. Fortunately, the computational tasks involved in this pipeline are highly data-parallel and can receive significant hardware acceleration from vector-streaming architectures such as the Cell/B.E. Identifying and exploiting these parallelism opportunities is challenging, but also critical to improving system performance. We observed, from our initial performance timings, that a single Cell/B.E. processor can recognize speech from thousands of simultaneous voice channels in real time--a channel density that is orders-of-magnitude greater than the capacity of existing software speech recognizers based on CPUs (central processing units). This result emphasizes the potential for Cell/B.E.-based speech recognition and will likely lead to the future development of production speech systems using Cell/B.E. clusters.

  18. Interactive Voice/Web Response System in clinical research.

    Science.gov (United States)

    Ruikar, Vrishabhsagar

    2016-01-01

    Emerging technologies in computer and telecommunication industry has eased the access to computer through telephone. An Interactive Voice/Web Response System (IxRS) is one of the user friendly systems for end users, with complex and tailored programs at its backend. The backend programs are specially tailored for easy understanding of users. Clinical research industry has experienced revolution in methodologies of data capture with time. Different systems have evolved toward emerging modern technologies and tools in couple of decades from past, for example, Electronic Data Capture, IxRS, electronic patient reported outcomes, etc.

  19. Voice Interactive Systems Technology (VIST) Research.

    Science.gov (United States)

    1984-01-01

    Z ’-’,- FIN J. , F: .J. *. I.1. ~ ~UNCLASSIFIED S;ECURITY CLASSIFICATION OF THIS PAGE...Ground DEC PDP-11 /45 SYSTEM GEERATION INFORMiTION RSX-11M Operating System Initializat ion Command File: File Name: INIT7.CMD Contents: SET/SPEED=TT7...34 " • - - - ’ ’ v " ’ ’ .- ’ % ’ - * ’ k’" ’ " ’ ’ ’ *. , , ,", . . . . .- - . . , . .- ’ ." - . .- . , . % . - -,- - , z , . . . . -*. . * .* -. 4 --

  20. Scientific Bases of Human-Machine Communication by Voice

    Science.gov (United States)

    Schafer, Ronald W.

    1995-10-01

    The scientific bases for human-machine communication by voice are in the fields of psychology, linguistics, acoustics, signal processing, computer science, and integrated circuit technology. The purpose of this paper is to highlight the basic scientific and technological issues in human-machine communication by voice and to point out areas of future research opportunity. The discussion is organized around the following major issues in implementing human-machine voice communication systems: (i) hardware/software implementation of the system, (ii) speech synthesis for voice output, (iii) speech recognition and understanding for voice input, and (iv) usability factors related to how humans interact with machines.

  1. Wireless Controlled Methods via Voice and Internet (e-mail for Home Automation System

    Directory of Open Access Journals (Sweden)

    R.A.Ramlee

    2013-08-01

    Full Text Available This paper presents a wireless Home Automation System (HAS that mainly performed by computer. The system is designed with several control methods in order to control the target electrical appliances. This various control methods implemented to fulfill the needs of users at home even at outside. The computer application is designed in Microsoft Windows OS that integrated with speech recognition voice control by using Microsoft Speech Application Programming Interface (SAPI. The voice control method provides more convenience especially to the blind and paralyzed users at home. This system is designed to perform short distance control by using wireless Bluetooth technology and long distance control by using Simple Mail Transfer Protocol (SMTP email control method. The short distance control is considered as the control that performed inside the house. Moreover, the long distance control can be performed at everywhere by devices that installed with browser or email application, and also with the internet access. The system intended to control electrical appliances at home with relatively low cost design, user-friendly interface and ease of installation.

  2. Real Time Implementation Of Face Recognition System

    Directory of Open Access Journals (Sweden)

    Megha Manchanda

    2014-10-01

    Full Text Available This paper proposes face recognition method using PCA for real time implementation. Nowadays security is gaining importance as it is becoming necessary for people to keep passwords in their mind and carry cards. Such implementations however, are becoming less secure and practical, also is becoming more problematic thus leading to an increasing interest in techniques related to biometrics systems. Face recognition system is amongst important subjects in biometrics systems. This system is very useful for security in particular and has been widely used and developed in many countries. This study aims to achieve face recognition successfully by detecting human face in real time, based on Principal Component Analysis (PCA algorithm.

  3. Automatic Speech Acquisition and Recognition for Spacesuit Audio Systems

    Science.gov (United States)

    Ye, Sherry

    2015-01-01

    NASA has a widely recognized but unmet need for novel human-machine interface technologies that can facilitate communication during astronaut extravehicular activities (EVAs), when loud noises and strong reverberations inside spacesuits make communication challenging. WeVoice, Inc., has developed a multichannel signal-processing method for speech acquisition in noisy and reverberant environments that enables automatic speech recognition (ASR) technology inside spacesuits. The technology reduces noise by exploiting differences between the statistical nature of signals (i.e., speech) and noise that exists in the spatial and temporal domains. As a result, ASR accuracy can be improved to the level at which crewmembers will find the speech interface useful. System components and features include beam forming/multichannel noise reduction, single-channel noise reduction, speech feature extraction, feature transformation and normalization, feature compression, and ASR decoding. Arithmetic complexity models were developed and will help designers of real-time ASR systems select proper tasks when confronted with constraints in computational resources. In Phase I of the project, WeVoice validated the technology. The company further refined the technology in Phase II and developed a prototype for testing and use by suited astronauts.

  4. Traffic-Sign Recognition Systems

    CERN Document Server

    Escalera, Sergio

    2011-01-01

    This work presents a full generic approach to the detection and recognition of traffic signs. The approach is based on the latest computer vision methods for object detection, and on powerful methods for multiclass classification. The challenge was to robustly detect a set of different sign classes in real time, and to classify each detected sign into a large, extensible set of classes. To address this challenge, several state-of-the-art methods were developed that can be used for different recognition problems. Following an introduction to the problems of traffic sign detection and categoriza

  5. Hierarchical Recognition Scheme for Human Facial Expression Recognition Systems

    Directory of Open Access Journals (Sweden)

    Muhammad Hameed Siddiqi

    2013-12-01

    Full Text Available Over the last decade, human facial expressions recognition (FER has emerged as an important research area. Several factors make FER a challenging research problem. These include varying light conditions in training and test images; need for automatic and accurate face detection before feature extraction; and high similarity among different expressions that makes it difficult to distinguish these expressions with a high accuracy. This work implements a hierarchical linear discriminant analysis-based facial expressions recognition (HL-FER system to tackle these problems. Unlike the previous systems, the HL-FER uses a pre-processing step to eliminate light effects, incorporates a new automatic face detection scheme, employs methods to extract both global and local features, and utilizes a HL-FER to overcome the problem of high similarity among different expressions. Unlike most of the previous works that were evaluated using a single dataset, the performance of the HL-FER is assessed using three publicly available datasets under three different experimental settings: n-fold cross validation based on subjects for each dataset separately; n-fold cross validation rule based on datasets; and, finally, a last set of experiments to assess the effectiveness of each module of the HL-FER separately. Weighted average recognition accuracy of 98.7% across three different datasets, using three classifiers, indicates the success of employing the HL-FER for human FER.

  6. Application of Multi- Tier Applications Technology Datasnap in Designing a System of Automatic Segmentation and Recognition of Sppech Signal

    Directory of Open Access Journals (Sweden)

    Yedilkhan N. Amirgaliyev

    2016-03-01

    Full Text Available In this paper we will address current issues in the field of development and application of automatic identification systems and segmentation of speech signals. The basic criteria for the shortcomings of such systems were formulated. The review of the types of speech recognition systems was conducted, and the optimum architecture for them, including information used in leading IT companies was described. The possibility of using multi-tier architectures for solving problems of speech recognition and their advantages were considered. Also practical implementation of multi-tier architecture based on DataSnap technology in voice recognition system for geo search in Kazakh language was described.

  7. Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders.

    Science.gov (United States)

    Kitzing, Peter; Maier, Andreas; Ahlander, Viveka Lyberg

    2009-01-01

    In general opinion computerized automatic speech recognition (ASR) seems to be regarded as a method only to accomplish transcriptions from spoken language to written text and as such quite insecure and rather cumbersome. However, due to great advances in computer technology and informatics methodology ASR has nowadays become quite dependable and easier to handle, and the number of applications has increased considerably. After some introductory background information on ASR a number of applications of great interest for professionals in voice, speech, and language therapy are pointed out. In the foreseeable future, the keyboard and mouse will by means of ASR technology be replaced in many functions by a microphone as the human-computer interface, and the computer will talk back via its loud-speaker. It seems important that professionals engaged in the care of oral communication disorders take part in this development so their clients may get the optimal benefit from this new technology.

  8. Human factors research problems in electronic voice warning system design

    Science.gov (United States)

    Simpson, C. A.; Williams, D. H.

    1975-01-01

    The speech messages issued by voice warning systems must be carefully designed in accordance with general principles of human decision making processes, human speech comprehension, and the conditions in which the warnings can occur. The operator's effectiveness must not be degraded by messages that are either inappropriate or difficult to comprehend. Important experimental variables include message content, linguistic redundancy, signal/noise ratio, interference with concurrent tasks, and listener expectations generated by the pragmatic or real world context in which the messages are presented.

  9. Portable EGG recording system based on a digital voice recorder.

    Science.gov (United States)

    Jang, J-K; Shieh, M-J; Kuo, T-S; Jaw, F-S

    2009-01-01

    Cutaneous electrogastrogram (EGG) recording offers the benefit of non-invasive gastrointestinal diagnosis. With long-term ambulatory recording of signals, researchers and clinicians could have more opportunities to investigate and analyse paroxysmal or acute symptoms. A portable EGG system based on a digital voice recorder (DVR) is designed for long-term recording of cutaneous EGG signals. The system consists of electrodes, an EGG amplifier, a modulator, and a DVR. Online monitoring and off-line acquisition of EGG are handled by software. A special design employing an integrated timer circuit is used to modulate the EGG frequency to meet the input requirements of the DVR. This approach involves low supply voltage and low power consumption. Software demodulation is used to simplify the complexity of the system, and is helpful in reducing the size of the portable device. By using surface-mount devices (SMD) and a low-power design, the system is robust, compact, and suitable for long-term portable recording. As a result, researchers can record an ambulatory EGG signal by means of the proposed circuits in conjunction with an up-to-date voice-recording device.

  10. Speech Recognition System Architecture for Gujarati Language

    National Research Council Canada - National Science Library

    Jinal H Tailor; Dipti B Shah

    2016-01-01

    .... To achieve good accuracy and efficiency of Automatic Speech Recognition (ASR) system for Indian Gujarati language is challenging task due to its morphology, language barriers, different dialects, and unavailability of resources...

  11. A Unique Wavelet Steganography Based Voice Biometric Protection Scheme

    Directory of Open Access Journals (Sweden)

    Sanjaypande M. B

    2013-03-01

    Full Text Available Voice biometric is an easy and cost effective biometric technique which requires minimalistic hardware and software complexity. General voice biometric needs a voice phrase by user which is processed with Mel Filter and Vector Quantized features are extracted. Vector quantization reduces the codebook size but decreases the accuracy of recognition. Therefore we propose a voice biometric system where voice file's non quantized code books are matched with spoken phrase. In order to ensure security to such direct voice sample we embed the voice file in a randomly selected image using DWT technique. Imposters are exposed to only images and are unaware of the voice files. We show that the technique produces better efficiency in comparison to VQ based technique.

  12. Research of Speech Recognition System Based on Matlab%基于Matlab的语音识别系统研究

    Institute of Scientific and Technical Information of China (English)

    王彪

    2011-01-01

    A speech recognition system based on Matlab software is designed, and record, broadcast, pretreat voice signals, subsection filtering, feature extraction and speech recognition are its main functions. This system has achieved discriminate simple voice requirements is verificated by the experiment, but some places are needed to improve, such as: whether complex voice coule be discriminated in complex environment.%设计了一个基于Matlab软件的语音识别系统,其主要功能有语音信号的录制、播放、预处理、分段滤波、特征提取以及识别语音.通过实验验证了本系统能够达到识别简单语音的要求,但仍有需改进的地方,如:能否在复杂环境下识别比较复杂的语音.

  13. Automatic TLI recognition system, general description

    Energy Technology Data Exchange (ETDEWEB)

    Lassahn, G.D.

    1997-02-01

    This report is a general description of an automatic target recognition system developed at the Idaho National Engineering Laboratory for the Department of Energy. A user`s manual is a separate volume, Automatic TLI Recognition System, User`s Guide, and a programmer`s manual is Automatic TLI Recognition System, Programmer`s Guide. This system was designed as an automatic target recognition system for fast screening of large amounts of multi-sensor image data, based on low-cost parallel processors. This system naturally incorporates image data fusion, and it gives uncertainty estimates. It is relatively low cost, compact, and transportable. The software is easily enhanced to expand the system`s capabilities, and the hardware is easily expandable to increase the system`s speed. In addition to its primary function as a trainable target recognition system, this is also a versatile, general-purpose tool for image manipulation and analysis, which can be either keyboard-driven or script-driven. This report includes descriptions of three variants of the computer hardware, a description of the mathematical basis if the training process, and a description with examples of the system capabilities.

  14. A Multi—View Face Recognition System

    Institute of Scientific and Technical Information of China (English)

    张永越; 彭振云; 等

    1997-01-01

    In many automatic face recognition systems,posture constraining is a key factor preventing them from application.In this paper a series of strategies will be described to achieve a system which enables face recognition under varying pose.These approaches include the multi-view face modeling,the threschold image based face feature detection,the affine transformation based face posture normalization and the template matching based face identification.Combining all of these strategies,a face recognition system with the pose invariance is designed successfully,Using a 75MHZ Pentium PC and with a database of 75 individuals,15 images for each person,and 225 test images with various postures,a very good recognition rate of 96.89% is obtained.

  15. Kannada Character Recognition System A Review

    CERN Document Server

    Indira, K

    2010-01-01

    Intensive research has been done on optical character recognition ocr and a large number of articles have been published on this topic during the last few decades. Many commercial OCR systems are now available in the market, but most of these systems work for Roman, Chinese, Japanese and Arabic characters. There are no sufficient number of works on Indian language character recognition especially Kannada script among 12 major scripts in India. This paper presents a review of existing work on printed Kannada script and their results. The characteristics of Kannada script and Kannada Character Recognition System kcr are discussed in detail. Finally fusion at the classifier level is proposed to increase the recognition accuracy.

  16. Distributed information-processing system with voice control based on OS Android

    Directory of Open Access Journals (Sweden)

    E. V. Apolonov

    2012-12-01

    Full Text Available Introduction: Trends of increase of ACS and AIS and their use in everyday life are discussed. The need a voice mode of human interaction with AIS is mentioned. Noticed that network integration of AIS allows to combine their resources and contributes to progress in speech recognition. The emergence of smart phones and their widespread use is the desire to use them as personal voice terminals for access to distributed information networks. Main part: Possibility of use of Android-based personal portable mobile devices (PPMD like terminals and like autonomous units, as well as possibility of use of Windows-based stationary PC like servers of distributed data-processing system (DDPS with voice control are considered. Criteria for selection of PPMD and OS of client terminals, as well as requirements DDPS and its structure are formulated. Concept of building of DDPS by "client - server" and "a lot of clients — many servers" technologies are submitted. Concept of a PPMD virtual interface and server virtual interface are offered. Communication between threads within the process of the PPMD virtual interface of client terminal and the interaction between the processes of the client and server in the autonomous mode, as well as in the DDPS mode are considered. The results of experimental tests of the prototype of DDPS when exchanging data between Windows and Android clients, and Windows Server are running; the accuracy and reliability of embedded solutions and scalability of DDPS are confirmed. Conclusions: Modern PPMD on Android OS with can be used as terminal devices for construction on the basis of their different specialized voice control DDPS with technology "client - server" and "a lot of customers - many servers". Unification APIs of PPMD with different OS can be done by implementing a virtual PPMD interface. Exchanging data between processes of DDPS better sell through technology Berkeley sockets, which are supported by most modern operating

  17. Evaluation of voice codecs for the Australian mobile satellite system

    Science.gov (United States)

    Bundrock, Tony; Wilkinson, Mal

    1990-01-01

    The evaluation procedure to choose a low bit rate voice coding algorithm is described for the Australian land mobile satellite system. The procedure is designed to assess both the inherent quality of the codec under 'normal' conditions and its robustness under 'severe' conditions. For the assessment, normal conditions were chosen to be random bit error rate with added background acoustic noise and the severe condition is designed to represent burst error conditions when mobile satellite channel suffers from signal fading due to roadside vegetation. The assessment is divided into two phases. First, a reduced set of conditions is used to determine a short list of candidate codecs for more extensive testing in the second phase. The first phase conditions include quality and robustness and codecs are ranked with a 60:40 weighting on the two. Second, the short listed codecs are assessed over a range of input voice levels, BERs, background noise conditions, and burst error distributions. Assessment is by subjective rating on a five level opinion scale and all results are then used to derive a weighted Mean Opinion Score using appropriate weights for each of the test conditions.

  18. Practical vision based degraded text recognition system

    Science.gov (United States)

    Mohammad, Khader; Agaian, Sos; Saleh, Hani

    2011-02-01

    Rapid growth and progress in the medical, industrial, security and technology fields means more and more consideration for the use of camera based optical character recognition (OCR) Applying OCR to scanned documents is quite mature, and there are many commercial and research products available on this topic. These products achieve acceptable recognition accuracy and reasonable processing times especially with trained software, and constrained text characteristics. Even though the application space for OCR is huge, it is quite challenging to design a single system that is capable of performing automatic OCR for text embedded in an image irrespective of the application. Challenges for OCR systems include; images are taken under natural real world conditions, Surface curvature, text orientation, font, size, lighting conditions, and noise. These and many other conditions make it extremely difficult to achieve reasonable character recognition. Performance for conventional OCR systems drops dramatically as the degradation level of the text image quality increases. In this paper, a new recognition method is proposed to recognize solid or dotted line degraded characters. The degraded text string is localized and segmented using a new algorithm. The new method was implemented and tested using a development framework system that is capable of performing OCR on camera captured images. The framework allows parameter tuning of the image-processing algorithm based on a training set of camera-captured text images. Novel methods were used for enhancement, text localization and the segmentation algorithm which enables building a custom system that is capable of performing automatic OCR which can be used for different applications. The developed framework system includes: new image enhancement, filtering, and segmentation techniques which enabled higher recognition accuracies, faster processing time, and lower energy consumption, compared with the best state of the art published

  19. [Research on Barrier-free Home Environment System Based on Speech Recognition].

    Science.gov (United States)

    Zhu, Husheng; Yu, Hongliu; Shi, Ping; Fang, Youfang; Jian, Zhuo

    2015-10-01

    The number of people with physical disabilities is increasing year by year, and the trend of population aging is more and more serious. In order to improve the quality of the life, a control system of accessible home environment for the patients with serious disabilities was developed to control the home electrical devices with the voice of the patients. The control system includes a central control platform, a speech recognition module, a terminal operation module, etc. The system combines the speech recognition control technology and wireless information transmission technology with the embedded mobile computing technology, and interconnects the lamp, electronic locks, alarms, TV and other electrical devices in the home environment as a whole system through a wireless network node. The experimental results showed that speech recognition success rate was more than 84% in the home environment.

  20. Image enhancement method for fingerprint recognition system.

    Science.gov (United States)

    Li, Shunshan; Wei, Min; Tang, Haiying; Zhuang, Tiange; Buonocore, Michael

    2005-01-01

    Image enhancement plays an important role in Fingerprint Recognition System. In this paper fingerprint image enhancement method, a refined Gabor filter, is presented. This enhancement method can connect the ridge breaks, ensures the maximal gray values located at the ridge center and has the ability to compensate for the nonlinear deformations. The result shows it can improve the performance of image enhancement.

  1. Humoral pattern recognition and the complement system.

    Science.gov (United States)

    Degn, S E; Thiel, S

    2013-08-01

    In the context of immunity, pattern recognition is the art of discriminating friend from foe and innocuous from noxious. The basis of discrimination is the existence of evolutionarily conserved patterns on microorganisms, which are intrinsic to these microorganisms and necessary for their function and existence. Such immutable or slowly evolving patterns are ideal handles for recognition and have been targeted by early cellular immune defence mechanisms such as Toll-like receptors, NOD-like receptors, RIG-I-like receptors, C-type lectin receptors and by humoral defence mechanisms such as the complement system. Complement is a proteolytic cascade system comprising around 35 different soluble and membrane-bound proteins. It constitutes a central part of the innate immune system, mediating several major innate effector functions and modulating adaptive immune responses. The complement cascade proceeds via controlled, limited proteolysis and conformational changes of constituent proteins through three activation pathways: the classical pathway, the alternative pathway and the lectin pathway, which converge in common effector functions. Here, we review the nature of the pattern recognition molecules involved in complement activation, as well as their close relatives with no or unknown capacity for activating complement. We proceed to examine the composition of the pattern recognition complexes involved in complement activation, focusing on those of the lectin pathway, and arrive at a new model for their mechanism of operation, supported by recently emerging evidence.

  2. Voice Quality Estimation in Combined Radio-VoIP Networks for Dispatching Systems

    Directory of Open Access Journals (Sweden)

    Jiri Vodrazka

    2016-01-01

    Full Text Available The voice quality modelling assessment and planning field is deeply and widely theoretically and practically mastered for common voice communication systems, especially for the public fixed and mobile telephone networks including Next Generation Networks (NGN - internet protocol based networks. This article seeks to contribute voice quality modelling assessment and planning for dispatching communication systems based on Internet Protocol (IP and private radio networks. The network plan, correction in E-model calculation and default values for the model are presented and discussed.

  3. A neuromorphic system for video object recognition.

    Science.gov (United States)

    Khosla, Deepak; Chen, Yang; Kim, Kyungnam

    2014-01-01

    Automated video object recognition is a topic of emerging importance in both defense and civilian applications. This work describes an accurate and low-power neuromorphic architecture and system for real-time automated video object recognition. Our system, Neuormorphic Visual Understanding of Scenes (NEOVUS), is inspired by computational neuroscience models of feed-forward object detection and classification pipelines for processing visual data. The NEOVUS architecture is inspired by the ventral (what) and dorsal (where) streams of the mammalian visual pathway and integrates retinal processing, object detection based on form and motion modeling, and object classification based on convolutional neural networks. The object recognition performance and energy use of the NEOVUS was evaluated by the Defense Advanced Research Projects Agency (DARPA) under the Neovision2 program using three urban area video datasets collected from a mix of stationary and moving platforms. These datasets are challenging and include a large number of objects of different types in cluttered scenes, with varying illumination and occlusion conditions. In a systematic evaluation of five different teams by DARPA on these datasets, the NEOVUS demonstrated the best performance with high object recognition accuracy and the lowest energy consumption. Its energy use was three orders of magnitude lower than two independent state of the art baseline computer vision systems. The dynamic power requirement for the complete system mapped to commercial off-the-shelf (COTS) hardware that includes a 5.6 Megapixel color camera processed by object detection and classification algorithms at 30 frames per second was measured at 21.7 Watts (W), for an effective energy consumption of 5.45 nanoJoules (nJ) per bit of incoming video. These unprecedented results show that the NEOVUS has the potential to revolutionize automated video object recognition toward enabling practical low-power and mobile video processing

  4. Intelligent recognitive systems in nanomedicine.

    Science.gov (United States)

    Culver, Heidi; Daily, Adam; Khademhosseini, Ali; Peppas, Nicholas

    2014-05-01

    There is a bright future in the development and utilization of nanoscale systems based on intelligent materials that can respond to external input providing a beneficial function. Specific functional groups can be incorporated into polymers to make them responsive to environmental stimuli such as pH, temperature, or varying concentrations of biomolecules. The fusion of such "intelligent" biomaterials with nanotechnology has led to the development of powerful therapeutic and diagnostic platforms. For example, targeted release of proteins and chemotherapeutic drugs has been achieved using pH-responsive nanocarriers while biosensors with ultra-trace detection limits are being made using nanoscale, molecularly imprinted polymers. The efficacy of therapeutics and the sensitivity of diagnostic platforms will continue to progress as unique combinations of responsive polymers and nanomaterials emerge.

  5. Automatic TLI recognition system, user`s guide

    Energy Technology Data Exchange (ETDEWEB)

    Lassahn, G.D.

    1997-02-01

    This report describes how to use an automatic target recognition system (version 14). In separate volumes are a general description of the ATR system, Automatic TLI Recognition System, General Description, and a programmer`s manual, Automatic TLI Recognition System, Programmer`s Guide.

  6. A Neuromorphic System for Video Object Recognition

    Directory of Open Access Journals (Sweden)

    Deepak eKhosla

    2014-11-01

    Full Text Available Automated video object recognition is a topic of emerging importance in both defense and civilian applications. This work describes an accurate and low-power neuromorphic architecture and system for real-time automated video object recognition. Our system, Neuormorphic Visual Understanding of Scenes (NEOVUS, is inspired by recent findings in computational neuroscience on feed-forward object detection and classification pipelines for processing and extracting relevant information from visual data. The NEOVUS architecture is inspired by the ventral (what and dorsal (where streams of the mammalian visual pathway and combines retinal processing, form-based and motion-based object detection, and convolutional neural nets based object classification. Our system was evaluated by the Defense Advanced Research Projects Agency (DARPA under the NEOVISION2 program on a variety of urban area video datasets collected from both stationary and moving platforms. The datasets are challenging as they include a large number of targets in cluttered scenes with varying illumination and occlusion conditions. The NEOVUS system was also mapped to commercially available off-the-shelf hardware. The dynamic power requirement for the system that includes a 5.6Mpixel retinal camera processed by object detection and classification algorithms at 30 frames per second was measured at 21.7 Watts (W, for an effective energy consumption of 5.4 nanoJoules (nJ per bit of incoming video. In a systematic evaluation of five different teams by DARPA on three aerial datasets, the NEOVUS demonstrated the best performance with the highest recognition accuracy and at least three orders of magnitude lower energy consumption than two independent state of the art computer vision systems. These unprecedented results show that the NEOVUS has the potential to revolutionize automated video object recognition towards enabling practical low-power and mobile video processing applications.

  7. Cross domains Arabic named entity recognition system

    Science.gov (United States)

    Al-Ahmari, S. Saad; Abdullatif Al-Johar, B.

    2016-07-01

    Named Entity Recognition (NER) plays an important role in many Natural Language Processing (NLP) applications such as; Information Extraction (IE), Question Answering (QA), Text Clustering, Text Summarization and Word Sense Disambiguation. This paper presents the development and implementation of domain independent system to recognize three types of Arabic named entities. The system works based on a set of domain independent grammar-rules along with Arabic part of speech tagger in addition to gazetteers and lists of trigger words. The experimental results shown, that the system performed as good as other systems with better results in some cases of cross-domains corpora.

  8. An Intelligent Multilingual Mouse Gesture Recognition System

    Directory of Open Access Journals (Sweden)

    Nidal F. Shilbayeh

    2005-01-01

    Full Text Available A comprehensive mouse gesture system is designed and tested successfully. The system is based on UNIPEN algorithm in terms of mouse movements and applies its geometrical principles such as angles and transposition steps. The system incorporates Neural Networks as its learning and recognition engine. The designed algorithm is not only capable of translating discrete gesture moves, but also continuous sentences and complete paragraphs. Hopfield Network is also used for initial learning to add a feature of language independence to the system.

  9. Statistical feature extraction based iris recognition system

    Indian Academy of Sciences (India)

    ATUL BANSAL; RAVINDER AGARWAL; R K SHARMA

    2016-05-01

    Iris recognition systems have been proposed by numerous researchers using different feature extraction techniques for accurate and reliable biometric authentication. In this paper, a statistical feature extraction technique based on correlation between adjacent pixels has been proposed and implemented. Hamming distance based metric has been used for matching. Performance of the proposed iris recognition system (IRS) has been measured by recording false acceptance rate (FAR) and false rejection rate (FRR) at differentthresholds in the distance metric. System performance has been evaluated by computing statistical features along two directions, namely, radial direction of circular iris region and angular direction extending from pupil tosclera. Experiments have also been conducted to study the effect of number of statistical parameters on FAR and FRR. Results obtained from the experiments based on different set of statistical features of iris images show thatthere is a significant improvement in equal error rate (EER) when number of statistical parameters for feature extraction is increased from three to six. Further, it has also been found that increasing radial/angular resolution,with normalization in place, improves EER for proposed iris recognition system

  10. [Continuous speech recognition system for radiological reporting: comparison with experience of dictation].

    Science.gov (United States)

    Ichikawa, Tamaki; Koizumi, Jun; Takahara, Taro; Myojin, Kazunori; Yamashita, Eiko; Nasu, Seiji; Yanagimachi, Noriharu; Imai, Yutaka; Tsukune, Yoshihiko

    2005-10-01

    To compare rates of accuracy of recognition between experienced dictators and inexperienced ones in using an enrollment-less continuous speech recognition (CSR) system of radiological reporting, and to evaluate the usefulness of the system. Twenty board-certified radiologists were classified into 2 groups: a group of 10 members with more than 6 years' experience of conventional dictation by transcriptionist (group A) and a group of 10 members with no experience of dictation (group B). All radiologists created fresh radiological reports on sets of images using free-style dictation in the reports. We counted errors and total words in the reports individually, and compared the rates of accuracy of word recognition in the two groups. We used a CSR system AmiVoice (Advanced Media, Inc., Tokyo, Japan). The average rate of accuracy of word recognition was 96.42 +/- 1.68% in group A and 95.92 +/- 1.15% in group B. There was no significant difference in accuracy rate between the two groups. The accuracy of word recognition was independent of the experience of dictation, and the enrollment-less CSR system of radiological reporting was considered convenient and useful.

  11. Dance recognition system using lower body movement.

    Science.gov (United States)

    Simpson, Travis T; Wiesner, Susan L; Bennett, Bradford C

    2014-02-01

    The current means of locating specific movements in film necessitate hours of viewing, making the task of conducting research into movement characteristics and patterns tedious and difficult. This is particularly problematic for the research and analysis of complex movement systems such as sports and dance. While some systems have been developed to manually annotate film, to date no automated way of identifying complex, full body movement exists. With pattern recognition technology and knowledge of joint locations, automatically describing filmed movement using computer software is possible. This study used various forms of lower body kinematic analysis to identify codified dance movements. We created an algorithm that compares an unknown move with a specified start and stop against known dance moves. Our recognition method consists of classification and template correlation using a database of model moves. This system was optimized to include nearly 90 dance and Tai Chi Chuan movements, producing accurate name identification in over 97% of trials. In addition, the program had the capability to provide a kinematic description of either matched or unmatched moves obtained from classification recognition.

  12. Challenges and Specifications for Robust Face and Gait Recognition Systems for Surveillance Application

    Directory of Open Access Journals (Sweden)

    BUCIU Ioan

    2014-05-01

    Full Text Available Automated person recognition (APR based on biometric signals addresses the process of automatically recognize a person according to his physiological traits (face, voice, iris, fingerprint, ear shape, body odor, electroencephalogram – EEG, electrocardiogram, or hand geometry, or behavioural patterns (gait, signature, hand-grip, lip movement. The paper aims at briefly presenting the current challenges for two specific non-cooperative biometric approaches, namely face and gait biometrics as well as approaches that consider combination of the two in the attempt of a more robust system for accurate APR, in the context of surveillance application. Open problems from both sides are also pointed out.

  13. Multi-modal assessment of on-road demand of voice and manual phone calling and voice navigation entry across two embedded vehicle systems.

    Science.gov (United States)

    Mehler, Bruce; Kidd, David; Reimer, Bryan; Reagan, Ian; Dobres, Jonathan; McCartt, Anne

    2016-03-01

    One purpose of integrating voice interfaces into embedded vehicle systems is to reduce drivers' visual and manual distractions with 'infotainment' technologies. However, there is scant research on actual benefits in production vehicles or how different interface designs affect attentional demands. Driving performance, visual engagement, and indices of workload (heart rate, skin conductance, subjective ratings) were assessed in 80 drivers randomly assigned to drive a 2013 Chevrolet Equinox or Volvo XC60. The Chevrolet MyLink system allowed completing tasks with one voice command, while the Volvo Sensus required multiple commands to navigate the menu structure. When calling a phone contact, both voice systems reduced visual demand relative to the visual-manual interfaces, with reductions for drivers in the Equinox being greater. The Equinox 'one-shot' voice command showed advantages during contact calling but had significantly higher error rates than Sensus during destination address entry. For both secondary tasks, neither voice interface entirely eliminated visual demand. Practitioner Summary: The findings reinforce the observation that most, if not all, automotive auditory-vocal interfaces are multi-modal interfaces in which the full range of potential demands (auditory, vocal, visual, manipulative, cognitive, tactile, etc.) need to be considered in developing optimal implementations and evaluating drivers' interaction with the systems. Social Media: In-vehicle voice-interfaces can reduce visual demand but do not eliminate it and all types of demand need to be taken into account in a comprehensive evaluation.

  14. Privacy protection schemes for fingerprint recognition systems

    Science.gov (United States)

    Marasco, Emanuela; Cukic, Bojan

    2015-05-01

    The deployment of fingerprint recognition systems has always raised concerns related to personal privacy. A fingerprint is permanently associated with an individual and, generally, it cannot be reset if compromised in one application. Given that fingerprints are not a secret, potential misuses besides personal recognition represent privacy threats and may lead to public distrust. Privacy mechanisms control access to personal information and limit the likelihood of intrusions. In this paper, image- and feature-level schemes for privacy protection in fingerprint recognition systems are reviewed. Storing only key features of a biometric signature can reduce the likelihood of biometric data being used for unintended purposes. In biometric cryptosystems and biometric-based key release, the biometric component verifies the identity of the user, while the cryptographic key protects the communication channel. Transformation-based approaches only a transformed version of the original biometric signature is stored. Different applications can use different transforms. Matching is performed in the transformed domain which enable the preservation of low error rates. Since such templates do not reveal information about individuals, they are referred to as cancelable templates. A compromised template can be re-issued using a different transform. At image-level, de-identification schemes can remove identifiers disclosed for objectives unrelated to the original purpose, while permitting other authorized uses of personal information. Fingerprint images can be de-identified by, for example, mixing fingerprints or removing gender signature. In both cases, degradation of matching performance is minimized.

  15. Euro Banknote Recognition System for Blind People

    Directory of Open Access Journals (Sweden)

    Larisa Dunai Dunai

    2017-01-01

    Full Text Available This paper presents the development of a portable system with the aim of allowing blind people to detect and recognize Euro banknotes. The developed device is based on a Raspberry Pi electronic instrument and a Raspberry Pi camera, Pi NoIR (No Infrared filter dotted with additional infrared light, which is embedded into a pair of sunglasses that permit blind and visually impaired people to independently handle Euro banknotes, especially when receiving their cash back when shopping. The banknote detection is based on the modified Viola and Jones algorithms, while the banknote value recognition relies on the Speed Up Robust Features (SURF technique. The accuracies of banknote detection and banknote value recognition are 84% and 97.5%, respectively.

  16. Euro Banknote Recognition System for Blind People.

    Science.gov (United States)

    Dunai Dunai, Larisa; Chillarón Pérez, Mónica; Peris-Fajarnés, Guillermo; Lengua Lengua, Ismael

    2017-01-20

    This paper presents the development of a portable system with the aim of allowing blind people to detect and recognize Euro banknotes. The developed device is based on a Raspberry Pi electronic instrument and a Raspberry Pi camera, Pi NoIR (No Infrared filter) dotted with additional infrared light, which is embedded into a pair of sunglasses that permit blind and visually impaired people to independently handle Euro banknotes, especially when receiving their cash back when shopping. The banknote detection is based on the modified Viola and Jones algorithms, while the banknote value recognition relies on the Speed Up Robust Features (SURF) technique. The accuracies of banknote detection and banknote value recognition are 84% and 97.5%, respectively.

  17. Euro Banknote Recognition System for Blind People

    Science.gov (United States)

    Dunai Dunai, Larisa; Chillarón Pérez, Mónica; Peris-Fajarnés, Guillermo; Lengua Lengua, Ismael

    2017-01-01

    This paper presents the development of a portable system with the aim of allowing blind people to detect and recognize Euro banknotes. The developed device is based on a Raspberry Pi electronic instrument and a Raspberry Pi camera, Pi NoIR (No Infrared filter) dotted with additional infrared light, which is embedded into a pair of sunglasses that permit blind and visually impaired people to independently handle Euro banknotes, especially when receiving their cash back when shopping. The banknote detection is based on the modified Viola and Jones algorithms, while the banknote value recognition relies on the Speed Up Robust Features (SURF) technique. The accuracies of banknote detection and banknote value recognition are 84% and 97.5%, respectively. PMID:28117703

  18. Non Audio-Video gesture recognition system

    DEFF Research Database (Denmark)

    Craciunescu, Razvan; Mihovska, Albena Dimitrova; Kyriazakos, Sofoklis

    2016-01-01

    Gesture recognition is a topic in computer science and language technology with the goal of interpreting human gestures via mathematical algorithms. Gestures can originate from any bodily motion or state but commonly originate from the face or hand. Current research focus includes on the emotion...... that can be connected to any computer on the market. The paper proposes an equation that relates the distance and voltage for a Sharp GP2Y0A21 and GP2D120 sensors in the situation that a hand is used as the reflective object. In the end, the presented system is compared with other audio/video system...

  19. System and method for character recognition

    Science.gov (United States)

    Hong, J. P. (Inventor)

    1974-01-01

    A character recognition system is disclosed in which each character in a retina, defining a scanning raster, is scanned with random lines uniformly distributed over the retina. For each type of character to be recognized the system stores a probability density function (PDF) of the random line intersection lengths and/or a PDF of the random line number of intersections. As an unknown character is scanned, the random line intersection lengths and/or the random line number of intersections are accumulated and based on a comparison with the prestored PDFs a classification of the unknown character is performed.

  20. Device-Free Indoor Activity Recognition System

    Directory of Open Access Journals (Sweden)

    Mohammed Abdulaziz Aide Al-qaness

    2016-11-01

    Full Text Available In this paper, we explore the properties of the Channel State Information (CSI of WiFi signals and present a device-free indoor activity recognition system. Our proposed system uses only one ubiquitous router access point and a laptop as a detection point, while the user is free and neither needs to wear sensors nor carry devices. The proposed system recognizes six daily activities, such as walk, crawl, fall, stand, sit, and lie. We have built the prototype with an effective feature extraction method and a fast classification algorithm. The proposed system has been evaluated in a real and complex environment in both line-of-sight (LOS and none-line-of-sight (NLOS scenarios, and the results validate the performance of the proposed system.

  1. Automatic TLI recognition system. Part 1: System description

    Energy Technology Data Exchange (ETDEWEB)

    Partin, J.K.; Lassahn, G.D.; Davidson, J.R.

    1994-05-01

    This report describes an automatic target recognition system for fast screening of large amounts of multi-sensor image data, based on low-cost parallel processors. This system uses image data fusion and gives uncertainty estimates. It is relatively low cost, compact, and transportable. The software is easily enhanced to expand the system`s capabilities, and the hardware is easily expandable to increase the system`s speed. This volume gives a general description of the ATR system.

  2. Exploring the anatomical encoding of voice with a mathematical model of the vocal system.

    Science.gov (United States)

    Assaneo, M Florencia; Sitt, Jacobo; Varoquaux, Gael; Sigman, Mariano; Cohen, Laurent; Trevisan, Marcos A

    2016-11-01

    The faculty of language depends on the interplay between the production and perception of speech sounds. A relevant open question is whether the dimensions that organize voice perception in the brain are acoustical or depend on properties of the vocal system that produced it. One of the main empirical difficulties in answering this question is to generate sounds that vary along a continuum according to the anatomical properties the vocal apparatus that produced them. Here we use a mathematical model that offers the unique possibility of synthesizing vocal sounds by controlling a small set of anatomically based parameters. In a first stage the quality of the synthetic voice was evaluated. Using specific time traces for sub-glottal pressure and tension of the vocal folds, the synthetic voices generated perceptual responses, which are indistinguishable from those of real speech. The synthesizer was then used to investigate how the auditory cortex responds to the perception of voice depending on the anatomy of the vocal apparatus. Our fMRI results show that sounds are perceived as human vocalizations when produced by a vocal system that follows a simple relationship between the size of the vocal folds and the vocal tract. We found that these anatomical parameters encode the perceptual vocal identity (male, female, child) and show that the brain areas that respond to human speech also encode vocal identity. On the basis of these results, we propose that this low-dimensional model of the vocal system is capable of generating realistic voices and represents a novel tool to explore the voice perception with a precise control of the anatomical variables that generate speech. Furthermore, the model provides an explanation of how auditory cortices encode voices in terms of the anatomical parameters of the vocal system.

  3. A Massively Parallel Face Recognition System

    Directory of Open Access Journals (Sweden)

    Lahdenoja Olli

    2007-01-01

    Full Text Available We present methods for processing the LBPs (local binary patterns with a massively parallel hardware, especially with CNN-UM (cellular nonlinear network-universal machine. In particular, we present a framework for implementing a massively parallel face recognition system, including a dedicated highly accurate algorithm suitable for various types of platforms (e.g., CNN-UM and digital FPGA. We study in detail a dedicated mixed-mode implementation of the algorithm and estimate its implementation cost in the view of its performance and accuracy restrictions.

  4. A Massively Parallel Face Recognition System

    Directory of Open Access Journals (Sweden)

    Ari Paasio

    2006-12-01

    Full Text Available We present methods for processing the LBPs (local binary patterns with a massively parallel hardware, especially with CNN-UM (cellular nonlinear network-universal machine. In particular, we present a framework for implementing a massively parallel face recognition system, including a dedicated highly accurate algorithm suitable for various types of platforms (e.g., CNN-UM and digital FPGA. We study in detail a dedicated mixed-mode implementation of the algorithm and estimate its implementation cost in the view of its performance and accuracy restrictions.

  5. A Development of a System Enables Character Input and PC Operation via Voice for a Physically Disabled Person with a Speech Impediment

    Science.gov (United States)

    Tanioka, Toshimasa; Egashira, Hiroyuki; Takata, Mayumi; Okazaki, Yasuhisa; Watanabe, Kenzi; Kondo, Hiroki

    We have designed and implemented a PC operation support system for a physically disabled person with a speech impediment via voice. Voice operation is an effective method for a physically disabled person with involuntary movement of the limbs and the head. We have applied a commercial speech recognition engine to develop our system for practical purposes. Adoption of a commercial engine reduces development cost and will contribute to make our system useful to another speech impediment people. We have customized commercial speech recognition engine so that it can recognize the utterance of a person with a speech impediment. We have restricted the words that the recognition engine recognizes and separated a target words from similar words in pronunciation to avoid misrecognition. Huge number of words registered in commercial speech recognition engines cause frequent misrecognition for speech impediments' utterance, because their utterance is not clear and unstable. We have solved this problem by narrowing the choice of input down in a small number and also by registering their ambiguous pronunciations in addition to the original ones. To realize all character inputs and all PC operation with a small number of words, we have designed multiple input modes with categorized dictionaries and have introduced two-step input in each mode except numeral input to enable correct operation with small number of words. The system we have developed is in practical level. The first author of this paper is physically disabled with a speech impediment. He has been able not only character input into PC but also to operate Windows system smoothly by using this system. He uses this system in his daily life. This paper is written by him with this system. At present, the speech recognition is customized to him. It is, however, possible to customize for other users by changing words and registering new pronunciation according to each user's utterance.

  6. Cross domains Arabic named entity recognition system

    KAUST Repository

    Al-Ahmari, S. Saad

    2016-07-11

    Named Entity Recognition (NER) plays an important role in many Natural Language Processing (NLP) applications such as; Information Extraction (IE), Question Answering (QA), Text Clustering, Text Summarization and Word Sense Disambiguation. This paper presents the development and implementation of domain independent system to recognize three types of Arabic named entities. The system works based on a set of domain independent grammar-rules along with Arabic part of speech tagger in addition to gazetteers and lists of trigger words. The experimental results shown, that the system performed as good as other systems with better results in some cases of cross-domains corpora. © (2016) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.

  7. A commercial large-vocabulary discrete speech recognition system: DragonDictate.

    Science.gov (United States)

    Mandel, M A

    1992-01-01

    DragonDictate is currently the only commercially available general-purpose, large-vocabulary speech recognition system. It uses discrete speech and is speaker-dependent, adapting to the speaker's voice and language model with every word. Its acoustic adaptability is based in a three-level phonology and a stochastic model of production. The phonological levels are phonemes, augmented triphones (phonemes-in-context or PICs), and steady-state spectral slices that are concatenated to approximate the spectra of these PICs (phonetic elements or PELs) and thus of words. Production is treated as a hidden Markov process, which the recognizer has to identify from its output, the spoken word. Findings of practical value to speech recognition are presented from research on six European languages.

  8. Method and System for Object Recognition Search

    Science.gov (United States)

    Duong, Tuan A. (Inventor); Duong, Vu A. (Inventor); Stubberud, Allen R. (Inventor)

    2012-01-01

    A method for object recognition using shape and color features of the object to be recognized. An adaptive architecture is used to recognize and adapt the shape and color features for moving objects to enable object recognition.

  9. Phonological systems of pediatric cochlear implant users: The acquisition of voicing

    Science.gov (United States)

    Chin, Steven B.; Oglesbee, Eric N.; Kirk, Andrew K.; Krug, Joseph E.

    2005-04-01

    Although cochlear implants are primarily auditory prostheses, they have also demonstrated their usefulness as aids to speech production and the acquisition of spoken language in children. This presentation reports on research currently being conducted at the Indiana University Medical Center on the development of phonological systems by children with five or more years of cochlear implant use in English-speaking environments. Characteristics of the feature [voice] will be examined in children with cochlear implants and in two comparison groups: adults with normal hearing and children with normal hearing. Specific aspects of voicing to be discussed include characteristic error patterns, phonetic implementation of the voicing contrast, and phonetic implementation of neutralization of the voicing contrast. Much of the evidence obtained thus far indicates that voicing acquisition in children with cochlear implants is not radically different from that of children with normal hearing. Many differences between the systems of children with cochlear implants and the ambient system thus appear to reflect the children's age as much as their hearing status. [Work supported by grants from the National Institutes of Health to Indiana University: R01DC005594 and R03DC003852.

  10. Voice, Schooling, Inequality, and Scale

    Science.gov (United States)

    Collins, James

    2013-01-01

    The rich studies in this collection show that the investigation of voice requires analysis of "recognition" across layered spatial-temporal and sociolinguistic scales. I argue that the concepts of voice, recognition, and scale provide insight into contemporary educational inequality and that their study benefits, in turn, from paying attention to…

  11. Comparison of post menopausal voice changes across professional and non-professional users of the voice

    Directory of Open Access Journals (Sweden)

    Pallavi Vishwas Sovani

    2010-12-01

    Full Text Available Menopause effects a permanent change in certain body functions, one of them being voice. Moreover, if the voice is used continuously as a part of one’s occupation, this may further impact postmenopausal voice changes. The present study investigated the impact of menopause and professional voice use, and their interaction effect, on the voice. 92 women were classified into reproductive (52 and postmenopausal (40. Each group was divided into Level II (teachers and Level IV (clerks of Koufman and Isaacson’s (1991 classification. Acoustic parameters were analyzed using the VisiPitch III software. Aerodynamic parameters were manually calculated. The VHI (Voice Handicap Index was also included to improve the face validity of the study. Results suggest that Fo, SFo and MPT reduce post menopause while NHR and VTI increase. Some changes are accelerated in teachers as compared to clerks while some are decelerated. VHI scores of teachers are significantly greater than clerks, though not significantly different across menopause. Thus the presence or absence of voice use in one’s profession differentially affects postmenopausal changes. The study has implications in improving the condition of teachers in India, developing norms for menopausal changes and modifying allowable limits for voice recognition systems in future.

  12. 75 FR 41509 - Notice of Proposed Information Collection for Public Comment; LOCCS Voice Response System Payment...

    Science.gov (United States)

    2010-07-16

    ... System Payment Vouchers for Public and Indian Housing Programs AGENCY: Office of the Assistant Secretary... Payment Vouchers for Public and Indian Housing Programs. OMB Control Number: 2577-0166. Agency form number... voice activated system. The information collected on the payment voucher will also be used as...

  13. Speech Rate Control for Improving Elderly Speech Recognition of Smart Devices

    Directory of Open Access Journals (Sweden)

    SON, G.

    2017-05-01

    Full Text Available Although smart devices have become a widely-adopted tool for communication in modern society, it still requires a steep learning curve among the elderly. By introducing a voice-based interface for smart devices using voice recognition technology, smart devices can become more user-friendly and useful to the elderly. However, the voice recognition technology used in current devices is attuned to the voice patterns of the young. Therefore, speech recognition falters when an elderly user speaks into the device. This paper has identified that the elderly's improper speech rate by each syllable contributes to the failure in the voice recognition system. Thus, upon modifying the speech rate by each syllable, the voice recognition rate saw an increase of 12.3%. This paper demonstrates that by simply modifying the speech rate by each syllable, which is one of the factors that causes errors in voice recognition, the recognition rate can be substantially increased. Such improvements in voice recognition technology can make it easier for the elderly to operate smart devices that will allow them to be more socially connected in a mobile world and access information at their fingertips. It may also be helpful in bridging the communication divide between generations.

  14. Speaker Recognition

    DEFF Research Database (Denmark)

    Mølgaard, Lasse Lohilahti; Jørgensen, Kasper Winther

    2005-01-01

    Speaker recognition is basically divided into speaker identification and speaker verification. Verification is the task of automatically determining if a person really is the person he or she claims to be. This technology can be used as a biometric feature for verifying the identity of a person...... in applications like banking by telephone and voice mail. The focus of this project is speaker identification, which consists of mapping a speech signal from an unknown speaker to a database of known speakers, i.e. the system has been trained with a number of speakers which the system can recognize....

  15. Research on Face Recognition Based on Embedded System

    Directory of Open Access Journals (Sweden)

    Hong Zhao

    2013-01-01

    Full Text Available Because a number of image feature data to store, complex calculation to execute during the face recognition, therefore the face recognition process was realized only by PCs with high performance. In this paper, the OpenCV facial Haar-like features were used to identify face region; the Principal Component Analysis (PCA was employed in quick extraction of face features and the Euclidean Distance was also adopted in face recognition; as thus, data amount and computational complexity would be reduced effectively in face recognition, and the face recognition could be carried out on embedded platform. Finally, based on Tiny6410 embedded platform, a set of embedded face recognition systems was constructed. The test results showed that the system has stable operation and high recognition rate can be used in portable and mobile identification and authentication.

  16. Intelligent Home Speech Recognition System Based on NL6621%语音识别技术在智能家居中的应用

    Institute of Scientific and Technical Information of China (English)

    王爱芸

    2015-01-01

    The research of intelligent home speech recognition system is very important for the development of smart home. Through the analysis of the embedded speech recognition technology and smart home control technology, voice is recorded with NL6621 board as the platform and VS1003 as audio decoding chip. And Hidden Markov Model (HMM) algorithm is used to carry out voice model training and voice matching, so that we can achieve a smart home voice con-trol system. Experiments prove that the speech control system has high recognition rate and real-time performance.%研究实用的智能家居语音识别系统,对于智能家居的发展具有重要意义。通过分析嵌入式语音识别技术以及智能家居控制技术,以 NL6621板为平台,VS1003为音频解码芯片录制语音。并利用隐马尔可夫(HMM)算法进行语音模型训练和语音匹配,实现智能家居语音控制系统。实验证明此语音控制系统具有较高的识别率和实时性。

  17. Developing a Credit Recognition System for Chinese Higher Education Institutions

    Science.gov (United States)

    Li, Fuhui

    2015-01-01

    In recent years, a credit recognition system has been developing in Chinese higher education institutions. Much research has been done on this development, but it has been concentrated on system building, barriers/issues and international practices. The relationship between credit recognition system reforms and democratisation of higher education…

  18. Self Assistive Technology for Disabled People – Voice Controlled Wheel Chair and Home Automation System

    Directory of Open Access Journals (Sweden)

    R. Puviarasi

    2014-07-01

    Full Text Available This paper describes the design of an innovative and low cost self-assistive technology that is used to facilitate the control of a wheelchair and home appliances by using advanced voice commands of the disabled people. This proposed system will provide an alternative to the physically challenged people with quadriplegics who is permanently unable to move their limbs (but who is able to speak and hear and elderly people in controlling the motion of the wheelchair and home appliances using their voices to lead an independent, confident and enjoyable life. The performance of this microcontroller based and voice integrated design is evaluated in terms of accuracy and velocity in various environments. The results show that it could be part of an assistive technology for the disabled persons without any third person’s assistance.

  19. Currency Recognition System Using Image Processing

    Directory of Open Access Journals (Sweden)

    S. M. Saifullah

    2015-11-01

    Full Text Available In the last few years a great technological advances in color printing, duplicating and scanning, counterfeiting problems have become more serious. In past only authorized printing house has the ability to make currency paper, but now a days it is possible for anyone to print fake bank note with the help of modern technology such as computer, laser printer. Fake notes are burning questions in almost every country. Like others country Bangladesh has also hit really heard and has become a very acute problem. Therefore there is a need to design a currency recognition system that can easily make a difference between real and fake banknote and the process will time consuming. Our system describes an approach for verification of Bangladeshi currency banknotes. The currency will be verified by using image processing techniques. The approach consists of a number of components including image processing, image segmentation, feature extraction, comparing images. The system is designed by MATLAB. Image processing involves changing the nature of an image in order to improve its pictorial information for human interpretation. The image processing software is a collection of functions that extends the capability of the MATLAB numeric computing environment. The result will be whether currency is real or fake.

  20. Data Equivalency of an Interactive Voice Response System for Home Assessment of Back Pain and Function

    Directory of Open Access Journals (Sweden)

    William S Shaw

    2007-01-01

    Full Text Available BACKGROUND: Interactive voice response (IVR systems that collect survey data using automated, push-button telephone responses may be useful to monitor patients’ pain and function at home; however, its equivalency to other data collection methods has not been studied.

  1. Speech recognition system based on LPCC parameter%基于LPCC参数的语音识别系统

    Institute of Scientific and Technical Information of China (English)

    王彪

    2012-01-01

    为了识别简单语音,设计了一个基于LPCC参数的语音识别系统。该系统其主要功能有语音信号的录制、播放、预处理、分段滤波、特征提取以及识别语音。最后通过仿真实验验证了本系统能够达到识别简单语音的要求,但仍有需改进的地方,如:能否在复杂环境下识别比较复杂的语音。%In order to recognize simple speech,a speech recognition system based on LPCC parameter is designed,and record,broadcast,pretreat voice signals,subsection filtering,feature extraction and speech recognition are its main functions.This system has achieved discriminate simple voice requirements is verificated by the simulation experiment,but some places are needed to improve,such as:whether complex voice coule be discriminated in complex environment.

  2. Random-Profiles-Based 3D Face Recognition System

    Directory of Open Access Journals (Sweden)

    Joongrock Kim

    2014-03-01

    Full Text Available In this paper, a noble nonintrusive three-dimensional (3D face modeling system for random-profile-based 3D face recognition is presented. Although recent two-dimensional (2D face recognition systems can achieve a reliable recognition rate under certain conditions, their performance is limited by internal and external changes, such as illumination and pose variation. To address these issues, 3D face recognition, which uses 3D face data, has recently received much attention. However, the performance of 3D face recognition highly depends on the precision of acquired 3D face data, while also requiring more computational power and storage capacity than 2D face recognition systems. In this paper, we present a developed nonintrusive 3D face modeling system composed of a stereo vision system and an invisible near-infrared line laser, which can be directly applied to profile-based 3D face recognition. We further propose a novel random-profile-based 3D face recognition method that is memory-efficient and pose-invariant. The experimental results demonstrate that the reconstructed 3D face data consists of more than 50 k 3D point clouds and a reliable recognition rate against pose variation.

  3. Fractal Dimension of Voice-Signal Waveforms

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The fractal dimension is one important parameter that characterizes waveforms. In this paper, we derive a new method to calculate fractal dimension of digital voice-signal waveforms. We show that fractal dimension is an efficient tool for speaker recognition or speech recognition. It can be used to identify different speakers or distinguish speech. We apply our results to Chinese speaker recognition and numerical experiment shows that fractal dimension is an efficient parameter to characterize individual Chinese speakers. We have developed a semiautomatic voiceprint analysis system based on the theory of this paper and former researches.

  4. The role of the medial temporal limbic system in processing emotions in voice and music.

    Science.gov (United States)

    Frühholz, Sascha; Trost, Wiebke; Grandjean, Didier

    2014-12-01

    Subcortical brain structures of the limbic system, such as the amygdala, are thought to decode the emotional value of sensory information. Recent neuroimaging studies, as well as lesion studies in patients, have shown that the amygdala is sensitive to emotions in voice and music. Similarly, the hippocampus, another part of the temporal limbic system (TLS), is responsive to vocal and musical emotions, but its specific roles in emotional processing from music and especially from voices have been largely neglected. Here we review recent research on vocal and musical emotions, and outline commonalities and differences in the neural processing of emotions in the TLS in terms of emotional valence, emotional intensity and arousal, as well as in terms of acoustic and structural features of voices and music. We summarize the findings in a neural framework including several subcortical and cortical functional pathways between the auditory system and the TLS. This framework proposes that some vocal expressions might already receive a fast emotional evaluation via a subcortical pathway to the amygdala, whereas cortical pathways to the TLS are thought to be equally used for vocal and musical emotions. While the amygdala might be specifically involved in a coarse decoding of the emotional value of voices and music, the hippocampus might process more complex vocal and musical emotions, and might have an important role especially for the decoding of musical emotions by providing memory-based and contextual associations.

  5. EasyVoice: Integrating voice synthesis with Skype

    CERN Document Server

    Condado, Paulo A

    2007-01-01

    This paper presents EasyVoice, a system that integrates voice synthesis with Skype. EasyVoice allows a person with voice disabilities to talk with another person located anywhere in the world, removing an important obstacle that affect these people during a phone or VoIP-based conversation.

  6. Validity of jitter measures in non-quasi-periodic voices. Part I: perceptual and computer performances in cycle pattern recognition.

    Science.gov (United States)

    Dejonckere, Philippe; Schoentgen, Jean; Giordano, Andrea; Fraj, Samia; Bocchi, Leonardo; Manfredi, Claudia

    2011-07-01

    The limit of about 5% for reliable quantification of jitter in sustained vowels of dysphonic voices-a widely accepted guideline-deserves critical analysis. The present study pertains to the effect of experience and training on the perceptual (visual) capability of correctly identifying periods in (highly) perturbed signals, and to a comparison of the performance of several programs for voice analysis. Synthesized realistic vowels (/a:/) with exactly known jitter (2.7%-31.5%) are used as material. After selection and training, experienced raters demonstrate excellent agreement in correctly identifying periods up to high values of jitter put in. Perceptual rating outperforms all computer programs in accuracy. Most remain reliable up to 10% jitter; one of them correctly measures up to the highest level.

  7. A Multi-Modal Recognition System Using Face and Speech

    Directory of Open Access Journals (Sweden)

    Samir Akrouf

    2011-05-01

    Full Text Available Nowadays Person Recognition has got more and more interest especially for security reasons. The recognition performed by a biometric system using a single modality tends to be less performing due to sensor data, restricted degrees of freedom and unacceptable error rates. To alleviate some of these problems we use multimodal biometric systems which provide better recognition results. By combining different modalities, such us speech, face, fingerprint, etc., we increase the performance of recognition systems. In this paper, we study the fusion of speech and face in a recognition system for taking a final decision (i.e., accept or reject identity claim. We evaluate the performance of each system differently then we fuse the results and compare the performances.

  8. A Prototype System for Controlling a Computer by Head Movements and Voice Commands

    CERN Document Server

    Ismail, Anis; Hajjar, Mohammad

    2011-01-01

    This paper introduces a new prototype system for controlling a PC by head movements and also with voice commands. Our system is a multimodal interface concerned with controlling the computer. The selected modes of interaction are speech and gestures. We are seeing the revolutionary of computers and information technologies into daily practice. Healthy people use keyboard, mouse, trackball, or touchpad for controlling the PC. However these peripheries are usually not suitable for handicapped people. They may have problems using these standard peripheries, for example when they suffer from myopathy, or cannot move their hands after an injury. Our system has been developed to provide computer access for people with severe disabilities. This system tracks the computer user's Head movements with a video camera and translates them into the movements of the mouse pointer on the screen and the voice as button presses. Therefore we are coming with a proposal system that can be used with handicapped people to control t...

  9. Iris analysis for biometric recognition systems

    CERN Document Server

    Bodade, Rajesh M

    2014-01-01

    The book presents three most significant areas in Biometrics and Pattern Recognition. A step-by-step approach for design and implementation of Dual Tree Complex Wavelet Transform (DTCWT) plus Rotated Complex Wavelet Filters (RCWF) is discussed in detail. In addition to the above, the book provides detailed analysis of iris images and two methods of iris segmentation. It also discusses simplified study of some subspace-based methods and distance measures for iris recognition backed by empirical studies and statistical success verifications.

  10. Single and Multiple Hand Gesture Recognition Systems: A Comparative Analysis

    Directory of Open Access Journals (Sweden)

    Siddharth Rautaray

    2014-10-01

    Full Text Available With the evolution of higher computing speed, efficient communication technologies, and advanced display techniques the legacy HCI techniques become obsolete and are no more helpful in accurate and fast flow of information in present day computing devices. Hence the need of user friendly human machine interfaces for real time interfaces for human computer interaction have to be designed and developed to make the man machine interaction more intuitive and user friendly. The vision based hand gesture recognition affords users with the ability to interact with computers in more natural and intuitive ways. These gesture recognition systems generally consist of three main modules like hand segmentation, hand tracking and gesture recognition from hand features, designed using different image processing techniques which are further integrated with different applications. An increase use of new interfaces based on hand gesture recognition designed to cope up with the computing devices for interaction. This paper is an effort to provide a comparative analysis between such real time vision based hand gesture recognition systems which are based on interaction using single and multiple hand gestures. Single hand gesture based recognition systems (SHGRS have fewer complexes to implement, with a constraint to the count of different gestures which is large enough with various permutations and combinations of gesture, which is possible with multiple hands in multiple hand gesture recognition systems (MHGRS. The thorough comparative analysis has been done on various other vital parameters for the recognition systems.

  11. Semantic Model for Voice Controlled Telephone Dialing and Inquiry Systems

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    A new scheme is presented to detect a large number of keywords in voice controlled switchboard tasks. The new scheme is based on two stages. In the first stage, N-best syllable candidates with their corresponding acoustic scores are generated by an acoustic recognizer. In the second stage, a semantic model based parser is applied to determine the optimum keywords by searching through the lattice of N-best candidates. The experimental results show that when the spoken input deviates from the predefined syntactic constraints, the parser can also demonstrate high performance. For comparison purposes, the most common way to incorporate the syntactic knowledge of the task directly into the acoustic recognizer in the form of a finite state network is also investigated. Furthermore, to address the sparse-data problems, out-of-domain data in the form of newspaper text are used to obtain a more robust combined semantic model. The experiments show that the combined semantic model can improve the keywords detection rate from 90.07% to 92.91% when 80 ungrammatical sentences which do not conform to the task grammar are used as testing material.

  12. The Study of Application System for Small and Medium CTI Based on Voice Card

    Directory of Open Access Journals (Sweden)

    Zhong Dong

    2016-01-01

    Full Text Available With the rapid development of computer telecommunications integration (CTI technology, the development of application system for small and medium CTI are updated constantly, but the study of application system for small and medium CTI, we are lack of a stability and unified model. In this paper, the author analyzes the unified structure platform of application system for small and medium CTI based on voice card. Meanwhile, the author introduces a suitable software architecture model and general procedural framework for application system for small and medium CTI based on voice card by using the idea of hierarchical design, which shows the versatility of the architecture. It provided an efficient channel for the development of small and medium CTI.

  13. Ensemble Feature Extraction Modules for Improved Hindi Speech Recognition System

    Directory of Open Access Journals (Sweden)

    Malay Kumar

    2012-05-01

    Full Text Available Speech is the most natural way of communication between human beings. The field of speech recognition generates intrigues of man - machine conversation and due to its versatile applications; automatic speech recognition systems have been designed. In this paper we are presenting a novel approach for Hindi speech recognition by ensemble feature extraction modules of ASR systems and their outputs have been combined using voting technique ROVER. Experimental results have been shown that proposed system will produce better result than traditional ASR systems.

  14. A quality integrated spectral minutiae fingerprint recognition system

    NARCIS (Netherlands)

    Xu, Haiyun; Veldhuis, Raymond N.J.; Kevenaar, Tom A.M.; Akkermans, Anton H.M.

    2009-01-01

    Many fingerprint recognition systems are based on minutiae matching. However, the recognition accuracy of minutiae-based matching algorithms is highly dependent on the fingerprint minutiae quality. Therefore, in this paper, we introduce a quality integrated spectral minutiae algorithm, in which the

  15. FUNDAMENTALS OF SPEAKER RECOGNITION

    OpenAIRE

    ERTAŞ, Figen

    2000-01-01

    The explosive growth of information technology in the last decade has made a considerable impact on the design and construction of systems for human-machine communication, which is becoming increasingly important in many aspects of life. Amongst other speech processing tasks, a great deal of attention has been devoted to developing procedures that identify people from their voices, and the design and construction of speaker recognition systems has been a fascinating enterprise pursued over ma...

  16. FUNDAMENTALS OF SPEAKER RECOGNITION

    Directory of Open Access Journals (Sweden)

    Figen ERTAŞ

    2000-02-01

    Full Text Available The explosive growth of information technology in the last decade has made a considerable impact on the design and construction of systems for human-machine communication, which is becoming increasingly important in many aspects of life. Amongst other speech processing tasks, a great deal of attention has been devoted to developing procedures that identify people from their voices, and the design and construction of speaker recognition systems has been a fascinating enterprise pursued over many decades. This paper introduces speaker recognition in general and discusses its relevant parameters in relation to system performance.

  17. Hands-free speech after surgical voice rehabilitation with a Provox voice prosthesis: experience with the Provox FreeHands HME tracheostoma valve system.

    Science.gov (United States)

    Lorenz, K J; Groll, K; Ackerstaff, A H; Hilgers, F J M; Maier, H

    2007-02-01

    Excellent results have been reported with the use of voice prostheses for the rehabilitation of laryngectomees. Patients, however, consider it a disadvantage that the tracheostoma must be closed manually for speech production. This limits their ability to simultaneously communicate by gesture or to work with both hands. An automatic tracheostoma valve helps patients overcome this problem. We describe a prospective clinical trial evaluating our experience with the Provox FreeHands HME Automatic Tracheostoma Valve system. Twenty-four laryngectomees were randomly selected from the patients who had undergone laryngectomy at the ENT Department. Immediately, after 4 weeks and 6 months later having been fitted with a Provox FreeHands HME, the patients were asked to complete a questionnaire in order to assess their satisfaction, voice quality, wearing comfort, fixation, potential problems, and the effectiveness of the HME cassette. In addition, we investigated relevant voice quality parameters including dynamics range, frequency range of the speaking voice, and maximum phonation time. Seven patients discontinued the study due to problems of securing the valve to the skin (four patients) or recurrent cancer (three patients). Ten of the remaining 17 patients wore the valve daily for an average of 8.4 h. A total of 88% of the patients considered it a great advantage to be able to speak without having to use their hands. With the Provox FreeHands HME, maximum phonation time was 8.7 (+/-6.2) s and the dynamic range was 21.9 (+/-5.8) decibels. The results show that the Provox FreeHands HME Automatic Tracheostoma Valve system not only allows hands-free speech but is also associated with excellent compliance and good voice rehabilitation.

  18. The linear position tracking servo system using a linear voice-coil motor

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    More and more linear servo systems have been used in servo applications. Direct drive technology can greatly increase the bandwidth and the tracking accuracy. A position servo-system based on linear voice-coil motor was designed for one linear oscillation movement application. Besides the conventional position, speed and current control loops, the speed and acceleration feed-forward control of command position signal were also used. The experimental test proved the correctness of the design, and the system can track the given periodic sinnsoid position command signal of 15Hz with high accuracy. The linear voice-coil motor is very suitable for short stroke position tracking application with high dynamic response.

  19. Application of Embedded Speech Recognition System Based on LD3320%LD3320的嵌入式语音识别系统的应用

    Institute of Scientific and Technical Information of China (English)

    洪家平

    2012-01-01

    Voice interaction system is the Human-Machine Interface, and it requires the involvement o{ the voice recognition system. LD3320 is one such voice recognition chip. Its principle and typical application are introduced in this article, and hardware and program flowchart of the interface of LD3320 and microcontroller are provided. The Voice interaction system with the MCU will have the good application prospect with the development of the top grade MCU.%语音交互系统是比较人性化的人机操作界面,它需要语音识别系统的支持。LD3320就是这样一款语音识别芯片。介绍了该芯片的工作原理及应用,给出了LD3320与微处理器的硬件接口电路及软件程序。随着高档MCU的不断出现,以MCU为核心的嵌入式语音交互系统会有非常好的应用前景。

  20. 8th International Conference on Computer Recognition Systems

    CERN Document Server

    Jackowski, Konrad; Kurzynski, Marek; Wozniak, Michał; Zolnierek, Andrzej

    2013-01-01

    The computer recognition systems are nowadays one of the most promising directions in artificial intelligence. This book is the most comprehensive study of this field. It contains a collection of 86 carefully selected articles contributed by experts of pattern recognition. It reports on current research with respect to both methodology and applications. In particular, it includes the following sections: Biometrics Data Stream Classification and Big Data Analytics  Features, learning, and classifiers Image processing and computer vision Medical applications Miscellaneous applications Pattern recognition and image processing in robotics  Speech and word recognition This book is a great reference tool for scientists who deal with the problems of designing computer pattern recognition systems. Its target readers can be the as well researchers as students of computer science, artificial intelligence or robotics.

  1. Implementation of a Tour Guide Robot System Using RFID Technology and Viterbi Algorithm-Based HMM for Speech Recognition

    Directory of Open Access Journals (Sweden)

    Neng-Sheng Pai

    2014-01-01

    Full Text Available This paper applied speech recognition and RFID technologies to develop an omni-directional mobile robot into a robot with voice control and guide introduction functions. For speech recognition, the speech signals were captured by short-time processing. The speaker first recorded the isolated words for the robot to create speech database of specific speakers. After the speech pre-processing of this speech database, the feature parameters of cepstrum and delta-cepstrum were obtained using linear predictive coefficient (LPC. Then, the Hidden Markov Model (HMM was used for model training of the speech database, and the Viterbi algorithm was used to find an optimal state sequence as the reference sample for speech recognition. The trained reference model was put into the industrial computer on the robot platform, and the user entered the isolated words to be tested. After processing by the same reference model and comparing with previous reference model, the path of the maximum total probability in various models found using the Viterbi algorithm in the recognition was the recognition result. Finally, the speech recognition and RFID systems were achieved in an actual environment to prove its feasibility and stability, and implemented into the omni-directional mobile robot.

  2. Utilization of Internet Protocol-Based Voice Systems in Remote Payload Operations

    Science.gov (United States)

    Chamberlain, jim; Bradford, Bob; Best, Susan; Nichols, Kelvin

    2002-01-01

    Due to limited crew availability to support science and the large number of experiments to be operated simultaneously, telescience is key to a successful International Space Station (ISS) science program. Crew, operations personnel at NASA centers, and researchers at universities and companies around the world must work closely together to per orm scientific experiments on-board ISS. The deployment of reliable high-speed Internet Protocol (IP)-based networks promises to greatly enhance telescience capabilities. These networks are now being used to cost-effectively extend the reach of remote mission support systems. They reduce the need for dedicated leased lines and travel while improving distributed workgroup collaboration capabilities. NASA has initiated use of Voice over Internet Protocol (VoIP) to supplement the existing mission voice communications system used by researchers at their remote sites. The Internet Voice Distribution System (IVoDS) connects remote researchers to mission support "loopsll or conferences via NASA networks and Internet 2. Researchers use NODS software on personal computers to talk with operations personnel at NASA centers. IVoDS also has the ;capability, if authorized, to allow researchers to communicate with the ISS crew during experiment operations. NODS was developed by Marshall Space Flight Center with contractors & Technology, First Virtual Communications, Lockheed-Martin, and VoIP Group. NODS is currently undergoing field-testing with full deployment for up to 50 simultaneous users expected in 2002. Research is being performed in parallel with IVoDS deployment for a next-generation system to qualitatively enhance communications among ISS operations personnel. In addition to the current voice capability, video and data/application-sharing capabilities are being investigated. IVoDS technology is also being considered for mission support systems for programs such as Space Launch Initiative and Homeland Defense.

  3. Automatic TLI recognition system. Part 2: User`s guide

    Energy Technology Data Exchange (ETDEWEB)

    Partin, J.K.; Lassahn, G.D.; Davidson, J.R.

    1994-05-01

    This report describes an automatic target recognition system for fast screening of large amounts of multi-sensor image data, based on low-cost parallel processors. This system uses image data fusion and gives uncertainty estimates. It is relatively low cost, compact, and transportable. The software is easily enhanced to expand the system`s capabilities, and the hardware is easily expandable to increase the system`s speed. This volume is a user`s manual for an Automatic Target Recognition (ATR) system. This guide is intended to provide enough information and instruction to allow individuals to the system for their own applications.

  4. Active Multimodal Sensor System for Target Recognition and Tracking.

    Science.gov (United States)

    Qu, Yufu; Zhang, Guirong; Zou, Zhaofan; Liu, Ziyue; Mao, Jiansen

    2017-06-28

    High accuracy target recognition and tracking systems using a single sensor or a passive multisensor set are susceptible to external interferences and exhibit environmental dependencies. These difficulties stem mainly from limitations to the available imaging frequency bands, and a general lack of coherent diversity of the available target-related data. This paper proposes an active multimodal sensor system for target recognition and tracking, consisting of a visible, an infrared, and a hyperspectral sensor. The system makes full use of its multisensor information collection abilities; furthermore, it can actively control different sensors to collect additional data, according to the needs of the real-time target recognition and tracking processes. This level of integration between hardware collection control and data processing is experimentally shown to effectively improve the accuracy and robustness of the target recognition and tracking system.

  5. The hearing voices network: initial lessons and future directions for mental health professionals and Systems of Care.

    Science.gov (United States)

    Styron, Thomas; Utter, Lauren; Davidson, Larry

    2017-02-01

    For more than two decades, the Hearing Voices Network (HVN) has provided alternative approaches to supporting voice hearers, and an emerging body of research is now confirming their value. HVN approaches present unique opportunities and challenges for mental health professionals and systems of care that work with individuals who hear voices. An overview of the HVN is presented, including its history, principles and approaches. HVN approaches are compared and contrasted with traditional mental health treatments. HVN's potential contribution to the transformation of mental health care is discussed. Directions for future research are presented.

  6. 75 FR 30845 - Request Voucher for Grant Payment and Line of Credit Control System (LOCCS) Voice Response System...

    Science.gov (United States)

    2010-06-02

    ... URBAN DEVELOPMENT Request Voucher for Grant Payment and Line of Credit Control System (LOCCS) Voice... is soliciting public comments on the subject proposal. Payment request vouchers for distribution of.... This Notice Also Lists the Following Information Title of Proposal: Request Voucher for Grant...

  7. An Automatic Number Plate Recognition System under Image Processing

    OpenAIRE

    Sarbjit Kaur

    2016-01-01

    Automatic Number Plate Recognition system is an application of computer vision and image processing technology that takes photograph of vehicles as input image and by extracting their number plate from whole vehicle image , it display the number plate information into text. Mainly the ANPR system consists of 4 phases: - Acquisition of Vehicle Image and Pre-Processing, Extraction of Number Plate Area, Character Segmentation and Character Recognition. The overall accuracy and efficiency of whol...

  8. A Robot Control System Based on Gesture Recognition Using Kinect

    Directory of Open Access Journals (Sweden)

    Biao MA

    2013-05-01

    Full Text Available The Kinect camera is widely used for capturing human body images and human motion recognition in video game playing, and there are already some research works on gesture recognition. However, to achieve the anti-interference performance, the current recognition algorithms are often complex and tardiness, and most of the applications are based on the incomplete gesture library and not all hand gestures can be recognized. This paper explores a new method and algorithm which can describe all five fingertips for each hand in any time for hand gesture recognition with the Kinect system. The hand images are processed to build the hand models which are then compared with the gesture library for gesture recognition. After hand gestures are recognized with high accuracy and less computing, control commands corresponding to hand gestures are sent from the hand gesture recognition system to a hexagon robot controller wirelessly, the hexagon robot can then be controlled wirelessly and change its shape according to the hand gesture command. Thus the robot can interact with humans promptly through the gesture recognition system.

  9. Individual differences in involvement of the visual object recognition system during visual word recognition.

    Science.gov (United States)

    Laszlo, Sarah; Sacchi, Elizabeth

    2015-01-01

    Individuals with dyslexia often evince reduced activation during reading in left hemisphere (LH) language regions. This can be observed along with increased activation in the right hemisphere (RH), especially in areas associated with object recognition - a pattern referred to as RH compensation. The mechanisms of RH compensation are relatively unclear. We hypothesize that RH compensation occurs when the RH object recognition system is called upon to supplement an underperforming LH visual word form recognition system. We tested this by collecting ERPs while participants with a range of reading abilities viewed words, objects, and word/object ambiguous items (e.g., "SMILE" shaped like a smile). Less experienced readers differentiate words, objects, and ambiguous items less strongly, especially over the RH. We suggest that this lack of differentiation may have negative consequences for dyslexic individuals demonstrating RH compensation.

  10. Optimization Methods in Emotion Recognition System

    Directory of Open Access Journals (Sweden)

    L. Povoda

    2016-09-01

    Full Text Available Emotions play big role in our everyday communication and contain important information. This work describes a novel method of automatic emotion recognition from textual data. The method is based on well-known data mining techniques, novel approach based on parallel run of SVM (Support Vector Machine classifiers, text preprocessing and 3 optimization methods: sequential elimination of attributes, parameter optimization based on token groups, and method of extending train data sets during practical testing and production release final tuning. We outperformed current state of the art methods and the results were validated on bigger data sets (3346 manually labelled samples which is less prone to overfitting when compared to related works. The accuracy achieved in this work is 86.89% for recognition of 5 emotional classes. The experiments were performed in the real world helpdesk environment, was processing Czech language but the proposed methodology is general and can be applied to many different languages.

  11. Implementation of Reliable Open Source IRIS Recognition System

    Directory of Open Access Journals (Sweden)

    Dhananjay Ikhar

    2013-12-01

    Full Text Available RELIABLE automatic recognition of persons has long been an attractive goal. As in all pattern recognition problems, the key issue is the relation between inter-class and intra-class variability: objects can be reliably classified only if the variability among different instances of a given class is less than the variability between different classes.The objective of this paper is to implement an open-source iris recognition system in order to verify the claimed performance of the technology. The development tool used will be MATLAB, and emphasis will be only on the software for performing recognition and not hardware for capturing an eye image. A reliable application development approach will be employed in order to produce results quickly. MATLAB provides an excellent environment, with its image processing toolbox. To test the system, a database of 756 grayscale eye images courtesy of Chinese Academy of Sciences-Institute of Automation (CASIA is used. The system is to be composed of a number of sub-systems, which correspond to each stage of iris recognition. These stages are- image acquisition, segmentation, normalization and feature encoding. The input to the system will be an eye image, and the output will be an iris template, which will provide a mathematical representation of the iris region. Which conclude the objectives to design recognition system are- study of different biometrics and their features? Study of different recognition systems and their steps, selection of simple and efficient recognition algorithm for implementation, selection of fast and efficient tool for processing, apply the implemented algorithm to different database and find out performance factors.

  12. AN IMPROVED ALGORITHM OF GMM VOICE CONVERSION SYSTEM BASED ON CHANGING THE TIME-SCALE

    Institute of Scientific and Technical Information of China (English)

    Zhou Ying; Zhang Linghua

    2011-01-01

    This paper improves and presents an advanced method of the voice conversion system based on Gaussian Mixture Models (GMM) models by changing the time-scale of speech.The Speech Transformation and Representation using Adaptive Interpolation of weiGHTed spectrum (STRAIGHT) model is adopted to extract the spectrum features,and the GMM models are trained to generate the conversion function.The spectrum features of a source speech will be converted by the conversion function.The time-scale of speech is changed by extracting the converted features and adding to the spectrum.The conversion voice was evaluated by subjective and objective measurements.The results confirm that the transformed speech not only approximates the characteristics of the target speaker,but also more natural and more intelligible.

  13. Parameterless-Growing-SOM and Its Application to a Voice Instruction Learning System

    Directory of Open Access Journals (Sweden)

    Takashi Kuremoto

    2010-01-01

    Full Text Available An improved self-organizing map (SOM, parameterless-growing-SOM (PL-G-SOM, is proposed in this paper. To overcome problems existed in traditional SOM (Kohonen, 1982, kinds of structure-growing-SOMs or parameter-adjusting-SOMs have been invented and usually separately. Here, we combine the idea of growing SOMs (Bauer and Villmann, 1997; Dittenbach et al. 2000 and a parameterless SOM (Berglund and Sitte, 2006 together to be a novel SOM named PL-G-SOM to realize additional learning, optimal neighborhood preservation, and automatic tuning of parameters. The improved SOM is applied to construct a voice instruction learning system for partner robots adopting a simple reinforcement learning algorithm. User's instructions of voices are classified by the PL-G-SOM at first, then robots choose an expected action according to a stochastic policy. The policy is adjusted by the reward/punishment given by the user of the robot. A feeling map is also designed to express learning degrees of voice instructions. Learning and additional learning experiments used instructions in multiple languages including Japanese, English, Chinese, and Malaysian confirmed the effectiveness of our proposed system.

  14. 移动机器人实时语音控制的实现%Realization on the real-time voice control system for mobile robot

    Institute of Scientific and Technical Information of China (English)

    高美娟; 杨智鑫; 田景文

    2011-01-01

    Based on the study of speech recognition algorithms, the real-time voice control system for mobile robot has been designed and realized. Dynamic Time warping (DTW) algorithm was chosen for this system, and the programming has been finished with VC platform. To ensure the real-time property of the system, the automatic record system with voice switch was designed. The experiment confirmed that this system shows good effect in real-time speech control%通过对语音识别过程的研究,设计并制作开发了1套基于移动机器人的实时语音控制系统.采用了动态时间规整(DTW)算法作为核心识别方法,并使用VC完成了对各部分的软件编程.其中为了保证系统的实时性,特别设计了带有声控开关的自动录音系统.通过最后的实验证实了,系统可以很好的完成实时语音控制.

  15. Optical character recognition systems for different languages with soft computing

    CERN Document Server

    Chaudhuri, Arindam; Badelia, Pratixa; K Ghosh, Soumya

    2017-01-01

    The book offers a comprehensive survey of soft-computing models for optical character recognition systems. The various techniques, including fuzzy and rough sets, artificial neural networks and genetic algorithms, are tested using real texts written in different languages, such as English, French, German, Latin, Hindi and Gujrati, which have been extracted by publicly available datasets. The simulation studies, which are reported in details here, show that soft-computing based modeling of OCR systems performs consistently better than traditional models. Mainly intended as state-of-the-art survey for postgraduates and researchers in pattern recognition, optical character recognition and soft computing, this book will be useful for professionals in computer vision and image processing alike, dealing with different issues related to optical character recognition.

  16. Smart Homes with Voice Activated Systems for Disabled People

    National Research Council Canada - National Science Library

    Bekir Busatlic; Nejdet Dogru; Isaac Lera; Enes Sukic

    2017-01-01

    Smart home refers to the application of various technologies to semi-unsupervised home control It refers to systems that control temperature, lighting, door locks, windows and many other appliances...

  17. The Bellevue Classification System: nursing's voice upon the library shelves.

    Science.gov (United States)

    Mages, Keith C

    2011-01-01

    This article examines the inspiration, construction, and meaning of the Bellevue Classification System (BCS), created during the 1930s for use in the Bellevue School of Nursing Library. Nursing instructor Ann Doyle, with assistance from librarian Mary Casamajor, designed the BCS after consulting with library leaders and examining leading contemporary classification systems, including the Dewey Decimal Classification and Library of Congress, Ballard, and National Health Library classification systems. A close textual reading of the classes, subclasses, and subdivisions of these classification systems against those of the resulting BCS, reveals Doyle's belief that the BCS was created not only to organize the literature, but also to promote the burgeoning intellectualism and professionalism of early twentieth-century American nursing.

  18. A hand vein recognition system based on DSP and CPLD

    Institute of Scientific and Technical Information of China (English)

    KANG Wen-xiong; CHEN Zi-yi; YANG Qing-qiang

    2010-01-01

    @@ The hand vein recognition system based on digital signal processing(DSP)and complex programmable logic device (CPLD)is designed according to the requirements for equipment volume,accuracy and reaction speed.The overall structure and detailed implementation of the system hardware architecture are discussed in this paper.Moreover,the design philosophy and specific realization of system software as well as core algorithms are explored.The recognition system owns many good characteristics,such as high-degree integration,simple structure,flexible programming,convenient application and so on,which make it suitable for circumstances with high requirements for personal identification.

  19. FACELOCK-Lock Control Security System Using Face Recognition-

    Science.gov (United States)

    Hirayama, Takatsugu; Iwai, Yoshio; Yachida, Masahiko

    A security system using biometric person authentication technologies is suited to various high-security situations. The technology based on face recognition has advantages such as lower user’s resistance and lower stress. However, facial appearances change according to facial pose, expression, lighting, and age. We have developed the FACELOCK security system based on our face recognition methods. Our methods are robust for various facial appearances except facial pose. Our system consists of clients and a server. The client communicates with the server through our protocol over a LAN. Users of our system do not need to be careful about their facial appearance.

  20. A Malaysian Vehicle License Plate Localization and Recognition System

    Directory of Open Access Journals (Sweden)

    Ganapathy Velappa

    2008-02-01

    Full Text Available Technological intelligence is a highly sought after commodity even in traffic-based systems. These intelligent systems do not only help in traffic monitoring but also in commuter safety, law enforcement and commercial applications. In this paper, a license plate localization and recognition system for vehicles in Malaysia is proposed. This system is developed based on digital images and can be easily applied to commercial car park systems for the use of documenting access of parking services, secure usage of parking houses and also to prevent car theft issues. The proposed license plate localization algorithm is based on a combination of morphological processes with a modified Hough Transform approach and the recognition of the license plates is achieved by the implementation of the feed-forward backpropagation artificial neural network. Experimental results show an average of 95% successful license plate localization and recognition in a total of 589 images captured from a complex outdoor environment.

  1. Clonal Selection Based Artificial Immune System for Generalized Pattern Recognition

    Science.gov (United States)

    Huntsberger, Terry

    2011-01-01

    The last two decades has seen a rapid increase in the application of AIS (Artificial Immune Systems) modeled after the human immune system to a wide range of areas including network intrusion detection, job shop scheduling, classification, pattern recognition, and robot control. JPL (Jet Propulsion Laboratory) has developed an integrated pattern recognition/classification system called AISLE (Artificial Immune System for Learning and Exploration) based on biologically inspired models of B-cell dynamics in the immune system. When used for unsupervised or supervised classification, the method scales linearly with the number of dimensions, has performance that is relatively independent of the total size of the dataset, and has been shown to perform as well as traditional clustering methods. When used for pattern recognition, the method efficiently isolates the appropriate matches in the data set. The paper presents the underlying structure of AISLE and the results from a number of experimental studies.

  2. Dissociating the cortical basis of memory for voices, words and tones.

    Science.gov (United States)

    Stevens, Alexander A

    2004-01-01

    Human speech carries both linguistic content and information about the speaker's identity and affect. While neuroimaging has been used extensively to study verbal memory, there has been little attention to the neural basis of memory for voices. Evidence from studies of aphasia and auditory agnosia suggests that voice memory may rely on anatomically distinct areas in the right temporal and parietal lobes regions, but there is little data on the broader neural systems involved in voice memory. The present study tested the hypothesis that the neural systems involved in voice memory are functionally distinct from the systems involved in word recognition and are primarily located in the right cerebral hemisphere. Subjects performed two-back tasks in which they were required to alternately remember the voices speaking (Voice condition), and the words they produced (Word condition). A tone memory condition was also included, as a non-speech comparison. The contrast between the Voice and Word conditions revealed greater Voice-related effects in left temporal, right frontal and right medial parietal areas, while the Word-related effects appeared in left frontal and bilateral parietal areas. These findings map out a partially right-lateralized fronto-parietal network associated with voice memory, which can be distinguished from predominantly left-hemisphere regions associated with verbal working memory. These results provide further evidence that distinct neural systems are associated with the carrier waves of speech and word identity.

  3. The Female Voice: Applications to Bowen's Family Systems Theory.

    Science.gov (United States)

    Knudson-Martin, Carmen

    1994-01-01

    Responds to calls from feminist scholars to address potential biases against women in theories of family therapy. Summarizes findings from studies of female development and integrates findings into expanded model of Bowen's family systems theory. Includes case example comparing expanded model with traditional application of Bowen's theory.…

  4. THE RECOGNITION SYSTEM OF MOVING MACHINE PRINTED MARK/NUMERAL

    Institute of Scientific and Technical Information of China (English)

    Miao Yalin; Miao Xianglin; Bian Zhengzhong; Zhou Jianlong

    2005-01-01

    This paper presents a recognition system for the automatic quality control in industrial applications. The purpose of the system is to collect the product information (e.g. Expiry-date, production identification) and verify these information for quality control. The main difficulties of the system are to make an effcient preprocessing for the acquired low resolution image and to create a simple and fast recognition method to get the product information. In this paper, we propose an effcient recognition method based on the endpoint features and structure characteristics of the numerals. The experimental results show that the proposed method is effcient, robust and reliable for recognizing machine printed numerals. The system is currently successfully working with a real application with required specifications.

  5. (F)-law collision and system state recognition

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Using dual function one direction S-rough sets,this article gives the,(f)-law,the (F)-law,law distance and the concept of system law collided by the (F)-law.The characteristics presented by the system law collided by the (F)-law,the recognition of these characteristics and recognition criterion are also proposed.The dual function one direction S-rough sets is one of the basic forms of function S-rough sets.Its basic theory and application in the study of system law collision are reviewed.

  6. F-Law collision and system state recognition

    Institute of Scientific and Technical Information of China (English)

    Shi Kaiquan; Xu Xiaojing

    2007-01-01

    Using function one direction S-rough sets (function one direction singular rough sets), f-law and F-law and the concept of law distance and the concept of system law collided by F-law are given.Using these concepts, state characteristic presented by system law collided by .F-law and recognition of these states characteristic and recognition criterion and applications are given.Function one direction S-rough sets is one of basic forms of function S-rough sets (function singular rough sets).Function one direction S-rough sets is importance theory and is a method in studying system law collision.

  7. Actuator prototype system by voice commands using free software

    Directory of Open Access Journals (Sweden)

    Jaime Andrango

    2016-06-01

    Full Text Available This prototype system is a software application that through the use of techniques of digital signal processing, extracts information from the user's speech, which is then used to manage the on/off actuator on a peripheral computer when vowels are pronounced. The method applies spectral differences. The application uses the parallel port as actuator, with the information recorded in the memory address 378H. This prototype was developed using free software tools for its versatility and dynamism, and to allow other researchers to base on it for further studies.

  8. The effect of different navigation voices on trust and attention while using in-vehicle navigation systems.

    Science.gov (United States)

    Large, David R; Burnett, Gary E

    2014-06-01

    Automobiles are suffused with computers and technology designed to support drivers at all levels of the driving hierarchy. Classic secondary devices, such as in-vehicle navigation systems (IVNS), present strategic and tactical information to drivers. In order to mitigate the potential distraction and workload when interacting with these devices while driving, IVNS often employ voices to deliver navigational instructions. In contrast, voices are used during interpersonal encounters to engage the listener, provide clues about the speaker's personality and make judgments about them, for example, whether to like them and to trust them. A study conducted within a fixed-based medium-fidelity driving simulator investigated if drivers made similar 'personality' attributions to voices emanating from an IVNS and if this subsequently affected how they engaged with the device while driving. Twenty-nine experienced drivers and IVNS users drove to a specified destination with a simulated IVNS and authentically reproduced UK road signage to support their route-finding. Either of two navigation voices were used; one considered 'high-trust' and the other 'low-trust.' Presented with a conflict scenario, where the verbal route guidance differed to the road signs, 22 drivers followed the IVNS instruction rather than the road signs. Of these, the majority were using the 'high-trust' voice. A post-drive questionnaire revealed that, despite the fact that message content and delivery remained equivalent, participants recognized different attributes ('personalities') associated with each of the navigation voices. This influenced their attitudes towards them, including how much they liked them, their preferences for use, and the level of trust that they associated with each voice. While these, so-called, social responses may be invited and indeed encouraged in other contexts, in the automotive domain they are likely to conflict with the intended benefits of using a voice to deliver route

  9. A Review on Different Currency Recognition System for Bangladesh India China and Euro Currency

    Directory of Open Access Journals (Sweden)

    Ahmed Ali Abbasi

    2014-02-01

    Full Text Available Paper currency recognition is one of the important applications of pattern recognition. This application is used to recognize the currency of different countries. Currency recognition system can be used in many places like Hotels, Shops and Automated Teller Machines etc. The currency recognition system should be able to classify this paper currency to the correct class of paper currencies to which it belongs. This paper represents currency recognition system of different countries using different techniques. The paper represents recognition system of different countries like Bangladesh, China, India and recognition system for Euro currency. Different techniques are used to develop these systems like Bangladeshi Currency Recognition System using Negatively Correlated Neural Network, Bangladeshi Currency Recognition System Using Neural Network with Axis Symmetrical Masks and Chinese Currency Recognition System based on BP (Back Propagation Neural Network Improved by Gene Algorithm, Chinese Currency Recognition by Neural Network, Chinese Currency Recognition based on LBP (Local Binary Pattern. Indian Currency Recognition System based on Heuristic Analysis and Recognition System for Euro using New Recognition Method. This paper represents currency recognition system of different countries and method used to develop these systems.

  10. Arm Motion Recognition and Exercise Coaching System for Remote Interaction

    Directory of Open Access Journals (Sweden)

    Hong Zeng

    2016-01-01

    Full Text Available Arm motion recognition and its related applications have become a promising human computer interaction modal due to the rapid integration of numerical sensors in modern mobile-phones. We implement a mobile-phone-based arm motion recognition and exercise coaching system that can help people carrying mobile-phones to do body exercising anywhere at any time, especially for the persons that have very limited spare time and are constantly traveling across cities. We first design improved k-means algorithm to cluster the collecting 3-axis acceleration and gyroscope data of person actions into basic motions. A learning method based on Hidden Markov Model is then designed to classify and recognize continuous arm motions of both learners and coaches, which also measures the action similarities between the persons. We implement the system on MIUI 2S mobile-phone and evaluate the system performance and its accuracy of recognition.

  11. Low Energy Physical Activity Recognition System on Smartphones

    Directory of Open Access Journals (Sweden)

    Luis Miguel Soria Morillo

    2015-03-01

    Full Text Available An innovative approach to physical activity recognition based on the use of discrete variables obtained from accelerometer sensors is presented. The system first performs a discretization process for each variable, which allows efficient recognition of activities performed by users using as little energy as possible. To this end, an innovative discretization and classification technique is presented based on the χ2 distribution. Furthermore, the entire recognition process is executed on the smartphone, which determines not only the activity performed, but also the frequency at which it is carried out. These techniques and the new classification system presented reduce energy consumption caused by the activity monitoring system. The energy saved increases smartphone usage time to more than 27 h without recharging while maintaining accuracy.

  12. Mandarin recognition over the telephone

    Science.gov (United States)

    Kao, Yuhung

    1996-06-01

    Mandarin Chinese is the official language in China and Taiwan, it is the native language of a quarter of the world population. As the services enabled by speech recognition technology (e.g. telephone voice dialing, information query) become more popular in English, we would like to extend this capability to other languages. Mandarin is one of the major languages under research in our laboratory. This paper describes how we extend our work in English speech recognition into Mandarin. We will described the corpus: Voice Across Taiwan, the training of a complete set of Mandarin syllable models, preliminary performance results and error analysis. A fast prototyping system was built, where a user can write any context free grammar with no restriction of vocabulary, then the grammar can be compiled into recognition models. It enables user to quickly test the performance of a new vocabulary.

  13. Development of a speech recognition system for Spanish broadcast news

    NARCIS (Netherlands)

    Niculescu, Andreea; Jong, de Franciska

    2008-01-01

    This paper reports on the development process of a speech recognition system for Spanish broadcast news within the MESH FP6 project. The system uses the SONIC recognizer developed at the Center for Spoken Language Research (CSLR), University of Colorado. Acoustic and language models were trained usi

  14. Student Modelling in an Intelligent Tutoring System for the Passive Voice of English Language

    Directory of Open Access Journals (Sweden)

    Dimitris Maras

    2000-01-01

    Full Text Available This paper describes an intelligent multimedia tutoring system for the passive voice of the English grammar. The system may be used to present theoretical issues about the passive voice and to provide exercises that the student may solve. The main focus of the tutor is on the student's error diagnosis process, which is performed by the student modelling component. When the student types the solution to an exercise, the system examines the correctness of the answer. If the student's answer has been erroneous it attempts to diagnose the underlying misconception of the mistake. In order to provide individualised help, the system holds a profile for every student, the long term student model. The student’s progress and his/her usual mistakes are recorded to this long term student model. This kind of information is used for the individualised error diagnosis of the student in subsequent sessions. In addition, the information stored about the student can also be used for the resolution of an arising ambiguity, as to what the underlying cause of a student error has been.

  15. Structural aspects of molecular recognition in the immune system. Part II: Pattern recognition receptors

    OpenAIRE

    2014-01-01

    The vertebrate immune system uses pattern recognition receptors (PRRs) to detect a large variety of molecular signatures (pathogen-associated molecular patterns, PAMPs) from a broad range of different invading pathogens. The PAMPs range in size from relatively small molecules, to others of intermediate size such as bacterial lipopolysaccharide, lipopeptides, and oligosaccharides, to macromolecules such as viral DNA, RNA, and pathogen-derived proteins such as flagellin. Underlying this functio...

  16. License Plate Recognition for Parking Control System by Mathematical Morphology

    Institute of Scientific and Technical Information of China (English)

    Javier Ortiz; Alberto Gómez

    2014-01-01

    Nowadays, license plate recognition for parking systems is a critical task to provide automatic control of customers and payment. This paper introduces a new method for automatic recognition of license plates of vehicles by mathematical morphology. The proposed method can provide the license plate number of the plates in different light conditions, colors, sizes, and inclination (angles). The algorithm can recognize the license plates of European Union vehicles quickly and correctly. The pattern learning of mathematical skeletons has high efficiency in the process. The performance of the algorithm is demonstrated well by the test in a parking control system.

  17. Object Recognition Using a 3D RFID System

    OpenAIRE

    Roh, Se-gon; Choi, Hyouk Ryeol

    2009-01-01

    Up to now, object recognition in robotics has been typically done by vision, ultrasonic sensors, laser ranger finders etc. Recently, RFID has emerged as a promising technology that can strengthen object recognition. In this chapter, the 3D RFID system and the 3D tag were presented. The proposed RFID system can determine if an object as well as other tags exists, and also can estimate the orientation and position of the object. This feature considerably reduces the dependence of the robot on o...

  18. Design of embedded intelligent monitoring system based on face recognition

    Science.gov (United States)

    Liang, Weidong; Ding, Yan; Zhao, Liangjin; Li, Jia; Hu, Xuemei

    2017-01-01

    In this paper, a new embedded intelligent monitoring system based on face recognition is proposed. The system uses Pi Raspberry as the central processor. A sensors group has been designed with Zigbee module in order to assist the system to work better and the two alarm modes have been proposed using the Internet and 3G modem. The experimental results show that the system can work under various light intensities to recognize human face and send alarm information in real time.

  19. Colorimetric Sensor Arrays System Based on FPGA for Image Recognition

    Institute of Scientific and Technical Information of China (English)

    Rui Chen; Jian-Hua Xu; Ya-Dong Jiang

    2009-01-01

    A FPGA-based image recognition system is designed for colorimetric sensor array in order to recognize a wide range of volatile organic compounds. The gas molecule is detected by the responsive sensor array and the responsive image is obtained. The image is decomposed to RGB color components using CMOS image sensor. An embedded image recognition archi- tecture based on Xilinx Spartan-3 FPGA is designed to implement the algorithms of image recognition. The algorithm of color coherence vector is discussed in detail[X1] compared with the algorithm of color histograms, and experimental results demonstrate that both of the two algorithms could be analyzed effectively to represent different volatile organic compounds according to their different responsive images in this system.

  20. 9th International Conference on Computer Recognition Systems

    CERN Document Server

    Jackowski, Konrad; Kurzyński, Marek; Woźniak, Michał; Żołnierek, Andrzej

    2016-01-01

    The computer recognition systems are nowadays one of the most promising directions in artificial intelligence. This book is the most comprehensive study of this field. It contains a collection of 79 carefully selected articles contributed by experts of pattern recognition. It reports on current research with respect to both methodology and applications. In particular, it includes the following sections: Features, learning, and classifiers Biometrics Data Stream Classification and Big Data Analytics Image processing and computer vision Medical applications Applications RGB-D perception: recent developments and applications This book is a great reference tool for scientists who deal with the problems of designing computer pattern recognition systems. Its target readers can be the as well researchers as students of computer science, artificial intelligence or robotics.  .

  1. Forensic Speaker Recognition Law Enforcement and Counter-Terrorism

    CERN Document Server

    Patil, Hemant

    2012-01-01

    Forensic Speaker Recognition: Law Enforcement and Counter-Terrorism is an anthology of the research findings of 35 speaker recognition experts from around the world. The volume provides a multidimensional view of the complex science involved in determining whether a suspect’s voice truly matches forensic speech samples, collected by law enforcement and counter-terrorism agencies, that are associated with the commission of a terrorist act or other crimes. While addressing such topics as the challenges of forensic case work, handling speech signal degradation, analyzing features of speaker recognition to optimize voice verification system performance, and designing voice applications that meet the practical needs of law enforcement and counter-terrorism agencies, this material all sounds a common theme: how the rigors of forensic utility are demanding new levels of excellence in all aspects of speaker recognition. The contributors are among the most eminent scientists in speech engineering and signal process...

  2. System and method for characterizing voiced excitations of speech and acoustic signals, removing acoustic noise from speech, and synthesizing speech

    Energy Technology Data Exchange (ETDEWEB)

    Burnett, Greg C.; Holzrichter, John F.; Ng, Lawrence C.

    2006-02-14

    The present invention is a system and method for characterizing human (or animate) speech voiced excitation functions and acoustic signals, for removing unwanted acoustic noise which often occurs when a speaker uses a microphone in common environments, and for synthesizing personalized or modified human (or other animate) speech upon command from a controller. A low power EM sensor is used to detect the motions of windpipe tissues in the glottal region of the human speech system before, during, and after voiced speech is produced by a user. From these tissue motion measurements, a voiced excitation function can be derived. Further, the excitation function provides speech production information to enhance noise removal from human speech and it enables accurate transfer functions of speech to be obtained. Previously stored excitation and transfer functions can be used for synthesizing personalized or modified human speech. Configurations of EM sensor and acoustic microphone systems are described to enhance noise cancellation and to enable multiple articulator measurements.

  3. System and method for characterizing voiced excitations of speech and acoustic signals, removing acoustic noise from speech, and synthesizing speech

    Energy Technology Data Exchange (ETDEWEB)

    Burnett, Greg C. (Livermore, CA); Holzrichter, John F. (Berkeley, CA); Ng, Lawrence C. (Danville, CA)

    2006-08-08

    The present invention is a system and method for characterizing human (or animate) speech voiced excitation functions and acoustic signals, for removing unwanted acoustic noise which often occurs when a speaker uses a microphone in common environments, and for synthesizing personalized or modified human (or other animate) speech upon command from a controller. A low power EM sensor is used to detect the motions of windpipe tissues in the glottal region of the human speech system before, during, and after voiced speech is produced by a user. From these tissue motion measurements, a voiced excitation function can be derived. Further, the excitation function provides speech production information to enhance noise removal from human speech and it enables accurate transfer functions of speech to be obtained. Previously stored excitation and transfer functions can be used for synthesizing personalized or modified human speech. Configurations of EM sensor and acoustic microphone systems are described to enhance noise cancellation and to enable multiple articulator measurements.

  4. System and method for characterizing voiced excitations of speech and acoustic signals, removing acoustic noise from speech, and synthesizing speech

    Energy Technology Data Exchange (ETDEWEB)

    Burnett, Greg C.; Holzrichter, John F.; Ng, Lawrence C.

    2004-03-23

    The present invention is a system and method for characterizing human (or animate) speech voiced excitation functions and acoustic signals, for removing unwanted acoustic noise which often occurs when a speaker uses a microphone in common environments, and for synthesizing personalized or modified human (or other animate) speech upon command from a controller. A low power EM sensor is used to detect the motions of windpipe tissues in the glottal region of the human speech system before, during, and after voiced speech is produced by a user. From these tissue motion measurements, a voiced excitation function can be derived. Further, the excitation function provides speech production information to enhance noise removal from human speech and it enables accurate transfer functions of speech to be obtained. Previously stored excitation and transfer functions can be used for synthesizing personalized or modified human speech. Configurations of EM sensor and acoustic microphone systems are described to enhance noise cancellation and to enable multiple articulator measurements.

  5. System And Method For Characterizing Voiced Excitations Of Speech And Acoustic Signals, Removing Acoustic Noise From Speech, And Synthesizi

    Energy Technology Data Exchange (ETDEWEB)

    Burnett, Greg C. (Livermore, CA); Holzrichter, John F. (Berkeley, CA); Ng, Lawrence C. (Danville, CA)

    2006-04-25

    The present invention is a system and method for characterizing human (or animate) speech voiced excitation functions and acoustic signals, for removing unwanted acoustic noise which often occurs when a speaker uses a microphone in common environments, and for synthesizing personalized or modified human (or other animate) speech upon command from a controller. A low power EM sensor is used to detect the motions of windpipe tissues in the glottal region of the human speech system before, during, and after voiced speech is produced by a user. From these tissue motion measurements, a voiced excitation function can be derived. Further, the excitation function provides speech production information to enhance noise removal from human speech and it enables accurate transfer functions of speech to be obtained. Previously stored excitation and transfer functions can be used for synthesizing personalized or modified human speech. Configurations of EM sensor and acoustic microphone systems are described to enhance noise cancellation and to enable multiple articulator measurements.

  6. The expression and recognition of emotions in the voice across five nations: A lens model analysis based on acoustic features.

    Science.gov (United States)

    Laukka, Petri; Elfenbein, Hillary Anger; Thingujam, Nutankumar S; Rockstuhl, Thomas; Iraki, Frederick K; Chui, Wanda; Althoff, Jean

    2016-11-01

    This study extends previous work on emotion communication across cultures with a large-scale investigation of the physical expression cues in vocal tone. In doing so, it provides the first direct test of a key proposition of dialect theory, namely that greater accuracy of detecting emotions from one's own cultural group-known as in-group advantage-results from a match between culturally specific schemas in emotional expression style and culturally specific schemas in emotion recognition. Study 1 used stimuli from 100 professional actors from five English-speaking nations vocally conveying 11 emotional states (anger, contempt, fear, happiness, interest, lust, neutral, pride, relief, sadness, and shame) using standard-content sentences. Detailed acoustic analyses showed many similarities across groups, and yet also systematic group differences. This provides evidence for cultural accents in expressive style at the level of acoustic cues. In Study 2, listeners evaluated these expressions in a 5 × 5 design balanced across groups. Cross-cultural accuracy was greater than expected by chance. However, there was also in-group advantage, which varied across emotions. A lens model analysis of fundamental acoustic properties examined patterns in emotional expression and perception within and across groups. Acoustic cues were used relatively similarly across groups both to produce and judge emotions, and yet there were also subtle cultural differences. Speakers appear to have a culturally nuanced schema for enacting vocal tones via acoustic cues, and perceivers have a culturally nuanced schema in judging them. Consistent with dialect theory's prediction, in-group judgments showed a greater match between these schemas used for emotional expression and perception. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  7. Intelligent Facial Recognition Systems: Technology advancements for security applications

    Energy Technology Data Exchange (ETDEWEB)

    Beer, C.L.

    1993-07-01

    Insider problems such as theft and sabotage can occur within the security and surveillance realm of operations when unauthorized people obtain access to sensitive areas. A possible solution to these problems is a means to identify individuals (not just credentials or badges) in a given sensitive area and provide full time personnel accountability. One approach desirable at Department of Energy facilities for access control and/or personnel identification is an Intelligent Facial Recognition System (IFRS) that is non-invasive to personnel. Automatic facial recognition does not require the active participation of the enrolled subjects, unlike most other biological measurement (biometric) systems (e.g., fingerprint, hand geometry, or eye retinal scan systems). It is this feature that makes an IFRS attractive for applications other than access control such as emergency evacuation verification, screening, and personnel tracking. This paper discusses current technology that shows promising results for DOE and other security applications. A survey of research and development in facial recognition identified several companies and universities that were interested and/or involved in the area. A few advanced prototype systems were also identified. Sandia National Laboratories is currently evaluating facial recognition systems that are in the advanced prototype stage. The initial application for the evaluation is access control in a controlled environment with a constant background and with cooperative subjects. Further evaluations will be conducted in a less controlled environment, which may include a cluttered background and subjects that are not looking towards the camera. The outcome of the evaluations will help identify areas of facial recognition systems that need further development and will help to determine the effectiveness of the current systems for security applications.

  8. Method for secure electronic voting system: face recognition based approach

    Science.gov (United States)

    Alim, M. Affan; Baig, Misbah M.; Mehboob, Shahzain; Naseem, Imran

    2017-06-01

    In this paper, we propose a framework for low cost secure electronic voting system based on face recognition. Essentially Local Binary Pattern (LBP) is used for face feature characterization in texture format followed by chi-square distribution is used for image classification. Two parallel systems are developed based on smart phone and web applications for face learning and verification modules. The proposed system has two tire security levels by using person ID followed by face verification. Essentially class specific threshold is associated for controlling the security level of face verification. Our system is evaluated three standard databases and one real home based database and achieve the satisfactory recognition accuracies. Consequently our propose system provides secure, hassle free voting system and less intrusive compare with other biometrics.

  9. Two Systems for Automatic Music Genre Recognition

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2012-01-01

    We re-implement and test two state-of-the-art systems for automatic music genre classification; but unlike past works in this area, we look closer than ever before at their behavior. First, we look at specific instances where each system consistently applies the same wrong label across multiple t...

  10. Additive attacks on speaker recognition

    Science.gov (United States)

    Farrokh Baroughi, Alireza; Craver, Scott

    2014-02-01

    Speaker recognition is used to identify a speaker's voice from among a group of known speakers. A common method of speaker recognition is a classification based on cepstral coefficients of the speaker's voice, using a Gaussian mixture model (GMM) to model each speaker. In this paper we try to fool a speaker recognition system using additive noise such that an intruder is recognized as a target user. Our attack uses a mixture selected from a target user's GMM model, inverting the cepstral transformation to produce noise samples. In our 5 speaker data base, we achieve an attack success rate of 50% with a noise signal at 10dB SNR, and 95% by increasing noise power to 0dB SNR. The importance of this attack is its simplicity and flexibility: it can be employed in real time with no processing of an attacker's voice, and little computation is needed at the moment of detection, allowing the attack to be performed by a small portable device. For any target user, knowing that user's model or voice sample is sufficient to compute the attack signal, and it is enough that the intruder plays it while he/she is uttering to be classiffed as the victim.

  11. Design of real-time voice over internet protocol system under bandwidth network

    Science.gov (United States)

    Zhang, Li; Gong, Lina

    2017-04-01

    With the increasing bandwidth of the network and network convergence accelerating, VoIP means of communication across the network is becoming increasingly popular phenomenon. The real-time identification and analysis for VOIP flow over backbone network become the urgent needs and research hotspot of network operations management. Based on this, the paper proposes a VoIP business management system over backbone network. The system first filters VoIP data stream over backbone network and further resolves the call signaling information and media voice. The system can also be able to design appropriate rules to complete real-time reduction and presentation of specific categories of calls. Experimental results show that the system can parse and process real-time backbone of the VoIP call, and the results are presented accurately in the management interface, VoIP-based network traffic management and maintenance provide the necessary technical support.

  12. Study on Unequal Error Protection for Distributed Speech Recognition System

    Institute of Scientific and Technical Information of China (English)

    XIE Xiang; WANG Si-yao; LIU Jia-kang

    2006-01-01

    The unequal error protection (UEP) is applied in distributed speech recognition (DSR) system and three schemes are proposed. All of these three schemes are evaluated on the GSM simulating platform for recognizing mandarin digit strings and compared with the equal error protection (EEP) scheme. Experiments show that UEP can protect the data transmitted in DSR system more effectively, which results in a higher word accurate rate of DSR system.

  13. Auditory signal design for automatic number plate recognition system

    NARCIS (Netherlands)

    Heydra, C.G.; Jansen, R.J.; Van Egmond, R.

    2014-01-01

    This paper focuses on the design of an auditory signal for the Automatic Number Plate Recognition system of Dutch national police. The auditory signal is designed to alert police officers of suspicious cars in their proximity, communicating priority level and location of the suspicious car and takin

  14. A Vehicle License Plate Detection and Recognition System

    Directory of Open Access Journals (Sweden)

    Khalid W. Maglad

    2012-01-01

    Full Text Available Problem statement: Automatic vehicle license plate detection and recognition is a key technique in most of traffic related applications and is an active research topic in the image processing domain. Different methods, techniques and algorithms have been developed for license plate detection and recognitions. Approach: Due to the varying characteristics of the license plate from country to country like numbering system, colors, language of characters, style (font and sizes of license plate, further research is still needed in this area. Results: In most of the Middle East countries, they use the combination of Arabic and English letters, along with their countries logo. Thus, it makes the localization of plate number, the differentiation between Arabic and English letters and logo’s object and finally the recognition of those characters become a more challenging research task. The use of artificial neural network has proved itself beneficial for plate recognition, but it has not been applied for the plate detection. Radial Basis Function (RBF neural network is used both for the detection and recognition of Saudi Arabian license plates. Conclusion/Recommendations: The proposed approach has been tested on 200 front images of national license plate of Saudi Arabia. A higher percentage of accuracy has been obtained to show that the significant of this approach. The study could be further investigated in other Middle East countries.

  15. F-generation law and recognition of system law

    Institute of Scientific and Technical Information of China (English)

    Shi Kaiquan; Yao Bingxue

    2007-01-01

    If a system is not disturbed (or invaded) by some law, there is no doubt that each system will move according to the expected law and keep stable. Although such a fact often appears, some unknown law breaks into the system and leads it into turbulence. Using function one direction S-rough sets, this article gives the concept of the F-generation law in the system, the generation model of the F-generation law and the recognition method of the system law. Function one direction singular rough sets is a new theory and method in recognizing the disturbance law existing in the system and recognizing the system law.

  16. Admission Control of Integrated Voice and Data CDMA/TDD System Considering Asymmetric Traffic and Power Limit

    Institute of Scientific and Technical Information of China (English)

    CAOYanbo; ZHOUBin; LIChengshu

    2004-01-01

    In this paper, we research an admission control scheme of integrated voice and data CDMA/TDD (Code division multiple access/Time division duplex) system considering asymmetric traffic and power limit. A new user can access the system only if the outage probabilities it experiences on the uplink and downlink time slots are below a threshold value. Based on the power limit the results show the voice and data blocking probabilities under different cell coverage~ arrival rates and various uplink/downlink time slot allocation patterns. Furthermore, multicode and multislot schemes are also evaluated under the presented admission control scheme.

  17. Efficient Web-based Facial Recognition System Employing 2DHOG

    CERN Document Server

    Abdelwahab, Moataz M; Yousry, Islam

    2012-01-01

    In this paper, a system for facial recognition to identify missing and found people in Hajj and Umrah is described as a web portal. Explicitly, we present a novel algorithm for recognition and classifications of facial images based on applying 2DPCA to a 2D representation of the Histogram of oriented gradients (2D-HOG) which maintains the spatial relation between pixels of the input images. This algorithm allows a compact representation of the images which reduces the computational complexity and the storage requirments, while maintaining the highest reported recognition accuracy. This promotes this method for usage with very large datasets. Large dataset was collected for people in Hajj. Experimental results employing ORL, UMIST, JAFFE, and HAJJ datasets confirm these excellent properties.

  18. Voice Disorders

    Science.gov (United States)

    Voice is the sound made by air passing from your lungs through your larynx, or voice box. In your larynx are your vocal cords, ... to make sound. For most of us, our voices play a big part in who we are, ...

  19. Every Voice

    Science.gov (United States)

    Patrick, Penny

    2008-01-01

    This article discusses how the author develops an approach that allows her students, who are part of the marginalized population, to learn the power of their own voices--not just their writing voices, but their oral voices as well. The author calls it "TWIST": Thoughts, Writing folder, Inquiring mind, Supplies, and Teamwork. It is where…

  20. Every Voice

    Science.gov (United States)

    Patrick, Penny

    2008-01-01

    This article discusses how the author develops an approach that allows her students, who are part of the marginalized population, to learn the power of their own voices--not just their writing voices, but their oral voices as well. The author calls it "TWIST": Thoughts, Writing folder, Inquiring mind, Supplies, and Teamwork. It is where…

  1. Voice restoration

    NARCIS (Netherlands)

    Hilgers, F.J.M.; Balm, A.J.M.; van den Brekel, M.W.M.; Tan, I.B.; Remacle, M.; Eckel, H.E.

    2010-01-01

    Surgical prosthetic voice restoration is the best possible option for patients to regain oral communication after total laryngectomy. It is considered to be the present "gold standard" for voice rehabilitation of laryngectomized individuals. Surgical prosthetic voice restoration, in essence, is alwa

  2. Voice over Internet Protocol (VoIP) Technology as a Global Learning Tool: Information Systems Success and Control Belief Perspectives

    Science.gov (United States)

    Chen, Charlie C.; Vannoy, Sandra

    2013-01-01

    Voice over Internet Protocol- (VoIP) enabled online learning service providers struggling with high attrition rates and low customer loyalty issues despite VoIP's high degree of system fit for online global learning applications. Effective solutions to this prevalent problem rely on the understanding of system quality, information quality, and…

  3. A Real-Time Face Recognition System Using Eigenfaces

    Directory of Open Access Journals (Sweden)

    Daniel Georgescu

    2011-12-01

    Full Text Available A real-time system for recognizing faces in a video stream provided by a surveillance camera was implemented, having real-time face detection. Thus, both face detection and face recognition techniques are summary presented, without skipping the important technical aspects. The proposed approach essentially was to implement and verify the algorithm Eigenfaces for Recognition, which solves the recognition problem for two dimensional representations of faces, using the principal component analysis. The snapshots, representing input images for the proposed system, are projected in to a face space (feature space which best defines the variation for the face images training set. The face space is defined by the ‘eigenfaces’ which are the eigenvectors of the set of faces. These eigenfaces contribute in face reconstruction of a new face image projected onto face space with a meaningful (named weight.The projection of the new image in this feature space is then compared to the available projections of training set to identify the person using the Euclidian distance.  The implemented system is able to perform real-time face detection, face recognition and can give feedback giving a window with the subject's info from database and sending an e-mail notification to interested institutions.

  4. Recognition of bacterial plant pathogens: local, systemic and transgenerational immunity.

    Science.gov (United States)

    Henry, Elizabeth; Yadeta, Koste A; Coaker, Gitta

    2013-09-01

    Bacterial pathogens can cause multiple plant diseases and plants rely on their innate immune system to recognize and actively respond to these microbes. The plant innate immune system comprises extracellular pattern recognition receptors that recognize conserved microbial patterns and intracellular nucleotide binding leucine-rich repeat (NLR) proteins that recognize specific bacterial effectors delivered into host cells. Plants lack the adaptive immune branch present in animals, but still afford flexibility to pathogen attack through systemic and transgenerational resistance. Here, we focus on current research in plant immune responses against bacterial pathogens. Recent studies shed light onto the activation and inactivation of pattern recognition receptors and systemic acquired resistance. New research has also uncovered additional layers of complexity surrounding NLR immune receptor activation, cooperation and sub-cellular localizations. Taken together, these recent advances bring us closer to understanding the web of molecular interactions responsible for coordinating defense responses and ultimately resistance.

  5. Connected digit speech recognition system for Malayalam language

    Indian Academy of Sciences (India)

    Cini Kurian; Kannan Balakrishnan

    2013-12-01

    A connected digit speech recognition is important in many applications such as automated banking system, catalogue-dialing, automatic data entry, automated banking system, etc. This paper presents an optimum speaker-independent connected digit recognizer for Malayalam language. The system employs Perceptual Linear Predictive (PLP) cepstral coefficient for speech parameterization and continuous density Hidden Markov Model (HMM) in the recognition process. Viterbi algorithm is used for decoding. The training data base has the utterance of 21 speakers from the age group of 20 to 40 years and the sound is recorded in the normal office environment where each speaker is asked to read 20 set of continuous digits. The system obtained an accuracy of 99.5 % with the unseen data.

  6. Traffic Sign Recognition System based on Cambridge Correlator Image Comparator

    OpenAIRE

    J. Turan; L. Ovsenik; T. Harasthy

    2012-01-01

    Paper presents basic information about application of Optical Correlator (OC), specifically Cambridge Correlator, in system to recognize of traffic sign. Traffic Sign Recognition System consists of three main blocks, Preprocessing, Optical Correlator and Traffic Sign Identification. The Region of Interest (ROI) is defined and chosen in preprocessing block and then goes to Optical Correlator, where is compared with database of Traffic Sign. Output of Optical Correlation is correlation plane, w...

  7. Speech Acquisition and Automatic Speech Recognition for Integrated Spacesuit Audio Systems

    Science.gov (United States)

    Huang, Yiteng; Chen, Jingdong; Chen, Shaoyan

    2010-01-01

    A voice-command human-machine interface system has been developed for spacesuit extravehicular activity (EVA) missions. A multichannel acoustic signal processing method has been created for distant speech acquisition in noisy and reverberant environments. This technology reduces noise by exploiting differences in the statistical nature of signal (i.e., speech) and noise that exists in the spatial and temporal domains. As a result, the automatic speech recognition (ASR) accuracy can be improved to the level at which crewmembers would find the speech interface useful. The developed speech human/machine interface will enable both crewmember usability and operational efficiency. It can enjoy a fast rate of data/text entry, small overall size, and can be lightweight. In addition, this design will free the hands and eyes of a suited crewmember. The system components and steps include beam forming/multi-channel noise reduction, single-channel noise reduction, speech feature extraction, feature transformation and normalization, feature compression, model adaption, ASR HMM (Hidden Markov Model) training, and ASR decoding. A state-of-the-art phoneme recognizer can obtain an accuracy rate of 65 percent when the training and testing data are free of noise. When it is used in spacesuits, the rate drops to about 33 percent. With the developed microphone array speech-processing technologies, the performance is improved and the phoneme recognition accuracy rate rises to 44 percent. The recognizer can be further improved by combining the microphone array and HMM model adaptation techniques and using speech samples collected from inside spacesuits. In addition, arithmetic complexity models for the major HMMbased ASR components were developed. They can help real-time ASR system designers select proper tasks when in the face of constraints in computational resources.

  8. Speech Recognition: How Do We Teach It?

    Science.gov (United States)

    Barksdale, Karl

    2002-01-01

    States that growing use of speech recognition software has made voice writing an essential computer skill. Describes how to present the topic, develop basic speech recognition skills, and teach speech recognition outlining, writing, proofreading, and editing. (Contains 14 references.) (SK)

  9. Autonomous facial recognition system inspired by human visual system based logarithmical image visualization technique

    Science.gov (United States)

    Wan, Qianwen; Panetta, Karen; Agaian, Sos

    2017-05-01

    Autonomous facial recognition system is widely used in real-life applications, such as homeland border security, law enforcement identification and authentication, and video-based surveillance analysis. Issues like low image quality, non-uniform illumination as well as variations in poses and facial expressions can impair the performance of recognition systems. To address the non-uniform illumination challenge, we present a novel robust autonomous facial recognition system inspired by the human visual system based, so called, logarithmical image visualization technique. In this paper, the proposed method, for the first time, utilizes the logarithmical image visualization technique coupled with the local binary pattern to perform discriminative feature extraction for facial recognition system. The Yale database, the Yale-B database and the ATT database are used for computer simulation accuracy and efficiency testing. The extensive computer simulation demonstrates the method's efficiency, accuracy, and robustness of illumination invariance for facial recognition.

  10. Face Recognition System based on SURF and LDA Technique

    Directory of Open Access Journals (Sweden)

    Narpat A. Singh

    2016-02-01

    Full Text Available In the past decade, Improve the quality in face recognition system is a challenge. It is a challenging problem and widely studied in the different type of imag-es to provide the best quality of faces in real life. These problems come due to illumination and pose effect due to light in gradient features. The improvement and optimization of human face recognition and detection is an important problem in the real life that can be handles to optimize the error rate, accuracy, peak signal to noise ratio, mean square error, and structural similarity Index. Now-a-days, there several methods are proposed to recognition face in different problem to optimize above parameters. There occur many invariant changes in hu-man faces due to the illumination and pose variations. In this paper we proposed a novel method in face recogni-tion to improve the quality parameters using speed up robust feature and linear discriminant analysis for opti-mize result. SURF is used for feature matching. In this paper, we use linear discriminant analysis for the edge dimensions reduction to live faces from our data-sets. The proposed method shows the better result as compare to the previous result on the basis of comparative analysis because our method show the better quality and better results in live images of face.

  11. Computer recognition of slag property diagrams in ternary systems

    Institute of Scientific and Technical Information of China (English)

    Jinxiong Lu; Li Wang; Jiongming Zhang; Xinhua Wang

    2004-01-01

    In order to take data information from the slag property diagram in a ternary system automatically and actually, a picture recognition and drawing software has been developed by Visual Basic 6.0 based on the image coding principle of computer system and the graphics programming method of VB. This software can transform the ternary system isopleth diagram from bitmap format to data file and establish a corresponding database which can be applied to rapidly retrieve a mass of data and make correlative thermodynamics or kinetics calculation. Besides, it still has the function of drawing the ternary system diagram which can draw different kinds of property parameters in the same diagram.

  12. Design of an expert system for phonetic speech recognition

    Energy Technology Data Exchange (ETDEWEB)

    Carbonell, N.; Haton, J.P.; Pierrel, J.M.; Lonchamp, F.

    1983-07-01

    Expert systems have been extensively used as a means for integrating the expertise of a human being into an artificial intelligence system. The authors are presently designing an expert system which will integrate the strategy and the knowledge of a phonetician reading a speech spectrogram. Their goal is twofold, firstly to obtain a better insight into the acoustic-decoding of speech, and, secondly, to improve the efficiency of present automatic phonetic recognition systems. This paper presents a preliminary description of the project, especially the overall strategy of the expert and the role of duration parameters in the segmentation and identification processes.

  13. Blind Recognition Algorithm of Turbo Codes for Communication Intelligence Systems

    Directory of Open Access Journals (Sweden)

    Ali Naseri

    2011-11-01

    Full Text Available Turbo codes are widely used in land and space radio communication systems, and because of complexity of structure, are custom in military communication systems. In electronic warfare, COMINT systems make attempt to recognize codes by blind ways. In this Paper, the algorithm is proposed for blind recognition of turbo code parameters like code kind, code-word length, code rate, length of interleaver and delay blocks number of convolution code. The algorithm calculations volume is0.5L3+1.25L, therefore it is suitable for real time systems.

  14. VoiceRelay: voice key operation using visual basic.

    Science.gov (United States)

    Abrams, Lise; Jennings, David T

    2004-11-01

    Using a voice key is a popular method for recording vocal response times in a variety of language production tasks. This article describes a class module called VoiceRelay that can be easily utilized in Visual Basic programs for voice key operation. This software-based voice key offers the precision of traditional voice keys (although accuracy is system dependent), as well as the flexibility of volume and sensitivity control. However, VoiceRelay is a considerably less expensive alternative for recording vocal response times because it operates with existing PC hardware and does not require the purchase of external response boxes or additional experiment-generation software. A sample project demonstrating implementation of the VoiceRelay class module may be downloaded from the Psychonomic Society Web archive, www.psychonomic.org/archive.

  15. DEVELOPMENT OF AN INTELLIGENT RECOGNITION AND SORTING SYSTEM

    Directory of Open Access Journals (Sweden)

    Zhi Li

    2012-01-01

    Full Text Available

    ENGLISH ABSTRACT: This paper presents the design of an intelligent recognition and sorting system. Intelligence is included in the system by using a multilayer feed-forward artificial neural network (ANN for image recognition. Full duplex Bluetooth communication is used between the intelligent system and a robot-control computer. Image compression and principal component analysis (PCA reduce the dimensionality of the data, and only the salient feature vectors of an image are used for image recognition. A control signal guides a robot arm to place an object into an allocated space. The system is relatively immune to noise, and can generalise when faced with missing data.

    AFRIKAANSE OPSOMMING: Hierdie artikel hou die ontwerp van ’n intelligente herkenning- en sorteringsisteem voor. Intelligensie word ingebou in die sisteem deur middel van ’n kunsmatige neurale netwerk vir beeldherkenning. Kommunikasie word bewerkstellig tussen die intelligente sisteem en ’n robot-beheerde rekenaar. Beeldkompressie en hoofkomponentanalise verminder die dimensionaliteit van die data en slegs kritiese kenvektore word aangewend vir beeldherkenning. ’n Beheersein rig die robotarm om die objek op ’n aangewese plek te plaas. Die sisteem is relatief immuun teen geraas en kan veralgemeen wanneer dit gekonfronteer word deur ontbrekende data.

  16. Youth Advisors Driving Action: hearing the youth voice in mental health systems of care.

    Science.gov (United States)

    Davis-Brown, Karen; Carter, Naeemah; Miller, Bethany D

    2012-03-01

    The Children's Mental Health Initiative (CMHI) is funded by the Center for Mental Health Services within the Substance Abuse and Mental Health Services Administration. CMHI assists communities in developing comprehensive, coordinated services for children with serious emotional disturbances and their families. Broadly speaking, these systems are designed to be child centered, youth guided, family driven, community based, and culturally competent. To assure implementation of the "youth-guided" core value in the national evaluation, an advisory group of youth coordinator/youth teams representing communities across the country was developed. This group chose the name YADA-Youth Advisors Driving Action. YADA has made a substantive contribution to national evaluation efforts by bringing the youth perspective and voice to its audience at the community and national levels. This article describes YADA's founding and development, as well as related implications for psychosocial nurses.

  17. Automatic Vehicle License Recognition Based on Video Vehicular Detection System

    Institute of Scientific and Technical Information of China (English)

    YANG Zhaoxuan; CHEN Yang; HE Yinghua; WU Jun

    2006-01-01

    Traditional methods of license character extraction cannot meet the requirements of recognition accuracy and speed rendered by the video vehicular detection system.Therefore, a license plate localization method based on multi-scale edge detection and a character segmentation algorithm based on Markov random field model is presented.Results of experiments demonstrate that the method yields more accurate license character extraction in contrast to traditional localization method based on edge detection by difference operator and character segmentation based on threshold.The accuracy increases from 90% to 94% under preferable illumination, while under poor condition, it increases more than 5%.When the two improved algorithms are used, the accuracy and speed of automatic license recognition meet the system's requirement even under the noisy circumstance or uneven illumination.

  18. Face Recognition System Based on Spectral Graph Wavelet Theory

    Directory of Open Access Journals (Sweden)

    R. Premalatha Kanikannan

    2014-09-01

    Full Text Available This study presents an efficient approach for automatic face recognition based on Spectral Graph Wavelet Theory (SGWT. SGWT is analogous to wavelet transform and the transform functions are defined on the vertices of a weighted graph. The given face image is decomposed by SGWT at first. The energies of obtained sub-bands are fused together and considered as feature vector for the corresponding image. The performance of proposed system is analyzed on ORL face database using nearest neighbor classifier. The face images used in this study has variations in pose, expression and facial details. The results indicate that the proposed system based on SGWT is better than wavelet transform and 94% recognition accuracy is achieved.

  19. Advances in signal processing and intelligent recognition systems

    CERN Document Server

    Gelbukh, Alexander; Mukhopadhyay, Jayanta

    2014-01-01

    This Edited Volume contains a selection of refereed and revised papers originally presented at the International Symposium on Signal Processing and Intelligent Recognition Systems (SIRS-2014), March 13-15, 2014, Trivandrum, India. The program committee received 134 submissions from 11 countries. Each paper was peer reviewed by at least three or more independent referees of the program committee and the 52 papers were finally selected. The papers offer stimulating insights into Pattern Recognition, Machine Learning and Knowledge-Based Systems; Signal and Speech Processing; Image and Video Processing; Mobile Computing and Applications and Computer Vision. The book is directed to the researchers and scientists engaged in various field of signal processing and related areas.  

  20. Contribution of Solvation Energy in Protein-Peptide Recognition Systems

    Institute of Scientific and Technical Information of China (English)

    LI,Fei; LI,Wei; SHEN,Jia-Cong

    2001-01-01

    The contribution of solvation energy to guding molecualr recognition for six rigid protein-peptide systems had been eval uated by the variation in the number of the identified native like configurations and in the driving force of specific interac tion resulting from the addition of the explicit solvation term in the force field function. The AMBER force field energy and the total energy including the force field energy and the WZS solvation energy were calculated for sampled configurations. The results obtained by the calculations of both force field and total energies were compared with each other. It suggests that specific recognition of the systems in which the ligands possess larger hydrophobic or aromatic residues while the protein re ceptors provide the active surfaces with hydrophobic property.

  1. An Automatic Interference Recognition Method in Spread Spectrum Communication System

    Institute of Scientific and Technical Information of China (English)

    YANG Xiao-ming; TAO Ran

    2007-01-01

    An algorithm to detect and recognize interferences embedded in a direct sequence spread spectrum (DSSS) communication system is proposed. Based on Welch's averaging modified periodogram method and fractional Fourier transformation (FRFT), the paper proposes a decision tree-based algorithm in which a set of decision criteria for identifying different types of interferences is developed. Simulation results demonstrate that the proposed algorithm provides a high recognition rate and is robust for various ISR and SNR.

  2. Decision support system in an international-voice-services business company

    Science.gov (United States)

    Hadianti, R.; Uttunggadewa, S.; Syamsuddin, M.; Soewono, E.

    2017-01-01

    We consider a problem facing by an international telecommunication services company in maximizing its profit. From voice services by controlling cost and business partnership. The competitiveness in this industry is very high, so that any efficiency from controlling cost and business partnership can help the company to survive in the very high competitiveness situation. The company trades voice traffic with a large number of business partners. There are four trading schemes that can be chosen by this company, namely, flat rate, class tiering, volume commitment, and revenue capped. Each scheme has a specific characteristic on the rate and volume deal, where the last three schemes are regarded as strategic schemes to be offered to business partner to ensure incoming traffic volume for both parties. This company and each business partner need to choose an optimal agreement in a certain period of time that can maximize the company’s profit. In this agreement, both parties agree to use a certain trading scheme, rate and rate/volume/revenue deal. A decision support system is then needed in order to give a comprehensive information to the sales officers to deal with the business partners. This paper discusses the mathematical model of the optimal decision for incoming traffic volume control, which is a part of the analysis needed to build the decision support system. The mathematical model is built by first performing data analysis to see how elastic the incoming traffic volume is. As the level of elasticity is obtained, we then derive a mathematical modelling that can simulate the impact of any decision on trading to the revenue of the company. The optimal decision can be obtained from these simulations results. To evaluate the performance of the proposed method we implement our decision model to the historical data. A software tool incorporating our methodology is currently in construction.

  3. Shape Recognition Using A CMAC Based Learning System

    Science.gov (United States)

    Glanz, F. H.; Miller, W. T.

    1988-02-01

    This paper discusses pattern recognition using a learning system which can learn an arbitrary function of the input and which has built-in generalization with the characteristic that similar inputs lead to similar outputs even for untrained inputs. The amount of similarity is controlled by a parameter of the program at compile time. Inputs and/or outputs may be vectors. The system is trained in a way similar to other pattern recognition systems using an LMS rule. Patterns in the input space are not separated by hyperplanes in the way they normally are using adaptive linear elements. As a result, linear separability is not the problem it is when using Perceptron or Adaline type elements. In fact, almost any shape category region is possible, and a region need not be simply connected nor convex. An example is given of geometric shape recognition using as features autoregressive model parameters representing the shape boundaries. These features are approximately independent of translation, rotation, and size of the shape. Results in the form of percent correct on test sets are given for eight different combinations of training and test sets derived from two groups of shapes.

  4. 基于嵌入式Linux语音识别系统的设计%Design of Speech Recognition System Based on Embedded Linux

    Institute of Scientific and Technical Information of China (English)

    钟豪; 张常年; 徐成波

    2014-01-01

    该设计运用三星公司的S3C2440,结合ICRoute公司的高性能语音识别芯片LD3320,进行了语音识别系统的硬件和软件设计。在嵌入式Linux操作系统下,运用多进程机制完成了对语音识别芯片、超声波测距和云台的控制,并将语音识别技术应用于多角度超声波测距系统中。通过测试,系统可以通过识别语音指令控制测量方向,无需手动干预,最后将测量结果通过语音播放出来。%This paper fulfills the hardware and software design of the voice recognition system, using the Samsung’s S3C2440 and the high performance chip LD3320 designed by ICRoute. It uses multi-process mechanism to complete the speech recognition, ultrasonic ranging and PTZ control based on embedded Linux platform. At the same time, the system makes the speech recognition technology applied to multi-angle ultrasonic ranging. Through the actual testing, the system can control the direction of measure-ment by identifying the voice command, without manual intervention, and finally the measurement results play out through the voice.

  5. An Efficient Vein Pattern-based Recognition System

    CERN Document Server

    Soni, Mohit; Rao, M S; Gupta, Phalguni

    2010-01-01

    This paper presents an efficient human recognition system based on vein pattern from the palma dorsa. A new absorption based technique has been proposed to collect good quality images with the help of a low cost camera and light source. The system automatically detects the region of interest from the image and does the necessary preprocessing to extract features. A Euclidean Distance based matching technique has been used for making the decision. It has been tested on a data set of 1750 image samples collected from 341 individuals. The accuracy of the verification system is found to be 99.26% with false rejection rate (FRR) of 0.03%.

  6. Design and implementation of Smart Voice system%Smart Voice系统的设计与实现

    Institute of Scientific and Technical Information of China (English)

    徐俊芳; 田素贞

    2012-01-01

    Aiming at the problem that the current speech synthesis systems has strong taste of speech machine and poor voice quality, Smart Voice, a kind of Chinese TTS system was designed, and the Smart Voice call implementation was given. The system used a hierarchical, modular program structure, completing the marker analysis, syntax analysis, sustain treatment, tone sandhi processing function. Especially in the sustain processor,waveform concatenation technology innovation was used,creating waveform internal circulation damper module to make a good degree of natural sustain. System operation results showed that Smart Voice sounded natural,and could provide high-speed flexible Chinese speech for the network,games and other computer applications.%针对目前我国语音合成系统合成的语音机器味较浓、语音质量较差的问题,设计和实现了一种中文TTS软件系统Smart Voice,并给出了Smart Voice的调用实现.该系统使用层次化、模块化的程序结构,完成了标记分析、语法分析、延音处理、变调处理等功能.尤其在延音处理器中,创新地采用波形拼接技术创制了波形内部循环的延音模块,使得声音在延音时具有很好的自然度.系统运行表明,Smart Voice发音自然,能为网络、游戏等各种计算机应用提供高速灵活的中文语音支持.

  7. Source Separation via Spectral Masking for Speech Recognition Systems

    Directory of Open Access Journals (Sweden)

    Gustavo Fernandes Rodrigues

    2012-12-01

    Full Text Available In this paper we present an insight into the use of spectral masking techniques in time-frequency domain, as a preprocessing step for the speech signal recognition. Speech recognition systems have their performance negatively affected in noisy environments or in the presence of other speech signals. The limits of these masking techniques for different levels of the signal-to-noise ratio are discussed. We show the robustness of the spectral masking techniques against four types of noise: white, pink, brown and human speech noise (bubble noise. The main contribution of this work is to analyze the performance limits of recognition systems  using spectral masking. We obtain an increase of 18% on the speech hit rate, when the speech signals were corrupted by other speech signals or bubble noise, with different signal-to-noise ratio of approximately 1, 10 and 20 dB. On the other hand, applying the ideal binary masks to mixtures corrupted by white, pink and brown noise, results an average growth of 9% on the speech hit rate, with the same different signal-to-noise ratio. The experimental results suggest that the masking spectral techniques are more suitable for the case when it is applied a bubble noise, which is produced by human speech, than for the case of applying white, pink and brown noise.

  8. Human activity recognition based on Evolving Fuzzy Systems.

    Science.gov (United States)

    Iglesias, Jose Antonio; Angelov, Plamen; Ledezma, Agapito; Sanchis, Araceli

    2010-10-01

    Environments equipped with intelligent sensors can be of much help if they can recognize the actions or activities of their users. If this activity recognition is done automatically, it can be very useful for different tasks such as future action prediction, remote health monitoring, or interventions. Although there are several approaches for recognizing activities, most of them do not consider the changes in how a human performs a specific activity. We present an automated approach to recognize daily activities from the sensor readings of an intelligent home environment. However, as the way to perform an activity is usually not fixed but it changes and evolves, we propose an activity recognition method based on Evolving Fuzzy Systems.

  9. FUZZY NEURAL NETWORK FOR MACHINE PARTS RECOGNITION SYSTEM

    Institute of Scientific and Technical Information of China (English)

    Luo Xiaobin; Yin Guofu; Chen Ke; Hu Xiaobing; Luo Yang

    2003-01-01

    The primary purpose is to develop a robust adaptive machine parts recognition system. A fuzzy neural network classifier is proposed for machine parts classifier. It is an efficient modeling method. Through learning, it can approach a random nonlinear function. A fuzzy neural network classifier is presented based on fuzzy mapping model. It is used for machine parts classification. The experimental system of machine parts classification is introduced. A robust least square back-propagation (RLSBP) training algorithm which combines robust least square (RLS) with back-propagation (BP) algorithm is put forward. Simulation and experimental results show that the learning property of RLSBP is superior to BP.

  10. A Systemic Functional Approach to the Passive Voice in English into Spanish Translation: Thematic Development in a Medical Research Article

    Directory of Open Access Journals (Sweden)

    Rodríguez-Vergara Daniel

    2017-01-01

    Full Text Available The purpose of this study was to explore, from the perspective of Systemic Functional Grammar, how passive clauses in a medical research article were translated into Spanish, specifically if they were kept in the passive voice, were changed into the active voice, or were turned into some other structure, and if voice change in the translated version affected the original thematic development. The medical paper chosen for this study was originally written in English and published in an Anglophone journal; it was then translated into Spanish and published in a Mexican journal. Both the original and the translated article were analyzed in terms of Theme and Rheme; all of the instances of passive and active voice were quantified and compared. The results show that in some cases the original thematic patterns were modified in the translation due to the use of the reflexive passive in Spanish, which results in the fronting of the verb in the sentences, thereby causing a change of Themes in the paragraphs with respect to the original structure. This study contributes to our understanding of the function of passive constructions in English and Spanish and its relationship with thematic progression.

  11. An Automatic Number Plate Recognition System under Image Processing

    Directory of Open Access Journals (Sweden)

    Sarbjit Kaur

    2016-03-01

    Full Text Available Automatic Number Plate Recognition system is an application of computer vision and image processing technology that takes photograph of vehicles as input image and by extracting their number plate from whole vehicle image , it display the number plate information into text. Mainly the ANPR system consists of 4 phases: - Acquisition of Vehicle Image and Pre-Processing, Extraction of Number Plate Area, Character Segmentation and Character Recognition. The overall accuracy and efficiency of whole ANPR system depends on number plate extraction phase as character segmentation and character recognition phases are also depend on the output of this phase. Further the accuracy of Number Plate Extraction phase depends on the quality of captured vehicle image. Higher be the quality of captured input vehicle image more will be the chances of proper extraction of vehicle number plate area. The existing methods of ANPR works well for dark and bright/light categories image but it does not work well for Low Contrast, Blurred and Noisy images and the detection of exact number plate area by using the existing ANPR approach is not successful even after applying existing filtering and enhancement technique for these types of images. Due to wrong extraction of number plate area, the character segmentation and character recognition are also not successful in this case by using the existing method. To overcome these drawbacks I proposed an efficient approach for ANPR in which the input vehicle image is pre-processed firstly by iterative bilateral filtering , adaptive histogram equalization and number plate is extracted from pre-processed vehicle image using morphological operations, image subtraction, image binarization/thresholding, sobel vertical edge detection and by boundary box analysis. Sometimes the extracted plate area also contains noise, bolts, frames etc. So the extracted plate area is enhanced by using morphological operations to improve the quality of

  12. Viral recognition by the innate immune system: the role of pattern recognition receptors

    OpenAIRE

    Silvia Torres Pedraza; Juan Guillermo Betancur; Silvio Urcuqui-Inchima

    2011-01-01

    Pattern recognition receptors are the main sensors of the innate immune response. Their function is to recognize pathogen-associated molecular patterns, which are molecules essential for the survival of microbial pathogens, but are not produced by the host. The recognition of pathogen-associated molecular patterns by pattern recognition receptors leads to the expression of cytokines, chemokines, and co-stimulatory molecules that eliminate pathogens, such as viruses, for the activation of anti...

  13. Applications of Artificial Intelligence in Voice Recognition Systems in Micro-Computers.

    Science.gov (United States)

    1982-03-01

    DELTAO THEN 1290 1050 IF ANS$(I) = "HAIN MENU THEN 320 1060 IF ANS$(I) - " ABORTO THEN 3150 1070 IF ANS$(I) - 󈧄 BACK’ THEN 3590 1080 NEXT I 1090... ABORTO THEN 3150 1660 NEXT I 1670 SOTO 3350 3 REM’ ERROR PACK 1680 STOP 1690 REM SHIPS MENU 1700 REM------------ 1710 HOME : VTAB 5 :HTAB 15 :PRINT...IF ANS*(I) - PROFILESO THEN 3100 2470 IF IS$(I) - "MIN MENU" THEN 320 24Sf IF NB$(I) - "G0 BACK" THEN 3590 2490 IF ANS$(I) - " ABORTO THEN 3150 2500

  14. Voice Recognition Vocabulary Lists for the Army’s TACFIRE System.

    Science.gov (United States)

    1983-01-01

    aie missi-on f ir ed c1ddtt c 212 Pa n. text cddddddrrtt 213 Pian text ipessage cdddtttttt 214 Vertical shift dd.-tt 92 2ozd MaAZ qr_ $p_okD 9atPS...cdddddttttttt:Xc 178 Rouzds.;impacted cddddddt 179 Date time group cdddddttt6 180 Caliber cdddddttttttt 181 Vertical zhift cdddttt 182 Tropical...Observ ~ion potcdddddtrrrrrrrOPc 134 Patrol ps cdlddddtrrrrrPTLc 135 WrpatcdddddtzrrrrrrWKPTlc 136 Wort partoe cdddddtrrrrrrrAPERSc ,at ers ne cdddtr

  15. Plant systems for recognition of pathogen-associated molecular patterns.

    Science.gov (United States)

    Postel, Sandra; Kemmerling, Birgit

    2009-12-01

    Research of the last decade has revealed that plant immunity consists of different layers of defense that have evolved by the co-evolutional battle of plants with its pathogens. Particular light has been shed on PAMP- (pathogen-associated molecular pattern) triggered immunity (PTI) mediated by pattern recognition receptors. Striking similarities exist between the plant and animal innate immune system that point for a common optimized mechanism that has evolved independently in both kingdoms. Pattern recognition receptors (PRRs) from both kingdoms consist of leucine-rich repeat receptor complexes that allow recognition of invading pathogens at the cell surface. In plants, PRRs like FLS2 and EFR are controlled by a co-receptor SERK3/BAK1, also a leucine-rich repeat receptor that dimerizes with the PRRs to support their function. Pathogens can inject effector proteins into the plant cells to suppress the immune responses initiated after perception of PAMPs by PRRs via inhibition or degradation of the receptors. Plants have acquired the ability to recognize the presence of some of these effector proteins which leads to a quick and hypersensitive response to arrest and terminate pathogen growth.

  16. Image analysis in automatic system of pollen recognition

    Directory of Open Access Journals (Sweden)

    Piotr Rapiejko

    2012-12-01

    Full Text Available In allergology practice and research, it would be convenient to receive pollen identification and monitoring results in much shorter time than it comes from human identification. Image based analysis is one of the approaches to an automated identification scheme for pollen grain and pattern recognition on such images is widely used as a powerful tool. The goal of such attempt is to provide accurate, fast recognition and classification and counting of pollen grains by computer system for monitoring. The isolated pollen grain are objects extracted from microscopic image by CCD camera and PC computer under proper conditions for further analysis. The algorithms are based on the knowledge from feature vector analysis of estimated parameters calculated from grain characteristics, including morphological features, surface features and other applicable estimated characteristics. Segmentation algorithms specially tailored to pollen object characteristics provide exact descriptions of pollen characteristics (border and internal features already used by human expert. The specific characteristics and its measures are statistically estimated for each object. Some low level statistics for estimated local and global measures of the features establish the feature space. Some special care should be paid on choosing these feature and on constructing the feature space to optimize the number of subspaces for higher recognition rates in low-level classification for type differentiation of pollen grains.The results of estimated parameters of feature vector in low dimension space for some typical pollen types are presented, as well as some effective and fast recognition results of performed experiments for different pollens. The findings show the ewidence of using proper chosen estimators of central and invariant moments (M21, NM2, NM3, NM8 NM9, of tailored characteristics for good enough classification measures (efficiency > 95%, even for low dimensional classifiers

  17. Human-inspired sound environment recognition system for assistive vehicles

    Science.gov (United States)

    González Vidal, Eduardo; Fredes Zarricueta, Ernesto; Auat Cheein, Fernando

    2015-02-01

    Objective. The human auditory system acquires environmental information under sound stimuli faster than visual or touch systems, which in turn, allows for faster human responses to such stimuli. It also complements senses such as sight, where direct line-of-view is necessary to identify objects, in the environment recognition process. This work focuses on implementing human reaction to sound stimuli and environment recognition on assistive robotic devices, such as robotic wheelchairs or robotized cars. These vehicles need environment information to ensure safe navigation. Approach. In the field of environment recognition, range sensors (such as LiDAR and ultrasonic systems) and artificial vision devices are widely used; however, these sensors depend on environment constraints (such as lighting variability or color of objects), and sound can provide important information for the characterization of an environment. In this work, we propose a sound-based approach to enhance the environment recognition process, mainly for cases that compromise human integrity, according to the International Classification of Functioning (ICF). Our proposal is based on a neural network implementation that is able to classify up to 15 different environments, each selected according to the ICF considerations on environment factors in the community-based physical activities of people with disabilities. Main results. The accuracy rates in environment classification ranges from 84% to 93%. This classification is later used to constrain assistive vehicle navigation in order to protect the user during daily activities. This work also includes real-time outdoor experimentation (performed on an assistive vehicle) by seven volunteers with different disabilities (but without cognitive impairment and experienced in the use of wheelchairs), statistical validation, comparison with previously published work, and a discussion section where the pros and cons of our system are evaluated. Significance

  18. Human-inspired sound environment recognition system for assistive vehicles.

    Science.gov (United States)

    Vidal, Eduardo González; Zarricueta, Ernesto Fredes; Cheein, Fernando Auat

    2015-02-01

    The human auditory system acquires environmental information under sound stimuli faster than visual or touch systems, which in turn, allows for faster human responses to such stimuli. It also complements senses such as sight, where direct line-of-view is necessary to identify objects, in the environment recognition process. This work focuses on implementing human reaction to sound stimuli and environment recognition on assistive robotic devices, such as robotic wheelchairs or robotized cars. These vehicles need environment information to ensure safe navigation. In the field of environment recognition, range sensors (such as LiDAR and ultrasonic systems) and artificial vision devices are widely used; however, these sensors depend on environment constraints (such as lighting variability or color of objects), and sound can provide important information for the characterization of an environment. In this work, we propose a sound-based approach to enhance the environment recognition process, mainly for cases that compromise human integrity, according to the International Classification of Functioning (ICF). Our proposal is based on a neural network implementation that is able to classify up to 15 different environments, each selected according to the ICF considerations on environment factors in the community-based physical activities of people with disabilities. The accuracy rates in environment classification ranges from 84% to 93%. This classification is later used to constrain assistive vehicle navigation in order to protect the user during daily activities. This work also includes real-time outdoor experimentation (performed on an assistive vehicle) by seven volunteers with different disabilities (but without cognitive impairment and experienced in the use of wheelchairs), statistical validation, comparison with previously published work, and a discussion section where the pros and cons of our system are evaluated. The proposed sound-based system is very efficient

  19. Military personnel recognition system using texture, colour, and SURF features

    Science.gov (United States)

    Irhebhude, Martins E.; Edirisinghe, Eran A.

    2014-06-01

    This paper presents an automatic, machine vision based, military personnel identification and classification system. Classification is done using a Support Vector Machine (SVM) on sets of Army, Air Force and Navy camouflage uniform personnel datasets. In the proposed system, the arm of service of personnel is recognised by the camouflage of a persons uniform, type of cap and the type of badge/logo. The detailed analysis done include; camouflage cap and plain cap differentiation using gray level co-occurrence matrix (GLCM) texture feature; classification on Army, Air Force and Navy camouflaged uniforms using GLCM texture and colour histogram bin features; plain cap badge classification into Army, Air Force and Navy using Speed Up Robust Feature (SURF). The proposed method recognised camouflage personnel arm of service on sets of data retrieved from google images and selected military websites. Correlation-based Feature Selection (CFS) was used to improve recognition and reduce dimensionality, thereby speeding the classification process. With this method success rates recorded during the analysis include 93.8% for camouflage appearance category, 100%, 90% and 100% rates of plain cap and camouflage cap categories for Army, Air Force and Navy categories, respectively. Accurate recognition was recorded using SURF for the plain cap badge category. Substantial analysis has been carried out and results prove that the proposed method can correctly classify military personnel into various arms of service. We show that the proposed method can be integrated into a face recognition system, which will recognise personnel in addition to determining the arm of service which the personnel belong. Such a system can be used to enhance the security of a military base or facility.

  20. 与文本无关的声纹识别系统的研究%Research on Text-independence for Voiceprint Recognition System

    Institute of Scientific and Technical Information of China (English)

    霍春宝; 张彩娟; 赵红敏

    2013-01-01

      声纹识别按识别的方式分为与文本相关和与文本无关两类。针对声纹识别技术中与文本无关的声纹识别问题进行研究。为提高系统的识别率,提出并实现了多特征参数组合的识别算法。该算法以 LPCC,MFCC组合作为特征参数并将其应用到声纹识别系统中,实验结果表明组合特征参数由于充分利用了语音信号的相关特性和人耳听觉感知特性,比单独使用一种参数具有更好的识别效果。%Voiceprint recognition was divided into text-dependence and text-independence according to the recognition pattern. In view of voiceprint recognition technique, the problem on the text-independent voiceprint recognition was resear ched. Multi-feature vectors combination recognition algorithm was proposed and implemented to improve the system recognition rate. This algorithm took the combination of LPCC and MFCC as the feature parameters which were also put into use in voiceprint recognition system. The result of experiments expatiates that the parameters featured combination has much better recognition effect than a single kind of parameters used, because relative voice information with features and human hearing characteristics are fully used.

  1. Iris Recognition System Using Fractal Dimensions of Haar Patterns

    Directory of Open Access Journals (Sweden)

    Patnala S. R. Chandra Murty

    2009-09-01

    Full Text Available Classification of iris templates based on their texture patterns is one of the most effective methods in iris recognition systems. This paper proposes a novel algorithm for automatic iris classification based on fractal dimensions of Haar wavelet transforms is presented. Fractal dimensions obtained from multiple scale features are used to characterize the textures completely. Haar wavelet is applied in order to extract the multiple scale features at different resolutions from the iris image. Fractal dimensions are estimated from these patterns and a classifier is used to recognize the given image from a data base. Performance comparison was made among different classifiers.

  2. A Development of Hybrid Drug Information System Using Image Recognition

    Directory of Open Access Journals (Sweden)

    HwaMin Lee

    2015-04-01

    Full Text Available In order to prevent drug abuse or misuse cases and avoid over-prescriptions, it is necessary for medicine taker to be provided with detailed information about the medicine. In this paper, we propose a drug information system and develop an application to provide information through drug image recognition using a smartphone. We designed a contents-based drug image search algorithm using the color, shape and imprint of drug. Our convenient application can provide users with detailed information about drugs and prevent drug misuse.

  3. Autonomous Multiple Gesture Recognition System for Disabled People

    Directory of Open Access Journals (Sweden)

    Amarjot Singh

    2014-01-01

    Full Text Available The paper presents an intelligent multi gesture spotting system that can be used by disabled people to easily communicate with machines resulting into easement in day-to-day works. The system makes use of pose estimation for 10 signs used by hearing impaired people to communicate. Pose is extracted on the basis of silhouettes using timed motion history (tMHI followed by gesture recognition with Hu-Moments. Signs involving motion are recognized with the help of optical flow. Based on the recognized gestures, particular instructions are sent to the robot connected to system resulting into an appropriate action/movement by the robot. The system is unique as it can act as a assisting device and can communicate in local as well as wide area to assist the disabled person.

  4. Matrix sentence intelligibility prediction using an automatic speech recognition system.

    Science.gov (United States)

    Schädler, Marc René; Warzybok, Anna; Hochmuth, Sabine; Kollmeier, Birger

    2015-01-01

    The feasibility of predicting the outcome of the German matrix sentence test for different types of stationary background noise using an automatic speech recognition (ASR) system was studied. Speech reception thresholds (SRT) of 50% intelligibility were predicted in seven noise conditions. The ASR system used Mel-frequency cepstral coefficients as a front-end and employed whole-word Hidden Markov models on the back-end side. The ASR system was trained and tested with noisy matrix sentences on a broad range of signal-to-noise ratios. The ASR-based predictions were compared to data from the literature ( Hochmuth et al, 2015 ) obtained with 10 native German listeners with normal hearing and predictions of the speech intelligibility index (SII). The ASR-based predictions showed a high and significant correlation (R² = 0.95, p speech and noise signals. Minimum assumptions were made about human speech processing already incorporated in a reference-free ordinary ASR system.

  5. Electronic system with memristive synapses for pattern recognition

    Science.gov (United States)

    Park, Sangsu; Chu, Myonglae; Kim, Jongin; Noh, Jinwoo; Jeon, Moongu; Hun Lee, Byoung; Hwang, Hyunsang; Lee, Boreom; Lee, Byung-Geun

    2015-05-01

    Memristive synapses, the most promising passive devices for synaptic interconnections in artificial neural networks, are the driving force behind recent research on hardware neural networks. Despite significant efforts to utilize memristive synapses, progress to date has only shown the possibility of building a neural network system that can classify simple image patterns. In this article, we report a high-density cross-point memristive synapse array with improved synaptic characteristics. The proposed PCMO-based memristive synapse exhibits the necessary gradual and symmetrical conductance changes, and has been successfully adapted to a neural network system. The system learns, and later recognizes, the human thought pattern corresponding to three vowels, i.e. /a /, /i /, and /u/, using electroencephalography signals generated while a subject imagines speaking vowels. Our successful demonstration of a neural network system for EEG pattern recognition is likely to intrigue many researchers and stimulate a new research direction.

  6. Named entity recognition for bacterial Type IV secretion systems.

    Directory of Open Access Journals (Sweden)

    Sophia Ananiadou

    Full Text Available Research on specialized biological systems is often hampered by a lack of consistent terminology, especially across species. In bacterial Type IV secretion systems genes within one set of orthologs may have over a dozen different names. Classifying research publications based on biological processes, cellular components, molecular functions, and microorganism species should improve the precision and recall of literature searches allowing researchers to keep up with the exponentially growing literature, through resources such as the Pathosystems Resource Integration Center (PATRIC, patricbrc.org. We developed named entity recognition (NER tools for four entities related to Type IV secretion systems: 1 bacteria names, 2 biological processes, 3 molecular functions, and 4 cellular components. These four entities are important to pathogenesis and virulence research but have received less attention than other entities, e.g., genes and proteins. Based on an annotated corpus, large domain terminological resources, and machine learning techniques, we developed recognizers for these entities. High accuracy rates (>80% are achieved for bacteria, biological processes, and molecular function. Contrastive experiments highlighted the effectiveness of alternate recognition strategies; results of term extraction on contrasting document sets demonstrated the utility of these classes for identifying T4SS-related documents.

  7. Named entity recognition for bacterial Type IV secretion systems.

    Science.gov (United States)

    Ananiadou, Sophia; Sullivan, Dan; Black, William; Levow, Gina-Anne; Gillespie, Joseph J; Mao, Chunhong; Pyysalo, Sampo; Kolluru, Balakrishna; Tsujii, Junichi; Sobral, Bruno

    2011-03-29

    Research on specialized biological systems is often hampered by a lack of consistent terminology, especially across species. In bacterial Type IV secretion systems genes within one set of orthologs may have over a dozen different names. Classifying research publications based on biological processes, cellular components, molecular functions, and microorganism species should improve the precision and recall of literature searches allowing researchers to keep up with the exponentially growing literature, through resources such as the Pathosystems Resource Integration Center (PATRIC, patricbrc.org). We developed named entity recognition (NER) tools for four entities related to Type IV secretion systems: 1) bacteria names, 2) biological processes, 3) molecular functions, and 4) cellular components. These four entities are important to pathogenesis and virulence research but have received less attention than other entities, e.g., genes and proteins. Based on an annotated corpus, large domain terminological resources, and machine learning techniques, we developed recognizers for these entities. High accuracy rates (>80%) are achieved for bacteria, biological processes, and molecular function. Contrastive experiments highlighted the effectiveness of alternate recognition strategies; results of term extraction on contrasting document sets demonstrated the utility of these classes for identifying T4SS-related documents.

  8. Design of household control system based on speech recognition%基于语音识别的家居控制系统设计

    Institute of Scientific and Technical Information of China (English)

    黄辉健; 程良鸿; 黄明杰; 林垣华; 李志杰

    2014-01-01

    This paper studied the technology of speaker-dependent recognition based on Sunplus SPCE061A, voice recognition technology will be applied to the home control system. Proposed a control scheme which is convenient operation,easy to expand, and applicable to home applications. The system will be analyzed from the perspective of hardware circuit and software design. Also in the Google App Inventer platform, built out a control software based on Android smartphone’s Bluetooth communication.The tested results showed that the system has successfully realized the voice technology appliances and Android smartphones remote control technology.%本文研究了凌阳SPCE061A的特定人的语音识别与控制技术,将语音识别技术应用到家居控制系统中。提出一种操作简便、易扩展、适用于家庭应用的控制方案。分析了系统的硬件组成和软件设计流程。同时在Google App Inventer平台下,介绍了基于蓝牙通信的Android智能手机控制软件的搭建。经实际测试表明,本系统成功地实现对家电的声控技术和Android智能手机远程控制。

  9. Application of AI techniques to a voice-actuated computer system for reconstructing and displaying magnetic resonance imaging data

    Science.gov (United States)

    Sherley, Patrick L.; Pujol, Alfonso, Jr.; Meadow, John S.

    1990-07-01

    To provide a means of rendering complex computer architectures languages and input/output modalities transparent to experienced and inexperienced users research is being conducted to develop a voice driven/voice response computer graphics imaging system. The system will be used for reconstructing and displaying computed tomography and magnetic resonance imaging scan data. In conjunction with this study an artificial intelligence (Al) control strategy was developed to interface the voice components and support software to the computer graphics functions implemented on the Sun Microsystems 4/280 color graphics workstation. Based on generated text and converted renditions of verbal utterances by the user the Al control strategy determines the user''s intent and develops and validates a plan. The program type and parameters within the plan are used as input to the graphics system for reconstructing and displaying medical image data corresponding to that perceived intent. If the plan is not valid the control strategy queries the user for additional information. The control strategy operates in a conversation mode and vocally provides system status reports. A detailed examination of the various AT techniques is presented with major emphasis being placed on their specific roles within the total control strategy structure. 1.

  10. CHARACTER DETECTION AND RECOGNITION SYSTEM OF BEER BOTTLES

    Institute of Scientific and Technical Information of China (English)

    Shen Bangxing; Wu Wenjun; Zhang Yepeng; Shen Gang; Yang Liangen

    2005-01-01

    An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer bottles. The system consists of a hardware subsystem, including a rotating device, CCD, 16 mm focus lens, a frame grabber card, a penetrating lighting and a computer, and a software subsystem. The software subsystem performs pretreatment, character segmentation and character recognition. In the pretreatment, the original image is filtered with preset threshold to remove isolated spots. Then the horizontal projection and the vertical projection are used respectively to retrieve the character segmentation. Subsequently, the configuration characteristic algorithm is applied to recognize the characters. The experimental results demonstrate that this system can recognize the characters on beer bottles accurately and effectively; the algorithm is proven fast, stable and robust, making it suitable in the industrial environment.

  11. Traffic Sign Recognition System based on Cambridge Correlator Image Comparator

    Directory of Open Access Journals (Sweden)

    J. Turan

    2012-06-01

    Full Text Available Paper presents basic information about application of Optical Correlator (OC, specifically Cambridge Correlator, in system to recognize of traffic sign. Traffic Sign Recognition System consists of three main blocks, Preprocessing, Optical Correlator and Traffic Sign Identification. The Region of Interest (ROI is defined and chosen in preprocessing block and then goes to Optical Correlator, where is compared with database of Traffic Sign. Output of Optical Correlation is correlation plane, which consist of highly localized intensities, know as correlation peaks. The intensity of spots provides a measure of similarity and position of spots, how images (traffic signs are relatively aligned in the input scene. Several experiments have been done with proposed system and results and conclusion are discussed.

  12. Design and implementation of a user-oriented speech recognition interface: the synergy of technology and human factors

    NARCIS (Netherlands)

    Kloosterman, Sietse H.

    1994-01-01

    The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection. In the design of the interface a synergy of technology

  13. DESIGN AND IMPLEMENTATION OF A USER-ORIENTED SPEECH RECOGNITION INTERFACE - THE SYNERGY OF TECHNOLOGY AND HUMAN-FACTORS

    NARCIS (Netherlands)

    KLOOSTERMAN, SH

    The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection. In the design of the interface a synergy of technology

  14. Business model for sensor-based fall recognition systems.

    Science.gov (United States)

    Fachinger, Uwe; Schöpke, Birte

    2014-01-01

    AAL systems require, in addition to sophisticated and reliable technology, adequate business models for their launch and sustainable establishment. This paper presents the basic features of alternative business models for a sensor-based fall recognition system which was developed within the context of the "Lower Saxony Research Network Design of Environments for Ageing" (GAL). The models were developed parallel to the R&D process with successive adaptation and concretization. An overview of the basic features (i.e. nine partial models) of the business model is given and the mutual exclusive alternatives for each partial model are presented. The partial models are interconnected and the combinations of compatible alternatives lead to consistent alternative business models. However, in the current state, only initial concepts of alternative business models can be deduced. The next step will be to gather additional information to work out more detailed models.

  15. Memristor-MOS analog correlator for pattern recognition system.

    Science.gov (United States)

    Han, Ca-Ram; Lee, Sang-Jin; Oh, Kwang-Seok; Cho, Kyoungrok

    2013-05-01

    Emergence of new materials having significant improved properties continues to influence the formulation of novel architectures and as such new developments pave the way for innovative circuits and systems such as those required in visual imaging and recognition systems. In this paper we introduce a novel approach for the design of an analog comparator suitable for pattern matching using two Memristors as part of both the stored image data as well as that of the input signal. Our proposed comparator based on Memristor-CMOS fabrication process generates a signal indicating similarity/dissimilarity between two pattern data derived from image sensor and the corresponding Memristor-based template memory. For convenience, we also present an overview of a simplified Memristor model and hence provide simulation results for comparison with that of a conventional analog CMOS comparator.

  16. Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

    Directory of Open Access Journals (Sweden)

    Maureen Fontaine

    2017-07-01

    Full Text Available The ability to recognize an individual from their voice is a widespread ability with a long evolutionary history. Yet, the perceptual representation of familiar voices is ill-defined. In two experiments, we explored the neuropsychological processes involved in the perception of voice identity. We specifically explored the hypothesis that familiar voices (trained-to-familiar (Experiment 1, and famous voices (Experiment 2 are represented as a whole complex pattern, well approximated by the average of multiple utterances produced by a single speaker. In experiment 1, participants learned three voices over several sessions, and performed a three-alternative forced-choice identification task on original voice samples and several “speaker averages,” created by morphing across varying numbers of different vowels (e.g., [a] and [i] produced by the same speaker. In experiment 2, the same participants performed the same task on voice samples produced by familiar speakers. The two experiments showed that for famous voices, but not for trained-to-familiar voices, identification performance increased and response times decreased as a function of the number of utterances in the averages. This study sheds light on the perceptual representation of familiar voices, and demonstrates the power of average in recognizing familiar voices. The speaker average captures the unique characteristics of a speaker, and thus retains the information essential for recognition; it acts as a prototype of the speaker.

  17. Perturbation Measures of Voice: A Comparative Study between Multi-Dimensional Voice Program and Praat

    National Research Council Canada - National Science Library

    Maryn, Youri; Corthals, Paul; De Bodt, Marc; Van Cauwenberge, Paul; Deliyski, Dimitar

    2009-01-01

    .... In the present study, perturbation measures provided by two computer systems (a purpose-built professional voice analysis apparatus and a personal computer-based system for acoustic voice assessment...

  18. Pattern-Recognition System for Approaching a Known Target

    Science.gov (United States)

    Huntsberger, Terrance; Cheng, Yang

    2008-01-01

    A closed-loop pattern-recognition system is designed to provide guidance for maneuvering a small exploratory robotic vehicle (rover) on Mars to return to a landed spacecraft to deliver soil and rock samples that the spacecraft would subsequently bring back to Earth. The system could be adapted to terrestrial use in guiding mobile robots to approach known structures that humans could not approach safely, for such purposes as reconnaissance in military or law-enforcement applications, terrestrial scientific exploration, and removal of explosive or other hazardous items. The system has been demonstrated in experiments in which the Field Integrated Design and Operations (FIDO) rover (a prototype Mars rover equipped with a video camera for guidance) is made to return to a mockup of Mars-lander spacecraft. The FIDO rover camera autonomously acquires an image of the lander from a distance of 125 m in an outdoor environment. Then under guidance by an algorithm that performs fusion of multiple line and texture features in digitized images acquired by the camera, the rover traverses the intervening terrain, using features derived from images of the lander truss structure. Then by use of precise pattern matching for determining the position and orientation of the rover relative to the lander, the rover aligns itself with the bottom of ramps extending from the lander, in preparation for climbing the ramps to deliver samples to the lander. The most innovative aspect of the system is a set of pattern-recognition algorithms that govern a three-phase visual-guidance sequence for approaching the lander. During the first phase, a multifeature fusion algorithm integrates the outputs of a horizontal-line-detection algorithm and a wavelet-transform-based visual-area-of-interest algorithm for detecting the lander from a significant distance. The horizontal-line-detection algorithm is used to determine candidate lander locations based on detection of a horizontal deck that is part of the

  19. Entrance C - New Automatic Number Plate Recognition System

    CERN Multimedia

    2013-01-01

    Entrance C (Satigny) is now equipped with a latest-generation Automatic Number Plate Recognition (ANPR) system and a fast-action road gate.   During the month of August, Entrance C will be continuously open from 7.00 a.m. to 7.00 p.m. (working days only). The security guards will open the gate as usual from 7.00 a.m. to 9.00 a.m. and from 5.00 p.m. to 7.00 p.m. For the rest of the working day (9.00 a.m. to 5.00 p.m.) the gate will operate automatically. Please observe the following points:       Stop at the STOP sign on the ground     Position yourself next to the card reader for optimal recognition     Motorcyclists must use their CERN card     Cyclists may not activate the gate and should use the bicycle turnstile     Keep a safe distance from the vehicle in front of you   If access is denied, please check that your vehicle regist...

  20. Entrance C - New Automatic Number Plate Recognition System

    CERN Multimedia

    2013-01-01

    Entrance C (Satigny) is now equipped with a latest-generation Automatic Number Plate Recognition (ANPR) system and a fast-action road gate.   During the month of August, Entrance C will be continuously open from 7.00 a.m. to 7.00 p.m. (working days only). The security guards will open the gate as usual from 7.00 a.m. to 9.00 a.m. and from 5.00 p.m. to 7.00 p.m. For the rest of the working day (9.00 a.m. to 5.00 p.m.) the gate will operate automatically. Please observe the following points:       Stop at the STOP sign on the ground     Position yourself next to the card reader for optimal recognition     Motorcyclists must use their CERN card     Cyclists may not activate the gate and should use the bicycle turnstile     Keep a safe distance from the vehicle in front of you   If access is denied, please check that your vehicle regist...

  1. WIFI-based Telephone Voice Monitoring System%基于WIFI的电话语音监控系统

    Institute of Scientific and Technical Information of China (English)

    苏建志; 杨惠山

    2016-01-01

    设计一种新型的基于 WIFI无线的电话语音监控系统,该系统通过士兰微公司的主控芯片 SC6138对电话语音进行采集录音,利用 SD卡、U盘等存储媒介对电话语音信号进行储存,同时通过 WIFI 把储存的语音文件实时传输给远程监控端。首先设计该系统的硬件电路,并详细阐述各个功能模块的工作原理,然后设计主要工作模块的软件及其流程。结果表明,该系统可移动性强,不需要任何主机即可存储语音内容和通过 WIFI 实现远程监控。%The article designed a new wireless telephone voice monitoring system that is based on WIFI module.The sys-tem samples and records the telephone voice by use of main controlling chip SC6138 produced by SiLan Company.The tele-phone voice audio signal is saved as files in SD card and USB.At the same time the voice files stored is transferred live to the remote monitoring terminal through WIFI module when needed.First the hardware circuit of this system is designed and the working principle is elaborated on for each function module of the system.At last the software for the major operational mod-ules and their flow chart are designed.The testing results of the system find that it is of a high mobility and without a host ma-chine it can store voice and be remotely monitored via WIFI.

  2. Does knowing speaker sex facilitate vowel recognition at short durations?

    Science.gov (United States)

    Smith, David R R

    2014-05-01

    A man, woman or child saying the same vowel do so with very different voices. The auditory system solves the complex problem of extracting what the man, woman or child has said despite substantial differences in the acoustic properties of their voices. Much of the acoustic variation between the voices of men and woman is due to changes in the underlying anatomical mechanisms for producing speech. If the auditory system knew the sex of the speaker then it could potentially correct for speaker sex related acoustic variation thus facilitating vowel recognition. This study measured the minimum stimulus duration necessary to accurately discriminate whether a brief vowel segment was spoken by a man or woman, and the minimum stimulus duration necessary to accuately recognise what vowel was spoken. Results showed that reliable vowel recognition precedesreliable speaker sex discrimination, thus questioning the use of speaker sex information in compensating for speaker sex related acoustic variation in the voice. Furthermore, the pattern of performance across experiments where the fundamental frequency and formant frequency information of speaker's voices were systematically varied, was markedly different depending on whether the task was speaker-sex discrimination or vowel recognition. This argues for there being little relationship between perception of speaker sex (indexical information) and perception of what has been said (linguistic information) at short durations. Copyright © 2014 Elsevier B.V. All rights reserved.

  3. Keeping Your Voice Healthy

    Science.gov (United States)

    ... Find an ENT Doctor Near You Keeping Your Voice Healthy Keeping Your Voice Healthy Patient Health Information ... heavily voice-related. Key Steps for Keeping Your Voice Healthy Drink plenty of water. Moisture is good ...

  4. 基于Julius的机器人语音识别系统构建%Robot Speech Recognition System Based on Julius

    Institute of Scientific and Technical Information of China (English)

    付维; 刘冬; 闵华松

    2011-01-01

    As a result of the continuous development of robot technology, speech recognition of the robot is proposed as intelligent hu- man-computer interaction. After studying the basic principles of HMM speech recognition, in the robot platform of laboratory speech recognition system for isolated words is achieved with open source HTK and Julius. Using the speech recognition system, we can extract the voice command for robot control.%随着机器人技术不断发展,本文提出机器人的语音识别这一智能人机交互方式。在研究了基于HMM语音识别基本原理的情况下,在实验室的机器人平台上,利用HTK和Julius开源平台,构建了一个孤立词的语音识别系统。利用该语音识别系统可以提取语音命令用于机器人的控制。

  5. 基于NodeJS的智能家居语音控制系统服务器端设计与实现%Design of NodeJS-based Smart Home Voice-controI System Server

    Institute of Scientific and Technical Information of China (English)

    单振华; 王舒憬; 陈凯; 强杰

    2016-01-01

    This paper introduces the overaI structure, technicaI means and the achievement of the main function moduIes of the design of NodeJS-based smart home voice controI system server.The system is mainIy to achieve reaI-time server-side speech recognition.Through sending the received voice data to the Baidu cIoud recognition,the recognition resuIt is re-turned to the cIient.In addition,NodeJS owns the feature of the event-driven,the asynchronous programming,and its out-standing advantages make procedures to achieve high concurrency.%介绍了基于NodeJS的智能家居语音控制系统服务器端总体结构、技术手段和主要功能模块的实现。该系统服务器端主要实现实时语音识别,通过把接收的语音数据发送给百度语音云端识别,把识别结果返回给客户端。此外, NodeJS采用事件驱动、异步编程,其突出的优点使得程序能够实现高并发处理。NodeJS非阻塞模式的IO处理给NodeJS带来在相对低系统资源耗用下的高性能与出众的负载能力。

  6. 读书机器人变声系统的研制%A Voice Modification System for Book Reading Robot

    Institute of Scientific and Technical Information of China (English)

    邓杰; 房宁; 赵群飞

    2012-01-01

    为了增加读书机器人(JoyTon)朗读声音的多样性,设计了一种基于单一语音库的声音变换系统.将读书机嚣TTS(textto speech)合成出的初始声音分解成声音激励信号和声道滤波嚣信号,并转换到频域进行修改.利用短时傅立叶幅度谱重构激励信号的方法以及通过修改声道滤波器参数的方法来变换音速、音调和音色.修改后的声音激励信号和声道滤波嚣信号被重新合成产生新的声音信号.该变声系统能在不增加语音库容量的情况下使读书机嚣人用丰富多彩的感情和声调朗读.%Voice modification based on single speech database is proposed in order to increase the voice diversity of the book reading robot (JoyTon). The original sounds synthesized by book reading robot's TTS engine are broken down into excitation and vocal tract filter. Both excitation and vocal tract filter are modified in the frequency domain. Excitation reconstruction through short-time Fourier transform magnitude and parameters modification of vocal tract filter are used to achieve tempo, pitch and timbre modification. Modified excitation and modified vocal tract filter are synthesized back to generate new voice signals. This voice modification system makes book reading robot a text with a variety of emotions and tones without extra memory for speech library.

  7. Automated Degradation Diagnosis in Character Recognition System Subject to Camera Vibration

    Directory of Open Access Journals (Sweden)

    Chunmei Liu

    2014-01-01

    Full Text Available Degradation diagnosis plays an important role for degraded character processing, which can tell the recognition difficulty of a given degraded character. In this paper, we present a framework for automated degraded character recognition system by statistical syntactic approach using 3D primitive symbol, which is integrated by degradation diagnosis to provide accurate and reliable recognition results. Our contribution is to design the framework to build the character recognition submodels corresponding to degradation subject to camera vibration or out of focus. In each character recognition submodel, statistical syntactic approach using 3D primitive symbol is proposed to improve degraded character recognition performance. In the experiments, we show attractive experimental results, highlighting the system efficiency and recognition performance by statistical syntactic approach using 3D primitive symbol on the degraded character dataset.

  8. Face recognition system and method using face pattern words and face pattern bytes

    Energy Technology Data Exchange (ETDEWEB)

    Zheng, Yufeng

    2014-12-23

    The present invention provides a novel system and method for identifying individuals and for face recognition utilizing facial features for face identification. The system and method of the invention comprise creating facial features or face patterns called face pattern words and face pattern bytes for face identification. The invention also provides for pattern recognitions for identification other than face recognition. The invention further provides a means for identifying individuals based on visible and/or thermal images of those individuals by utilizing computer software implemented by instructions on a computer or computer system and a computer readable medium containing instructions on a computer system for face recognition and identification.

  9. Multimodal recognition of emotions

    NARCIS (Netherlands)

    Datcu, D.

    2009-01-01

    This thesis proposes algorithms and techniques to be used for automatic recognition of six prototypic emotion categories by computer programs, based on the recognition of facial expressions and emotion patterns in voice. Considering the applicability in real-life conditions, the research is carried

  10. Using Face Recognition System in Ship Protection Process

    Directory of Open Access Journals (Sweden)

    Miroslav Bača

    2006-03-01

    Full Text Available The process of security improvement is a huge problem especiallyin large ships. Terrorist attacks and everyday threatsagainst life and property destroy transport and tourist companies,especially large tourist ships. Every person on a ship can berecognized and identified using something that the personknows or by means of something the person possesses. The bestresults will be obtained by using a combination of the person'sknowledge with one biometric characteristic. Analyzing theproblem of biometrics in ITS security we can conclude that facerecognition process supported by one or two traditional biometriccharacteristics can give very good results regarding ship security.In this paper we will describe a biometric system basedon face recognition. Special focus will be given to crew member'sbiometric security in crisis situation like kidnapping, robbelyor illness.

  11. An overview of the SPHINX speech recognition system

    Science.gov (United States)

    Lee, Kai-Fu; Hon, Hsiao-Wuen; Reddy, Raj

    1990-01-01

    A description is given of SPHINX, a system that demonstrates the feasibility of accurate, large-vocabulary, speaker-independent, continuous speech recognition. SPHINX is based on discrete hidden Markov models (HMMs) with linear-predictive-coding derived parameters. To provide speaker independence, knowledge was added to these HMMs in several ways: multiple codebooks of fixed-width parameters, and an enhanced recognizer with carefully designed models and word-duration modeling. To deal with coarticulation in continuous speech, yet still adequately represent a large vocabulary, two new subword speech units are introduced: function-word-dependent phone models and generalized triphone models. With grammars of perplexity 997, 60, and 20, SPHINX attained word accuracies of 71, 94, and 96 percent, respectively, on a 997-word task.

  12. FaceID: A face detection and recognition system

    Energy Technology Data Exchange (ETDEWEB)

    Shah, M.B.; Rao, N.S.V.; Olman, V.; Uberbacher, E.C.; Mann, R.C.

    1996-12-31

    A face detection system that automatically locates faces in gray-level images is described. Also described is a system which matches a given face image with faces in a database. Face detection in an Image is performed by template matching using templates derived from a selected set of normalized faces. Instead of using original gray level images, vertical gradient images were calculated and used to make the system more robust against variations in lighting conditions and skin color. Faces of different sizes are detected by processing the image at several scales. Further, a coarse-to-fine strategy is used to speed up the processing, and a combination of whole face and face component templates are used to ensure low false detection rates. The input to the face recognition system is a normalized vertical gradient image of a face, which is compared against a database using a set of pretrained feedforward neural networks with a winner-take-all fuser. The training is performed by using an adaptation of the backpropagation algorithm. This system has been developed and tested using images from the FERET database and a set of images obtained from Rowley, et al and Sung and Poggio.

  13. A preliminary analysis of human factors affecting the recognition accuracy of a discrete word recognizer for C3 systems

    Science.gov (United States)

    Yellen, H. W.

    1983-03-01

    Literature pertaining to Voice Recognition abounds with information relevant to the assessment of transitory speech recognition devices. In the past, engineering requirements have dictated the path this technology followed. But, other factors do exist that influence recognition accuracy. This thesis explores the impact of Human Factors on the successful recognition of speech, principally addressing the differences or variability among users. A Threshold Technology T-600 was used for a 100 utterance vocubalary to test 44 subjects. A statistical analysis was conducted on 5 generic categories of Human Factors: Occupational, Operational, Psychological, Physiological and Personal. How the equipment is trained and the experience level of the speaker were found to be key characteristics influencing recognition accuracy. To a lesser extent computer experience, time or week, accent, vital capacity and rate of air flow, speaker cooperativeness and anxiety were found to affect overall error rates.

  14. Digital image pattern recognition system using normalized Fourier transform and normalized analytical Fourier-Mellin transform

    Science.gov (United States)

    Vélez-Rábago, Rodrigo; Solorza-Calderón, Selene; Jordan-Aramburo, Adina

    2016-12-01

    This work presents an image pattern recognition system invariant to translation, scale and rotation. The system uses the Fourier transform to achieve the invariance to translation and the analytical Forier-Mellin transform for the invariance to scale and rotation. According with the statistical theory of box-plots, the pattern recognition system has a confidence level at least of 95.4%.

  15. 42 CFR 403.322 - Termination of agreements for Medicare recognition of State systems.

    Science.gov (United States)

    2010-10-01

    ... 42 Public Health 2 2010-10-01 2010-10-01 false Termination of agreements for Medicare recognition of State systems. 403.322 Section 403.322 Public Health CENTERS FOR MEDICARE & MEDICAID SERVICES... Reimbursement Control Systems § 403.322 Termination of agreements for Medicare recognition of State systems....

  16. Poka Yoke system based on image analysis and object recognition

    Science.gov (United States)

    Belu, N.; Ionescu, L. M.; Misztal, A.; Mazăre, A.

    2015-11-01

    Poka Yoke is a method of quality management which is related to prevent faults from arising during production processes. It deals with “fail-sating” or “mistake-proofing”. The Poka-yoke concept was generated and developed by Shigeo Shingo for the Toyota Production System. Poka Yoke is used in many fields, especially in monitoring production processes. In many cases, identifying faults in a production process involves a higher cost than necessary cost of disposal. Usually, poke yoke solutions are based on multiple sensors that identify some nonconformities. This means the presence of different equipment (mechanical, electronic) on production line. As a consequence, coupled with the fact that the method itself is an invasive, affecting the production process, would increase its price diagnostics. The bulky machines are the means by which a Poka Yoke system can be implemented become more sophisticated. In this paper we propose a solution for the Poka Yoke system based on image analysis and identification of faults. The solution consists of a module for image acquisition, mid-level processing and an object recognition module using associative memory (Hopfield network type). All are integrated into an embedded system with AD (Analog to Digital) converter and Zync 7000 (22 nm technology).

  17. Human Iris Recognition System using Wavelet Transform and LVQ

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Kwan Yong; Lim, Shin Young [Electronics and Telecommunications Research Institute (Korea); Cho, Seong Won [Hongik University (Korea)

    2000-07-01

    The popular methods to check the identity of individuals include passwords and ID cards. These conventional methods for user identification and authentication are not altogether reliable because they can be stolen and forgotten. As an alternative of the existing methods, biometric technology has been paid much attention for the last few decades. In this paper, we propose an efficient system for recognizing the identity of a living person by analyzing iris patterns which have a high level of stability and distinctiveness than other biometric measurements. The proposed system is based on wavelet transform and a competitive neural network with the improved mechanisms. After preprocessing the iris data acquired through a CCD camera, feature vectors are extracted by using Haar wavelet transform. LVQ(Learning Vector Quantization) is exploited to classify these feature vectors. We improve the overall performance of the proposed system by optimizing the size of feature vectors and by introducing an efficient initialization of the weight vectors and a new method for determining the winner in order to increase the recognition accuracy of LVQ. From the experiments, we confirmed that the proposed system has a great potential of being applied to real applications in an efficient and effective way. (author). 14 refs., 13 figs., 7 tabs.

  18. Design and implementation of E-Kanban voice system based on Wifi%基于Wifi的电子看板语音系统设计实现

    Institute of Scientific and Technical Information of China (English)

    赵慧元; 孙鲁; 苏禹; 彭文亮

    2012-01-01

    E-Kanban widely used in Industrial production areas The paper introduce a new type E-Kanban. compared with traditional one , the system Add IC card authentication, video monitoring, voice call functions, among them the voice call enhance the work field management, and voice broadcast is also implementation, it can provide real time voice training. The function consists of two parts-position machine and embedded system, voice is compressed by adpcm and transferred by wifi. Through testing, the voice is clear and no delay.%电子看版广泛的应用到工业生产领域,文中介绍了一种新型基于arm嵌入式的工业电子看板。和传统的电子看板系统相比,增加了刷卡认证、视频监控、语音通话。其中语音通话增强现场的管理,并能实现广播,给现场人员提供实时培训等功能。语音系统包括上位机及嵌入式两部分组成,语音采用adpcm进行压缩并通过wifi传输,通过测试语音输出清晰、没有延迟。

  19. Vermont STep Ahead Recognition System: QRS Profile. The Child Care Quality Rating System (QRS) Assessment

    Science.gov (United States)

    Child Trends, 2010

    2010-01-01

    This paper presents a profile of Vermont's STep Ahead Recognition System (STARS) prepared as part of the Child Care Quality Rating System (QRS) Assessment Study. The profile consists of several sections and their corresponding descriptions including: (1) Program Information; (2) Rating Details; (3) Quality Indicators for All Child Care Programs;…

  20. Contextual System of Symbol Structural Recognition based on an Object-Process Methodology

    OpenAIRE

    Delalandre, Mathieu

    2005-01-01

    We present in this paper a symbol recognition system for the graphic documents. This one is based on a contextual approach for symbol structural recognition exploiting an Object-Process Methodology. It uses a processing library composed of structural recognition processings and contextual evaluation processings. These processings allow our system to deal with the multi-representation of symbols. The different processings are controlled, in an automatic way, by an inference engine during the r...

  1. Modularized reconfigurable system for target recognition with multi-DSP processing

    Science.gov (United States)

    Li, Yun; Li, Huili; Xie, Xiaoming

    2013-10-01

    A modularized reconfigurable system for target recognition with multi-DSP processing is designed to reconfigure the target recognition modules and update the distributed target feature libraries through the serial channel to adapt to the varied application. The system is separated into three independent modules and two work modes running at different time slides based on project switch. The modularized reconfiguration module is designed as a minimum security kernel separated from the target recognition module to decrease their coupling and interrelationship. This kind of multi-project design based on cyclic redundancy check presents a more independent and reliable target recognition system with modularized reconfiguration ability.

  2. RESEARCH AND DEVELOPMENT OF DSP-BASED FACE RECOGNITION SYSTEM FOR ROBOTIC REHABILITATION NURSING BEDS

    Directory of Open Access Journals (Sweden)

    Ming XING

    2016-04-01

    Full Text Available This article describes the development of DSP as the core of the face recognition system, on the basis of understanding the background, significance and current research situation at home and abroad of face recognition issue, having a in-depth study to face detection, Image preprocessing, feature extraction face facial structure, facial expression feature extraction, classification and other issues during face recognition and have achieved research and development of DSP-based face recognition system for robotic rehabilitation nursing beds. The system uses a fixed-point DSP TMS320DM642 as a central processing unit, with a strong processing performance, high flexibility and programmability.

  3. Development of an automated speech recognition interface for personal emergency response systems

    Directory of Open Access Journals (Sweden)

    Mihailidis Alex

    2009-07-01

    Full Text Available Abstract Background Demands on long-term-care facilities are predicted to increase at an unprecedented rate as the baby boomer generation reaches retirement age. Aging-in-place (i.e. aging at home is the desire of most seniors and is also a good option to reduce the burden on an over-stretched long-term-care system. Personal Emergency Response Systems (PERSs help enable older adults to age-in-place by providing them with immediate access to emergency assistance. Traditionally they operate with push-button activators that connect the occupant via speaker-phone to a live emergency call-centre operator. If occupants do not wear the push button or cannot access the button, then the system is useless in the event of a fall or emergency. Additionally, a false alarm or failure to check-in at a regular interval will trigger a connection to a live operator, which can be unwanted and intrusive to the occupant. This paper describes the development and testing of an automated, hands-free, dialogue-based PERS prototype. Methods The prototype system was built using a ceiling mounted microphone array, an open-source automatic speech recognition engine, and a 'yes' and 'no' response dialog modelled after an existing call-centre protocol. Testing compared a single microphone versus a microphone array with nine adults in both noisy and quiet conditions. Dialogue testing was completed with four adults. Results and discussion The microphone array demonstrated improvement over the single microphone. In all cases, dialog testing resulted in the system reaching the correct decision about the kind of assistance the user was requesting. Further testing is required with elderly voices and under different noise conditions to ensure the appropriateness of the technology. Future developments include integration of the system with an emergency detection method as well as communication enhancement using features such as barge-in capability. Conclusion The use of an automated

  4. Iris Recognition System Based on Feature Level Fusion

    Directory of Open Access Journals (Sweden)

    Dr. S. R. Ganorkar

    2013-11-01

    Full Text Available Multibiometric systems utilize the evidence presented by multiple biometric sources (e.g., face and fingerprint, multiple fingers of a single user, multiple matchers, etc. in order to determine or verify the identity of an individual. Information from multiple sources can be consolidated in several distinct levels. But fusion of two different biometric traits are difficult due to (i the feature sets of multiple modalities may be incompatible (e.g., minutiae set of fingerprints and eigen-coefficients of face; (ii the relationship between the feature spaces of different biometric systems may not be known; (iii concatenating two feature vectors may result in a feature vector with very large dimensionality leading to the `curse of dimensionality problem, huge storage space and different processing algorithm. Also if we are use multiple images of single biometric trait, then it doesn’t show much variations. So in this paper, we present a efficient technique of feature-based fusion in a multimodal system where left eye and right eye are used as input. Iris recognition basically contains iris location, feature extraction, and identification. This algorithm uses canny edge detection to identify inner and outer boundary of iris. Then this image is feed to Gabor wavelet transform to extract the feature and finally matching is done by using indexing algorithm. The results from the analysis of works indicate that the proposed technique can lead to substantial improvement in performance.

  5. Syllogisms delivered in an angry voice lead to improved performance and engagement of a different neural system compared to neutral voice

    Directory of Open Access Journals (Sweden)

    Kathleen Walton Smith

    2015-05-01

    Full Text Available Despite the fact that most real-world reasoning occurs in some emotional context, very little is known about the underlying behavioral and neural implications of such context. To further understand the role of emotional context in logical reasoning we scanned 15 participants with fMRI while they engaged in logical reasoning about neutral syllogisms presented through the auditory channel in a sad, angry, or neutral tone of voice. Exposure to angry voice led to improved reasoning performance compared to exposure to sad and neutral voice. A likely explanation for this effect is that exposure to expressions of anger increases selective attention toward the relevant features of target stimuli, in this case the reasoning task. Supporting this interpretation, reasoning in the context of angry voice was accompanied by activation in the superior frontal gyrus—a region known to be associated with selective attention. Our findings contribute to a greater understanding of the neural processes that underlie reasoning in an emotional context by demonstrating that two emotional contexts, despite being of the same (negative valence, have different effects on reasoning.

  6. On Assisting a Visual-Facial Affect Recognition System with Keyboard-Stroke Pattern Information

    Science.gov (United States)

    Stathopoulou, I.-O.; Alepis, E.; Tsihrintzis, G. A.; Virvou, M.

    Towards realizing a multimodal affect recognition system, we are considering the advantages of assisting a visual-facial expression recognition system with keyboard-stroke pattern information. Our work is based on the assumption that the visual-facial and keyboard modalities are complementary to each other and that their combination can significantly improve the accuracy in affective user models. Specifically, we present and discuss the development and evaluation process of two corresponding affect recognition subsystems, with emphasis on the recognition of 6 basic emotional states, namely happiness, sadness, surprise, anger and disgust as well as the emotion-less state which we refer to as neutral. We find that emotion recognition by the visual-facial modality can be aided greatly by keyboard-stroke pattern information and the combination of the two modalities can lead to better results towards building a multimodal affect recognition system.

  7. Ethical aspects of face recognition systems in public places.

    NARCIS (Netherlands)

    Brey, Philip A.E.

    2004-01-01

    This essay examines ethical aspects of the use of facial recognition technology for surveillance purposes in public and semipublic areas, focusing particularly on the balance between security and privacy and civil liberties. As a case study, the FaceIt facial recognition engine of Identix Corporatio

  8. Indonesian Automatic Speech Recognition For Command Speech Controller Multimedia Player

    Directory of Open Access Journals (Sweden)

    Vivien Arief Wardhany

    2014-12-01

    Full Text Available The purpose of multimedia devices development is controlling through voice. Nowdays voice that can be recognized only in English. To overcome the issue, then recognition using Indonesian language model and accousticc model and dictionary. Automatic Speech Recognizier is build using engine CMU Sphinx with modified english language to Indonesian Language database and XBMC used as the multimedia player. The experiment is using 10 volunteers testing items based on 7 commands. The volunteers is classifiedd by the genders, 5 Male & 5 female. 10 samples is taken in each command, continue with each volunteer perform 10 testing command. Each volunteer also have to try all 7 command that already provided. Based on percentage clarification table, the word “Kanan” had the most recognize with percentage 83% while “pilih” is the lowest one. The word which had the most wrong clarification is “kembali” with percentagee 67%, while the word “kanan” is the lowest one. From the result of Recognition Rate by male there are several command such as “Kembali”, “Utama”, “Atas “ and “Bawah” has the low Recognition Rate. Especially for “kembali” cannot be recognized as the command in the female voices but in male voice that command has 4% of RR this is because the command doesn’t have similar word in english near to “kembali” so the system unrecognize the command. Also for the command “Pilih” using the female voice has 80% of RR but for the male voice has only 4% of RR. This problem is mostly because of the different voice characteristic between adult male and female which male has lower voice frequencies (from 85 to 180 Hz than woman (165 to 255 Hz.The result of the experiment showed that each man had different number of recognition rate caused by the difference tone, pronunciation, and speed of speech. For further work needs to be done in order to improving the accouracy of the Indonesian Automatic Speech Recognition system

  9. Social context predicts recognition systems in ant queens

    DEFF Research Database (Denmark)

    Dreier, Stéphanie Agnès Jeanine; d'Ettorre, Patrizia

    2009-01-01

    Recognition of group-members is a key feature of sociality. Ants use chemical communication to discriminate nestmates from intruders, enhancing kin cooperation and preventing parasitism. The recognition code is embedded in their cuticular chemical profile, which typically varies between colonies....... We predicted that ants might be capable of accurate recognition in unusual situations when few individuals interact repeatedly, as new colonies started by two to three queens. Individual recognition would be favoured by selection when queens establish dominance hierarchies, because repeated fights...... for dominance are costly; but it would not evolve in absence of hierarchies. We previously showed that Pachycondyla co-founding queens, which form dominance hierarchies, have accurate individual recognition based on chemical cues. Here, we used the ant Lasius niger to test the null hypothesis that individual...

  10. Playful Interaction with Voice Sensing Modular Robots

    DEFF Research Database (Denmark)

    Heesche, Bjarke; MacDonald, Ewen; Fogh, Rune

    2013-01-01

    This paper describes a voice sensor, suitable for modular robotic systems, which estimates the energy and fundamental frequency, F0, of the user’s voice. Through a number of example applications and tests with children, we observe how the voice sensor facilitates playful interaction between...... children and two different robot configurations. In future work, we will investigate if such a system can motivate children to improve voice control and explore how to extend the sensor to detect emotions in the user’s voice....

  11. Sustainable Consumer Voices

    DEFF Research Database (Denmark)

    Klitmøller, Anders; Rask, Morten; Jensen, Nevena

    2011-01-01

    Aiming to explore how user driven innovation can inform high level design strategies, an in-depth empirical study was carried out, based on data from 50 observations of private vehicle users. This paper reports the resulting 5 consumer voices: Technology Enthusiast, Environmentalist, Design Lover......, Pragmatist and Status Seeker. Expedient use of the voices in creating design strategies is discussed, thus contributing directly to the practice of high level design managers. The main academic contribution of this paper is demonstrating how applied anthropology can be used to generate insights...... into disruptive emergence of product service systems, where quantitative user analyses rely on historical continuation....

  12. Distributed Speech Recognition Systems and Some Key Factors Affecting It's Performance

    Institute of Scientific and Technical Information of China (English)

    YE Lei; YANG Zhen

    2003-01-01

    In this paper we first analyze the Distributed Speech Recognition (DSR) system and the key factors that affect it's performance and then focus on the research on the relationship between the length of testing speech and the recognition accuracy of the system. Some experimental results are given at last.

  13. The effect of image resolution on the performance of a face recognition system

    NARCIS (Netherlands)

    Boom, B.J.; Beumer, G.M.; Spreeuwers, L.J.; Veldhuis, R.N.J.

    2006-01-01

    In this paper we investigate the effect of image resolution on the error rates of a face verification system. We do not restrict ourselves to the face recognition algorithm only, but we also consider the face registration. In our face recognition system, the face registration is done by finding land

  14. Personal recognition using head-top image for health-monitoring system in the home.

    Science.gov (United States)

    Nakajima, K; Sasaki, K

    2004-01-01

    Automatic health-monitoring systems for the smart house are being developed for the elderly. An automatic health-monitoring system needs a way of personal recognition when two or more aged persons live together. We propose a personal recognition method based on the space spectrum of the head-top image. We examined 33 head-top images from eleven subjects and achieved a personal recognition rate of 86.4 percent. When one subject with thinning hair was excluded, the personal recognition rate was 90.0 percent in 30 head-top images from ten subjects.

  15. Songs induced mood recognition system using EEG signals.

    Science.gov (United States)

    Janvale, G B; Gawali, B W; Deore, Rakesh S; Mehrotra, Suresh C; Deshmukh, Sachin N; Marwale, Arun V

    2010-04-01

    Brain computer interfacing is a system that acquires and analyzes neural signals to create a communication channel directly between the brain and the computer. The EEG records the electrical fields generated by the nerve cells. With the help of Fourier Transformation the EEG signals are classified into four different frequency bands. The main purpose of the present paper is to report results related to classification of EEG signals of different people subjected to different conditions. The experiment has been done on 10 subjects having activities related to hearing music chosen from categories of patriotic, happy, romantic and sad songs along with relaxation activity. 19 electrodes have been used under (10-20) International Standard. The δ, θ α and β components of EEG signals to these activities have been determined. Different statistical methods including linear discriminate analysis have been tested for classification. Result of the Linear Discriminant Analysis (LDA) made four groups of all modes (Relaxation, Happy, Sad, Patriotic and Romantic Song) labeled group1, Group2, Group3 and Group4 of all ten electrodes for Delta, Theta, alpha and Beta frequencies. The study may be used for the development of activities induced mood recognition (AIMR) system from the EEG signal.

  16. Vowel recognition by fuzzy inference and application to recognition of continuous Korean speech. Fuzzy suiron ni yoru boin ninshiki to kankokugo renzoku onsei eno oyo

    Energy Technology Data Exchange (ETDEWEB)

    Choi, W.K.; Akizuki, K. (Waseda Univ., Tokyo (Japan)); Lee, H.H. (Fukuoka Inst. of Tech., Fukuoka (Japan))

    1991-05-20

    The target of voice recognition is to recognize continuous speech which is effective for speech recognition of unspecified persons. As a new matching method, the variations of feature parameters of speakers are represented as fuzzy variables to express the variation by membership functions. It is a new pattern matching method of fuzzy inference using feature parameters, fuzzy relation and synthesis of each formant, and the fuzzy rule. It is a recognition method for the inference of best formant which matches the fact by providing each characteristic quantity and fuzzy rule for composite calculation. For consonant recognition, pitch, logarithmic energies, zero crossing rates, etc. are used which represent features of each formant. KOSRES 2, recognition system for continuous Korean speech, was structured using this method which was subjected to recognition experiments on continuous Korean speech, and the recognition method by fuzzy inference is found to be effective for speech recognition of unspecified persons. 8 refs., 9 figs., 3 tabs.

  17. Design of intelligent voice control system based on DSP%基于DSP的智能语音控制系统设计

    Institute of Scientific and Technical Information of China (English)

    郑微; 李正周; 田蕾

    2012-01-01

    利用语音命令实现与智能设备的交互已经成为现代控制理论研究的热门话题之一.介绍了一种基于数字信号处理器(DSP)、语音采集模块、无线收发模块、片上外设等资源实现的语音命令控制处理系统.该系统首先通过语音采集模块采集到语音控制信号;然后通过DSP和相应的片上外设实现对语音命令的识别;最后将识别的语音命令传递给无线收发模块以实现对于智能设备的控制.整个系统的设计应用领域广泛,可以为人机交互提供一种切实可行的参考方案.%Using voice commands to realize the interaction with intelligent equipment has become one of the hot topics in modern control theory. The voice command control processing system based on DSP, voice acquisition module, wireless transceiver module, on-chip peripheral resources is designed and realized. The voice control signals is acquired by the voice acquisition module in the system,and voice commands is recognized by DSP and on-chip peripheral resources, and the identified voice commands is transferred to the wireless transceiver module to control the intelligent equipment. The design application field of the whole system is very wide, and provides a kind of feasible and practical reference for human-computer interaction.

  18. A REVIEW ON THE DEVELOPMENT OF INDONESIAN SIGN LANGUAGE RECOGNITION SYSTEM

    OpenAIRE

    Sutarman; Mazlina Abdul Majid; Jasni Mohamad Zain

    2013-01-01

    Sign language is mainly employed by hearing-impaired people to communicate with each other. However, communication with normal people is a major handicap for them since normal people do not understand their sign language. Sign language recognition is needed for realizing a human oriented interactive system that can perform an interaction like normal communication. Sign language recognition basically uses two approaches: (1) computer vision-based gesture recognition, in which a camera is used ...

  19. Video based Parallel Face recognition using Gabor filter on homogeneous distributed systems

    DEFF Research Database (Denmark)

    Ali, Usman; Bilal, Muhammad

    This research aimed at building a fast video, parallel face recognition system based on the well known Gabor filtering approach. Face recognition is done after face detection in each frame of the video, individually. The master-slave technique is employed as the parallel computing model. Each frame...... is processed by different slave personal computers (PC) attached to the master, which acquire and distribute frames. It is believed that this approach can be used for practical face recognition applications with some further optimization...

  20. Listen to a voice

    DEFF Research Database (Denmark)

    Hølge-Hazelton, Bibi

    2001-01-01

    Listen to the voice of a young girl Lonnie, who was diagnosed with Type 1 diabetes at 16. Imagine that she is deeply involved in the social security system. She lives with her mother and two siblings in a working class part of a small town. She is at a special school for problematic youth, and he...

  1. What the voice reveals.

    NARCIS (Netherlands)

    Ko, Sei Jin

    2007-01-01

    Given that the voice is our main form of communication, we know surprisingly little about how it impacts judgment and behavior. Furthermore, the modern advancement in telecommunication systems, such as cellular phones, has meant that a large proportion of our everyday interactions are conducted voca

  2. Voices for Careers.

    Science.gov (United States)

    York, Edwin G.; Kapadia, Madhu

    Listed in this annotated bibliography are 502 cassette tapes of value to career exploration for Grade 7 through the adult level, whether as individualized instruction, small group study, or total class activity. Available to New Jersey educators at no charge, this Voices for Careers System is also available for duplication on request from the New…

  3. What the voice reveals

    NARCIS (Netherlands)

    Ko, Sei Jin

    2007-01-01

    Given that the voice is our main form of communication, we know surprisingly little about how it impacts judgment and behavior. Furthermore, the modern advancement in telecommunication systems, such as cellular phones, has meant that a large proportion of our everyday interactions are conducted voca

  4. Towards very large vocabulary word recognition

    Science.gov (United States)

    Waibel, A.

    1982-11-01

    In this paper, preliminary considerations and some experimental results are presented in an effort to design Very Large Vocabulary Recognition (VLVR) systems. We will first consider the applicability of current recognition techniques and argue their inadequacy for VLVR. Possible alternate strategies will be explored and their potential usefulness statistically evaluated. Our results indicate that suprasegmental cues such as syllabification, stress patterns, rhythmic patterns, rhythmic patterns and the voiced - unvoiced patterns in the syllables of a word provide powerful mechanisms for search space reduction. Suprasegmental feature could thus operate in a complementary fashion to segmental features.

  5. Eusocial evolution and the recognition systems in social insects.

    Science.gov (United States)

    Krasnec, Michelle O; Breed, Michael D

    2012-01-01

    Eusocial species, animals which live in colonies with a reproductive division of labor, typically have closed societies, in which colony members are allowed entry and nonmembers, including animals of the same species, are excluded. This implies an ability to discriminate colony members ("self") from nonmembers ("nonself"). We draw analogies between this type of discrimination and MHC-mediated cellular recognition in vertebrates. Recognition of membership in eusocial colonies is typically mediated by differences in the surface chemistry between members and nonmembers and we review studies which support this hypothesis. In rare instances, visual signals mediate recognition. We highlight the need for better understanding of which surface compounds actually mediate recognition and for further work on how differences between colony members and nonmembers are perceived.

  6. Pattern recognition for the anti PANDA forward tacking systems

    Energy Technology Data Exchange (ETDEWEB)

    Galuska, Martin J.; Hu, Jifeng; Kuehn, Wolfgang; Lange, J. Soeren; Liang, Yutie; Muenchow, David; Spruck, Bjoern [Giessen Univ. (Germany). 2. Physikalisches Inst.; Collaboration: PANDA-Collaboration

    2013-07-01

    The anti PANDA experiment is planned to start operation in 2017 as part of the future FAIR facility, which will be built at the site of GSI in Darmstadt. It will utilize antiproton beams with beam momentum resolutions of Δ p/p ≤ 2 . 10{sup -5}. anti PANDA is particularly suited to perform resonance scans of exclusively produced charmonium(-like) states, and thus provide absolute measurements of resonance widths. As it is a fixed target experiment a large fraction of final state particles will be boosted toward forward angles in aforementioned reactions facilitating the importance of forward tracking for the success of the experiment. The key challenges for forward tracking arise from the beam momentum dependent magnetic fields: anti PANDA is comprised of a barrel part with a solenoid field of B{sub z} = 2 T and a forward detector with a dipole field of B . L = 2 Tm. The interference of the aforementioned magnetic fields leads to complex particle tracks making accurate matching of hits challenging. A Hough Transform algorithm for pattern recognition in the Forward Tracking System based upon a parabola track model was developed. The performance of a proof-of-concept implementation was studied with detailed PandaRoot simulations. The algorithm is presented in the poster alongside results for momentum resolution, efficiency and ghost rate. Alternative approaches are discussed. Results for momentum resolution, efficiency and ghost rate are discussed.

  7. Implementation of age and gender recognition system for intelligent digital signage

    Science.gov (United States)

    Lee, Sang-Heon; Sohn, Myoung-Kyu; Kim, Hyunduk

    2015-12-01

    Intelligent digital signage systems transmit customized advertising and information by analyzing users and customers, unlike existing system that presented advertising in the form of broadcast without regard to type of customers. Currently, development of intelligent digital signage system has been pushed forward vigorously. In this study, we designed a system capable of analyzing gender and age of customers based on image obtained from camera, although there are many different methods for analyzing customers. We conducted age and gender recognition experiments using public database. The age/gender recognition experiments were performed through histogram matching method by extracting Local binary patterns (LBP) features after facial area on input image was normalized. The results of experiment showed that gender recognition rate was as high as approximately 97% on average. Age recognition was conducted based on categorization into 5 age classes. Age recognition rates for women and men were about 67% and 68%, respectively when that conducted separately for different gender.

  8. Design of an Optical Character Recognition System for Camera-based Handheld Devices

    CERN Document Server

    Mollah, Ayatullah Faruk; Basu, Subhadip; Nasipuri, Mita

    2011-01-01

    This paper presents a complete Optical Character Recognition (OCR) system for camera captured image/graphics embedded textual documents for handheld devices. At first, text regions are extracted and skew corrected. Then, these regions are binarized and segmented into lines and characters. Characters are passed into the recognition module. Experimenting with a set of 100 business card images, captured by cell phone camera, we have achieved a maximum recognition accuracy of 92.74%. Compared to Tesseract, an open source desktop-based powerful OCR engine, present recognition accuracy is worth contributing. Moreover, the developed technique is computationally efficient and consumes low memory so as to be applicable on handheld devices.

  9. Structural insight into RNA recognition motifs: versatile molecular Lego building blocks for biological systems.

    Science.gov (United States)

    Muto, Yutaka; Yokoyama, Shigeyuki

    2012-01-01

    'RNA recognition motifs (RRMs)' are common domain-folds composed of 80-90 amino-acid residues in eukaryotes, and have been identified in many cellular proteins. At first they were known as RNA binding domains. Through discoveries over the past 20 years, however, the RRMs have been shown to exhibit versatile molecular recognition activities and to behave as molecular Lego building blocks to construct biological systems. Novel RNA/protein recognition modes by RRMs are being identified, and more information about the molecular recognition by RRMs is becoming available. These RNA/protein recognition modes are strongly correlated with their biological significance. In this review, we would like to survey the recent progress on these versatile molecular recognition modules.

  10. Diagonal Based Feature Extraction for Handwritten Alphabets Recognition System using Neural Network

    CERN Document Server

    Pradeep, J; Himavathi, S; 10.5121/ijcsit.2011.3103

    2011-01-01

    An off-line handwritten alphabetical character recognition system using multilayer feed forward neural network is described in the paper. A new method, called, diagonal based feature extraction is introduced for extracting the features of the handwritten alphabets. Fifty data sets, each containing 26 alphabets written by various people, are used for training the neural network and 570 different handwritten alphabetical characters are used for testing. The proposed recognition system performs quite well yielding higher levels of recognition accuracy compared to the systems employing the conventional horizontal and vertical methods of feature extraction. This system will be suitable for converting handwritten documents into structural text form and recognizing handwritten names.

  11. Comparing Speech Recognition Systems (Microsoft API, Google API And CMU Sphinx

    Directory of Open Access Journals (Sweden)

    Veton Këpuska

    2017-03-01

    Full Text Available The idea of this paper is to design a tool that will be used to test and compare commercial speech recognition systems, such as Microsoft Speech API and Google Speech API, with open-source speech recognition systems such as Sphinx-4. The best way to compare automatic speech recognition systems in different environments is by using some audio recordings that were selected from different sources and calculating the word error rate (WER. Although the WER of the three aforementioned systems were acceptable, it was observed that the Google API is superior.

  12. 智能音控小车系统设计%Design of an Intelligent Voice Trolley System

    Institute of Scientific and Technical Information of China (English)

    曹斌芳; 李建奇; 胡惟文

    2012-01-01

    The design of an intelligent voice vehicle system was introduced,which uses sunplus's microcomputer SPCE061A as the core control portion,and drives chip L298N of electrical motor as the main control modules in the design.According to recorded the voice command the system can control the car to start,stop,return and turn.The run state of the intellectual vehicle shows that the project is feasible.%以凌阳单片机SPCE061A为核心控制部件,电机驱动芯片L298N为主要元件,以语音小车控制电路板为辅,设计并制作了一种智能音控小车系统。该系统能根据录制的语音命令来控制小车的启动、停止、返回、拐弯。智能音控小车的运行状态表明该设计方案是可行的。

  13. Text-Independent Speaker Recognition for Low SNR Environments with Encryption

    CERN Document Server

    Chadha, Aman; Roja, M Mani; 10.5120/3864-5394

    2011-01-01

    Recognition systems are commonly designed to authenticate users at the access control levels of a system. A number of voice recognition methods have been developed using a pitch estimation process which are very vulnerable in low Signal to Noise Ratio (SNR) environments thus, these programs fail to provide the desired level of accuracy and robustness. Also, most text independent speaker recognition programs are incapable of coping with unauthorized attempts to gain access by tampering with the samples or reference database. The proposed text-independent voice recognition system makes use of multilevel cryptography to preserve data integrity while in transit or storage. Encryption and decryption follow a transform based approach layered with pseudorandom noise addition whereas for pitch detection, a modified version of the autocorrelation pitch extraction algorithm is used. The experimental results show that the proposed algorithm can decrypt the signal under test with exponentially reducing Mean Square Error ...

  14. Designing a Low-Resolution Face Recognition System for Long-Range Surveillance

    NARCIS (Netherlands)

    Peng, Y.; Spreeuwers, Lieuwe Jan; Veldhuis, Raymond N.J.

    2016-01-01

    Most face recognition systems deal well with high-resolution facial images, but perform much worse on low-resolution facial images. In low-resolution face recognition, there is a specific but realistic surveillance scenario: a surveillance camera monitoring a large area. In this scenario, usually

  15. AUTOMATIC SPEECH RECOGNITION SYSTEM CONCERNING THE MOROCCAN DIALECTE (Darija and Tamazight)

    OpenAIRE

    A. EL GHAZI; Daoui, C.; Idrissi, N

    2012-01-01

    In this work we present an automatic speech recognition system for Moroccan dialect mainly: Darija (Arab dialect) and Tamazight. Many approaches have been used to model the Arabic and Tamazightphonetic units. In this paper, we propose to use the hidden Markov model (HMM) for modeling these phoneticunits. Experimental results show that the proposed approach further improves the recognition.

  16. Predicting performance of a face recognition system based on image quality

    NARCIS (Netherlands)

    Dutta, Abhishek

    2015-01-01

    In this dissertation, we present a generative model to capture the relation between facial image quality features (like pose, illumination direction, etc) and face recognition performance. Such a model can be used to predict the performance of a face recognition system. Since the model is based sole

  17. Design and optimization of voice coil actuator for six degree of freedom active vibration isolation system using Halbach magnet array.

    Science.gov (United States)

    Kim, MyeongHyeon; Kim, Hyunchang; Gweon, Dae-Gab

    2012-10-01

    This paper describes the design, modeling, optimization, and validation of an active vibration isolation system using a voice coil motor. The active vibration isolating method was constructed with a passive isolator and an active isolator. A spring was used for passive isolating; an actuator was used for active isolating. The proposed active vibration isolation system (AVIS) can isolate disturbances for many kinds of instruments. Until now, developed AVIS were able to isolate a six degree-of-freedom disturbance effectively. This paper proposes the realization of such a six degree-of-freedom active vibration isolation system that can work as a bench top device for precision measuring machines such as atomic force microscope, scanning probe microscope, etc.

  18. 2nd International Symposium on Signal Processing and Intelligent Recognition Systems

    CERN Document Server

    Bandyopadhyay, Sanghamitra; Krishnan, Sri; Li, Kuan-Ching; Mosin, Sergey; Ma, Maode

    2016-01-01

    This Edited Volume contains a selection of refereed and revised papers originally presented at the second International Symposium on Signal Processing and Intelligent Recognition Systems (SIRS-2015), December 16-19, 2015, Trivandrum, India. The program committee received 175 submissions. Each paper was peer reviewed by at least three or more independent referees of the program committee and the 59 papers were finally selected. The papers offer stimulating insights into biometrics, digital watermarking, recognition systems, image and video processing, signal and speech processing, pattern recognition, machine learning and knowledge-based systems. The book is directed to the researchers and scientists engaged in various field of signal processing and related areas. .

  19. Face Recognition Based Door Lock System Using Opencv and C# with Remote Access and Security Features

    Directory of Open Access Journals (Sweden)

    Prathamesh Timse

    2014-04-01

    Full Text Available This paper investigates the accuracy and effectiveness of the face detection and recognition algorithms using OpenCV and C# computer language. The adaboost algorithm [2] is used for face detection and PCA algorithm[1] is used for face recognition. This paper also investigates the robustness of the face recognition system when an unknown person is being detected, wherein the system will send an email to the owner of the system using SMTP [7]. The door lock can also be accessed remotely from any part of the world by using a Dropbox [8] account.

  20. Hardware based segmentation in iris recognition and authentication systems

    Science.gov (United States)

    Ulis, Bradley J.; Broussard, Randy P.; Rakvic, Ryan N.; Ives, Robert W.; Steiner, Neil; Ngo, Hau

    2009-05-01

    Iris recognition algorithms depend on image processing techniques for proper segmentation of the iris. In the Ridge Energy Direction (RED) iris recognition algorithm, the initial step in the segmentation process searches for the pupil by thresholding and using binary morphology functions to rectify artifacts obfuscating the pupil. These functions take substantial processing time in software on the order of a few hundred million operations. Alternatively, a hardware version of the binary morphology functions is implemented to assist in the segmentation process. The hardware binary morphology functions have negligible hardware footprint and power consumption while achieving speed up of 200 times compared to the original software functions.

  1. Rotation, scale and translation invariant pattern recognition system for color images

    Science.gov (United States)

    Barajas-García, Carolina; Solorza-Calderón, Selene; Álvarez-Borrego, Josué

    2016-12-01

    This work presents a color image pattern recognition system invariant to rotation, scale and translation. The system works with three 1D signatures, one for each RGB color channel. The signatures are constructed based on Fourier transform, analytic Fourier-Mellin transform and Hilbert binary rings mask. According with the statistical theory of box-plots, the pattern recognition system has a confidence level at least of 95.4%.

  2. Improved sensitivity of wearable nanogenerators made of electrospun Eu3+ doped P(VDF-HFP)/graphene composite nanofibers for self-powered voice recognition

    Science.gov (United States)

    Adhikary, Prakriti; Biswas, Anirban; Mandal, Dipankar

    2016-12-01

    Composite nanofibers of Eu3+ doped poly(vinylidene fluoride-co-hexafluoropropylene) (P(VDF-HFP))/graphene are prepared by the electrospinning technique for the fabrication of ultrasensitive wearable piezoelectric nanogenerators (WPNGs) where the post-poling technique is not necessary. It is found that the complete conversion of the piezoelectric β-phase and the improvement of the degree of crystallinity is governed by the incorporation of Eu3+ and graphene sheets into P(VDF-HFP) nanofibers. The flexible nanocomposite fibers are associated with a hypersensitive electronic transition that results in an intense red light emission, and WPNGs also have the capability of detecting external pressure as low as ~23 Pa with a higher degree of acoustic sensitivity, ~11 V Pa-1, than has ever been previously reported. This means that ultrasensitive WPNGs can be utilized to recognize human voices, which suggests they could be a potential tool in the biomedical and national security sectors. The capacitor’s ability to charge from abundant environmental vibrations, such as music, wind, body motion, etc, drives WPNGs as a power source for portable electronics. This fact may open up the prospect of using the Eu3+ doped P(VDF-HFP)/graphene composite electrospun nanofibers, with their multifunctional properties such as vibration sensitivity, wearability, red light emission capability and piezoelectric energy harvesting, for various promising applications in portable electronics, health care monitoring, noise detection and security monitoring.

  3. Multi-message Voice Record and Playback System Based on MOU%单片机的多段语音组合录放系统设计

    Institute of Scientific and Technical Information of China (English)

    温洪昌; 黄应强; 傅贵兴

    2011-01-01

    A kind of multi-message voice record and playback system is designed based on MCUSTC89C52RC and voice chip ISD1730, The paper introduces the process realization, including voice circuits design, multi-message voice recording, locating, editing and cornbi nation outputting. Also, a temperature measuring instrument with voice output function is produced. Experiments indicate that this in strument is easier to operate and use.%介绍一种基于单片机STC89C52RC、语音芯片ISD1730组成的多段语音录放系统设计方案。描述了语音电路设计、语音分段录入、语音段定位、语音段剪辑、多段语音组合输出等的实现方法;并制作了一个具有语音输出功能的温度测量仪。实验表明,具有语音输出功能的温度测量仪器更便于操作和使用。

  4. The design of a digital voice data compression technique for orbiter voice channels

    Science.gov (United States)

    1975-01-01

    Voice bandwidth compression techniques were investigated to anticipate link margin difficulties in the shuttle S-band communication system. It was felt that by reducing the data rate on each voice channel from the baseline 24 (or 32) Kbps to 8 Kbps, additional margin could be obtained. The feasibility of such an alternate voice transmission system was studied. Several factors of prime importance that were addressed are: (1) achieving high quality voice at 8 Kbps; (2) performance in the presence of the anticipated shuttle cabin environmental noise; (3) performance in the presence of the anticipated channel error statistics; and (4) minimal increase in size, weight, and power over the current baseline voice processor.

  5. Research on gesture recognition of augmented reality maintenance guiding system based on improved SVM

    Science.gov (United States)

    Zhao, Shouwei; Zhang, Yong; Zhou, Bin; Ma, Dongxi

    2014-09-01

    Interaction is one of the key techniques of augmented reality (AR) maintenance guiding system. Because of the complexity of the maintenance guiding system's image background and the high dimensionality of gesture characteristics, the whole process of gesture recognition can be divided into three stages which are gesture segmentation, gesture characteristic feature modeling and trick recognition. In segmentation stage, for solving the misrecognition of skin-like region, a segmentation algorithm combing background mode and skin color to preclude some skin-like regions is adopted. In gesture characteristic feature modeling of image attributes stage, plenty of characteristic features are analyzed and acquired, such as structure characteristics, Hu invariant moments features and Fourier descriptor. In trick recognition stage, a classifier based on Support Vector Machine (SVM) is introduced into the augmented reality maintenance guiding process. SVM is a novel learning method based on statistical learning theory, processing academic foundation and excellent learning ability, having a lot of issues in machine learning area and special advantages in dealing with small samples, non-linear pattern recognition at high dimension. The gesture recognition of augmented reality maintenance guiding system is realized by SVM after the granulation of all the characteristic features. The experimental results of the simulation of number gesture recognition and its application in augmented reality maintenance guiding system show that the real-time performance and robustness of gesture recognition of AR maintenance guiding system can be greatly enhanced by improved SVM.

  6. INTEGRATED EXPRESSIONAL AND COLOR INVARIANT FACIAL RECOGNITION SCHEME FOR HUMAN BIOMETRIC SYSTEM

    Directory of Open Access Journals (Sweden)

    M.Punithavalli

    2013-09-01

    Full Text Available In many practical applications like biometrics, video surveillance and human computer interaction, face recognition plays a major role. The previous works focused on recognizing and enhancing the biometric systems based on the facial components of the system. In this work, we are going to build Integrated Expressional and Color Invariant Facial Recognition scheme for human biometric recognition suited to different security provisioning public participation areas.At first, the features of the face are identified and processed using bayes classifier with RGB and HSV color bands. Second, psychological emotional variance are identified and linked with the respective human facial expression based on the facial action code system. Finally, an integrated expressional and color invariant facial recognition is proposed for varied conditions of illumination, pose, transformation, etc. These conditions on color invariant model are suited to easy and more efficient biometric recognition system in public domain and high confidential security zones. The integration is made derived genetic operation on the color and expression components of the facial feature system. Experimental evaluation is planned to done with public face databases (DBs such as CMU-PIE, Color FERET, XM2VTSDB, SCface, and FRGC 2.0 to estimate the performance of the proposed integrated expressional facial and color invariant recognition scheme [IEFCIRS]. Performance evaluation is done based on the constraints like recognition rate, security and evalaution time.

  7. 基于ARM的嵌入式语音存储系统设计%Design of Embedded Voice Storage System Based on ARM

    Institute of Scientific and Technical Information of China (English)

    于春雪

    2012-01-01

    In order to effectively save the transmission bandwidth of voice data and disk space of storage system, it needs to lower the encoding bit rate when ensuring the voice quality. The design uses optimized G. 729 voice compression encoding and decoding algorithm and ARM processor. The embedded voice storage systems can achieve mass storage of voice signals. It has fast processing speed, good reliability and convenient expansion. Through rigorous test and evaluation, the system achieves the compression and recording of large num- bers of voice data,and indicators reachs the expected level.%为了有效地节省语音数据的传输带宽和存储系统的磁盘空间,需要在保证语音质量的前提下尽可能降低其编码比特率。本设计采用经过优化的G.729语音压缩编译码算法,以ARM处理器为载体,开发的嵌入式语音存储系统可实现语音信号的海量存储,而且处理速度快、可靠性好、扩展方便。通过严格的测试和评估,该系统能够实现对大量语音数据的压缩和记录,各项指标基本达到了预期的水平。

  8. Design of miniature hybrid target recognition system with combination of FPGA+DSP

    Science.gov (United States)

    Luo, Shishang; Li, Xiujian; Jia, Hui; Hu, Wenhua; Nie, Yongming; Chang, Shengli

    2010-10-01

    With advantages of flexibility, high bandwidth, high spatial resolution and high-speed parallel operation, the opto-electronic hybrid target recognition system can be applied in many civil and military areas, such as video surveillance, intelligent navigation and robot vision. A miniature opto-electronic hybrid target recognition system based on FPGA+DSP is designed, which only employs single Fourier lens and with a focal length. With the precise timing control of the FPGA and images pretreatment of the DSP, the system performs both Fourier transform and inverse Fourier transform with all optical process, which can improve recognition speed and reduce the system volume remarkably. We analyzed the system performance, and a method to achieve scale invariant pattern recognition was proposed on the basis of lots of experiments.

  9. Flow of information in the spoken word recognition system

    NARCIS (Netherlands)

    McQueen, J.M.; Cutler, A.; Norris, D.

    2003-01-01

    Spoken word recognition consists of two major component processes. At the prelexical stage, information in the speech signal is used to generate an abstract description of the utterance which can then be used to access stored lexical knowledge. The lexical stage is characterized by multiple activati

  10. Fuzzy Pattern Recognition System for Detection of Alga Distribution

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    To realize the on-line measurement and make analysis on the density of algae and their cluster distribution, the fluorescent detection and fuzzy pattern recognition techniques are used. The principle of fluorescent fiber-optic detection is given as well as the method of fuzzy feature extraction using a class of neural network.

  11. System Accuracy Evaluation of the GlucoRx Nexus Voice TD-4280 Blood Glucose Monitoring System

    Directory of Open Access Journals (Sweden)

    Muhammad Khan

    2014-01-01

    Full Text Available Use of blood glucose (BG meters in the self-monitoring of blood glucose (SMBG significantly lowers the risk of diabetic complications. With several BG meters now commercially available, the International Organization for Standardization (ISO ensures that each BG meter conforms to a set degree of accuracy. Although adherence to ISO guidelines is a prerequisite for commercialization in Europe, several BG meters claim to meet the ISO guidelines yet fail to do so on internal validation. We conducted a study to determine whether the accuracy of the GlucoRx Nexus TD-4280 meter, utilized by our department for its cost-effectiveness, complied with ISO guidelines. 105 patients requiring laboratory blood glucose analysis were randomly selected and reference measurements were determined by the UniCel DxC 800 clinical system. Overall the BG meter failed to adhere to the ≥95% accuracy criterion required by both the 15197:2003 (overall accuracy 92.4% and 15197:2013 protocol (overall accuracy 86.7%. Inaccurate meters have an inherent risk of over- and/or underestimating the true BG concentration, thereby risking patients to incorrect therapeutic interventions. Our study demonstrates the importance of internally validating the accuracy of BG meters to ensure that its accuracy is accepted by standardized guidelines.

  12. Improving the recognition of fingerprint biometric system using enhanced image fusion

    Science.gov (United States)

    Alsharif, Salim; El-Saba, Aed; Stripathi, Reshma

    2010-04-01

    Fingerprints recognition systems have been widely used by financial institutions, law enforcement, border control, visa issuing, just to mention few. Biometric identifiers can be counterfeited, but considered more reliable and secure compared to traditional ID cards or personal passwords methods. Fingerprint pattern fusion improves the performance of a fingerprint recognition system in terms of accuracy and security. This paper presents digital enhancement and fusion approaches that improve the biometric of the fingerprint recognition system. It is a two-step approach. In the first step raw fingerprint images are enhanced using high-frequency-emphasis filtering (HFEF). The second step is a simple linear fusion process between the raw images and the HFEF ones. It is shown that the proposed approach increases the verification and identification of the fingerprint biometric recognition system, where any improvement is justified using the correlation performance metrics of the matching algorithm.

  13. Developing Autonomic Properties for Distributed Pattern-Recognition Systems with ASSL: A Distributed MARF Case Study

    CERN Document Server

    Vassev, Emil

    2011-01-01

    In this paper, we discuss our research towards developing special properties that introduce autonomic behavior in pattern-recognition systems. In our approach we use ASSL (Autonomic System Specification Language) to formally develop such properties for DMARF (Distributed Modular Audio Recognition Framework). These properties enhance DMARF with an autonomic middleware that manages the four stages of the framework's pattern-recognition pipeline. DMARF is a biologically inspired system employing pattern recognition, signal processing, and natural language processing helping us process audio, textual, or imagery data needed by a variety of scientific applications, e.g., biometric applications. In that context, the notion go autonomic DMARF (ADMARF) can be employed by autonomous and robotic systems that theoretically require less-to-none human intervention other than data collection for pattern analysis and observing the results. In this article, we explain the ASSL specification models for the autonomic propertie...

  14. Image Quality Enhancement Using the Direction and Thickness of Vein Lines for Finger-Vein Recognition

    OpenAIRE

    Young Ho Park; Kang Ryoung Park

    2012-01-01

    On the basis of the increased emphasis placed on the protection of privacy, biometric recognition systems using physical or behavioural characteristics such as fingerprints, facial characteristics, iris and finger‐vein patterns or the voice have been introduced in applications including door access control, personal certification, Internet banking and ATM machines. Among these, finger‐vein recognition is advantageous in that it involves the use of inexpensive and small devices that are diffic...

  15. Review of Data Preprocessing Methods for Sign Language Recognition Systems based on Artificial Neural Networks

    Directory of Open Access Journals (Sweden)

    Zorins Aleksejs

    2016-12-01

    Full Text Available The article presents an introductory analysis of relevant research topic for Latvian deaf society, which is the development of the Latvian Sign Language Recognition System. More specifically the data preprocessing methods are discussed in the paper and several approaches are shown with a focus on systems based on artificial neural networks, which are one of the most successful solutions for sign language recognition task.

  16. On model architecture for a children's speech recognition interactive dialog system

    OpenAIRE

    Kraleva, Radoslava; Kralev, Velin

    2016-01-01

    This report presents a general model of the architecture of information systems for the speech recognition of children. It presents a model of the speech data stream and how it works. The result of these studies and presented veins architectural model shows that research needs to be focused on acoustic-phonetic modeling in order to improve the quality of children's speech recognition and the sustainability of the systems to noise and changes in transmission environment. Another important aspe...

  17. Efficacy of a New Medical Information system, Ubiquitous Healthcare Service with Voice Inception Technique in Elderly Diabetic Patients.

    Science.gov (United States)

    Kim, Kyoung Min; Park, Kyeong Seon; Lee, Hyun Ju; Lee, Yun Hee; Bae, Ji Seon; Lee, Young Joon; Choi, Sung Hee; Jang, Hak Chul; Lim, Soo

    2015-12-11

    We have demonstrated previously that an individualized health management system using advanced medical information technology, named ubiquitous (u)-healthcare, was helpful in achieving better glycemic control than routine care. Recently, we generated a new u-healthcare system using a voice inception technique for elderly diabetic patients to communicate information about their glucose control, physical activity, and diet more easily. In a randomized clinical trial, 70 diabetic patients aged 60-85 years were assigned randomly to a standard care group or u-healthcare group for 6 months. The primary end points were the changes in glycated hemoglobin (HbA1c) and glucose fluctuation assessed by the mean amplitude glycemic excursion (MAGE). Changes in body weight, lifestyle, and knowledge about diabetes were also investigated. After 6 months, the HbA1c levels decreased significantly in the u-healthcare group (from 8.6 ± 1.0% to 7.5 ± 0.6%) compared with the standard care group (from 8.7 ± 0.9% to 8.2 ± 1.1%, P < 0.01). The MAGE decreased more in the u-healthcare group than in the standard care group. Systolic blood pressure and body weight decreased and liver functions improved in the u-healthcare group, but not in the standard care group. The u-healthcare system with voice inception technique was effective in achieving glycemic control without hypoglycemia in elderly diabetic patients (Clinicaltrials.gov: NCT01891474).

  18. Development of an Environment-Aware Locomotion Mode Recognition System for Powered Lower Limb Prostheses.

    Science.gov (United States)

    Liu, Ming; Wang, Ding; Helen Huang, He

    2016-04-01

    This paper aimed to develop and evaluate an environment-aware locomotion mode recognition system for volitional control of powered artificial legs. A portable terrain recognition (TR) module, consisting of an inertia measurement unit and a laser distance meter, was built to identify the type of terrain in front of the wearer while walking. A decision tree was used to classify the terrain types and provide either coarse or refined information about the walking environment. Then, the obtained environmental information was modeled as a priori probability and was integrated with a neuromuscular-mechanical-fusion-based locomotion mode (LM) recognition system. The designed TR module and environmental-aware LM recognition system was evaluated separately on able-bodied subjects and a transfemoral amputee online. The results showed that the TR module provided high quality environmental information: TR accuracy is above 98% and terrain transitions are detected over 500 ms before the time required to switch the prosthesis control mode. This enabled smooth locomotion mode transitions for the wearers. The obtained environmental information further improved the performance of LM recognition system, regardless of whether coarse or refined information was used. In addition, the environment-aware LM recognition system produced reliable online performance when the TR output was relatively noisy, which indicated the potential of this system to operate in unconstructed environment. This paper demonstrated that environmental information should be considered for operating wearable lower limb robotic devices, such as prosthetics and orthotics.

  19. Improved ASL based Gesture Recognition using HMM for System Application

    Directory of Open Access Journals (Sweden)

    Shalini Anand

    2014-03-01

    Full Text Available Gesture recognition is a growing field of research and among various human computer interactions; hand gesture recognition is very popular for interacting between human and machines. It is non verbal way of communication and this research area is full of innovative approaches. This project aims at recognizing 34 basic static hand gestures based on American Sign Language (ASL including alphabets as well as numbers (0 to 9. In this project we have not considered two alphabets i.e J and Z as our project aims as recognizing static hand gesture but according to ASL they are considered as dynamic. The main features used are optimization of the database using neural network and Hidden Markov Model (HMM. That is the algorithm is based on shape based features by keeping in the mind that shape of human hand is same for all human beings except in some situations

  20. A new accurate pill recognition system using imprint information

    Science.gov (United States)

    Chen, Zhiyuan; Kamata, Sei-ichiro

    2013-12-01

    Great achievements in modern medicine benefit human beings. Also, it has brought about an explosive growth of pharmaceuticals that current in the market. In daily life, pharmaceuticals sometimes confuse people when they are found unlabeled. In this paper, we propose an automatic pill recognition technique to solve this problem. It functions mainly based on the imprint feature of the pills, which is extracted by proposed MSWT (modified stroke width transform) and described by WSC (weighted shape context). Experiments show that our proposed pill recognition method can reach an accurate rate up to 92.03% within top 5 ranks when trying to classify more than 10 thousand query pill images into around 2000 categories.

  1. Modular Neural Networks and Type-2 Fuzzy Systems for Pattern Recognition

    CERN Document Server

    Melin, Patricia

    2012-01-01

    This book describes hybrid intelligent systems using type-2 fuzzy logic and modular neural networks for pattern recognition applications. Hybrid intelligent systems combine several intelligent computing paradigms, including fuzzy logic, neural networks, and bio-inspired optimization algorithms, which can be used to produce powerful pattern recognition systems. Type-2 fuzzy logic is an extension of traditional type-1 fuzzy logic that enables managing higher levels of uncertainty in complex real world problems, which are of particular importance in the area of pattern recognition. The book is organized in three main parts, each containing a group of chapters built around a similar subject. The first part consists of chapters with the main theme of theory and design algorithms, which are basically chapters that propose new models and concepts, which are the basis for achieving intelligent pattern recognition. The second part contains chapters with the main theme of using type-2 fuzzy models and modular neural ne...

  2. A REVIEW ON THE DEVELOPMENT OF INDONESIAN SIGN LANGUAGE RECOGNITION SYSTEM

    Directory of Open Access Journals (Sweden)

    Sutarman

    2013-01-01

    Full Text Available Sign language is mainly employed by hearing-impaired people to communicate with each other. However, communication with normal people is a major handicap for them since normal people do not understand their sign language. Sign language recognition is needed for realizing a human oriented interactive system that can perform an interaction like normal communication. Sign language recognition basically uses two approaches: (1 computer vision-based gesture recognition, in which a camera is used as input and videos are captured in the form of video files stored before being processed using image processing; (2 approach based on sensor data, which is done by using a series of sensors that are integrated with gloves to get the motion features finger grooves and hand movements. Different of sign languages exist around the world, each with its own vocabulary and gestures. Some examples are American Sign Language (ASL, Chinese Sign Language (CSL, British Sign Language (BSL, Indonesian Sign Language (ISL and so on. The structure of Indonesian Sign Language (ISL is different from the sign language of other countries, in that words can be formed from the prefix and or suffix. In order to improve recognition accuracy, researchers use methods, such as the hidden Markov model, artificial neural networks and dynamic time warping. Effective algorithms for segmentation, matching the classification and pattern recognition have evolved. The main objective of this study is to review the sign language recognition methods in order to choose the best method for developing the Indonesian sign language recognition system.

  3. Keyboard With Voice Output

    Science.gov (United States)

    Huber, W. C.

    1986-01-01

    Voice synthesizer tells what key is about to be depressed. Verbal feedback useful for blind operators or where dim light prevents sighted operator from seeing keyboard. Also used where operator is busy observing other things while keying data into control system. Used as training aid for touch typing, and to train blind operators to use both standard and braille keyboards. Concept adapted to such equipment as typewriters, computers, calculators, telephones, cash registers, and on/off controls.

  4. Health Care in Home Automation Systems with Speech Recognition and Mobile Technology

    Directory of Open Access Journals (Sweden)

    Jasmin Kurti

    2016-08-01

    Full Text Available - Home automation systems use technology to facilitate the lives of people using it, and it is especially useful for assisting the elderly and persons with special needs. These kind of systems have been a popular research subject in last few years. In this work, I present the design and development of a system that provides a life assistant service in a home environment, a smart home-based healthcare system controlled with speech recognition and mobile technology. This includes developing software with speech recognition, speech synthesis, face recognition, controls for Arduino hardware, and a smartphone application for remote controlling the system. With the developed system, elderly and persons with special needs can stay independently in their own home secure and with care facilities. This system is tailored towards the elderly and disabled, but it can also be embedded in any home and used by anybody. It provides healthcare, security, entertainment, and total local and remote control of home.

  5. Feature based sliding window technique for face recognition

    Science.gov (United States)

    Javed, Muhammad Younus; Mohsin, Syed Maajid; Anjum, Muhammad Almas

    2010-02-01

    Human beings are commonly identified by biometric schemes which are concerned with identifying individuals by their unique physical characteristics. The use of passwords and personal identification numbers for detecting humans are being used for years now. Disadvantages of these schemes are that someone else may use them or can easily be forgotten. Keeping in view of these problems, biometrics approaches such as face recognition, fingerprint, iris/retina and voice recognition have been developed which provide a far better solution when identifying individuals. A number of methods have been developed for face recognition. This paper illustrates employment of Gabor filters for extracting facial features by constructing a sliding window frame. Classification is done by assigning class label to the unknown image that has maximum features similar to the image stored in the database of that class. The proposed system gives a recognition rate of 96% which is better than many of the similar techniques being used for face recognition.

  6. Accuracy, security, and processing time comparisons of biometric fingerprint recognition system using digital and optical enhancements

    Science.gov (United States)

    Alsharif, Salim; El-Saba, Aed; Jagapathi, Rajendarreddy

    2011-06-01

    Fingerprint recognition is one of the most commonly used forms of biometrics and has been widely used in daily life due to its feasibility, distinctiveness, permanence, accuracy, reliability, and acceptability. Besides cost, issues related to accuracy, security, and processing time in practical biometric recognition systems represent the most critical factors that makes these systems widely acceptable. Accurate and secure biometric systems often require sophisticated enhancement and encoding techniques that burdens the overall processing time of the system. In this paper we present a comparison between common digital and optical enhancementencoding techniques with respect to their accuracy, security and processing time, when applied to biometric fingerprint systems.

  7. The Glasgow Voice Memory Test: Assessing the ability to memorize and recognize unfamiliar voices.

    Science.gov (United States)

    Aglieri, Virginia; Watson, Rebecca; Pernet, Cyril; Latinus, Marianne; Garrido, Lúcia; Belin, Pascal

    2017-02-01

    One thousand one hundred and twenty subjects as well as a developmental phonagnosic subject (KH) along with age-matched controls performed the Glasgow Voice Memory Test, which assesses the ability to encode and immediately recognize, through an old/new judgment, both unfamiliar voices (delivered as vowels, making language requirements minimal) and bell sounds. The inclusion of non-vocal stimuli allows the detection of significant dissociations between the two categories (vocal vs. non-vocal stimuli). The distributions of accuracy and sensitivity scores (d') reflected a wide range of individual differences in voice recognition performance in the population. As expected, KH showed a dissociation between the recognition of voices and bell sounds, her performance being significantly poorer than matched controls for voices but not for bells. By providing normative data of a large sample and by testing a developmental phonagnosic subject, we demonstrated that the Glasgow Voice Memory Test, available online and accessible from all over the world, can be a valid screening tool (~5 min) for a preliminary detection of potential cases of phonagnosia and of "super recognizers" for voices.

  8. Psychometric assessments of life quality and voice for teachers within the municipal system, in Bauru, SP, Brazil

    Directory of Open Access Journals (Sweden)

    Janaína Gheissa Martinello

    2011-12-01

    Full Text Available Studies show a high prevalence of vocal alterations among teachers. One of the criteria for the establishment of the prevalence of vocal alteration is based on teachers' self-perception. Objective: This study aimed at comparing voice-disordered quality of life measures between a group of teachers who reported vocal alteration and a group of teachers who did not, by verifying the teachers' perception regarding the impact of vocal alteration in the different dimensions of voice quality of life. Material and Methods: Ninety-seven (97 teachers answered three psychometric protocols of voice quality of life: Voice Handicap Index (VHI, Voice-Related Quality of Life (V-RQOL, and the Voice Activity Participation Profile (VAPP, in addition to a questionnaire for characterization of the sample. Results: The results were that 39.8% of the teachers reported vocal alteration. When comparing voice measures between the groups (with and without vocal alteration, statistically significant differences were observed: the total score of VHI, total score of V-RQOL and total score of VAAP and its dimensions. It was also verified that the physical dimension of VHI has a greater impact among the dimensions of this protocol. For VRQOL, the most striking dimension was the physical functioning domain, both indicating the laryngeal discomfort and the impact of voice on communication, in teachers with and without complaints. As for VAAP, no domain prevailed over the others in the group with no complaints. For teachers with complaints, three domains, i.e., daily communication, work, and emotions have a greater impact than social communication. The limitation and restriction scores were calculated as well, and it was observed the limitation of activities is greater than the restriction of activities, both in the group with and the group without complaints. Conclusion: One may conclude that the teachers who reported vocal alterations better realize the impact of voice in

  9. Low-Complexity Hand Gesture Recognition System for Continuous Streams of Digits and Letters.

    Science.gov (United States)

    Poularakis, Stergios; Katsavounidis, Ioannis

    2016-09-01

    In this paper, we propose a complete gesture recognition framework based on maximum cosine similarity and fast nearest neighbor (NN) techniques, which offers high-recognition accuracy and great computational advantages for three fundamental problems of gesture recognition: 1) isolated recognition; 2) gesture verification; and 3) gesture spotting on continuous data streams. To support our arguments, we provide a thorough evaluation on three large publicly available databases, examining various scenarios, such as noisy environments, limited number of training examples, and time delay in system's response. Our experimental results suggest that this simple NN-based approach is quite accurate for trajectory classification of digits and letters and could become a promising approach for implementations on low-power embedded systems.

  10. Man-system interface based on automatic speech recognition: integration to a virtual control desk

    Energy Technology Data Exchange (ETDEWEB)

    Jorge, Carlos Alexandre F.; Mol, Antonio Carlos A.; Pereira, Claudio M.N.A.; Aghina, Mauricio Alves C., E-mail: calexandre@ien.gov.b, E-mail: mol@ien.gov.b, E-mail: cmnap@ien.gov.b, E-mail: mag@ien.gov.b [Instituto de Engenharia Nuclear (IEN/CNEN-RJ), Rio de Janeiro, RJ (Brazil); Nomiya, Diogo V., E-mail: diogonomiya@gmail.co [Universidade Federal do Rio de Janeiro (UFRJ), RJ (Brazil)

    2009-07-01

    This work reports the implementation of a man-system interface based on automatic speech recognition, and its integration to a virtual nuclear power plant control desk. The later is aimed to reproduce a real control desk using virtual reality technology, for operator training and ergonomic evaluation purpose. An automatic speech recognition system was developed to serve as a new interface with users, substituting computer keyboard and mouse. They can operate this virtual control desk in front of a computer monitor or a projection screen through spoken commands. The automatic speech recognition interface developed is based on a well-known signal processing technique named cepstral analysis, and on artificial neural networks. The speech recognition interface is described, along with its integration with the virtual control desk, and results are presented. (author)

  11. FPGA IMPLEMENTATION OF ADAPTIVE INTEGRATED SPIKING NEURAL NETWORK FOR EFFICIENT IMAGE RECOGNITION SYSTEM

    Directory of Open Access Journals (Sweden)

    T. Pasupathi

    2014-05-01

    Full Text Available Image recognition is a technology which can be used in various applications such as medical image recognition systems, security, defense video tracking, and factory automation. In this paper we present a novel pipelined architecture of an adaptive integrated Artificial Neural Network for image recognition. In our proposed work we have combined the feature of spiking neuron concept with ANN to achieve the efficient architecture for image recognition. The set of training images are trained by ANN and target output has been identified. Real time videos are captured and then converted into frames for testing purpose and the image were recognized. The machine can operate at up to 40 frames/sec using images acquired from the camera. The system has been implemented on XC3S400 SPARTAN-3 Field Programmable Gate Arrays.

  12. Named Entity Recognition in a Hungarian NL Based QA System

    Science.gov (United States)

    Tikkl, Domonkos; Szidarovszky, P. Ferenc; Kardkovacs, Zsolt T.; Magyar, Gábor

    In WoW project our purpose is to create a complex search interface with the following features: search in the deep web content of contracted partners' databases, processing Hungarian natural language (NL) questions and transforming them to SQL queries for database access, image search supported by a visual thesaurus that describes in a structural form the visual content of images (also in Hungarian). This paper primarily focuses on a particular problem of question processing task: the entity recognition. Before going into details we give a short overview of the project's aims.

  13. BRAF inhibition improves tumor recognition by the immune system

    DEFF Research Database (Denmark)

    Donia, Marco; Fagone, Paolo; Nicoletti, Ferdinando

    2012-01-01

    , which represents one of the most promising approaches currently in clinical development for the treatment of metastatic melanoma. Here we show that blocking the BRAF-MAPK pathway in BRAF signaling-addicted melanoma cells significantly increases the ability of T cells contained in clinical grade tumor......-infiltrating lymphocytes to recognize autologous BRAF(V600) mutant melanoma cell lines in vitro. Antitumor reactivity was improved regardless of the class of antigen recognized by tumor-specific CD8(+) T cells. Microarray data suggests that improved tumor recognition is associated with modified expression of MHC Class I...

  14. Adaptive Compensation Algorithm in Open Vocabulary Mandarin Speaker-Independent Speech Recognition

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    In speech recognition systems, the physiological characteristics of the speech production model cause the voiced sections of the speech signal to have an attenuation of approximately 20 dB per decade. Many speech recognition algorithms have been developed to solve this problem by filtering the input signal with a single-zero high pass filter. Unfortunately, this technique increases the noise energy at high frequencies above 4 kHz, which in some cases degrades the recognition accuracy. This paper solves the problem using a pre-emphasis filter in the front end of the recognizer. The aim is to develop a modified parameterization approach taking into account the whole energy zone in the spectrum to improve the performance of the existing baseline recognition system in the acoustic phase. The results show that a large vocabulary speaker-independent continuous speech recognition system using this approach has a greatly improved recognition rate.

  15. Multipath/RFI/modulation study for DRSS-RFI problem: Voice coding and intelligibility testing for a satellite-based air traffic control system

    Science.gov (United States)

    Birch, J. N.; Getzin, N.

    1971-01-01

    Analog and digital voice coding techniques for application to an L-band satellite-basedair traffic control (ATC) system for over ocean deployment are examined. In addition to performance, the techniques are compared on the basis of cost, size, weight, power consumption, availability, reliability, and multiplexing features. Candidate systems are chosen on the bases of minimum required RF bandwidth and received carrier-to-noise density ratios. A detailed survey of automated and nonautomated intelligibility testing methods and devices is presented and comparisons given. Subjective evaluation of speech system by preference tests is considered. Conclusion and recommendations are developed regarding the selection of the voice system. Likewise, conclusions and recommendations are developed for the appropriate use of intelligibility tests, speech quality measurements, and preference tests with the framework of the proposed ATC system.

  16. Automatic micropropagation of plants--the vision-system: graph rewriting as pattern recognition

    Science.gov (United States)

    Schwanke, Joerg; Megnet, Roland; Jensch, Peter F.

    1993-03-01

    The automation of plant-micropropagation is necessary to produce high amounts of biomass. Plants have to be dissected on particular cutting-points. A vision-system is needed for the recognition of the cutting-points on the plants. With this background, this contribution is directed to the underlying formalism to determine cutting-points on abstract-plant models. We show the usefulness of pattern recognition by graph-rewriting along with some examples in this context.

  17. A New Multimodal Biometric System Based on Finger Vein and Hand Vein Recognition

    OpenAIRE

    Randa Boukhris Trabelsi; Alima Damak Masmoudi; Dorra Sellami Masmoudi

    2013-01-01

    As a reliable and robust biological characteristic, the vein pattern increases more and more the progress in biometric researches. Generally, it was shown that single biometric modality recognition is not able to meet high performances. In this paper, we propose a new multimodal biometric system based on fusion of both hand vein and finger vein modalities. For finger vein recognition, we employ the Monogenic Local Binary Pattern (MLBP), and for hand vein recognitionan Improved Gaussian Matche...

  18. CHARACTERIZING HABITUATION USING THE TIME-ON-TASK METRIC IN AN IRIS RECOGNITION SYSTEM

    OpenAIRE

    Hasselgren, Jacob A.

    2014-01-01

    This thesis presents a characterization of biometric habituation in an iris recognition study using qualitative analysis of a distributed habituation survey and quantitative analysis of iris images collected in 2010 and 2012. The performed analyses answered the following two questions: a) How consistently does the biometric community define habituation?; and b) Does the time-on-task variable provide enough evidence to indicate the existence of habituation in an iris recognition system? The qu...

  19. Hand gesture recognition system based in computer vision and machine learning

    OpenAIRE

    Trigueiros, Paulo; Ribeiro, António Fernando; Reis, L.P.

    2015-01-01

    "Lecture notes in computational vision and biomechanics series, ISSN 2212-9391, vol. 19" Hand gesture recognition is a natural way of human computer interaction and an area of very active research in computer vision and machine learning. This is an area with many different possible applications, giving users a simpler and more natural way to communicate with robots/systems interfaces, without the need for extra devices. So, the primary goal of gesture recognition research applied to Hum...

  20. Towards events recognition in a distributed fiber-optic sensor system: Kolmogorov-Zurbenko filtering

    CERN Document Server

    Fedorov, Aleksey; Zhirnov, Andrey; Nesterov, Evgeniy; Namiot, Dmitry; Pnev, Alexey; Karasik, Valery

    2015-01-01

    The paper is about de-noising procedures aimed on events recognition in signals from a distributed fiber-optic vibration sensor system based on the phase-sensitive optical time-domain reflectometry. We report experimental results on recognition of several classes of events in a seismic background. A de-noising procedure uses the framework of the time-series analysis and Kolmogorov-Zurbenko filtering. We demonstrate that this approach allows revealing signatures of several classes of events.

  1. 基于ADPCM的数字语音存储与回放系统%Digital voice storage and replay system based on ADPCM

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    With singlechip and FPGA as the cybernetics core,the system realizes voice storage and reply system. It can collect and simulate voice signals and stereo signals from earphone and lift utilization rate of memory by the use of ADPCM, which means the voice can be stored for more than 2 minutes. Based on the short⁃time Fourier transform principle,it can also achieve spectral analysis of voice signals and real⁃time display. Through using the stereo audio amplifier,each sound track can be adjusted and muted. Furthermore,some measures as pre⁃emphasis,de⁃emphasis and anti⁃aliasing filtering are used in this system to increase SNR efficiently and get good quality of the recorded voice for a longer time.%  系统以单片机和FPGA为控制核心,实现了语音存储与回放系统。能够采集模拟语音信号以及耳机立体声信号,以ADPCM(自适应差分编码)的方式提高了存储器的利用率,语音存储时间可达2 min;基于短时傅里叶变换原理,实现了语音信号的频谱分析与实时显示。同时,利用立体声音频功放播放语音,每声道音量可调并具有静噪功能。此外,系统还采用预加重、去加重、抗混叠滤波等措施,有效地提高了信噪比。语音回放质量良好,存储时间较长。

  2. Person Recognition System Based on a Combination of Body Images from Visible Light and Thermal Cameras.

    Science.gov (United States)

    Nguyen, Dat Tien; Hong, Hyung Gil; Kim, Ki Wan; Park, Kang Ryoung

    2017-03-16

    The human body contains identity information that can be used for the person recognition (verification/recognition) problem. In this paper, we propose a person recognition method using the information extracted from body images. Our research is novel in the following three ways compared to previous studies. First, we use the images of human body for recognizing individuals. To overcome the limitations of previous studies on body-based person recognition that use only visible light images for recognition, we use human body images captured by two different kinds of camera, including a visible light camera and a thermal camera. The use of two different kinds of body image helps us to reduce the effects of noise, background, and variation in the appearance of a human body. Second, we apply a state-of-the art method, called convolutional neural network (CNN) among various available methods, for image features extraction in order to overcome the limitations of traditional hand-designed image feature extraction methods. Finally, with the extracted image features from body images, the recognition task is performed by measuring the distance between the input and enrolled samples. The experimental results show that the proposed method is efficient for enhancing recognition accuracy compared to systems that use only visible light or thermal images of the human body.

  3. Fusion of Visible and Thermal Descriptors Using Genetic Algorithms for Face Recognition Systems

    Directory of Open Access Journals (Sweden)

    Gabriel Hermosilla

    2015-07-01

    Full Text Available The aim of this article is to present a new face recognition system based on the fusion of visible and thermal features obtained from the most current local matching descriptors by maximizing face recognition rates through the use of genetic algorithms. The article considers a comparison of the performance of the proposed fusion methodology against five current face recognition methods and classic fusion techniques used commonly in the literature. These were selected by considering their performance in face recognition. The five local matching methods and the proposed fusion methodology are evaluated using the standard visible/thermal database, the Equinox database, along with a new database, the PUCV-VTF, designed for visible-thermal studies in face recognition and described for the first time in this work. The latter is created considering visible and thermal image sensors with different real-world conditions, such as variations in illumination, facial expression, pose, occlusion, etc. The main conclusions of this article are that two variants of the proposed fusion methodology surpass current face recognition methods and the classic fusion techniques reported in the literature, attaining recognition rates of over 97% and 99% for the Equinox and PUCV-VTF databases, respectively. The fusion methodology is very robust to illumination and expression changes, as it combines thermal and visible information efficiently by using genetic algorithms, thus allowing it to choose optimal face areas where one spectrum is more representative than the other.

  4. Fusion of Visible and Thermal Descriptors Using Genetic Algorithms for Face Recognition Systems.

    Science.gov (United States)

    Hermosilla, Gabriel; Gallardo, Francisco; Farias, Gonzalo; San Martin, Cesar

    2015-07-23

    The aim of this article is to present a new face recognition system based on the fusion of visible and thermal features obtained from the most current local matching descriptors by maximizing face recognition rates through the use of genetic algorithms. The article considers a comparison of the performance of the proposed fusion methodology against five current face recognition methods and classic fusion techniques used commonly in the literature. These were selected by considering their performance in face recognition. The five local matching methods and the proposed fusion methodology are evaluated using the standard visible/thermal database, the Equinox database, along with a new database, the PUCV-VTF, designed for visible-thermal studies in face recognition and described for the first time in this work. The latter is created considering visible and thermal image sensors with different real-world conditions, such as variations in illumination, facial expression, pose, occlusion, etc. The main conclusions of this article are that two variants of the proposed fusion methodology surpass current face recognition methods and the classic fusion techniques reported in the literature, attaining recognition rates of over 97% and 99% for the Equinox and PUCV-VTF databases, respectively. The fusion methodology is very robust to illumination and expression changes, as it combines thermal and visible information efficiently by using genetic algorithms, thus allowing it to choose optimal face areas where one spectrum is more representative than the other.

  5. Using the Voice to Design Ceramics

    DEFF Research Database (Denmark)

    Hansen, Flemming Tvede; Jensen, Kristoffer

    2011-01-01

    SoundShaping, a system to create ceramics from the human voice. Based on a generic audio feature extraction system, and the principal component analysis to ensure that the pertinent information in the voice is used, a 3D shape is created using simple geometric rules. This shape is output to a 3D printer...

  6. Using the Voice to Design Ceramics

    DEFF Research Database (Denmark)

    Hansen, Flemming Tvede; Jensen, Kristoffer

    2011-01-01

    SoundShaping, a system to create ceramics from the human voice. Based on a generic audio feature extraction system, and the principal component analysis to ensure that the pertinent information in the voice is used, a 3D shape is created using simple geometric rules. This shape is output to a 3D printer...

  7. Speaking and Nonspeaking Voice Professionals: Who Has the Better Voice?

    Science.gov (United States)

    Chitguppi, Chandala; Raj, Anoop; Meher, Ravi; Rathore, P K

    2017-04-18

    Voice professionals can be classified into two major subgroups: the primarily speaking and the primarily nonspeaking voice professionals. Nonspeaking voice professionals mainly include singers, whereas speaking voice professionals include the rest of the voice professionals. Although both of these groups have high vocal demands, it is currently unknown whether both groups show similar voice changes after their daily voice use. Comparison of these two subgroups of voice professionals has never been done before. This study aimed to compare the speaking voice of speaking and nonspeaking voice professionals with no obvious vocal fold pathology or voice-related complaints on the day of assessment. After obtaining relevant voice-related history, voice analysis and videostroboscopy were performed in 50 speaking and 50 nonspeaking voice professionals. Speaking voice professionals showed significantly higher incidence of voice-related complaints as compared with nonspeaking voice professionals. Voice analysis revealed that most acoustic parameters including fundamental frequency, jitter percent, and harmonic-to-noise ratio were significantly higher in speaking voice professionals, whereas videostroboscopy did not show any significant difference between the two groups. This is the first study of its kind to analyze the effect of daily voice use in the two subgroups of voice professionals with no obvious vocal fold pathology. We conclude that voice professionals should not be considered as a homogeneous group. The detrimental effects of excessive voice use were observed to occur more significantly in speaking voice professionals than in nonspeaking voice professionals. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  8. CCNA Voice Study Guide, Exam 640-460

    CERN Document Server

    Froehlich, Andrew

    2010-01-01

    The ultimate guide to the new CCNA voice network administrator certification exam. The new CCNA Voice exam tests candidates on their ability to implement a Cisco VoIP solution. Network administrators of voice systems will appreciate that the CCNA Voice Study Guide focuses completely on the information required by the exam. Along with hands-on labs and an objective map showing where each objective is covered, this guide includes a CD with the Sybex Test Engine, flashcards, and entire book in PDF format.: The new CCNA Voice certification will be valuable for administrators of voice network syste

  9. A Russian Keyword Spotting System Based on Large Vocabulary Continuous Speech Recognition and Linguistic Knowledge

    Directory of Open Access Journals (Sweden)

    Valentin Smirnov

    2016-01-01

    Full Text Available The paper describes the key concepts of a word spotting system for Russian based on large vocabulary continuous speech recognition. Key algorithms and system settings are described, including the pronunciation variation algorithm, and the experimental results on the real-life telecom data are provided. The description of system architecture and the user interface is provided. The system is based on CMU Sphinx open-source speech recognition platform and on the linguistic models and algorithms developed by Speech Drive LLC. The effective combination of baseline statistic methods, real-world training data, and the intensive use of linguistic knowledge led to a quality result applicable to industrial use.

  10. Leveraging voice

    DEFF Research Database (Denmark)

    Frølunde, Lisbeth

    2017-01-01

    researchers improve our practices and how could digital online video help offer more positive stories about research and higher education? How can academics in higher education be better to tell about our research, thereby reclaiming and leveraging our voice in a post-factual era? As higher education......This paper speculates on how researchers share research without diluting our credibility and how to make strategies for the future. It also calls for consideration of new traditions and practices for communicating knowledge to a wider audience across multiple media platforms. How might we...... continues to engage with digital and networked technologies it becomes increasingly relevant to question why and how academics could (re) position research knowledge in the digital and online media landscape of today and the future. The paper highlights methodological issues that arise in relation...

  11. Feeling voices.

    Directory of Open Access Journals (Sweden)

    Paolo Ammirante

    Full Text Available Two experiments investigated deaf individuals' ability to discriminate between same-sex talkers based on vibrotactile stimulation alone. Nineteen participants made same/different judgments on pairs of utterances presented to the lower back through voice coils embedded in a conforming chair. Discrimination of stimuli matched for F0, duration, and perceived magnitude was successful for pairs of spoken sentences in Experiment 1 (median percent correct = 83% and pairs of vowel utterances in Experiment 2 (median percent correct = 75%. Greater difference in spectral tilt between "different" pairs strongly predicted their discriminability in both experiments. The current findings support the hypothesis that discrimination of complex vibrotactile stimuli involves the cortical integration of spectral information filtered through frequency-tuned skin receptors.

  12. Determining adaptive thresholds for image segmentation for a license plate recognition system

    Directory of Open Access Journals (Sweden)

    Siti Norul Huda Sheikh Abdullah

    2016-06-01

    Full Text Available A vehicle license plate recognition (LPR system is useful to many applications, such as entrance admission, security, parking control, airport and cargo, traffic and speed control. This paper describe an adaptive threshold for image segmentation applied to a system for Malaysian intelligent license plate recognition (MyiLPR. Due to the different types of license plates used, the requirements of an automatic LPR system are rather different for each country. Upon receiving the input car image, this system (MyiLPR detects and segments the license plate based on proposed adaptive threshold via image and blob histogram, and blob agglomeration, and finally, it extracts geometric character features and classifies them using neural network. The use of the proposed adaptive threshold increased the detection, segmentation and recognition rate to 99%, 94.98% and 90% correspondingly, from 95%, 78.27% and 71.08% obtained with the fixed threshold used in the originally proposed system.

  13. Spoof Detection for Finger-Vein Recognition System Using NIR Camera

    Directory of Open Access Journals (Sweden)

    Dat Tien Nguyen

    2017-10-01

    Full Text Available Finger-vein recognition, a new and advanced biometrics recognition method, is attracting the attention of researchers because of its advantages such as high recognition performance and lesser likelihood of theft and inaccuracies occurring on account of skin condition defects. However, as reported by previous researchers, it is possible to attack a finger-vein recognition system by using presentation attack (fake finger-vein images. As a result, spoof detection, named as presentation attack detection (PAD, is necessary in such recognition systems. Previous attempts to establish PAD methods primarily focused on designing feature extractors by hand (handcrafted feature extractor based on the observations of the researchers about the difference between real (live and presentation attack finger-vein images. Therefore, the detection performance was limited. Recently, the deep learning framework has been successfully applied in computer vision and delivered superior results compared to traditional handcrafted methods on various computer vision applications such as image-based face recognition, gender recognition and image classification. In this paper, we propose a PAD method for near-infrared (NIR camera-based finger-vein recognition system using convolutional neural network (CNN to enhance the detection ability of previous handcrafted methods. Using the CNN method, we can derive a more suitable feature extractor for PAD than the other handcrafted methods using a training procedure. We further process the extracted image features to enhance the presentation attack finger-vein image detection ability of the CNN method using principal component analysis method (PCA for dimensionality reduction of feature space and support vector machine (SVM for classification. Through extensive experimental results, we confirm that our proposed method is adequate for presentation attack finger-vein image detection and it can deliver superior detection results compared

  14. Automated alignment system for optical wireless communication systems using image recognition.

    Science.gov (United States)

    Brandl, Paul; Weiss, Alexander; Zimmermann, Horst

    2014-07-01

    In this Letter, we describe the realization of a tracked line-of-sight optical wireless communication system for indoor data distribution. We built a laser-based transmitter with adaptive focus and ray steering by a microelectromechanical systems mirror. To execute the alignment procedure, we used a CMOS image sensor at the transmitter side and developed an algorithm for image recognition to localize the receiver's position. The receiver is based on a self-developed optoelectronic integrated chip with low requirements on the receiver optics to make the system economically attractive. With this system, we were able to set up the communication link automatically without any back channel and to perform error-free (bit error rate <10⁻⁹) data transmission over a distance of 3.5 m with a data rate of 3 Gbit/s.

  15. Design of an Optical Character Recognition System for Camera-based Handheld Devices

    Directory of Open Access Journals (Sweden)

    Ayatullah Faruk Mollah

    2011-07-01

    Full Text Available This paper presents a complete Optical Character Recognition (OCR system for camera captured image/graphics embedded textual documents for handheld devices. At first, text regions are extracted and skew corrected. Then, these regions are binarized and segmented into lines and characters. Characters are passed into the recognition module. Experimenting with a set of 100 business card images, captured by cell phone camera, we have achieved a maximum recognition accuracy of 92.74%. Compared to Tesseract, an open source desktop-based powerful OCR engine, present recognition accuracy is worth contributing. Moreover, the developed technique is computationally efficient and consumes low memory so as to be applicable on handheld devices.

  16. Neuropeptide S interacts with the basolateral amygdala noradrenergic system in facilitating object recognition memory consolidation.

    Science.gov (United States)

    Han, Ren-Wen; Xu, Hong-Jiao; Zhang, Rui-San; Wang, Pei; Chang, Min; Peng, Ya-Li; Deng, Ke-Yu; Wang, Rui

    2014-01-01

    The noradrenergic activity in the basolateral amygdala (BLA) was reported to be involved in the regulation of object recognition memory. As the BLA expresses high density of receptors for Neuropeptide S (NPS), we investigated whether the BLA is involved in mediating NPS's effects on object recognition memory consolidation and whether such effects require noradrenergic activity. Intracerebroventricular infusion of NPS (1nmol) post training facilitated 24-h memory in a mouse novel object recognition task. The memory-enhancing effect of NPS could be blocked by the β-adrenoceptor antagonist propranolol. Furthermore, post-training intra-BLA infusions of NPS (0.5nmol/side) improved 24-h memory for objects, which was impaired by co-administration of propranolol (0.5μg/side). Taken together, these results indicate that NPS interacts with the BLA noradrenergic system in improving object recognition memory during consolidation.

  17. From Birdsong to Human Speech Recognition: Bayesian Inference on a Hierarchy of Nonlinear Dynamical Systems

    Science.gov (United States)

    Yildiz, Izzet B.; von Kriegstein, Katharina; Kiebel, Stefan J.

    2013-01-01

    Our knowledge about the computational mechanisms underlying human learning and recognition of sound sequences, especially speech, is still very limited. One difficulty in deciphering the exact means by which humans recognize speech is that there are scarce experimental findings at a neuronal, microscopic level. Here, we show that our neuronal-computational understanding of speech learning and recognition may be vastly improved by looking at an animal model, i.e., the songbird, which faces the same challenge as humans: to learn and decode complex auditory input, in an online fashion. Motivated by striking similarities between the human and songbird neural recognition systems at the macroscopic level, we assumed that the human brain uses the same computational principles at a microscopic level and translated a birdsong model into a novel human sound learning and recognition model with an emphasis on speech. We show that the resulting Bayesian model with a hierarchy of nonlinear dynamical systems can learn speech samples such as words rapidly and recognize them robustly, even in adverse conditions. In addition, we show that recognition can be performed even when words are spoken by different speakers and with different accents—an everyday situation in which current state-of-the-art speech recognition models often fail. The model can also be used to qualitatively explain behavioral data on human speech learning and derive predictions for future experiments. PMID:24068902

  18. A multi-agent system simulating human splice site recognition.

    Science.gov (United States)

    Vignal, L; Lisacek, F; Quinqueton, J; d'Aubenton-Carafa, Y; Thermes, C

    1999-06-15

    The present paper describes a method detecting splice sites automatically on the basis of sequence data and models of site/signal recognition supported by experimental evidences. The method is designed to simulate splicing and while doing so, track prediction failures, missing information and possibly test correcting hypotheses. Correlations between nucleotides in the splice site regions and the various elements of the acceptor region are evaluated and combined to assess compensating interactions between elements of the splicing machinery. A scanning model of the acceptor region and a model of interaction between the splicing complexes (exon definition model) are also incorporated in the detection process. Subsets of sites presenting deficiencies of several splice site elements could be identified. Further examination of these sites helps to determine lacking elements and refine models.

  19. An Efficient Face Recognition System Based On the Hybridization of Pose Invariant and Illumination Process

    Directory of Open Access Journals (Sweden)

    S. Muruganantham

    2012-07-01

    Full Text Available In the previous decade, one of the most effectual applications of image analysis and indulgent that attracted significant consideration is the human face recognition. One of the diverse techniques used for identifying an individual is the Face recognition. Normally the image variations for the reason that of the change in face identity are less than the variations between the images of the same face under different illumination and viewing angle. Among several factors that manipulate face recognition, illumination and pose are the two major challenges. Pose and illumination variations harshly affect the performance of face recognition. Considerably less effort has been taken to deal with the problem of mutual variations of pose and illumination in face recognition, while several algorithms have been proposed for face recognition from fixed points. In this paper we intend a face recognition method that is forceful to pose and illumination variations. We first put forward a simple pose estimation method based on 2D images, which uses a proper classification rule and image representation to classify a pose of a face image. After that, the image can be assigned to a pose class by a classification rule in a low-dimensional subspace constructed by a feature extraction method. We offer a shadow compensation method that compensates for illumination variation in a face image so that the image can be predictable by a face recognition system designed for images under normal illumination condition. Starting the accomplishment result, it is obvious that our projected technique based on the hybridization system recognizes the face images effectively.

  20. 不同录音系统对声纹检测的影响%Influence of recording system on voiceprint recognition

    Institute of Scientific and Technical Information of China (English)

    达钊; 李倩; 郭霞生; 章东

    2011-01-01

    Voiceprint recognition has been applied in the identification field. With the widely use of digital recording systems, recorded voice samples processing and analysis become important, which leads to the study of the effect of recording systems on voice samples and the stable parameters. In this study, a variety of samples are recorded by three kinds of recording systems, i.e. voice pen, microphone and mobile phone. After speech preprocessing work,the sample pitch, linear prediction coefficients and the formant Mel cepstrum are extracted and analyzed. The results show that differences in recording systems affect the extracted parameters, especially the phenomenon of resonance peak loss, Mel Frequency Cepstrum Coefficient parameter differences. The pitch value and MFCC differences between the three recording systems are not that great as linear prediction coefficients. It is suggested that (1) highquality recording equipment has optimistic formants lost phenomenon, and its bandwidth is obviously higher than low quality recording equipment. (2) Though MFCC parameters are robust, the MFCC information extracted from low quality recording equipment shows a weaker anti-noise ability. (3) Since different recording systems' influence on pitch is not very big, in dealing with low quality equipment recording speech samples, we can extract pitch parameters as well as MFCC parameters as the identification parameters.%声纹识别已在身份识别中得以应用.本文采用三种常用数字录音系统(语音笔、话筒及手机)录制样本,并对这些样本进行基音分析,线性预测系数法提取共振峰以及美尔倒谱系数.结果表明,录音设备自身性能的差异对声纹参数存在影响,尤其是存在共振峰的丢失现象,提取的美尔倒谱系数包络存在一定差异.

  1. An event-based neurobiological recognition system with orientation detector for objects in multiple orientations

    Directory of Open Access Journals (Sweden)

    Hanyu Wang

    2016-11-01

    Full Text Available A new multiple orientation event-based neurobiological recognition system is proposed by integrating recognition and tracking function in this paper, which is used for asynchronous address-event representation (AER image sensors. The characteristic of this system has been enriched to recognize the objects in multiple orientations with only training samples moving in a single orientation. The system extracts multi-scale and multi-orientation line features inspired by models of the primate visual cortex. An orientation detector based on modified Gaussian blob tracking algorithm is introduced for object tracking and orientation detection. The orientation detector and feature extraction block work in simultaneous mode, without any increase in categorization time. An addresses lookup table (addresses LUT is also presented to adjust the feature maps by addresses mapping and reordering, and they are categorized in the trained spiking neural network. This recognition system is evaluated with the MNIST dataset which have played important roles in the development of computer vision, and the accuracy is increase owing to the use of both ON and OFF events. AER data acquired by a DVS are also tested on the system, such as moving digits, pokers, and vehicles. The experimental results show that the proposed system can realize event-based multi-orientation recognition.The work presented in this paper makes a number of contributions to the event-based vision processing system for multi-orientation object recognition. It develops a new tracking-recognition architecture to feedforward categorization system and an address reorder approach to classify multi-orientation objects using event-based data. It provides a new way to recognize multiple orientation objects with only samples in single orientation.

  2. Face Recognition for Access Control Systems Combining Image-Difference Features Based on a Probabilistic Model

    Science.gov (United States)

    Miwa, Shotaro; Kage, Hiroshi; Hirai, Takashi; Sumi, Kazuhiko

    We propose a probabilistic face recognition algorithm for Access Control System(ACS)s. Comparing with existing ACSs using low cost IC-cards, face recognition has advantages in usability and security that it doesn't require people to hold cards over scanners and doesn't accept imposters with authorized cards. Therefore face recognition attracts more interests in security markets than IC-cards. But in security markets where low cost ACSs exist, price competition is important, and there is a limitation on the quality of available cameras and image control. Therefore ACSs using face recognition are required to handle much lower quality images, such as defocused and poor gain-controlled images than high security systems, such as immigration control. To tackle with such image quality problems we developed a face recognition algorithm based on a probabilistic model which combines a variety of image-difference features trained by Real AdaBoost with their prior probability distributions. It enables to evaluate and utilize only reliable features among trained ones during each authentication, and achieve high recognition performance rates. The field evaluation using a pseudo Access Control System installed in our office shows that the proposed system achieves a constant high recognition performance rate independent on face image qualities, that is about four times lower EER (Equal Error Rate) under a variety of image conditions than one without any prior probability distributions. On the other hand using image difference features without any prior probabilities are sensitive to image qualities. We also evaluated PCA, and it has worse, but constant performance rates because of its general optimization on overall data. Comparing with PCA, Real AdaBoost without any prior distribution performs twice better under good image conditions, but degrades to a performance as good as PCA under poor image conditions.

  3. Localization and recognition of traffic signs for automated vehicle control systems

    Science.gov (United States)

    Zadeh, Mahmoud M.; Kasvand, T.; Suen, Ching Y.

    1998-01-01

    We present a computer vision system for detection and recognition of traffic signs. Such systems are required to assist drivers and for guidance and control of autonomous vehicles on roads and city streets. For experiments we use sequences of digitized photographs and off-line analysis. The system contains four stages. First, region segmentation based on color pixel classification called SRSM. SRSM limits the search to regions of interest in the scene. Second, we use edge tracing to find parts of outer edges of signs which are circular or straight, corresponding to the geometrical shapes of traffic signs. The third step is geometrical analysis of the outer edge and preliminary recognition of each candidate region, which may be a potential traffic sign. The final step in recognition uses color combinations within each region and model matching. This system maybe used for recognition of other types of objects, provided that the geometrical shape and color content remain reasonably constant. The method is reliable, easy to implement, and fast, This differs form the road signs recognition method in the PROMETEUS. The overall structure of the approach is sketched.

  4. A computerized recognition system for the home-based physiotherapy exercises using an RGBD camera.

    Science.gov (United States)

    Ar, Ilktan; Akgul, Yusuf Sinan

    2014-11-01

    Computerized recognition of the home based physiotherapy exercises has many benefits and it has attracted considerable interest among the computer vision community. However, most methods in the literature view this task as a special case of motion recognition. In contrast, we propose to employ the three main components of a physiotherapy exercise (the motion patterns, the stance knowledge, and the exercise object) as different recognition tasks and embed them separately into the recognition system. The low level information about each component is gathered using machine learning methods. Then, we use a generative Bayesian network to recognize the exercise types by combining the information from these sources at an abstract level, which takes the advantage of domain knowledge for a more robust system. Finally, a novel postprocessing step is employed to estimate the exercise repetitions counts. The performance evaluation of the system is conducted with a new dataset which contains RGB (red, green, and blue) and depth videos of home-based exercise sessions for commonly applied shoulder and knee exercises. The proposed system works without any body-part segmentation, bodypart tracking, joint detection, and temporal segmentation methods. In the end, favorable exercise recognition rates and encouraging results on the estimation of repetition counts are obtained.

  5. Mental Disorder Diagnostic System Based on Logical-Combinatorial Methods of Pattern Recognition

    Directory of Open Access Journals (Sweden)

    Anna Yankovskaya

    2013-11-01

    Full Text Available The authors describe mental disorder diagnostic system based on logical-combinatorial methods of pattern recognition called as the intelligent system DIAPROD-LOG. The system is designed for diagnostics and prevention of depression. The mathematical apparatus for creation of the proposed system based on a matrix model of data and knowledge representation, as well as various kinds of regularities in data and knowledge are presented. The description of the system is given.

  6. Low-cost speech recognition system for small vocabulary and independent speaker

    Science.gov (United States)

    Teh, Chih Chiang; Jong, Ching C.; Siek, Liter

    2000-10-01

    In this paper an ASIC implementation of a low cost speech recognition system for small vocabulary, 15 isolated word, speaker independent is presented. The IC is a digital block that receives a 12 bit sample with a sampling rate of 11.025 kHz as its input. The IC is running at 10 MHz system clock and targeted at 0.35 micrometers CMOS process. The whole chip, which includes the speech recognition system core, RAM and ROM contains about 61000 gates. The die size is 1.5 mm by 3 mm. The current design had been coded in VHDL for hardware implementation and its functionality is identical with the Matlab simulation. The average speech recognition rate for this IC is 89 percent for 15 isolated words.

  7. Surface imprinted thin polymer film systems with selective recognition for bovine serum albumin.

    Science.gov (United States)

    Kryscio, David R; Peppas, Nicholas A

    2012-03-09

    Molecularly imprinted polymers are synthetic antibody mimics formed by the crosslinking of organic or inorganic polymers in the presence of an analyte which yields recognitive polymer networks with specific binding pockets for that biomolecule. Surface imprinted polymers were synthesized via a novel technique for the specific recognition of bovine serum albumin (BSA). Thin films of recognitive networks based on 2-(dimethylamino)ethyl methacrylate (DMAEMA) as the functional monomer and varying amounts of either N,N'-methylenebisacrylamide (MBA) or poly(ethylene glycol) (400) dimethacrylate (PEG400DMA) as the crosslinking agent were synthesized via UV free-radical polymerization and characterized. A clear and reproducible increase in recognition of the template BSA was demonstrated for these systems at 1.6-2.5 times more BSA recognized by the MIP sample relative to the control polymers. Additionally, these polymers exhibited selective recognition of the template relative to competing proteins with up to 2.9 times more BSA adsorbed than either glucose oxidase or bovine hemoglobin. These synthetic antibody mimics hold significant promise as the next generation of robust recognition elements in a wide range of bioassay and biosensor applications.

  8. A Gesture Recognition System for Detecting Behavioral Patterns of ADHD.

    Science.gov (United States)

    Bautista, Miguel Ángel; Hernández-Vela, Antonio; Escalera, Sergio; Igual, Laura; Pujol, Oriol; Moya, Josep; Violant, Verónica; Anguera, María T

    2016-01-01

    We present an application of gesture recognition using an extension of dynamic time warping (DTW) to recognize behavioral patterns of attention deficit hyperactivity disorder (ADHD). We propose an extension of DTW using one-class classifiers in order to be able to encode the variability of a gesture category, and thus, perform an alignment between a gesture sample and a gesture class. We model the set of gesture samples of a certain gesture category using either Gaussian mixture models or an approximation of convex hulls. Thus, we add a theoretical contribution to classical warping path in DTW by including local modeling of intraclass gesture variability. This methodology is applied in a clinical context, detecting a group of ADHD behavioral patterns defined by experts in psychology/psychiatry, to provide support to clinicians in the diagnose procedure. The proposed methodology is tested on a novel multimodal dataset (RGB plus depth) of ADHD children recordings with behavioral patterns. We obtain satisfying results when compared to standard state-of-the-art approaches in the DTW context.

  9. An improved cortex-like neuromorphic system for target recognitions

    Science.gov (United States)

    Tsitiridis, Aristeidis; Yuen, Peter; Hong, Kan; Chen, Tong; Ibrahim, Izzati; Jackman, James; James, David; Richardson, Mark

    2010-10-01

    This paper reports on the enhancement of biologically-inspired machine vision through a rotation invariance mechanism. Research over the years has suggested that rotation invariance is one of the fundamental generic elements of object constancy, a known generic visual ability of the human brain. Cortex-like vision unlike conventional pixel based machine vision is achieved by mimicking neuromorphic mechanisms of the primates' brain. In this preliminary study, rotation invariance is implemented through histograms from Gabor features of an object. The performance of rotation invariance in the neuromorphic algorithm is assessed by the classification accuracies of a test data set which consists of image objects in five different orientations. It is found that a much more consistent classification result over these five different oriented data sets has been achieved by the integrated rotation invariance neuromorphic algorithm compared to the one without. In addition, the issue of varying aspect ratios of input images to these models is also addressed, in an attempt to create a robust algorithm against a wider variability of input data. The extension of the present achievement is to improve the recognition accuracies while incorporating it to a series of different real-world scenarios which would challenge the approach accordingly.

  10. Pattern recognition receptors and central nervous system repair.

    Science.gov (United States)

    Kigerl, Kristina A; de Rivero Vaccari, Juan Pablo; Dietrich, W Dalton; Popovich, Phillip G; Keane, Robert W

    2014-08-01

    Pattern recognition receptors (PRRs) are part of the innate immune response and were originally discovered for their role in recognizing pathogens by ligating specific pathogen associated molecular patterns (PAMPs) expressed by microbes. Now the role of PRRs in sterile inflammation is also appreciated, responding to endogenous stimuli referred to as "damage associated molecular patterns" (DAMPs) instead of PAMPs. The main families of PRRs include Toll-like receptors (TLRs), Nod-like receptors (NLRs), RIG-like receptors (RLRs), AIM2-like receptors (ALRs), and C-type lectin receptors. Broad expression of these PRRs in the CNS and the release of DAMPs in and around sites of injury suggest an important role for these receptor families in mediating post-injury inflammation. Considerable data now show that PRRs are among the first responders to CNS injury and activation of these receptors on microglia, neurons, and astrocytes triggers an innate immune response in the brain and spinal cord. Here we discuss how the various PRR families are activated and can influence injury and repair processes following CNS injury.

  11. A simple and efficient optical character recognition system for basic symbols in printed Kannada text

    Indian Academy of Sciences (India)

    R Sanjeev Kunte; R D Sudhaker Samuel

    2007-10-01

    Optical Character Recognition (OCR) systems have been effectively developed for the recognition of printed characters of non-Indian languages. Efforts are on the way for the development of efficient OCR systems for Indian languages, especially for Kannada, a popular South Indian language. We present in this paper an OCR system developed for the recognition of basic characters (vowels and consonants) in printed Kannada text, which can handle different font sizes and font types. Hu’s invariant moments and Zernike moments that have been progressively used in pattern recognition are used in our system to extract the features of printed Kannada characters. Neural classifiers have been effectively used for the classification of characters based on moment features. An encouraging recognition rate of 96·8% has been obtained. The system methodology can be extended for the recognition of other south Indian languages, especially for Telugu.

  12. Body posture recognition and turning recording system for the care of bed bound patients.

    Science.gov (United States)

    Hsiao, Rong-Shue; Mi, Zhenqiang; Yang, Bo-Ru; Kau, Lih-Jen; Bitew, Mekuanint Agegnehu; Li, Tzu-Yu

    2015-01-01

    This paper proposes body posture recognition and turning recording system for assisting the care of bed bound patients in nursing homes. The system continuously detects the patient's body posture and records the length of time for each body posture. If the patient remains in the same body posture long enough to develop pressure ulcers, the system notifies caregivers to change the patient's body posture. The objective of recording is to provide the log of body turning for querying of patients' family members. In order to accurately detect patient's body posture, we developed a novel pressure sensing pad which contains force sensing resistor sensors. Based on the proposed pressure sensing pad, we developed a bed posture recognition module which includes a bed posture recognition algorithm. The algorithm is based on fuzzy theory. The body posture recognition algorithm can detect the patient's bed posture whether it is right lateral decubitus, left lateral decubitus, or supine. The detected information of patient's body posture can be then transmitted to the server of healthcare center by the communication module to perform the functions of recording and notification. Experimental results showed that the average posture recognition accuracy for our proposed module is 92%.

  13. A Single-System Model Predicts Recognition Memory and Repetition Priming in Amnesia

    Science.gov (United States)

    Kessels, Roy P.C.; Wester, Arie J.; Shanks, David R.

    2014-01-01

    We challenge the claim that there are distinct neural systems for explicit and implicit memory by demonstrating that a formal single-system model predicts the pattern of recognition memory (explicit) and repetition priming (implicit) in amnesia. In the current investigation, human participants with amnesia categorized pictures of objects at study and then, at test, identified fragmented versions of studied (old) and nonstudied (new) objects (providing a measure of priming), and made a recognition memory judgment (old vs new) for each object. Numerous results in the amnesic patients were predicted in advance by the single-system model, as follows: (1) deficits in recognition memory and priming were evident relative to a control group; (2) items judged as old were identified at greater levels of fragmentation than items judged new, regardless of whether the items were actually old or new; and (3) the magnitude of the priming effect (the identification advantage for old vs new items) overall was greater than that of items judged new. Model evidence measures also favored the single-system model over two formal multiple-systems models. The findings support the single-system model, which explains the pattern of recognition and priming in amnesia primarily as a reduction in the strength of a single dimension of memory strength, rather than a selective explicit memory system deficit. PMID:25122896

  14. The design and implementation of effective face detection and recognition system

    Science.gov (United States)

    Sun, Yigui

    2011-06-01

    In the paper, a face detection and recognition system (FDRS) based on video sequences and still image is proposed. It uses the AdaBoost algorithm to detect human face in the image or frame, adopts Discrete Cosine Transforms (DCT) for feature extraction and recognition in face image. The related technologies are firstly outlined. Then, the system requirements and UML use case diagram are described. In addition, the paper mainly introduces the design solution and key procedures. The FDRS's source-code is built in VC++, Standard Template Library (STL) and Intel Open Source Computer Vision Library (OpenCV).

  15. A smart pattern recognition system for the automatic identification of aerospace acoustic sources

    Science.gov (United States)

    Cabell, R. H.; Fuller, C. R.

    1989-01-01

    An intelligent air-noise recognition system is described that uses pattern recognition techniques to distinguish noise signatures of five different types of acoustic sources, including jet planes, propeller planes, a helicopter, train, and wind turbine. Information for classification is calculated using the power spectral density and autocorrelation taken from the output of a single microphone. Using this system, as many as 90 percent of test recordings were correctly identified, indicating that the linear discriminant functions developed can be used for aerospace source identification.

  16. 一个语音信息门户的设计与实现%Design and Implementation of a Voice Portal System

    Institute of Scientific and Technical Information of China (English)

    周宽久; 曾琳铖曦; 李瑶

    2006-01-01

    语音门户是利用了CTI技术实现电话网与互联网集成的重要部件,支持了用户通过普通电话访问互联网获取信息,是由IVR(Interactive Voice Response)、TTS(Text To Speech)、ASR(Automatic Speech Recognition)、Voice XML 4个子系统组成,该文在一个实用的语音门户系统的基础上,讨论了系统结构以及4个模块的设计实现,系统设计采用面向对象技术、自动机技术将板卡、通道以其语音合成、识别等资源有机集成在一个系统内,方便了系统设计与功能扩充.

  17. DEVELOPMENT OF UYGHUR VOICE CONTROL SYSTEM BASED ON SMART PHONE%基于智能手机的维吾尔语语音控制系统的开发

    Institute of Scientific and Technical Information of China (English)

    米尔阿迪力江·麦麦提; 吾守尔·斯拉木; 努尔麦麦提·尤鲁瓦斯; 热依曼·吐尔逊; 艾尼宛尔·托乎提

    2016-01-01

    With the purpose of implementing Uyghur command words recognition, we elaborately studied the development and implementation process of Uyghur command words recognition system on Android platform,introduced the development difficulties,core technologies and typical functions of the system.The system was developed mainly using Android SDK,eclipse integrated development environment and API interfaces,and realised the functions of correct display and processing of multiple texts of Uyghur,Chinese and English through automatic styles selection rule.Aiming at different speaking styles of the majority of users,we rebuilt Uyghur voice and grammar files,and solved the problem of different dialects around the Region.Moreover we gained the testing results of right recognition rate of 90.56% and the successful implementation rate of 85% in the experiment made in usual Lab condition,this showed that in the research of Uyghur non-specific command words recognition,the structure and construction of grammar files had different effects on system.%以实现维吾尔语命令词识别为目的,重点研究维吾尔语命令词识别系统在Android平台下的开发与实现过程,介绍系统开发难点、核心技术及系统典型的几个功能。系统主要由Android开发包、Eclipse集成开发环境和API接口进行开发,并且通过自动选型规则来实现维汉英多种文字的正确显示及处理等问题,针对广大用户的不同说话方式,重新构建维吾尔语语音语法文件,解决各地不同方言问题。在一般实验室环境下做实验得到了90.56%的正确识别率和85.00%的成功执行率等测试结果,表明维吾尔语非特定人命令词识别研究中语法文件的结构及构建对系统有不同的影响。

  18. Perceiving a stranger's voice as being one's own: a 'rubber voice' illusion?

    Directory of Open Access Journals (Sweden)

    Zane Z Zheng

    Full Text Available We describe an illusion in which a stranger's voice, when presented as the auditory concomitant of a participant's own speech, is perceived as a modified version of their own voice. When the congruence between utterance and feedback breaks down, the illusion is also broken. Compared to a baseline condition in which participants heard their own voice as feedback, hearing a stranger's voice induced robust changes in the fundamental frequency (F0 of their production. Moreover, the shift in F0 appears to be feedback dependent, since shift patterns depended reliably on the relationship between the participant's own F0 and the stranger-voice F0. The shift in F0 was evident both when the illusion was present and after it was broken, suggesting that auditory feedback from production may be used separately for self-recognition and for vocal motor control. Our findings indicate that self-recognition of voices, like other body attributes, is malleable and context dependent.

  19. Eye-hand Hybrid Gesture Recognition System for Human Machine Interface

    Directory of Open Access Journals (Sweden)

    N. R. Raajan

    2013-04-01

    Full Text Available Gesture Recognition has become a way for computers to recognise and understand human body language. They bridge the gap between machines and human beings and make the primitive interfaces like keyboards and mice redundant. This paper suggests a hybrid gesture recognition system for computer interface and wireless robot control. The real-time eye-hand gesture recognition system can be used for computer drawing, navigating cursors and simulating mouse clicks, playing games, controlling a wireless robot with commands and more. The robot illustrated in this paper is controlled by RF module. Playing a PING-PONG game has also been demonstrated using the gestures. The Haar cascade classifiers and template matching are used to detect eye gestures and convex hull for finding the defects and counting the number of fingers in the given region.

  20. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Shozo Makino

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about 86% retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0% retrieval accuracy was obtained with a query consisting of five words.

  1. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Suzuki Motoyuki

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0 retrieval accuracy was obtained with a query consisting of five words.

  2. Segment, Track, Extract, Recognize and Convert Sign Language Videos to Voice/Text

    Directory of Open Access Journals (Sweden)

    P.V.V.Kishore

    2012-06-01

    Full Text Available This paper summarizes various algorithms used to design a sign language recognition system. Sign language is the language used by deaf people to communicate among themselves and with normal people. We designed a real time sign language recognition system that can recognize gestures of sign language from videos under complex backgrounds. Segmenting and tracking of non-rigid hands and head of the signer in sign language videos is achieved by using active contour models. Active contour energy minimization is done using signers hand and head skin colour, texture, boundary and shape information. Classification of signs is done by an artificial neural network using error back propagation algorithm. Each sign in the video is converted into a voice and text command. The system has been implemented successfully for 351 signs of Indian Sign Language under different possible video environments. The recognition rates are calculated for different video environments.

  3. Dimensionality in voice quality.

    Science.gov (United States)

    Bele, Irene Velsvik

    2007-05-01

    This study concerns speaking voice quality in a group of male teachers (n = 35) and male actors (n = 36), as the purpose was to investigate normal and supranormal voices. The goal was the development of a method of valid perceptual evaluation for normal to supranormal and resonant voices. The voices (text reading at two loudness levels) had been evaluated by 10 listeners, for 15 vocal characteristics using VA scales. In this investigation, the results of an exploratory factor analysis of the vocal characteristics used in this method are presented, reflecting four dimensions of major importance for normal and supranormal voices. Special emphasis is placed on the effects on voice quality of a change in the loudness variable, as two loudness levels are studied. Furthermore, the vocal characteristics Sonority and Ringing voice quality are paid special attention, as the essence of the term "resonant voice" was a basic issue throughout a doctoral dissertation where this study was included.

  4. Voice box (image)

    Science.gov (United States)

    The larynx, or voice box, is located in the neck and performs several important functions in the body. The larynx is involved in swallowing, breathing, and voice production. Sound is produced when the air which ...

  5. Voice and Aging

    Science.gov (United States)

    ... dramatic voice changes are those during childhood and adolescence. The larynx (or voice box) and vocal cord tissues do not fully mature until late teenage years. Hormone-related changes during adolescence are ...

  6. Shifting voices with participant roles: Voice qualities and speech registers in Mesoamerica

    OpenAIRE

    Sicoli, M.

    2010-01-01

    Although an increasing number of sociolinguistic researchers consider functions of voice qualities as stylistic features, few studies consider cases where voice qualities serve as the primary signs of speech registers. This article addresses this gap through the presentation of a case study of Lachixio Zapotec speech registers indexed though falsetto, breathy, creaky, modal, and whispered voice qualities. I describe the system of contrastive speech registers in Lachixio Zapotec and then track...

  7. WiFi Voice Communication System Based on QT%基于QT的WiFi语音通信系统

    Institute of Scientific and Technical Information of China (English)

    赵付轩; 杨斌

    2012-01-01

    Aiming at speech signal acquisition in embedded system, the design uses UDP protocol for data transmission, uses Socket programming to ensure the reliability of data transmission and realizes LAN speech communication from end to end. This paper describes how to design and develop LAN real-time voice communication software based on QT and Linux development platform by using existing audio programming and network programming knowledge.%针对嵌入式系统语音信号的采集,采用UDP协议进行数据传输,运用Socket编程保证数据传输的可靠性,实现局域网里端到端的语音通信.主要阐述了如何在Linux开发平台上,利用现有的音频编程和网络编程知识,设计和开发局域网里基于QT的实时语音通信软件.

  8. Voice and endocrinology

    OpenAIRE

    KVS Hari Kumar; Anurag Garg; Ajai Chandra, N. S.; Singh, S. P.; Rakesh Datta

    2016-01-01

    Voice is one of the advanced features of natural evolution that differentiates human beings from other primates. The human voice is capable of conveying the thoughts into spoken words along with a subtle emotion to the tone. This extraordinary character of the voice in expressing multiple emotions is the gift of God to the human beings and helps in effective interpersonal communication. Voice generation involves close interaction between cerebral signals and the peripheral apparatus consistin...

  9. An Automatic System of Vehicle Number-Plate Recognition Based on Neural Networks

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    This paper presents an automatic system of vehicle number-plate recognition based on neural networks. In this system, location of number-plate and recognition of characters in number-plate can be automatically completed. Pixel colors of Number-plate area are classified using neural network, then color features are extracted by analyzing scanning lines of the cross-section of number-plate. It takes full use of number-plate color features to locate number plate. Characters in number-plate can be effectively recognized using the neural networks. Experimental results show that the correct rate of number-plate location is close to 100%, and the time of number-plate location is less than 1 second. Moreover, recognition rate of characters is improved due to the known number-plate type. It is also observed that this system is not sensitive to variations of weather, illumination and vehicle speed. In addition, and also the size of number-plate need not to be known in prior. This system is of crucial significance to apply and spread the automatic system of vehicle number-plate recognition.

  10. Effects of Emotional and Perceptual-Motor Stress on a Voice Recognition System’s Accuracy: An Applied Investigation.

    Science.gov (United States)

    1984-02-01

    15 3. RESULTS 3-1 3.1 Overview 3-1 3.2 Total Errors 3-2 3.3 Nonrecognitions 3-7 3.4 Misrecognitions 3-1] 3.5 Sinus Arrythmia 3-18 3.6 Heartrate 3-18...Dynograph outside the sound chamber and the experimenter recalibrated the machine until heartbeat and heartrate were being measured and recorded...SINUS ARRTHYMIA BY TEST CONDITION 3-17 IA 3.b Heartrate An analysis of variance on heartrate in the test conditions yielded significant main effects

  11. Using Continuous Voice Recognition Technology as an Input Medium to the Naval Warfare Interactive Simulation System (NWISS).

    Science.gov (United States)

    1984-06-01

    grammars cannot properly characterize major subsets of English sentences, unless sentence coplexity is severely limited, they are auite appropriate for...a menu-driven facility, called GFIL, for creatinj gramars whist is basic- ally user friendly. ith GRID, tae designer inputs poten- tial grammars for

  12. Writing with Voice

    Science.gov (United States)

    Kesler, Ted

    2012-01-01

    In this Teaching Tips article, the author argues for a dialogic conception of voice, based in the work of Mikhail Bakhtin. He demonstrates a dialogic view of voice in action, using two writing examples about the same topic from his daughter, a fifth-grade student. He then provides five practical tips for teaching a dialogic conception of voice in…

  13. Tips for Healthy Voices

    Science.gov (United States)

    ... social interaction as well as for most people’s occupation. Proper care and use of your voice will give you the best chance for having a healthy voice for your entire lifetime. Hoarseness or roughness in your voice is often ...

  14. Voice over IP Security

    CERN Document Server

    Keromytis, Angelos D

    2011-01-01

    Voice over IP (VoIP) and Internet Multimedia Subsystem technologies (IMS) are rapidly being adopted by consumers, enterprises, governments and militaries. These technologies offer higher flexibility and more features than traditional telephony (PSTN) infrastructures, as well as the potential for lower cost through equipment consolidation and, for the consumer market, new business models. However, VoIP systems also represent a higher complexity in terms of architecture, protocols and implementation, with a corresponding increase in the potential for misuse. In this book, the authors examine the

  15. Performance analysis and code recognition for dual N-ary orthogonal hybrid modulation systems

    Institute of Scientific and Technical Information of China (English)

    Qiao Xiaoqiang; Zhao Hangsheng; Cai Yueming

    2008-01-01

    A dual N-ary orthogonal hybrid modulation system is introduced in this paper, which can increase the data rate greatly compared with conventional N-ary orthogonal spread spectrum system, so it can be used for high rate data communication. Then, three code recognition algorithms are presented for dual N-ary orthogonal hybrid modulation system and the analytic bit error rate (BER) performance of the system in additive white Gaussian noise (AWGN) and flat Rayleigh fading channel is derived. Finally, the computer simulation of the system with three code recognition algorithms is performed, which shows that the simplified maximum a posteriori (MAP) algorithm is the best for the system with a compromise between the performance and the complexity.

  16. The Usefulness and Feasibility of Mobile Interface in Tuberculosis Notification (MITUN) Voice Based System for Notification of Tuberculosis by Private Medical Practitioners – A Pilot Project

    Science.gov (United States)

    Velayutham, Banurekha; Thomas, Beena; Nair, Dina; Thiruvengadam, Kannan; Prashant, Suma; Kittusami, Sathyapriya; Vijayakumar, Harivanzan; Chidambaram, Meenachi; Shivakumar, Shri Vijay Bala Yogendra; Jayabal, Lavanya; Jhunjhunwala, Ashok; Swaminathan, Soumya

    2015-01-01

    Introduction Tuberculosis (TB) is a notifiable disease and health care providers are required to notify every TB case to local authorities. We conducted a pilot study to determine the usefulness and feasibility of mobile interface in TB notification (MITUN) voice based system for notification of TB cases by private medical practitioners. Methodology The study was conducted during September 2013 to October 2014 in three zones of Chennai, an urban setting in South India. Private clinics wherein services are provided by single private medical practitioners were approached. The steps involved in MITUN included: Registration of the practitioners and notification of TB cases by them through voice interactions. Pre and post-intervention questionnaires were administered to collect information on TB notification practices and feasibility of MITUN after an implementation period of 6 months. Results A total of 266 private medical practitioners were approached for the study. Of them, 184 (69%) participated in the study; of whom 11 (6%) practitioners used MITUN for TB notification. Reasons for not using MITUN include lack of time, referral of patients to government facility, issues related to patient confidentiality and technical problems. Suggestions for making mobile phone based TB notification process user-friendly included reducing call duration, including only crucial questions and using missed call or SMS options. Conclusion The performance (feasibility and usefulness) of MITUN voice based system for TB notification in the present format was sub-optimal. Perceived problems, logistical and practical issues preclude scale–up of notification of TB by private practitioners. PMID:26376197

  17. The Usefulness and Feasibility of Mobile Interface in Tuberculosis Notification (MITUN Voice Based System for Notification of Tuberculosis by Private Medical Practitioners--A Pilot Project.

    Directory of Open Access Journals (Sweden)

    Banurekha Velayutham

    Full Text Available Tuberculosis (TB is a notifiable disease and health care providers are required to notify every TB case to local authorities. We conducted a pilot study to determine the usefulness and feasibility of mobile interface in TB notification (MITUN voice based system for notification of TB cases by private medical practitioners.The study was conducted during September 2013 to October 2014 in three zones of Chennai, an urban setting in South India. Private clinics wherein services are provided by single private medical practitioners were approached. The steps involved in MITUN included: Registration of the practitioners and notification of TB cases by them through voice interactions. Pre and post-intervention questionnaires were administered to collect information on TB notification practices and feasibility of MITUN after an implementation period of 6 months.A total of 266 private medical practitioners were approached for the study. Of them, 184 (69% participated in the study; of whom 11 (6% practitioners used MITUN for TB notification. Reasons for not using MITUN include lack of time, referral of patients to government facility, issues related to patient confidentiality and technical problems. Suggestions for making mobile phone based TB notification process user-friendly included reducing call duration, including only crucial questions and using missed call or SMS options.The performance (feasibility and usefulness of MITUN voice based system for TB notification in the present format was sub-optimal. Perceived problems, logistical and practical issues preclude scale-up of notification of TB by private practitioners.

  18. Real Time Multiple Hand Gesture Recognition System for Human Computer Interaction

    Directory of Open Access Journals (Sweden)

    Siddharth S. Rautaray

    2012-05-01

    Full Text Available With the increasing use of computing devices in day to day life, the need of user friendly interfaces has lead towards the evolution of different types of interfaces for human computer interaction. Real time vision based hand gesture recognition affords users the ability to interact with computers in more natural and intuitive ways. Direct use of hands as an input device is an attractive method which can communicate much more information by itself in comparison to mice, joysticks etc allowing a greater number of recognition system that can be used in a variety of human computer interaction applications. The gesture recognition system consist of three main modules like hand segmentation, hand tracking and gesture recognition from hand features. The designed system further integrated with different applications like image browser, virtual game etc. possibilities for human computer interaction. Computer Vision based systems has the potential to provide more natural, non-contact solutions. The present research work focuses on to design and develops a practical framework for real time hand gesture.

  19. Predicting Performance of a Face Recognition System Based on Image Quality

    NARCIS (Netherlands)

    Dutta, A.

    2015-01-01

    In this dissertation, we focus on several aspects of models that aim to predict performance of a face recognition system. Performance prediction models are commonly based on the following two types of performance predictor features: a) image quality features; and b) features derived solely from

  20. ISOLATED SPEECH RECOGNITION SYSTEM FOR TAMIL LANGUAGE USING STATISTICAL PATTERN MATCHING AND MACHINE LEARNING TECHNIQUES

    Directory of Open Access Journals (Sweden)

    VIMALA C.

    2015-05-01

    Full Text Available In recent years, speech technology has become a vital part of our daily lives. Various techniques have been proposed for developing Automatic Speech Recognition (ASR system and have achieved great success in many applications. Among them, Template Matching techniques like Dynamic Time Warping (DTW, Statistical Pattern Matching techniques such as Hidden Markov Model (HMM and Gaussian Mixture Models (GMM, Machine Learning techniques such as Neural Networks (NN, Support Vector Machine (SVM, and Decision Trees (DT are most popular. The main objective of this paper is to design and develop a speaker-independent isolated speech recognition system for Tamil language using the above speech recognition techniques. The background of ASR system, the steps involved in ASR, merits and demerits of the conventional and machine learning algorithms and the observations made based on the experiments are presented in this paper. For the above developed system, highest word recognition accuracy is achieved with HMM technique. It offered 100% accuracy during training process and 97.92% for testing process.

  1. Action recognition system based on human body tracking with depth images

    Directory of Open Access Journals (Sweden)

    M. Martínez-Zarzuela

    Full Text Available When tracking a human body, action recognition tasks can be performed to determine what kind of movement the person is performing. Although a lot of implementations have emerged, state-of-the-art technology such as depth cameras and intelligent systems ca ...

  2. Cherry Picking Robot Vision Recognition System Based on OpenCV

    Directory of Open Access Journals (Sweden)

    Zhang Qi Rong

    2016-01-01

    Full Text Available Through OpenCV function, the cherry in a natural environment image after image preprocessing, color recognition, threshold segmentation, morphological filtering, edge detection, circle Hough transform, you can draw the cherry’s center and circular contour, to carry out the purpose of the machine picking. The system is simple and effective.

  3. Evaluating Automatic Speech Recognition-Based Language Learning Systems: A Case Study

    Science.gov (United States)

    van Doremalen, Joost; Boves, Lou; Colpaert, Jozef; Cucchiarini, Catia; Strik, Helmer

    2016-01-01

    The purpose of this research was to evaluate a prototype of an automatic speech recognition (ASR)-based language learning system that provides feedback on different aspects of speaking performance (pronunciation, morphology and syntax) to students of Dutch as a second language. We carried out usability reviews, expert reviews and user tests to…

  4. Interactions of the humoral pattern recognition molecule PTX3 with the complement system

    DEFF Research Database (Denmark)

    Doni, Andrea; Garlanda, Cecilia; Bottazzi, Barbara

    2012-01-01

    The innate immune system comprises a cellular and a humoral arm. The long pentraxin PTX3 is a fluid phase pattern recognition molecule, which acts as an essential component of the humoral arm of innate immunity. PTX3 has antibody-like properties including interactions with complement components. ...

  5. Evaluating Automatic Speech Recognition-Based Language Learning Systems: A Case Study

    Science.gov (United States)

    van Doremalen, Joost; Boves, Lou; Colpaert, Jozef; Cucchiarini, Catia; Strik, Helmer

    2016-01-01

    The purpose of this research was to evaluate a prototype of an automatic speech recognition (ASR)-based language learning system that provides feedback on different aspects of speaking performance (pronunciation, morphology and syntax) to students of Dutch as a second language. We carried out usability reviews, expert reviews and user tests to…

  6. DEVELOPMENT OF AUTOMATED SPEECH RECOGNITION SYSTEM FOR EGYPTIAN ARABIC PHONE CONVERSATIONS

    Directory of Open Access Journals (Sweden)

    A. N. Romanenko

    2016-07-01

    Full Text Available The paper deals with description of several speech recognition systems for the Egyptian Colloquial Arabic. The research is based on the CALLHOME Egyptian corpus. The description of both systems, classic: based on Hidden Markov and Gaussian Mixture Models, and state-of-the-art: deep neural network acoustic models is given. We have demonstrated the contribution from the usage of speaker-dependent bottleneck features; for their extraction three extractors based on neural networks were trained. For their training three datasets in several languageswere used:Russian, English and differentArabic dialects.We have studied the possibility of application of a small Modern Standard Arabic (MSA corpus to derive phonetic transcriptions. The experiments have shown that application of the extractor obtained on the basis of the Russian dataset enables to increase significantly the quality of the Arabic speech recognition. We have also stated that the usage of phonetic transcriptions based on modern standard Arabic decreases recognition quality. Nevertheless, system operation results remain applicable in practice. In addition, we have carried out the study of obtained models application for the keywords searching problem solution. The systems obtained demonstrate good results as compared to those published before. Some ways to improve speech recognition are offered.

  7. Switching Systems: Active Mode Recognition, Identification of the Switching Law

    OpenAIRE

    Elom Ayih Domlan; José Ragot; Didier Maquin

    2007-01-01

    http://www.hindawi.com/journals/jcse/raa.50796.html; International audience; The problem of the estimation of the discrete state of a switching system is studied. The knowledge of the switching law is essential for this kind of system as it simplifies their manipulation for control purposes. This paper investigates the use of a model-based disgnosis method for the determination of the active mode at each timepoint based on the system input/output data. The issue of the parametric identificati...

  8. Neuro-parity pattern recognition system and method

    Science.gov (United States)

    Gross, Kenneth C.; Singer, Ralph M.; Van Alstine, Rollin G.; Wegerich, Stephan W.; Yue, Yong

    2000-01-01

    A method and system for monitoring a process and determining its condition. Initial data is sensed, a first set of virtual data is produced by applying a system state analyzation to the initial data, a second set of virtual data is produced by applying a neural network analyzation to the initial data and a parity space analyzation is applied to the first and second set of virtual data and also to the initial data to provide a parity space decision about the condition of the process. A logic test can further be applied to produce a further system decision about the state of the process.

  9. A dynamic gesture recognition system for the Korean sign language (KSL).

    Science.gov (United States)

    Kim, J S; Jang, W; Bien, Z

    1996-01-01

    The sign language is a method of communication for the deaf-mute. Articulated gestures and postures of hands and fingers are commonly used for the sign language. This paper presents a system which recognizes the Korean sign language (KSL) and translates into a normal Korean text. A pair of data-gloves are used as the sensing device for detecting motions of hands and fingers. For efficient recognition of gestures and postures, a technique of efficient classification of motions is proposed and a fuzzy min-max neural network is adopted for on-line pattern recognition.

  10. Automated Mulitple Object Optical Tracking and Recognition System Project

    Data.gov (United States)

    National Aeronautics and Space Administration — OPTRA proposes to develop an optical tracking system that is capable of recognizing and tracking up to 50 different objects within an approximately 2 degree x 3...

  11. A graph based system for multi-stage attacks recognition

    Institute of Scientific and Technical Information of China (English)

    Safaa O. Al-Mamory; Zhai Jianhong; Zhang Hongli

    2008-01-01

    Building attack scenario is one of the most important aspects in network security. This paper proposed a system which collects intrusion alerts, clusters them as sub-attacks using alerts abstraction, aggregates the similar sub-attacks, and then correlates and generates correlation graphs. The scenarios were represented by alert classes instead of alerts themselves so as to reduce the required rules and have the ability of detecting new variations of attacks. The proposed system is capable of passing some of the missed attacks. To evaluate system effectiveness, it was tested with different datasets which contain multi-step attacks. Compressed and easily understandable correlation graphs which reflect attack scenarios were generated. The proposed system can correlate related alerts, uncover the attack strategies, and detect new variations of attacks.

  12. Review Paper on Performance Evaluation of Nut and Bolt Recognition System Using Artificial Neural Network

    Directory of Open Access Journals (Sweden)

    Shruti Paunikar

    2013-09-01

    Full Text Available There is constant research going on in the field of recognition by means of artificial intelligence to enhance the productivity. The automotive industry requires an automated system to sort different sizes and shapes nut and bolt which are the mainly used component in the industry, to improve the overall productivity. This review paper deals with some feature extraction techniques and its performance impact on the artificial neural network efficiency for the recognition of nut and bolt. The main feature extraction techniques analysed for this review paper are stationary wavelet transform, principle component analysis and radius analysis. The aforementioned techniques are already tested and simulation is done on MATLAB.The results obtained varies depending on pre-processing techniques used for the nut and bolt recognition.

  13. Summary of the transfer of optical processing to systems: optical pattern recognition program

    Science.gov (United States)

    Lindell, Scott D.

    1995-06-01

    Martin Marietta has successfully completed a TOPS optical pattern recognition program. The program culminated in August 1994 with an automatic target recognition flight demonstration inwhich an M60A2 tank was acquired, identified, and tracked with a visible seeker from a UH-1 helicopter flying a fiber optic guided missile (FOG-M) mission profile. The flight demonstration was conducted by the US Army Missile Command (MICOM) and supported by Martin Marietta. The pattern recognition system performance for acquiring and identifying the M60A2 tank, which was positioned among an array with five other vehicle types, was 90% probability of correct identification and a 4% false identification for over 40,000 frames of imagery processed. Imagery was processed at a 15 Hz input rate with a 1 ft3, 76 W, 4 GFLOP processor performing up to 800 correlations per second.

  14. Robust Sign Language Recognition System Using ToF Depth Cameras

    CERN Document Server

    Zahedi, Morteza

    2011-01-01

    Sign language recognition is a difficult task, yet required for many applications in real-time speed. Using RGB cameras for recognition of sign languages is not very successful in practical situations and accurate 3D imaging requires expensive and complex instruments. With introduction of Time-of-Flight (ToF) depth cameras in recent years, it has become easier to scan the environment for accurate, yet fast depth images of the objects without the need of any extra calibrating object. In this paper, a robust system for sign language recognition using ToF depth cameras is presented for converting the recorded signs to a standard and portable XML sign language named SiGML for easy transferring and converting to real-time 3D virtual characters animations. Feature extraction using moments and classification using nearest neighbor classifier are used to track hand gestures and significant result of 100% is achieved for the proposed approach.

  15. VERIFICATION OF GRAPHEMES USING NEURAL NETWORKS IN AN HMM­BASED ON­LINE KOREAN HANDWRITING RECOGNITION SYSTEM

    NARCIS (Netherlands)

    So, S.J.; Kim, J.; Kim, J.H.

    2004-01-01

    This paper presents a neural network based verification method in an HMM­based on­line Korean handwriting recognition system. It penalizes unreasonable grapheme hypotheses and complements global and structural information to the HMM­based recognition system, which is intrinsically based on local inf

  16. Current trends in small vocabulary speech recognition for equipment control

    Science.gov (United States)

    Doukas, Nikolaos; Bardis, Nikolaos G.

    2017-09-01

    Speech recognition systems allow human - machine communication to acquire an intuitive nature that approaches the simplicity of inter - human communication. Small vocabulary speech recognition is a subset of the overall speech recognition problem, where only a small number of words need to be recognized. Speaker independent small vocabulary recognition can find significant applications in field equipment used by military personnel. Such equipment may typically be controlled by a small number of commands that need to be given quickly and accurately, under conditions where delicate manual operations are difficult to achieve. This type of application could hence significantly benefit by the use of robust voice operated control components, as they would facilitate the interaction with their users and render it much more reliable in times of crisis. This paper presents current challenges involved in attaining efficient and robust small vocabulary speech recognition. These challenges concern feature selection, classification techniques, speaker diversity and noise effects. A state machine approach is presented that facilitates the voice guidance of different equipment in a variety of situations.

  17. Vision-based obstacle recognition system for automated lawn mower robot development

    Science.gov (United States)

    Mohd Zin, Zalhan; Ibrahim, Ratnawati

    2011-06-01

    Digital image processing techniques (DIP) have been widely used in various types of application recently. Classification and recognition of a specific object using vision system require some challenging tasks in the field of image processing and artificial intelligence. The ability and efficiency of vision system to capture and process the images is very important for any intelligent system such as autonomous robot. This paper gives attention to the development of a vision system that could contribute to the development of an automated vision based lawn mower robot. The works involve on the implementation of DIP techniques to detect and recognize three different types of obstacles that usually exist on a football field. The focus was given on the study on different types and sizes of obstacles, the development of vision based obstacle recognition system and the evaluation of the system's performance. Image processing techniques such as image filtering, segmentation, enhancement and edge detection have been applied in the system. The results have shown that the developed system is able to detect and recognize various types of obstacles on a football field with recognition rate of more 80%.

  18. Performance Evaluation on the Effect of Combining DCT and LBP on Face Recognition System

    Directory of Open Access Journals (Sweden)

    Dasari Haritha

    2012-12-01

    Full Text Available In this paper, we introduce a face recognition algorithm based on doubly truncated multivariate Gaussian mixture model with Discrete Cosine Transform (DCT and Local binary pattern (LBP. Here, the input face image is transformed to the local binary pattern domain. The obtained local binary pattern image is divided into non-overlapping blocks. Then from each block the DCT coefficients are computed and feature vector is extracted. Assigning that the feature vector follows a doubly truncated multivariate Gaussian mixture distribution, the face image is modelled. By using the Expectation-Maximization algorithm the model parameters are estimated. The initialization of the model parameters is done by using either K-means algorithm or hierarchical clustering algorithm and moment method of estimation. The face recognition system is developed with the likelihood function under Bayesian frame. The efficiency of the developed face recognition system is evaluated by conducting experimentation with JNTUK and Yale face image databases. The performance measures like half total error rate, recognition rates are computed along with plotting the ROC curves. A comparative study of the developed algorithm with some of the earlier existing algorithm revealed that this system perform better since, it utilizes local and global information of the face.

  19. Enhancement Performance of Road Recognition System of Autonomous Robots in Shadow Scenario

    Directory of Open Access Journals (Sweden)

    Olusanya Y. Agunbiade

    2013-12-01

    Full Text Available Road region recognition is a main feature that is g aining increasing attention from intellectuals beca use it helps autonomous vehicle to achieve a successful na vigation without accident. However, different techniques based on camera sensor have been used by various researchers and outstanding results have been achieved. Despite their success, environmental noise like shadow leads to inaccurate recognition of road region which eventually leads to accident for autonomous vehicle. In this research, we conducted an investigation on shadow and its effects, optimized the road region recognition system of autonomous vehicle by introducing an algorithm capable of dete cting and eliminating the effects of shadow. The experimental performance of our system was tested a nd compared using the following schemes: Total Positive Rate (TPR, False Negative Rate (FNR, Tot al Negative Rate (TNR, Error Rate (ERR and False Positive Rate (FPR. The performance result of the system improved on road recognition in shadow scenario and this advancement has added tremendousl y to successful navigation approaches for autonomous vehicle

  20. A region finding method to remove the noise from the images of the human hand gesture recognition system

    Science.gov (United States)

    Khan, Muhammad Jibran; Mahmood, Waqas

    2015-12-01

    The performance of the human hand gesture recognition systems depends on the quality of the images presented to the system. Since these systems work in real time environment the images may be corrupted by some environmental noise. By removing the noise the performance of the system can be enhanced. So far different noise removal methods have been presented in many researches to eliminate the noise but all have its own limitations. We have presented a region finding method to deal with the environmental noise that gives better results and enhances the performance of the human hand gesture recognition systems so that the recognition rate of the system can be improved.